More

equark · on Jan 3, 2013

Both derby and meteor don't support relational databases. Give me a realtime graph or relational database.

equark · on Dec 1, 2012

Sense (http://www.senseplatform.com, http://angel.co/sense) - San Francisco

Sense is a next-generation platform for data analysis, statistical modeling, and business analytics. We're building amazing technology and need help at all parts of the stack.

We're a tiny company of three. You will be a core team member building amazing technology in a fast paced, drama free, intellectually stimulating, environment. Competitive salary and equity.

= Lead Full Stack Web Developer =

* Experience building highly interactive, client-side, web applications (Backbone/AngularJS/Ember/etc).

* Deep knowledge of JavaScript / NodeJS.

* Experience building large systems on AWS.

* Highly productive and independent.

= Lead UI Designer =

* Fluent in Adobe Creative Suite.

* Pixel perfect design for print, mobile, and web UI.

* Ability to lead entire UI design and branding effort.

* Knowledge of JavaScript/HTML/CSS a plus but not required.

* Interest in data visualization a plus.

= Senior Technical Developer =

* Deep knowledge of numerical and statistical computing and familiarity with existing tools R/Matlab/SAS/SPSS/Stata.

* Experience building big data systems.

* Fluent in C++.

* Knowledge of JavaScript/V8/NodeJS a plus.

* Love of Bayesian statistics and MCMC samplers a plus.

* Experience with OpenCL and LLVM a plus.

* PhD a plus but not required.

Email: tristan@senseplatform.com

equark · on Nov 28, 2012

It's coming soon and much more: http://angel.co/sense, http://www.senseplatform.com. Three of us, full time, for over a year.

equark · on Nov 11, 2012

Both are correct but they target different things. The disagreement is around what is the target should be and the advantages and disadvantages of choosing these targets. Bayesians are interested in p(unknown|data) and frequentists are interested in p(data|unknown = H0). Inference can be framed either way but means different things.

ced · on Nov 11, 2012

Are there any situations where you want to use a frequentist procedure?

I've concluded that given a perfect, infinite-power MCMC simulator, I would always do a Gelman-style Bayesian analysis (with model falsification and improvement), but in practice, frequentist methods are computationally convenient.

Inference can be framed either way but means different things.

A Bayesian posterior P(H|D,M) is the probability that hypothesis H is true given data D and modelling assumptions M.

What does a frequentist p-value mean?

keithwinstein · on Nov 11, 2012

Sure, see my link above (http://stats.stackexchange.com/a/2287/1122). If you want to put an upper bound on the worst-case probability of making a mistake, you use a p-value. If you want to express the conditional probability of a particular hypothesis given the observation (and given a prior belief), you use a posterior probability. The Bayesians also can do silly things (see the cookie example with the inept Bayesian robots). In the end there is no free lunch.

arrrg · on Nov 11, 2012

The frequentist p-value is about H0, not (directly) the hypothesis you are testing. More specifically, it denotes the probability of rejecting H0, even though it's true.

jamesjporter · on Nov 11, 2012

Wow thank you, this is the clearest and most straightforward explanation of the difference between the two camps in this thread.

equark · on Oct 30, 2012

Impressively fast. Is the voice recognition being done exclusively on Google's servers or is there a local component?

TrevorJ · on Oct 30, 2012

With as fast as it's working I would be shocked if a chunk of it wasn't being done locally. I think the refinement may be server side (IE: looking at the words in context against popular search strings to see if it may have miss-interpreted a word in the search string).

cheald · on Oct 31, 2012

Android 4.1 introduced local voice recognition, and I expect they put some of that juice into this product. It works incredibly fast in either case.

Caligula · on Oct 31, 2012

I would guess all server side but maybe they extract features on the device to save some processing/data.

equark · on Oct 15, 2012

They are creating decentralized mechanisms for free individuals to find individually and socially beneficial outcomes. The free market organized by the U.S. legal system is one such mechanism, but certainly not the only one.

equark · on Oct 10, 2012

My problem with books like this is that they have almost no connection to why Bayesian statistics is successful: Bayesian statistics provides a unified recipe to tackle complex data analysis problems. Arguably the only known unified recipe.

The Bayesian book I want should emphasize how Bayes is a recipe for studying complex problems and teach a broad range of model ingredients. Learning Bayesian statistics is about becoming fluent in describing scientific problems in probabilistic language. This requires knowing how to express and compose traditional models and build new ones based on first principles.

An unfortunate reality is that you still need to know computational methods too, but that should change soon enough.

AllenDowney · on Oct 10, 2012

Yes, that's exactly what the objective of this book is! I am not using computation out of necessity, but rather because I think it provides leverage for understanding the concepts, and learning to (as you say) compose traditional models and build new ones.

As the book comes along, I am finding that many ideas that are hard to explain and understand mathematically can be very easy to express computationally, especially using discrete approximations to continuous distributions.

For example, I just posted a section on ABC

http://www.greenteapress.com/thinkbayes/html/thinkbayes008.h...

that (I think) really demonstrates the strength of this approach.

Of course, my premise only applies for people who are as comfortable with programming as with math, or more so.

equark · on Oct 10, 2012

I'd recommend using as many real examples as possible. Things like forecasting, product recommendations, topic modeling, etc. While you can conceptually explain how Bayesian statistics is a unified recipe, it's incredibly hard to have this sink in with toy problems. This is especially true since many people using traditional tools are actually using advanced methods to solve real problems, so when they start reading about urns or doors it all comes across as rather academic. That's sad because the benefit of Bayesian coherency is mostly that it leads to a highly productive mode of practical data analysis.

Definitely shoot me an email at tristan@senseplatform.com if you're interested in the computational side of this area. At Sense (http://www.senseplatform.com), we're working on making applied Bayesian analysis as amazing as it should be.

loup-vaillant · on Oct 10, 2012

E.T. Jaynes book, "Probability Theory: the Logic of Science" may come close to what you want. It emphasize that there are rules of thought, which lead to Bayesian statistics. As such, Bayesian statistics aren't just a recipe, but the law.

Now, I can only personally vouch for the first 2 chapters, as I haven't read the rest yet.

avaku · on Oct 10, 2012

The greatness of this book cannot be overstated

ph0rcyas · on Oct 10, 2012

Agree. It is unbelievable - one has to study it to believe it.

danso · on Oct 10, 2012

Going to Amazon right now...

* edit: Doh, no Kindle version. I don't mind paying $90+ for a good book though, just like it to be electronic: http://www.amazon.com/Probability-Theory-The-Logic-Science/d...

dan_yall · on Oct 10, 2012

Free pdf is available here:

http://bayes.wustl.edu/etj/prob/book.pdf

It's always nice to see good things come out of Wash U. (Alum here.)

Fixnum · on Oct 10, 2012

Unfortunately, it's only the first 95 pages.

gwern · on Oct 10, 2012

There must be a fuller version floating around, though; my PDF version has 548 pages and ends with Appendix E, 'Multivariate Gaussian Integrals'.

EDIT: In case anyone wants to make me feel bad about pirating, Jaynes is dead, and besides that, I bought a hardcopy as backup.

Wilduck · on Oct 10, 2012

I found the full text here:

http://www.naturalthinker.net/trl/texts/Science/Jaynes,%20E....

The first couple pages are a bit funny looking, but after that, there are all 500+ pages. It was the fourth result on Google for me.

incision · on Oct 10, 2012

Well, it's available on Google Books [1], but I don't know about $63 for what appears to be a skewed scan of the print book.

Personally, I searched out a PDF and based on what I've read so far, I'm itching to pull the trigger on Amazon as I'm simply loving what I'm reading.

1: http://goo.gl/UHMBi

Datonomics · on Oct 11, 2012

http://www.naturalthinker.net/trl/texts/Science/Jaynes,%20E....

equark · on Sept 24, 2012

You could come work at Sense (http://www.senseplatform.com) or email me tristan@senseplatform.com. Not Google scale, but the same technological challenges without the legacy bagage.

equark · on Sept 12, 2012

I'm really not sure this is true. I'd like to see evidence of it because certainly the people I know who buy android do it for features. That's also how they evangelize. It seems to work. Apple has a great marketing strategy but that doesn't mean the best competitive response is to mimic it.

equark · on Sept 11, 2012

This would be very interesting to lots of people. There are lots of half-baked sandbox solutions so seeing what you guys did you be interesting.