Hacker News new | past | comments | ask | show | jobs | submit login

Sure thing. Sorry for the previous terseness--I really really really hate this whole "Coursera --> Kaggle --> DS job at Facebook" meme when it (rarely) appears on HN, since it isn't even close to reality.

I'm not a data scientist, but I work with them very closely as an engineer and I've considered going down the same path. When I talk about data scientists, it's not a reference to any of the following:

> Engineers working with big data technology, like Hadoop, Storm, Kafka, who are essential but often uninvolved in model construction and evaluation.

> Analysts who develop models, then hand them off to engineers/IT to code them up (or keep them in Excel spreadsheets).

Instead, I'm thinking about someone with a specific background. They likely have a PhD, since that's an excellent way to experience the "ask-explore-code-test-present" workflow needed to answer an interesting question with real-world implications. The strong academic background is not necessary, but it greatly reduces friction during the research workflow (since you've spent 3-4 years in it). I'm getting a MS and working hard to make it as research-oriented as possible, fwiw.

This person also has a strong foundation in applied math. They might have worked on signal processing questions, applied algorithms for learning Bayesian network structure to proteins, or thought about the transition from Hopfield networks to RBNs or whatever awesome deep learning stuff is going on nowadays. A guy I respect described this quality as that of "a traveler," someone who can understand advanced work in a number of disciplines in addition to their specialty.

This person is an engineer. They learn languages easily, understand algorithmic complexity and think about the complexity of their models. They don't have to be Linus.

Finally, the person is forward-thinking. They understand that questions are motivated by business needs, and that answering these questions can have serious implications for the company or its partners. I should channel patio11 here!

Anyway I'm obviously very opinionated about this, but it's just one opinion. I'm happy to discuss this more with anyone who's interested, though--contact is in my profile.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: