Hacker News new | past | comments | ask | show | jobs | submit login
B is for Billion: Wordnik passes 9 billion record mark with MongoDB (wordnik.com)
32 points by johnny99 on July 12, 2010 | hide | past | favorite | 13 comments



Very sparse on details. Are they partitioning the data? If so how? Is it MongoDB shards or at the application level or...? What's their average record size? Do they see a long-tail kind of access such that MongoDB really does keep the most commonly used records in memory all the time? How many machines are they serving from? What are their specs (particularly RAM)?


This is pretty well covered in this presentation: http://www.slideshare.net/fehguy/migrating-from-mysql-to-mon...

A few unanswered questions (sharding? Looks like none to me), but a good bit of numbers and specifics on their current setup.


The 47.7 queries per second figure in those slides surprised me a bit. I'd like to know more details there...is that for a single node, or the entire cluster?


That's per api server, we have 4 api servers.


Video of this talk is available here: http://blip.tv/file/3621377

This article also has some details: http://www.royans.net/arch/mongodb-migration-from-mysql-at-w...


Some of these questions are answered in a separate set of slides, which the author links to in the comments of the post. Hardware looks like a single server, 2x4 core CPUs, 32 GB of RAM, and a FC SAN.


Single node, eh? I thought MongoDB was not supposed to be run in that configuration.


They're probably running 1+ Mongod process/node per core - not exactly ideal though.


We run one mongod master, one slave (soon to be two slaves). These are used across all the api servers.


Well I'd never heard of Wordnik but this is very cool and I'll be using it. This is like a respectable Dictionary.com+ Urban Dictionary. I don't see the category theory definition of a monad on it, though. Which is somewhat odd.


Hi Christopher, glad you like Wordnik. Do you mean you couldn't find this?

http://www.wordnik.com/words/monad


This is still a relatively small production database as far as large DB instances go.


As far as production websites go, you are completely wrong.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: