Hacker News new | comments | show | ask | jobs | submit login

Any stats on how much documents or GB is in the index, how big the cluster is, and how long this took to build?

For those interested:

1212672153 documents across 2866400 repositories taking up 17 TB of disk space over 23 elasticsearch storage nodes fronted by 8 elasticsearch compute nodes

It took about a month to iterate over all the repositories stored on the file servers and index the source code.

Guidelines | FAQ | Support | API | Security | Lists | Bookmarklet | DMCA | Apply to YC | Contact