Hacker News new | past | comments | ask | show | jobs | submit login

> 14,736 versus 21,231 at 95%

So about 50% more words for French. The very extreme tail is less interesting because it captures the size of the dictionary, and words very rarely used.

I think the exponent of the tail would be the most relevant metric, but I can't open those pdf. Can someone plot the inverse CDF and make a log-log plot?




Consider applying for YC's Spring batch! Applications are open till Feb 11.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: