Hacker Newsnew | past | comments | ask | show | jobs | submit | hypernewbie's commentslogin

Yeah if you're looking at sv.txt, completely agree. That's the source depot I'm aggregating from before filtering. The actual filtered output is in vbw.csv.

A lot of the zh.txt and en.txt have the exact same problem. That's exactly why I made this project, to collect thesr and filter out.

Maybe I can push the filtering more.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: