You can do this without sorting: awk '!x[$0]++' | Hacker News

Hacker News new | past | comments | ask | show | jobs | submit

login

sheetjs on Aug 27, 2014 | parent | context | favorite | on: Useful Unix commands for exploring data

You can do this without sorting:

    awk '!x[$0]++'

_delirium on Aug 27, 2014 | [–]

That's usually faster where possible, but it may cause problems on large data sets, since it loads the entire set of unique strings (and their counts) into an in-memory hash table.

jingo on Aug 27, 2014 | [–]

I use something like this everyday:

awk '!($0 in a);a[$0]; print}'

I rarely if ever use uniq to remove duplicates. Sorting is expensive.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact