https://github.com/basho/riak_ql
Specifically:
https://github.com/basho/riak_ql/blob/develop/src/riak_ql_le...
https://github.com/basho/riak_ql/blob/develop/src/riak_ql_pa...
It is a very concise and well written piece of software.
With its help I've created scrapers and crawlers that digest even the most disgusting HTML.
Definitely interesting though.
https://github.com/hansihe/ex_html5ever
Excellent article about creating parsers though even if html parsing is a particularly difficult problem.
https://github.com/basho/riak_ql
Specifically:
https://github.com/basho/riak_ql/blob/develop/src/riak_ql_le...
https://github.com/basho/riak_ql/blob/develop/src/riak_ql_pa...
It is a very concise and well written piece of software.