What if each submission caused the HN engine to download the title from the referenced page. It's exceedingly unlikely that there'll be two different useful pages with the same HTML title, but whenever there's a duplicate submission, the title should be the same.
It's previously been suggested that duplications could/should be detected by looking at the page source, but surely it would be enough 99.9% of the time just to check for duplication of the <title> contents.
Thoughts? Problems? Counter-suggestions?