On the HN thread on Troy's last post about this, I said [1]: *"Big sites can get...

chias · on May 23, 2019

> literally everyone who'd want to visit Facebook knows Facebook's URL

Never forget when ReadWriteWeb posted a blog post entitled "Facebook Wants to Be Your One True Login", which ended up ranking very highly for the search query "facebook login". It was subsequently innundated with comments from angry people who hated the new redesign and couldn't figure out how to log in... believing that they were on facebook, because they googled "facebook login" and clicked the first result.

https://web.archive.org/web/20100213061037/https://www.readw...

Scroll down for the comments. It's pages and pages of people frothing at the bit looking for anywhere to enter their facebook password.

viraptor · on May 23, 2019

I forgot about that one. It's interesting how people will hold on to their method of doing things, even if the workaround is harder than doing the direct thing: (one of the comments)

> for those of you that want to get in face book now just go to Bing..put in face book and search (or it will pop up) hit on face book login and it takes you to your password page...i did it.... if this ever gets back to normal I will use the address bar from now on.....

danpalmer · on May 23, 2019

> literally everyone who'd want to visit Facebook knows Facebook's URL

I'm not sure I'd agree with this. I don't think most people look at or really understand URLs or domain names. I think people assume they are a lot "fuzzier" than they are, so "facebookapp.com" or "facebook.foobar.com" or anything else would be assumed to be "Facebook" by most people.

As technical people it's easy for us to assume things that feel basic to us are at least understood but I don't think it's the case. My parents do most of their shopping online, and when I was growing up we always had computers around, but I don't think either one of them really understands URLs or things like file hierarchies, or windowing systems on desktops for example.

cwyers · on May 23, 2019

I have seen multiple people, mostly elderly, who will go to whatever search engine that loads by default in their browser, search for Facebook, and click on the link. Nothing close to everybody has Facebook's URL memorized.

danpalmer · on May 23, 2019

Not everyone knows what a URL is, or understands the difference between a search box and a URL text box.

chias · on May 23, 2019

I remember a few years ago when someone wrote a blog post about Facebook that somehow ranked higher than Facebook itself in Google search results for "facebook login", and was innundated by people who thought they were on facebook and left angry comments about how the redesign was terrible and they couldn't figure out how to log in.

A very non-trivial amount of people type "facebook" into Google and then click the first result.

Edit: found it! Scroll to the bottom for the comments. https://web.archive.org/web/20100213061037/https://www.readw...

brobdingnagians · on May 23, 2019

I found the linked page from ian.sh is extremely instructive. [1] Especially the part about how some browsers hide the domain name when the EV certificate is in use.

I'm with a company that uses EV certificate for our site, mostly just to "look even more trustworthy", even though it functionally servers very little purpose for us, and in an industry where I doubt very few clients know what it really is. For large companies the cost of an EV certificate is negligible. If someone wanted to impersonate us, they would only need to change a letter or two and get a similar `.com` domain, which would be easy to do since we have a somewhat unusual name.

The linked article also talks about the possibility of people getting a shortened domain name like `g.uk` for phishing. If a company has a wide portfolio of sites, that becomes even easier since it weakens peoples association between a canonical domain and the company.

I guess the main point is that the DNS records themselves are one of the most effective preventions of phishing since you need a mapping from a user intent to the site they arrived at. Domain names provide a memorable and easily "mentally verifiable" method of mapping from "facebook.com" -> Social Media, "gmail.com" -> Email, etc. for most people. But strings of characters are also vulnerable since they can be easily changed by a transposition, or other operation, to create a new domain name that looks _very_ similar. Since our minds are much better at recognizing words by the beginning and end, they are vulnerable to changes in letters in the middle.

It makes one wonder if there is a system that would allow much easier mental verifiability of that mapping, but which would still allow a large number of possibilities. Or whitelisting sites that you regularly visit and alerting an individual on sites not in the whitelist or which _resemble_ sites in the whitelist.

[1] https://www.typewritten.net/writer/ev-phishing/

EDIT: just to make clear, I think strings and domain checking mentally is a terrible model for verifying trust. But at least it kinda works some of the time, even though there are tons of vulnerabilities and problems. But probably better than typing in raw IP addresses, most of the time.

JohnFen · on May 23, 2019

> Especially the part about how some browsers hide the domain name when the EV certificate is in use.

This drives me nuts. Browsers should never hide the domain name (or any other part of the URL).

judge2020 · on May 23, 2019

Thankfully even iOS (I believe starting with 12.2) no longer hides the full domain name when EV is present, but EV will still make the domain's characters turn green.

tialaramex · on May 23, 2019

Security Keys fix the problem of credentials getting phished by making them unphishable (a Security Key will let you give evilguy.example credentials, but only for evilguy.example, they are utterly useless for facebook.com or goodguy.example or any other site, it dumbly refuses to help you give Evil Guy your real goodguy.example credentials)

More subtle manipulations from phishing remain possible, but they're trickier to pull off than just stealing people's passwords.

IIRC somebody has a proof like with Arrow's theorem - that the perfect desirable set of features for names can't be achieved, but I can't find it. Certainly you can see a different trade in Tor hidden services for example, bad guys can't get a name arbitrarily similar to yours, but your name is also mostly gibberish, so your users probably don't remember what it is anyway. Did they buy the last bag from greatweedxstrm4aqp.onion ? Or was it weedxgreatstre6g2q.onion ? And unlike conventional Phishing the plan is probably to break your door down and arrest you if you pick the wrong one which sucks worse than card fraud...

linsomniac · on May 23, 2019

True, but using visual inspection of the URL is problematic too: Do you mean to be at apple.com or аpple.com ?

www.xudongz.com/blog/2017/idn-phishing/

madeofpalk · on May 23, 2019

Which, isn't really a problem any more since browsers don't display Punycode any more. https://imgur.com/a/COCrJGy

half-kh-hacker · on May 23, 2019

So, if you're a non-English speaker, doesn't this render distinguishability for real IDNs completely useless?

Compare http://xn--j1ail.xn--p1ai and https://xn--d1aqf.xn--p1ai - If rendered to punycode, they look like this:

kto.rf: xn--j1ail.xn--p1ai dom.rf: xn--d1aqf.xn--p1ai

To note: Firefox (Nightly) on desktop for me rendered the false Apple domain near-identically to the real Apple, and for the two RF domains, the Cyrillic was used.

Macha · on May 23, 2019

Iirc, the rule being used is the IDN gets rendered unless it crosses character sets. So your domain entirely in Cyrillic or Kanji is fine, but mixing them with Latin characters should be a no-no.

TazeTSchnitzel · on May 23, 2019

It should be. Unfortunately even really obviously fine cases like Swedish government agencies on the Sweden TLD (försäkringskassan.se), which by the way supposedly has a proper IDN policy that browsers should respect, don't get respected by all modern browsers :(

wolf550e · on May 23, 2019

That Swedish domain doesn't have a TLS certificate for their fancy name.

Compare:

https://www.ssllabs.com/ssltest/analyze.html?d=xn--frskrings...

https://www.ssllabs.com/ssltest/analyze.html?d=www.forsakrin...

TazeTSchnitzel · on May 24, 2019

Why should they need one to have the IDN respected?

wolf550e · on May 24, 2019

I don't know what försäkringskassan.se looks like in modern browsers because it redirects (302) to https://www.forsakringskassan.se/

EDIT: woah HN did a funky thing with a URL in a post to fight against misleading domains! http://xn--frskringskassan-2kb71a.se

neogodless · on May 23, 2019

Looks like this is fixed in Chrome and Edge by default. In Firefox (release 67.0) you have to dig into about:config and flip the switch on network.IDN_show_punycode (this is discussed in the above demonstration article.)

madeofpalk · on May 23, 2019

Defined 'fixed'

airstrike · on May 23, 2019

huh... Apparently appIe.com takes you to the Amazon page for Apple products through a referral link...

nailer · on May 23, 2019

That approach works for the very well known domains, but it doesn't even work for tech giants

Who is https://withgoogle.com?

Who is https://www.microsoftedgeinsider.com?