For the people not understanding why this discussion is a problem, in synopsis: ...

tptacek · on Aug 28, 2014

This summary created more questions than it eliminated.

Nobody has explained this better than Coda Hale:

http://codahale.com/how-to-safely-store-a-password/

(in place of "bcrypt", you can substitute "PBKDF2" [weaker] or "scrypt" [stronger], but all those options are just fine).

windowshopping · on Aug 28, 2014

I'm a bit confused. This article says SHA-256 is a general purpose hash algorithm, but wikipedia says it's a cryptographic hash function. Is the wiki article wrong?

owenmarshall · on Aug 28, 2014

Both are correct. A cryptographic hash function is a hash function that is not invertible, resists tampering and collisions, and has other properties. Using cryptographic hash functions in certain cryptographic settings is appropriate. SHA256("My Super Secret Password") is not one of those settings.

SHA256 can be the basis of a scheme to securely store passwords. There are many different names for that:

* key derivation function

* secure password storage scheme

* password scrambler (PHK's term)

* (...)

A secure password storage scheme uses a cryptographic primitive like a cryptographic hash or a cipher to increase the time it takes to decrypt passwords. That's a good thing, because when it takes your user 0.5 seconds to go from "right password -> on disk hash", it'll take a prohibitively long time for an attacker to guess through passwords.

windowshopping · on Aug 28, 2014

Okay, that makes sense. SHA-256 for 50 iterations, with salting: is that equivalent to bcrypt in the matter of slowing things down....is it good enough?

owenmarshall · on Aug 28, 2014

> SHA-256 for 50 iterations (...) is it good enough?

If you mean they call SHA256(SHA256(...48 more calls("password" + salt))?

That's not remotely good enough.

http://hashcat.net/oclhashcat/ -- look at the hardware and see for yourself; those values for SHA256 are million hashes per second.

So 50 rounds of SHA256 can be computed in less than a microsecond.

https://twitter.com/hashcat/status/349192539443699713 -- now, Bcrypt.

I'm not sure what their bcrypt difficulty is, but whatever it is: bcrypt makes it five or six orders of magnitude harder. And remember, bcrypt (like good KDFs) is tunable, so you could make that as hard as you want.

EDIT: Some further consideration: http://arstechnica.com/security/2012/12/25-gpu-cluster-crack...

> As a result, the new cluster, even with its four-fold increase in speed, can make only 71,000 guesses against Bcrypt and 364,000 guesses against SHA512crypt.

https://github.com/freedomofpress/securedrop/issues/180 has some more numbers as well.

wglb · on Aug 28, 2014

bcrypt makes it five or six orders of magnitude harder.

And its work factor is adjustable at run-time for the time that hardware speed increases.

windowshopping · on Aug 28, 2014

Not quite-- 50 iterations of `hash = SHA256(hash + salt)`

Any better? (EDIT: Still in process of reading links.)

owenmarshall · on Aug 28, 2014

>(EDIT: Still in process of reading links.)

Sorry, I'll try to stop incessantly editing ;-)

> Any better?

No.

General purpose cryptographic hash functions are designed to be incredibly fast.

http://www.intel.com/content/dam/www/public/us/en/documents/...

But convince yourself. Do you have Ruby? http://pastebin.com/zjLiYt2j

If I bump up your password scheme by 4 orders of magnitude it's still under <.06 seconds on my machine. And that's with a ridiculously inefficient password generator I banged up in ten seconds.

Use bcrypt. Use scrypt. Use PBKDF2. Don't roll your own password stretching function, it's a recipe for disaster.

windowshopping · on Aug 28, 2014

Thanks very much, all very clear now. I'll try to persuade the people who have a say in the matter...it's tricky when you're not really an expert yourself so you can't fully explain the matter to a perfect extent, and you're stuck saying "but people on the internet said..."

owenmarshall · on Aug 28, 2014

Half the problem is that quite a few standards/compliance groups don't impose the right requirements. PCI audits used to just check for "encrypted passwords."

OWASP does make a decent recommendation, maybe that'll help give you a leg to stand on: https://www.owasp.org/index.php/Password_Storage_Cheat_Sheet...

Alupis · on Aug 28, 2014

> PCI audits used to just check for "encrypted passwords.

PCI is a joke. The self-fill audit: "Are you secure?" -- Uh... sure?

The various scanning companies are of differing quality too, adding to the difficulty in getting any real security improvements from the standard across all ecommerce shops.

sarciszewski · on Aug 29, 2014

AFAIK, PCI is an unfunny joke. This is why my pentester friends have told me about it:

> Release v1.0 of super cool app.

> v1.0 gets PCI certified

> Someone discloses a vuln in v1.0

> Release v1.1

> Companies are stuck with v1.0 for at least 90 days until v1.1 can be certified

sarciszewski · on Aug 28, 2014

If you mention Thomas Ptacek of Matasano Security (that's who tptacek is), they might be more inclined to listen. Sadly, my name is of little value and if they had heard of it, they'd probably assume malicious intent.

sarciszewski · on Aug 28, 2014

It's a cryptographic hash function for general cryptographic usage (e.g. HMAC), but it's not a password hash function.

ddevault · on Aug 28, 2014

To expand upon that...

The problem with md5 and sha and such is that they are fast. bcrypt is useful because it takes a fixed amount of complexity which can be specified by the programmer (and later increased). It also uses salts to thwart rainbow tables. The complexity bit is important, though, because as computers get better, you can make it harder to reverse your passwords.

Also, the biggest reason any of this is a problem is because password reuse is rampant. You are being entrusted with the password to your user's life - their email, their bank account, their facebook account... treat it with respect.

pinkyand · on Aug 28, 2014

True, that's the tech behind it.

Is there any reason most programmers should know any of this ?

The code should look more like this(and should use a standard library):

import Password_protector

P = Password_protector(date_of_first_use) #library upgrades algorithms every once in a while, only a single algorithm available per date. If there's a need for automatically upgraded hashing library should handle covertly.

protected_password = P.protect_password(password)

if P.compare_password(password, protected_password) == True ...

Or something similar. That at least should cover the basics - safely.

mikeash · on Aug 28, 2014

Quite simply, the reason programmers should know about this (if they're implementing a password system) is because the library you propose often doesn't exist, and when it does exist it's often implemented incorrectly.

In an ideal world, programmers shouldn't have to worry about this stuff. But this is not an ideal world.

sarciszewski · on Aug 28, 2014

I suppose that's Python? The whitespace got chewed up by the comment system; please pastebin/gist it instead.

windowshopping · on Aug 28, 2014

I was under the impression SHA256 is valid...it's a cryptographic hash function that hasn't had weaknesses identified yet. 40 iterations of that + salting would suffice, no?

sarciszewski · on Aug 28, 2014

If you're going to use SHA256 for hashing, why not use it in PBKDF2-SHA256?

https://defuse.ca/php-pbkdf2.htm

Alupis · on Aug 28, 2014

the SHA algorithms were designed to digest data as fast as possible -- which things like bcrypt purposefully slow it down plus can perform multiple iterations with the output of one iteration feeding as the input to another.

Here is some more info: http://forums.udacity.com/questions/6016855/hashing-password...

windowshopping · on Aug 28, 2014

I do realize bcrypt is arguably the best option, but why is SHA not valid also?

tptacek · on Aug 28, 2014

Because it has no work factor. It's uniformly fast, which makes it very easy to brute force. The general-purpose strong cryptographic hashes --- your SHA2s, SHA3s, and BLAKE2s --- are all amenable to fast implementation on GPU.

What you need is a password hash, not merely a cryptographic hash.

windowshopping · on Aug 28, 2014

As noted above, this discussion prompted me to investigate my company's methods, a fairly major US website, and I found they're using SHA-256 for 50 iterations with salting. Does that slow things down sufficiently, or is that not comparable to the internal mechanism of bcrypt?

sarciszewski · on Aug 28, 2014

No. Currently, PBKDF2 (which iterates HMAC-SHA1, HMAC-SHA256, etc.) typically is used with a count in the thousands.

I'd recommend upgrading to follow best standards. If you use PHP, scrypt isn't that hard to set up.

https://scott.arciszewski.me/blog/2013/10/php-scrypt-setup

windowshopping · on Aug 28, 2014

Company is stuck with ASP.

danielweber · on Aug 28, 2014

If you change 50 to (say) 1000 you are getting most of the benefits.

The big problem you are trying to solve is "our password database just leaked; how fast can attackers figure out the passwords?" Say you can run SHA in 1 microsecond, and 1000 iterations in 1 millisecond. Even though it's 1000x slower, that's probably an unnoticeable difference as far as your performance is concerned. (YMMV)

But, anyone trying to brute-force the passwords out of your database will now have to do 1000x the work.

BTW, none of this will help you much if your users choose passwords like "abc" or "password".

(IIRC scrypt lets you blow up in size as well as CPU, which makes GPUs impractical for attacking.)

sarciszewski · on Aug 29, 2014

Yep. http://blog.ircmaxell.com/2014/03/why-i-dont-recommend-scryp...

sarciszewski · on Aug 28, 2014

http://www.dotnetnoob.com/2012/05/towards-more-secure-passwo...

windowshopping · on Aug 28, 2014

Thank you!

mikeash · on Aug 28, 2014

Because a lot of passwords aren't particularly strong, and thus are feasibly crackable when you use something like SHA-256. You can test millions or billions of candidates per second on commodity hardware, and you'll be able to recover a lot of passwords.

The purpose of systems like PBKDF2, bcrypt, scrypt, etc. is to intentionally slow the whole thing down so that it takes on the order of one second to test a candidate. This is a minor burden for normal use, since you don't check passwords very often, but it makes it vastly harder for an attacker to crack a password.

Edit: just to put some numbers on this, it looks like high-end GPUs can brute-force about 2 billion SHA-256 hashes per second. That means that an eight-character alphanumeric password could be recovered from a SHA-256 hash in less than a day, on average. Add 20 non-alphanumeric symbols to your character set and you're still only looking at about six days. Eight alpha+num+symbol characters isn't a terribly great password anymore but it still shouldn't be recoverable if your password database leaks.

sarciszewski · on Aug 28, 2014

Thank you for posting this. +1