In this specific case it's not exactly the same as running a generic load test against some website, given the advertised feature of this specific website.
You're intentionally making it too generic to portray the guy as an asshole, when that's not how things went down. Very politician-like. Yuck.
Well the best thing would be to try it yourself. I have found:
* ab has more results variation between runs
* ab will almost always report lower performance than wrk
* If you have two implementations being benchmarked, A and B and B is always faster than A. wrk will report a greater degree of performance separation between A and B.
These results are less noticeable the lower performance the site being benchmarked is.
Probably best not to try this on any systems you care about if the command is completely unknown. killall on Solaris might have some unintended consequences.