You do not even know the memory load, what was running at the same time on the server, etc. It is like throwing one blue dice and one yellow. You get a six on the first and a three on the second and consider through your not terribly scientific benchmark that you get more with blue dices.
Recommended reading for the author:
Not a joke, this is a really good book about the test side of the stats.