It sounds like you haven't actually understood that argument. "The guy in the room" has always been a minor detail. Furthermore, machine translation has existed since before convolutional neural nets, so your whole point falls under the "not even wrong" category.