Cynical TLDR; We have plateaued and it has become obvious that fancy autocomplete is not and can never be close to reasoning, regardless of how many hacks and tweaks we are making.
No, I see the answers from GPT-5 are sometimes marginally better, and sometimes arguably marginally worse than GPT-4, which suggests to me that we have plateaued and even regressed.