But you know what? It could have been solved with general intelligence. But by defining it as some kind of major breakthrough a bunch of money got spent on solving just that one problem and then declaring victory by the players. I predict that this will happen for anything that gets pointed at as 'the' example of what a general AI would solve.
A better benchmark would be that a single piece of software that is not specialized is able to solve two of these problems.
This is trivial to do right now: Build a software program that recognizes the problem it is being set(limited list of problems it can handle at first, ie. extract text from an image, tell me if an image is a cat or a dog, play a chess game, play a go game) and once it has identified the problem get it to call into a specialized program for that problem(chess playing, OCR whatever).
As you want to support more use cases, you just need to plug in a new subprogram and modify the top level problem recognizer.
A chat bot sitting on top of a bunch of specialized systems, basically.
This is clearly still not general intelligence and probably not super useful, but hey.
The true benchmark for intelligence is a piece of software that is presented with some pattern and told to turn it into some other pattern and then figures out how to do it. This probably requires general and deep language comprehension(ie. get me a list of emails for environmental managers at US public water utilities. system needs to have some idea about google, some idea about the EPA and their web resources as a starting point for getting that dataset, some ability to google, read and understand web page data etc).
Obviously you can cheat by just having a giant database of information, but at some point you have to be able to answer questions that are not in the database.
A better benchmark would be that a single piece of software that is not specialized is able to solve two of these problems.