The actual complaint is about using their copyrighted works in the training of the LLM without a license. OpenAI is claiming it's fair use, the authors disagree. It's going to take a ruling from a judge to get clarity on the issue, and no matter what it'll be appealed until it hits the SC.
That's what discovery will be for, the complaint alleges that the likely source was libgen. Most of these authors haven't released DRM-free ebooks, and it seems unlikely that OpenAI has a large scale book scanning effort (and even if they did, that authors would likely claim that to be infringement itself.)
What if it never accessed the book, but read everything relevant like episode summaries, fan wikis, and forum discussions? It would still be as conversant. Is it still infringement?