There are basically two approaches to on-chain inference: consensus-based approaches (several parties run inference and give a claimed result), and zkML (one party runs inference and proves the result cryptographically).
zkML can be done using general-purpose ZK libraries (since they support arbitrary computations), or there are some specialized tools for proving ML inference, such as https://github.com/ddkang/zkml. It's currently pretty expensive to prove huge models like LLMs, but there's a lot of work being done to make it more practical.
zkML can be done using general-purpose ZK libraries (since they support arbitrary computations), or there are some specialized tools for proving ML inference, such as https://github.com/ddkang/zkml. It's currently pretty expensive to prove huge models like LLMs, but there's a lot of work being done to make it more practical.