nvidia has LLMRouter https://build.nvidia.com/nvidia/llm-router
llama-index also supports routing https://docs.llamaindex.ai/en/stable/examples/low_level/rout...
semantic router seems interesting https://github.com/aurelio-labs/semantic-router/
you could also just use langchain to route https://jimmy-wang-gen-ai.medium.com/llm-router-in-langchain...
interesting paper PickLLM: Context-Aware RL-Assisted Large Language Model Routing
https://arxiv.org/abs/2412.12170
nvidia has LLMRouter https://build.nvidia.com/nvidia/llm-router
llama-index also supports routing https://docs.llamaindex.ai/en/stable/examples/low_level/rout...
semantic router seems interesting https://github.com/aurelio-labs/semantic-router/
you could also just use langchain to route https://jimmy-wang-gen-ai.medium.com/llm-router-in-langchain...
interesting paper PickLLM: Context-Aware RL-Assisted Large Language Model Routing
https://arxiv.org/abs/2412.12170