Best LLM router: comparison

Posted by GrandMoo1@reddit | LocalLLaMA | View on Reddit | 17 comments

I was recently tasked to look into LLM routers as the company I'm working for wants to start working more with AI orchestration and LLM routing. With the growing AI infrastructure solutions, I started looking more in depth into these platforms.

The task is definitely not easy and I was looking into different services with the main key capabilities that impact ease of use, cost and performance. However, I created this cheat sheet where I was trying to compare a range of different features that make the platforms effective when it comes to managing and deploying large language models.

https://docs.google.com/spreadsheets/d/1Xx7vE2rV1UoknzDnYcwxm1Hsof3ZPDtjt4z_E2AQGN4/edit?gid=0#gid=0

My main considerations:

All the current tools in this table are for sure different and have different features as well as capabilities but I wanted to gather everything in one place and make them somewhat comparable, as you can summarize certain aspects of said features.

It has really made it easier for me and while it's not perfect and some things are difficult to compare due to different criteria, I hope it will be useful to at least some of you, as this is the best I've got.

Currently, I've reviewed these LLM routers: Portkey, TrueFoundry, Martian, Pruna AI and Unify, but I will constantly be adding new ones.

Any kind of suggestions or feedback from you are welcome!