Setting up local LLM system and charging tokens back to company

Posted by Wa1ker1@reddit | LocalLLaMA | View on Reddit | 7 comments

With all the recent issues with Claude and issues with codex I'm having it's more and more clear to me I need to have a large model LLM thats comparable to use for reliable work assistance.

I have a company myself but also work with another company that refuses to hire me more staff. For two weeks I've been arguing I need more on staff and have been given pushback tho they keep expanding an increased workload. They would rather outsource the work load or pay more for ai services. an example when I told them to give me 10-12k and 6k a month for an employee monthly they instead signed a 1 yr contract for 25k a month we can't even work with.

After speaking with our CFO the best solution is build out what I need out of pocket and cancel current services and bill them out monthly for token usages and fair market value prices vs buying equipment a little at a time.

this would give me the immediate deductible for equipment and allow a way to recover into a profitable status in a couple years. Also allowing me to charge other clients I work with for token usage directly and monitor extended electricity usages to charge back for.

I'll be heavily reliant on new models coming out from Kimi and minimax but possibly without the issues I currently have of downtime and the models seeming to get dumber by the day. but give a reliable system in place locally.

I'm not talking about building a system for 50 users just myself and maybe one or two more on team.

has anyone done this or thoughts on it worth it? I do have 2 companies I may contract to coming up in next couple months agreed to 10-12k equipment expense budget as well.