llama.cpp Kimi Linear llama-server bug fix

Posted by Ok_Warning2146@reddit | LocalLLaMA | View on Reddit | 0 comments

Thanks u/Lord_Pazzu for reporting Kimi Linear sometimes generates bad responses when running "llama-server --parallel 8" Now it should be fixed: [https://github.com/ggml-org/llama.cpp/pull/19531](https://github.com/ggml-org/llama.cpp/pull/19531) While waiting for this PR to merge, you can still give it a try by: git clone [https://github.com/ymcki/llama.cpp](https://github.com/ymcki/llama.cpp) \--branch Kimi-Linear Please let me know if you find any bugs.