llama.cpp Kimi Linear llama-server bug fix
Posted by Ok_Warning2146@reddit | LocalLLaMA | View on Reddit | 0 comments
Thanks u/Lord_Pazzu for reporting Kimi Linear sometimes generates bad responses when running "llama-server --parallel 8"
Now it should be fixed:
[https://github.com/ggml-org/llama.cpp/pull/19531](https://github.com/ggml-org/llama.cpp/pull/19531)
While waiting for this PR to merge, you can still give it a try by:
git clone [https://github.com/ymcki/llama.cpp](https://github.com/ymcki/llama.cpp) \--branch Kimi-Linear
Please let me know if you find any bugs.
0 Comments