PSA: Phi-4 GGUFs (from official release) don't work in Ollama (yet)
Posted by noneabove1182@reddit | LocalLLaMA | View on Reddit | 1 comments
I assumed no changes were needed (since it's the same arch) but there were changes made here in llama.cpp that haven't been picked up by them yet (they are aware):
https://github.com/ggerganov/llama.cpp/pull/10817/files
In the meantime LM Studio is confirmed working, and Koboldcpp seems to have picked up the changes as well
I can't tell if text-gen-webui would work, it's using llama-cpp-python 0.3.5 which is 4 weeks old where the llama.cpp changes were made 3 weeks ago, so I assume that it wouldn't work (unless it has its own workaround)
1 Comments
noneabove1182@reddit (OP)