pseudonerv

mistral released weights for Voxtral Mini 4B Realtime 2602

Posted by pseudonerv@reddit | LocalLLaMA | View on Reddit | 4 comments
abliterated lora gguf works with r1-distill-qwen-32b

Posted by pseudonerv@reddit | LocalLLaMA | View on Reddit | 0 comments
control vectors added to llama.cpp

Posted by pseudonerv@reddit | LocalLLaMA | View on Reddit | 5 comments
any study on llama.cpp low bpw quants?

Posted by pseudonerv@reddit | LocalLLaMA | View on Reddit | 3 comments
llama github README says 3.1

Posted by pseudonerv@reddit | LocalLLaMA | View on Reddit | 0 comments
A simple way to "Extending Context to 8K"?!

Posted by pseudonerv@reddit | LocalLLaMA | View on Reddit | 1 comments
How can I get the same response from gemini-1.5-pro as lmsys?

Posted by pseudonerv@reddit | LocalLLaMA | View on Reddit | 5 comments