pseudonerv
-
mistral released weights for Voxtral Mini 4B Realtime 2602
Posted by pseudonerv@reddit | LocalLLaMA | View on Reddit | 4 comments
-
abliterated lora gguf works with r1-distill-qwen-32b
Posted by pseudonerv@reddit | LocalLLaMA | View on Reddit | 0 comments
-
control vectors added to llama.cpp
Posted by pseudonerv@reddit | LocalLLaMA | View on Reddit | 5 comments
-
any study on llama.cpp low bpw quants?
Posted by pseudonerv@reddit | LocalLLaMA | View on Reddit | 3 comments
-
llama github README says 3.1
Posted by pseudonerv@reddit | LocalLLaMA | View on Reddit | 0 comments
-
A simple way to "Extending Context to 8K"?!
Posted by pseudonerv@reddit | LocalLLaMA | View on Reddit | 1 comments
-
How can I get the same response from gemini-1.5-pro as lmsys?
Posted by pseudonerv@reddit | LocalLLaMA | View on Reddit | 5 comments