indrasmirror
-
Got MTP + TurboQuant running — Qwen3.6-27B -- 80+ t/s at 262K context on a single RTX 4090
Posted by indrasmirror@reddit | LocalLLaMA | View on Reddit | 76 comments
-
Finetune LLama3 - Dataset format?
Posted by indrasmirror@reddit | LocalLLaMA | View on Reddit | 4 comments