BetEvening
-
DeepSeek-V2-Lite Finetuning w/4x 3090s?
Posted by BetEvening@reddit | LocalLLaMA | View on Reddit | 4 comments
-
Just letting models talk to themselves for a bit makes them so much more accurate in math. No other model in the LMSYS Arena ever gets close.
Posted by BetEvening@reddit | LocalLLaMA | View on Reddit | 12 comments