dubesor86

Token impact by long-Chain-of-Thought Reasoning Models

Posted by dubesor86@reddit | LocalLLaMA | View on Reddit | 21 comments
LLM Chess tournament - Single-elimination (includes DeepSeek & Llama models)

Posted by dubesor86@reddit | LocalLLaMA | View on Reddit | 12 comments
Compared DeepSeek-R1 to DeepSeek-R1-Zero: surprising results

Posted by dubesor86@reddit | LocalLLaMA | View on Reddit | 23 comments
I ran o1-preview through my small-scale benchmark, and it scored nearly identical to Llama 3.1 405B

Posted by dubesor86@reddit | LocalLLaMA | View on Reddit | 66 comments
Dubesor LLM Benchmark table

Posted by dubesor86@reddit | LocalLLaMA | View on Reddit | 7 comments
LLama 3.1 405B Instruct - Top 5 Overall in my own testing

Posted by dubesor86@reddit | LocalLLaMA | View on Reddit | 18 comments
Small scale personal benchmark results (28 models tested)

Posted by dubesor86@reddit | LocalLLaMA | View on Reddit | 31 comments
Llama-3-70b: performs very similarly to Mistral Medium in my testing

Posted by dubesor86@reddit | LocalLLaMA | View on Reddit | 7 comments
Completely novel to me: im-also-a-good-gpt2-chatbot on LMSYS Arena using codeblock to draw Diagrams to supplement its explanations

Posted by dubesor86@reddit | LocalLLaMA | View on Reddit | 10 comments