Math don't check out.

Posted by Ok-Internal9317@reddit | LocalLLaMA | View on Reddit | 13 comments

Say I own a blackwell, where prompt ingestion speed is 2500tok/s; which is very fast. My context window in opencode usually is 120K, 120000/2500=48seconds 48 second I must wait for each code edit, which could've been opening a file or a simple 20 token linux command. This is unuseably slow no?