Is there any use case for large models with very slow token output for batch processing?

Posted by Last_Bad_2687@reddit | LocalLLaMA | View on Reddit | 17 comments

Maybe I'm influenced by the sci-fi story "The Last Answer" by Issac Assimov but I've always got a tickle imagining a huge model like Kimi running on, say, disk. Even if it is 0.001 tok/sec to ask complex questions and get an answer in a week

Is there any use or community focused on this?