Curb Your Inference: AICI for rewriting context in real time, constrained generation, backtracking KV-cache

Posted by tucnak@reddit | LocalLLaMA | View on Reddit | 14 comments