Drummer's Precog 24B and 123B v1 - AI that writes a short draft before responding

Posted by TheLocalDrummer@reddit | LocalLLaMA | View on Reddit | 29 comments

Hey guys! I wanted to explore a different way of thinking where the AI uses the `<think>` block to plan ahead and create a short draft so that its *actual* response has **basis**. It seems like a good way to have the AI pan out its start, middle, and end before writing the entire thing. Kind of like a synopsis or abstract. I'm hoping it could strengthen consistency and flow since the AI doesn't have to *wing it* and write a thousand tokens from the get-go. It's a cheaper, more effective alternative to reasoning, especially when it comes to story / RP. You can also make adjustments to the draft to steer it a certain way. Testers have been happy with it. 24B: [https://huggingface.co/TheDrummer/Precog-24B-v1](https://huggingface.co/TheDrummer/Precog-24B-v1) 123B: [https://huggingface.co/TheDrummer/Precog-123B-v1](https://huggingface.co/TheDrummer/Precog-123B-v1) Examples: https://preview.redd.it/1li2viecf91g1.png?width=2264&format=png&auto=webp&s=af225606b23751beaf3076b1a58140b1c77b1a4f https://preview.redd.it/7iu4m7zcf91g1.png?width=887&format=png&auto=webp&s=4de7655654340ec91216d8a61c93c474571b1dc0 https://preview.redd.it/3qo833ndf91g1.png?width=1010&format=png&auto=webp&s=0cac98a5e93dd87baa885bda58574385b8e73c11