Yi-34B-200K model update: Needle-in-a-Haystack improved from 89.3% to 99.8%

Posted by rerri@reddit | LocalLLaMA | View on Reddit | 10 comments

Copy-pasta from model card: ​ >The long text capability of the Yi-34B-200K has been enhanced. > >In the "Needle-in-a-Haystack" test, the Yi-34B-200K's performance is improved by 10.5%, rising from 89.3% to an impressive 99.8%. We continue to pretrain the model on 5B tokens long-context data mixture and demonstrate a near-all-green performance. [https://huggingface.co/01-ai/Yi-34B-200K](https://huggingface.co/01-ai/Yi-34B-200K)