Yi-34B-200K model update: Needle-in-a-Haystack improved from 89.3% to 99.8%
Posted by rerri@reddit | LocalLLaMA | View on Reddit | 10 comments
Copy-pasta from model card:
​
>The long text capability of the Yi-34B-200K has been enhanced.
>
>In the "Needle-in-a-Haystack" test, the Yi-34B-200K's performance is improved by 10.5%, rising from 89.3% to an impressive 99.8%. We continue to pretrain the model on 5B tokens long-context data mixture and demonstrate a near-all-green performance.
[https://huggingface.co/01-ai/Yi-34B-200K](https://huggingface.co/01-ai/Yi-34B-200K)
10 Comments
bassgojoe@reddit
Beginning_Category64@reddit
VicboyV@reddit
Mescallan@reddit
Vehnum@reddit
Mescallan@reddit
aggracc@reddit
SeymourBits@reddit
aggracc@reddit
Paradigmind@reddit