Llama 3 8B instruct with fixed BPE tokenizer uploaded
Posted by noneabove1182@reddit | LocalLLaMA | View on Reddit | 39 comments
https://huggingface.co/bartowski/Meta-Llama-3-8B-Instruct-GGUF
I know it was just a week ago when I posted claiming "full support for Llama 3 in GGUF", but as I'm sure you all know there was a BPE tokenizer bug
This is with the fix now, and running it with the latest llama.cpp ./main, we can see that even the Q2_K model gets the simple addition correct:
<|begin_of_text|><|start_header_id|>system<|end_header_id|>
You are a helpful, smart, kind, and efficient AI assistant. You always fulfill the user's requests to the best of your ability.<|eot_id|><|start_header_id|>user<|end_header_id|>
What is 7777 + 3333?<|eot_id|><|start_header_id|>assistant<|end_header_id|>
The answer is: 11110<|eot_id|> [end of text]
These models will also work if you haven't updated to latest llama.cpp, but will still have the old broken tokenizer until you get your tool updated.
So feel free to download now in anticipation for support! I hear LM Studio should be updated by tomorrow
39 Comments
_Zibri_@reddit
noneabove1182@reddit (OP)
SomeOddCodeGuy@reddit
Deathcrow@reddit
mikael110@reddit
Calcidiol@reddit
mikael110@reddit
0x9e3779b1@reddit
mikael110@reddit
0x9e3779b1@reddit
Acceptable_Total_937@reddit
LocoLanguageModel@reddit
noneabove1182@reddit (OP)
Calcidiol@reddit
noneabove1182@reddit (OP)
mikael110@reddit
noneabove1182@reddit (OP)
Tall-Entrepreneur686@reddit
noneabove1182@reddit (OP)
mikael110@reddit
aseichter2007@reddit
nsfw_throwitaway69@reddit
noneabove1182@reddit (OP)
DNskfKrH8Ekl@reddit
Healthy-Nebula-3603@reddit
noneabove1182@reddit (OP)
jayFurious@reddit
noneabove1182@reddit (OP)
Healthy-Nebula-3603@reddit
Some_Endian_FP17@reddit
noneabove1182@reddit (OP)
OpusLatericium@reddit
noneabove1182@reddit (OP)
OpusLatericium@reddit
daHaus@reddit
noneabove1182@reddit (OP)
rngesius@reddit
noneabove1182@reddit (OP)
rngesius@reddit