Fimbulvetr 11b v2.1 16k Released

[-]

delveccio@reddit

Can someone possibly help me? This works no problem in Oobabooga, but when I use SillyTavern, it spits gibberish. If I use a GGUF, it works in either. Even if I use the same settings I use with the GGUF, it goes weird if I use the non-GGUF version. Why?

Reply

[-]

s101c@reddit

As a long-time user of Fimbulvetr v2, I will be checking it asap, thanks a lot Sao10K! One question: Solar (which is the basis of this model) has a maximum context window of 4096 tokens. Going beyond that most likely degrades the reasoning capabilities of the model – are there any benchmark comparisons with the regular v2 model?

Reply

[-]

Jim__my@reddit

"I recommend sticking up to 12K context, but loading the model at 16K for inference. It has a really accurate context up to 10K from multiple different extended long context tests. 16K works fine for roleplays, but not for more detailed tasks."

Reply

[-]

delveccio@reddit

I'm sorry if this isn't the right place to ask, but how do I set that in Oobabooga? If I chose llama.cpp I have these options but I have to choose "Transformers" and that only lets me set a ratio for Rope and context rather than a specific value.

Reply

[-]

Jim__my@reddit

Below a comment under a 20 day old post isn't a great place to ask. I don't use Oobabooga, but I'd recommend checking the GitHub page. Or download LM studio for a seamless UI.

Reply

[-]

delveccio@reddit

I don't know what I'm doing wrong but this only generates gibberish for me. I've tried my old Fimbulvetr settings (which worked great with the 4k model), I've tried Alpaca, I've tried Default, I've tried Universal Light settings.

Reply

[-]

mangkook@reddit

Just tried. Same template as v2. Directly using ollama terminal, it seems broken. Gibberish, cut off halfway etc. . But when I tried with enchanted or chatbox on ipad. it works. The output is not as good as usual. Disregard and forgetting instruction. Tried different temp. Not helping. As of now v2 still reign. Even other merge like holodeck is better than this v2.1. Fimbulvetr holodeck merge also has 32k context but it's nowhere near the advertised value, in my test.

Reply

[-]

What_Do_It@reddit

Why does no one on hugging face explain the intended use case of their model?

Reply

[-]

_chuck1z@reddit

It's explained on its first release - Fimbulvetr 10.7B v1. It's also a pretty popular model for that certain use case

Reply

[-]

myfairx@reddit

Nice. I’m writing a very extended steered story using v2. V2.1 might fix what’s wrong with extended context before need summarization

Reply

[-]

wakigatameth@reddit

This model is broken. Previous Fimbuletr v2 iMat allowed the NPCs some thoughts. Including self-preservation. . This one, at Q8_0... I can hand an NPC a gun and ask them to shoot themselves, and they will just do it. No amount of prompting about "not obeying" will fix this. This LLM was taught on a deep level that NPCs should mindlessly comply with user's requests, even if that behavior is overridden in the prompt.

Reply

[-]

toothpastespiders@reddit

That's really cool news. I pretty much passed on both solar and fimbulvetr until recently due to the combination of small size and low context. But on finally giving it a chance I was surprised at how solid it is. I'd been using standard rope with llama.cpp to get it to 12k, but a more tweaked solution is intriguing.

Reply

[-]

Languages_Learner@reddit

A few hours ago Sao10K also uploaded a model for group roleplay - [Sao10K/L3-8B-Chara-v1-Alpha · Hugging Face](https://huggingface.co/Sao10K/L3-8B-Chara-v1-Alpha). So i made q8 gguf for it: [NikolayKozloff/L3-8B-Chara-v1-Alpha-Q8\_0-GGUF · Hugging Face](https://huggingface.co/NikolayKozloff/L3-8B-Chara-v1-Alpha-Q8_0-GGUF)

Reply

[-]

drgreenair@reddit

Reply

[-]

cynerva@reddit

I'm guessing you'll get better results with Alpaca or Vicuna formatting. There are examples in the Fimbulvetr v2 model card: https://huggingface.co/Sao10K/Fimbulvetr-11B-v2

Reply

[-]

drgreenair@reddit

Saving this post for later gonna try it out on HF’s inferences!

Reply

[-]

PopularPrivacyPeople@reddit

quants are up [https://huggingface.co/mradermacher/Fimbulvetr-11B-v2.1-16K-i1-GGUF](https://huggingface.co/mradermacher/Fimbulvetr-11B-v2.1-16K-i1-GGUF)

Reply

Fimbulvetr 11b v2.1 16k Released

Reply to Post

17 Comments

delveccio@reddit

s101c@reddit

Jim__my@reddit

delveccio@reddit

Jim__my@reddit

delveccio@reddit

mangkook@reddit

What_Do_It@reddit

_chuck1z@reddit

myfairx@reddit

wakigatameth@reddit

toothpastespiders@reddit

Languages_Learner@reddit

drgreenair@reddit

cynerva@reddit

drgreenair@reddit

PopularPrivacyPeople@reddit