Is it weird I'm already feeling nostalgia being Dolphin was the very first local model I played with? And this was ***just a few months ago?!?!***
Genuinely hype for this hahaha
Frankly I was much less interested in language models back than, so I did not paid much attention.
Like I was interested in NLP, but from engineering perspective encoder and encoder-decoder models was more useful, so I paid attention to them instead.
I just sideloaded the Android version of Dolphin on my Quest 3S this week and I can confirm that Dolphin still slaps. Super Mario Sunshine in VR with an Xbox One controller is definitely not the intended way to play, but it was able to maintain 30FPS or above on everything outside of a loading screen.
Uncensored while not explicitly made for erp and generally fixing weird prompt formats to be ChatML is very, very welcome in my book. It's actually super sad how many big releases just need their shitty prompt format fixed.
I'd written off Qwen2.5 based on their original instruct tune. I didn't get how powerful that model really was until someone came back and tuned it off of the base.
I wouldn't because every time you make a jump in parameter size you go up or down a level of general intelligence and reasoning. That said, there's a clear different in smarts and in both the quality and style of writing between Qwen 72B and Largestral 123B. Both have finetunes from Drummer (one of the spicier datasets out there) and you see the underlying model quality, at that level even at tiny quants of the big one. Behemoth 1.2 at IQ2_M is a stronger model than Anubis Q4, and Behemoth at Q4 is stronger still (I usually run that on Runpod with TabbyAPI and anywhere between 4-5bpw).
I don't consider any model under 70B for what I do, because smaller models just don't handle multiple characters and tracking their thoughts, dialogue, and actions separately very well. You can have a good time in a 1:1 eRP scenario with the smaller models though and no doubt a lot of people do.
Going one further, the issue popped up that when the new llama 3's came out, they were such a leap from the old mistral models that what he was training llama on to unalign it actually made it dumber. That's when people started going the abliterated route.
I will be messaging you in 2 days on [**2025-01-02 03:37:31 UTC**](http://www.wolframalpha.com/input/?i=2025-01-02%2003:37:31%20UTC%20To%20Local%20Time) to remind you of [**this link**](https://www.reddit.com/r/LocalLLaMA/comments/1hpqcgg/dolphin_30/m4mzfjd/?context=3)
[**CLICK THIS LINK**](https://www.reddit.com/message/compose/?to=RemindMeBot&subject=Reminder&message=%5Bhttps%3A%2F%2Fwww.reddit.com%2Fr%2FLocalLLaMA%2Fcomments%2F1hpqcgg%2Fdolphin_30%2Fm4mzfjd%2F%5D%0A%0ARemindMe%21%202025-01-02%2003%3A37%3A31%20UTC) to send a PM to also be reminded and to reduce spam.
^(Parent commenter can ) [^(delete this message to hide from others.)](https://www.reddit.com/message/compose/?to=RemindMeBot&subject=Delete%20Comment&message=Delete%21%201hpqcgg)
*****
|[^(Info)](https://www.reddit.com/r/RemindMeBot/comments/e1bko7/remindmebot_info_v21/)|[^(Custom)](https://www.reddit.com/message/compose/?to=RemindMeBot&subject=Reminder&message=%5BLink%20or%20message%20inside%20square%20brackets%5D%0A%0ARemindMe%21%20Time%20period%20here)|[^(Your Reminders)](https://www.reddit.com/message/compose/?to=RemindMeBot&subject=List%20Of%20Reminders&message=MyReminders%21)|[^(Feedback)](https://www.reddit.com/message/compose/?to=Watchful1&subject=RemindMeBot%20Feedback)|
|-|-|-|-|
In the creative writing and RP/ERP world, I'd say there's more people who do their own thing, even FFTs, than there are groups collaborating more than loosely on what goes in their datasets.
I guess so, I used to be excited for the original dolphin's and other popular finetunes at the time, but I kind of stopped caring after a while.
I think part of that is because if we look at these general finetunes where the goal is to achieve higher scores or better output in general, I feel like it's a bit of a lost cause. It's also really hard for me to take it serious when a single person claims to achieve a 5-10 point increase in benchmarks that somehow a billion(s) dollar company couldn't do.
In the beginning there was *some* hope that maybe by training it on some better OpenAI data that we could improve it. And it might have worked a bit to some extend, but at this point these companies have invested so much money that if a single person who rents a few A100's could make that much of a difference, they would have done that by themselves. Think about how big Meta's datacenter is for example. They could literally run the same experiment in 2 minutes over and over and over. Not just that, but, they have a lot more advanced techniques then "just show it some different data" at this point.
As far as uncensored models go, we now also have uncensored models in pretty much every niche and then in every niche of those niches. Not to mention, generally uncensored models in most areas (finetuned on external data) and then also abliterated models for those who prefer that.
So personally, I'm just not interested in these anymore, but to each their own.
Dolphin are finetunes of the original models. As with every finetune, it can be a hit or miss. I usually check the base models first and go for finetunes only if I need something specific.
54 Comments
shockwaverc13@reddit
RemarkableRow6860@reddit
Affectionate-Cap-600@reddit
Tight_Range_5690@reddit
CtrlAltDelve@reddit
isr_431@reddit
aiwtl@reddit
Healthy-Nebula-3603@reddit
clduab11@reddit
denyicz@reddit
KallistiTMP@reddit
MorallyDeplorable@reddit
Thick-Protection-458@reddit
Affectionate-Cap-600@reddit
No-Trifle315@reddit
denyicz@reddit
Thick-Protection-458@reddit
denyicz@reddit
MorallyDeplorable@reddit
royalsail321@reddit
royalsail321@reddit
luquoo@reddit
MorallyDeplorable@reddit
sorehamstring@reddit
Optimistic_Futures@reddit
Rude-Proposal-9600@reddit
_stevencasteel_@reddit
TacticalBacon00@reddit
drooolingidiot@reddit
CtrlAltDelve@reddit
cobbleplox@reddit
skrshawk@reddit
MorallyDeplorable@reddit
skrshawk@reddit
ThatsALovelyShirt@reddit
skrshawk@reddit
Hoodfu@reddit
a_beautiful_rhind@reddit
sassydodo@reddit
Competitive_Ad_5515@reddit
RemindMeBot@reddit
TroyDoesAI@reddit
Nandakishor_ml@reddit
Conscious_Nobody9571@reddit
MustBeSomethingThere@reddit
skrshawk@reddit
Many_SuchCases@reddit
NoLifeGamer2@reddit
smooshie@reddit
TheLogiqueViper@reddit
s-kostyaev@reddit
Sky_Linx@reddit
martinerous@reddit
Zemanyak@reddit