TheaterFire

Dolphin 3.0 !

Posted by Evening_Action6217@reddit | LocalLLaMA | View on Reddit | 54 comments

Dolphin 3.0 !

Reply to Post

54 Comments

shockwaverc13@reddit

"I'm eval'ing and stuff" \-WizardLM
View on Reddit #44338080

RemarkableRow6860@reddit

https://preview.redd.it/ob6x9a3x3ucg1.png?width=1024&format=png&auto=webp&s=af38b2146b2b98f025875a2cc7f073397de76945 me
View on Reddit #75649382

Affectionate-Cap-600@reddit

F
View on Reddit #44406825

Tight_Range_5690@reddit

RIP 🕊️🪦
View on Reddit #44355444

CtrlAltDelve@reddit

😭
View on Reddit #44340067

isr_431@reddit

Dolphin Mixtral is still a beast
View on Reddit #44350719

aiwtl@reddit

True, not happy with dolphin3
View on Reddit #45205238

Healthy-Nebula-3603@reddit

Where? In the dinosaur's era?
View on Reddit #44357368

clduab11@reddit

Is it weird I'm already feeling nostalgia being Dolphin was the very first local model I played with? And this was ***just a few months ago?!?!*** Genuinely hype for this hahaha
View on Reddit #44337505

denyicz@reddit

My first model was llama. I feel like vietnam veteran compared to you.
View on Reddit #44364589

KallistiTMP@reddit

Remember GPT-2? Pepperidge farm remembers.
View on Reddit #44495943

MorallyDeplorable@reddit

I remember feeding years of IRC logs to a markov chain and making an IRC bot to pretend to be a user. Am I literally dirt at this point?
View on Reddit #44369435

Thick-Protection-458@reddit

And I started experimenting with LLMs at times of early GPT-3 (not GPT-3.5 aka ChatGPT). I guess I am a dynosaur than.
View on Reddit #44380493

Affectionate-Cap-600@reddit

the g(old) text-davinci-003
View on Reddit #44406890

No-Trifle315@reddit

I remember when tried to finetune GPT-2, seens like another whole life since then.
View on Reddit #44487421

denyicz@reddit

Remember openai claimed gpt2 was too dangerous lol
View on Reddit #44395997

Thick-Protection-458@reddit

Frankly I was much less interested in language models back than, so I did not paid much attention. Like I was interested in NLP, but from engineering perspective encoder and encoder-decoder models was more useful, so I paid attention to them instead.
View on Reddit #44447725

denyicz@reddit

Damn. Can you tell me about big bang old man?
View on Reddit #44395978

MorallyDeplorable@reddit

I believe it was the result of DALNet collecting all the densest people in one place and creating a cosmic event.
View on Reddit #44418580

royalsail321@reddit

And I was chatting with cleverbot checkmate
View on Reddit #44403777

royalsail321@reddit

And I was chatting with cleverbot checkmate
View on Reddit #44403757

luquoo@reddit

I think I'm getting nostalgia from reading your post!
View on Reddit #44343242

MorallyDeplorable@reddit

Remember when he italicized and bolded that part? So timeless.
View on Reddit #44363656

sorehamstring@reddit

that's my type of nostalgia
View on Reddit #44387570

Optimistic_Futures@reddit

So excited to play Super Monkey Ball on it
View on Reddit #44339380

Rude-Proposal-9600@reddit

I thought it was referring to an emulator too
View on Reddit #44349928

_stevencasteel_@reddit

Not "an" emulator. "THE" emulator. Totes in the top three or five.
View on Reddit #44354260

TacticalBacon00@reddit

I just sideloaded the Android version of Dolphin on my Quest 3S this week and I can confirm that Dolphin still slaps. Super Mario Sunshine in VR with an Xbox One controller is definitely not the intended way to play, but it was able to maintain 30FPS or above on everything outside of a loading screen.
View on Reddit #44440623

drooolingidiot@reddit

Is Dolphin a finetune of llama or something?
View on Reddit #44339831

CtrlAltDelve@reddit

It's intended to be an extremely uncensored model, but I don't really know how relevant that is anymore. It used to be hugely popular.
View on Reddit #44340093

cobbleplox@reddit

Uncensored while not explicitly made for erp and generally fixing weird prompt formats to be ChatML is very, very welcome in my book. It's actually super sad how many big releases just need their shitty prompt format fixed.
View on Reddit #44353349

skrshawk@reddit

I'd written off Qwen2.5 based on their original instruct tune. I didn't get how powerful that model really was until someone came back and tuned it off of the base.
View on Reddit #44360668

MorallyDeplorable@reddit

What tunes are you having luck with?
View on Reddit #44363721

skrshawk@reddit

Consider my use-case is primarily creative writing, so tunes like EVA and Mistoria (Euryale dataset for Qwen) are my daily drivers.
View on Reddit #44378362

ThatsALovelyShirt@reddit

How would you compare those to 22B MistralSmall models?
View on Reddit #44385274

skrshawk@reddit

I wouldn't because every time you make a jump in parameter size you go up or down a level of general intelligence and reasoning. That said, there's a clear different in smarts and in both the quality and style of writing between Qwen 72B and Largestral 123B. Both have finetunes from Drummer (one of the spicier datasets out there) and you see the underlying model quality, at that level even at tiny quants of the big one. Behemoth 1.2 at IQ2_M is a stronger model than Anubis Q4, and Behemoth at Q4 is stronger still (I usually run that on Runpod with TabbyAPI and anywhere between 4-5bpw). I don't consider any model under 70B for what I do, because smaller models just don't handle multiple characters and tracking their thoughts, dialogue, and actions separately very well. You can have a good time in a 1:1 eRP scenario with the smaller models though and no doubt a lot of people do.
View on Reddit #44402806

Hoodfu@reddit

Going one further, the issue popped up that when the new llama 3's came out, they were such a leap from the old mistral models that what he was training llama on to unalign it actually made it dumber. That's when people started going the abliterated route. 
View on Reddit #44356931

a_beautiful_rhind@reddit

QvQ 72b dolphin.
View on Reddit #44337118

sassydodo@reddit

honestly I'd love to see 8b reasoning model
View on Reddit #44399651

Competitive_Ad_5515@reddit

!remindme 2 days
View on Reddit #44380896

RemindMeBot@reddit

I will be messaging you in 2 days on [**2025-01-02 03:37:31 UTC**](http://www.wolframalpha.com/input/?i=2025-01-02%2003:37:31%20UTC%20To%20Local%20Time) to remind you of [**this link**](https://www.reddit.com/r/LocalLLaMA/comments/1hpqcgg/dolphin_30/m4mzfjd/?context=3) [**CLICK THIS LINK**](https://www.reddit.com/message/compose/?to=RemindMeBot&subject=Reminder&message=%5Bhttps%3A%2F%2Fwww.reddit.com%2Fr%2FLocalLLaMA%2Fcomments%2F1hpqcgg%2Fdolphin_30%2Fm4mzfjd%2F%5D%0A%0ARemindMe%21%202025-01-02%2003%3A37%3A31%20UTC) to send a PM to also be reminded and to reduce spam. ^(Parent commenter can ) [^(delete this message to hide from others.)](https://www.reddit.com/message/compose/?to=RemindMeBot&subject=Delete%20Comment&message=Delete%21%201hpqcgg) ***** |[^(Info)](https://www.reddit.com/r/RemindMeBot/comments/e1bko7/remindmebot_info_v21/)|[^(Custom)](https://www.reddit.com/message/compose/?to=RemindMeBot&subject=Reminder&message=%5BLink%20or%20message%20inside%20square%20brackets%5D%0A%0ARemindMe%21%20Time%20period%20here)|[^(Your Reminders)](https://www.reddit.com/message/compose/?to=RemindMeBot&subject=List%20Of%20Reminders&message=MyReminders%21)|[^(Feedback)](https://www.reddit.com/message/compose/?to=Watchful1&subject=RemindMeBot%20Feedback)| |-|-|-|-|
View on Reddit #44380953

TroyDoesAI@reddit

Eric Hartfords Dolphin is the main reason I went so heavy into LLMs and generative ai. I am excited to see the any updates.
View on Reddit #44380730

Nandakishor_ml@reddit

where are you dolphin
View on Reddit #44380442

Conscious_Nobody9571@reddit

We're not ready... 😭
View on Reddit #44361469

MustBeSomethingThere@reddit

Are fine-tunings by a single person still a thing?
View on Reddit #44335732

skrshawk@reddit

In the creative writing and RP/ERP world, I'd say there's more people who do their own thing, even FFTs, than there are groups collaborating more than loosely on what goes in their datasets.
View on Reddit #44360101

Many_SuchCases@reddit

I guess so, I used to be excited for the original dolphin's and other popular finetunes at the time, but I kind of stopped caring after a while. I think part of that is because if we look at these general finetunes where the goal is to achieve higher scores or better output in general, I feel like it's a bit of a lost cause. It's also really hard for me to take it serious when a single person claims to achieve a 5-10 point increase in benchmarks that somehow a billion(s) dollar company couldn't do. In the beginning there was *some* hope that maybe by training it on some better OpenAI data that we could improve it. And it might have worked a bit to some extend, but at this point these companies have invested so much money that if a single person who rents a few A100's could make that much of a difference, they would have done that by themselves. Think about how big Meta's datacenter is for example. They could literally run the same experiment in 2 minutes over and over and over. Not just that, but, they have a lot more advanced techniques then "just show it some different data" at this point. As far as uncensored models go, we now also have uncensored models in pretty much every niche and then in every niche of those niches. Not to mention, generally uncensored models in most areas (finetuned on external data) and then also abliterated models for those who prefer that. So personally, I'm just not interested in these anymore, but to each their own.
View on Reddit #44348831

NoLifeGamer2@reddit

She eval on my benchmark until I release
View on Reddit #44341152

smooshie@reddit

[EXTREMELY LOUD INCORRECT BUZZER]
View on Reddit #44344746

TheLogiqueViper@reddit

Test time inference? After launch of test time inference i want open source models to implement it so badly
View on Reddit #44335550

s-kostyaev@reddit

Try QwQ
View on Reddit #44344178

Sky_Linx@reddit

I’m not familiar with Dolphin as I only recently started to run local models. Is it good and how does it compare to Qwen 2.5 for example?
View on Reddit #44335324

martinerous@reddit

Dolphin are finetunes of the original models. As with every finetune, it can be a hit or miss. I usually check the base models first and go for finetunes only if I need something specific.
View on Reddit #44343053

Zemanyak@reddit

Brings up good memories. I hope Dolphin 3.0 will be as good as 2.0 when it was released.
View on Reddit #44333293