visionsmemories's Comments

[-]

visionsmemories@reddit

is the code open source anywhere?

Reply

[-]

sources [https://huggingface.co/spaces/PR-Puppets/PR-Puppet-Sora](https://huggingface.co/spaces/PR-Puppets/PR-Puppet-Sora) [https://x.com/legit\_rumors/status/1861431113408794898/photo/1](https://x.com/legit_rumors/status/1861431113408794898/photo/1) note that this is sora turbo which means likely not the biggest model they have

Reply

[-]

visionsmemories@reddit

thanks mate :D

Reply

[-]

visionsmemories@reddit

theyre kinda slow at releasing models. for example in fortnite theres a new season every 2 months, if alibaba research were as efficient we would have had qwen 6 by now

Reply

[-]

visionsmemories@reddit

youre correct about their benchmarks being slightly missleading, but cmon man, you get a sota open weights coder model for precisely 0.0$ and the first thing you do is complain? i mean you do you, whatever makes you happy

Reply

[-]

visionsmemories@reddit

your situation is unfortunate https://preview.redd.it/735qqvti9c0e1.png?width=286&format=png&auto=webp&s=be5227dfec0b75475536a3a81ebdf6584069a771 probably just use the 7b q4, or experiment with running 14b or even low quant 32b, though speeds will be quite low due to ram speed bottleneck

Reply

[-]

visionsmemories@reddit

problem is this seems really good on paper, but as for actual applications - there isnt an immediate advantage you get from it; so you just decide not to make it yourself and so do almost everyone else

Reply

[-]

visionsmemories@reddit

great, now benchmark them

Reply

[-]

visionsmemories@reddit

yup and pretty sure [https://github.com/exo-explore/exo](https://github.com/exo-explore/exo) can split moe models too

Reply

[-]

visionsmemories@reddit

i like that theres gpt5 and gpt 6

Reply

[-]

visionsmemories@reddit

This would be perfect for running on a 256gb m4 max wouldnt it? since its a moe with only 50b active params

Reply

[-]

visionsmemories@reddit

https://preview.redd.it/gdumlyfoy0zd1.png?width=1460&format=png&auto=webp&s=885e5d47595997e086fff715f552f940ddf4b5c3

Reply

[-]

visionsmemories@reddit

this is some fat ass model holy shit. that thing is massive. it is huge. it is very very big massive model

Reply

[-]

visionsmemories@reddit

sorry my bad forgot to press release button

Reply

[-]

visionsmemories@reddit

this is absolutely fucking amazing. decision tree X ai has so much potential its actually mindblowing! please share any projects related to this

Reply

[-]

visionsmemories@reddit

this is a really good one

Reply

[-]

visionsmemories@reddit

XERAKC - xkcd explained released after knowldege cutoff. Strong vouch!

Reply

[-]

visionsmemories@reddit (OP)

approved by some guy ™ https://preview.redd.it/q3pwyn8pl6yd1.png?width=1608&format=png&auto=webp&s=cb47a66090721d2b9e02ce2df1f1c2e13c1f7764

Reply

[-]

visionsmemories@reddit (OP)

YEAH imagine [this](https://www.youtube.com/watch?v=14XjtN6pBAE) but the game hes playing is a dreamlike hysterical aigen

Reply

[-]

visionsmemories@reddit (OP)

screw youtube i wanna see a horror game thats based on training data from idk liveleak. or a racing game made out of ironic tiktoks. or orr the possibilities are wild indeed

Reply

[-]

visionsmemories@reddit (OP)

💀💀💀💀 the comments on fire today

Reply

[-]

visionsmemories@reddit (OP)

nah just a frontend. you can also use [nitter.poast.org](http://nitter.poast.org)

Reply

[-]

visionsmemories@reddit (OP)

exactly. i have no intention to cancel twitter or whatever as i really enjoy using it, but the xcancel service is just so convenient

Reply

[-]

visionsmemories@reddit (OP)

bro no way im getting downvoted on localllama for the xcancel link 💀💀💀

Reply

[-]

visionsmemories@reddit (OP)

https://preview.redd.it/4cqgma6qd6yd1.png?width=1450&format=png&auto=webp&s=7613b3c667d4d6b43bf456c452563195b7f43c8f \> Gameplay is just the start. Soon, most of the internet will be AI-generated i mean, in many ways it aleady is, but i don't think anybody is truly ready for whats already happening source: [xcancel.com/Etched/status/1852089772329869436](http://xcancel.com/Etched/status/1852089772329869436)

Reply

[-]

visionsmemories@reddit

i wish they were using some of the finetuned modesl instead of just the mainstream ones haha

Reply

[-]

visionsmemories@reddit

also interested

Reply

[-]

visionsmemories@reddit (OP)

bro just test it dont look for the perfect solution because youll never know if its gonna be actually perfect for what youre trying to do

Reply

[-]

visionsmemories@reddit (OP)

yes and yes. go for 32b instruct in about q5

Reply

[-]

visionsmemories@reddit (OP)

idk what youre talking about this works really fucking well

Reply

[-]

visionsmemories@reddit (OP)

source: [https://www.ibm.com/new/ibm-granite-3-0-open-state-of-the-art-enterprise-models](https://www.ibm.com/new/ibm-granite-3-0-open-state-of-the-art-enterprise-models) nobody benchmarks against qwen2.5

Reply