The Scariest Thing In LLMs/AI Isn't the Models or the Math... It's the Names.

[-]

Normal-Ad-7114@reddit

I have to try and paste these into Midjourney and see what it comes up with

Reply

[-]

ortegaalfredo@reddit

NEMOTRON-45B-ABLITERATED Sounds like something that wants to fight Optimus Prime.

Reply

[-]

Marksta@reddit

So uhh, REMATEPIALIZATION is definitely a made up word right? 😂

Reply

[-]

XMasterrrr@reddit (OP)

[Dynamic Tensor Rematerialization](https://arxiv.org/abs/2006.09616). No idea what's it about, and too scared to look 😂

Reply

[-]

Instead of saving all layer activations (n of them) for backpropagation you save every sqrt n -th layer, and recompute missing ones as you need Tldr memory saving from n to 2 sqrt n with doubling compute cost

Reply

[-]

ChomsGP@reddit

haha no joke, between the ominous names and the excessive use of emojis the machine uprising is going to be interesting 🤣

Reply

[-]

segmond@reddit

don't care, we don't go to X. if you want to be a part of local LLM, post things relevant to local LLMs. This is not it.

Reply

[-]

Limp_Classroom_2645@reddit

it's has pretty cool visuals tho, i'll upvote it, might tell us how you generated the visuals OP?

Reply

[-]

Dr_Ambiorix@reddit

The yellow/brownish/orange tint of it all reeks of ChatGPT 4o image generator. I don't mind AI generated images, but I hate it when they have a quirk that I can't unsee once you've seen it...

Reply

[-]

Inaeipathy@reddit

Mind absolutely broken.

Reply

[-]

Evening_Ad6637@reddit

Geez, Reddit is the same centralized shit just fyi

Reply

[-]

Inevitable_Ad3676@reddit

The funny tag exists, and the jokes are about stuff that local LLM enthusiasts can interact with. I'd say it's relevant.

Reply

[-]

XMasterrrr@reddit (OP)

???

Reply

[-]

Euchale@reddit

The scariest one is "Download Torch"

Reply

[-]

dacevnim@reddit

and then there is BLT, (dynamic byte latent transformer)

Reply

[-]

Lissanro@reddit

Perhaps for the last panel, `Oobabooga` would be a good fit, to control them all.

Reply

[-]

yukiarimo@reddit

RUUUUUN

Reply

[-]

Sidran@reddit

FOR THE CHOPPER???!?!?!

Reply

[-]

Linkpharm2@reddit

Smegmma 2 27b

Reply

[-]

Ardalok@reddit

https://preview.redd.it/frsft9m7gl0f1.png?width=471&format=png&auto=webp&s=4f0f424726d2587a32db5474cc292f42ce9645ad

Reply

[-]

XMasterrrr@reddit (OP)

Hey guys, I haven't posted here in a while, I've been a lot more active over on X, especially since the LLM research scene is much more alive there. Just wanted to cross-post this here as well. I’m the original author of this on [X](https://x.com/TheAhmadOsman/status/1922336545719107759). > the scariest thing in llms/ai isn't the models or the math > it's the names > > > kv cache prefill strategy > > multi-head attention with rotary position embeddings > > fused CUDA kernel for dynamic tensor rematerialization > > nucleus sampling with temperature scaling and repetition penalty > > flash attention v2 with block-sparse operations, causal masking, and warp-level primitives > > bro they sound like boss fights frfr

Reply

[-]

XMasterrrr@reddit (OP)

Guys, I don't understand the downvotes. I literally copied the entire tweet over here so nobody has to click on anything 😅 I am also sharing my findings that there is an active research community over there so that people know to keep their eyes open, I am not advocating for a platform but rather sharing something I think genuinely helpful to the collective knowledge of the members of this community

Reply

The Scariest Thing In LLMs/AI Isn't the Models or the Math... It's the Names.

Reply to Post

23 Comments

Normal-Ad-7114@reddit

ortegaalfredo@reddit

Remove_Ayys@reddit

Marksta@reddit

XMasterrrr@reddit (OP)

wektor420@reddit

ChomsGP@reddit

segmond@reddit

Limp_Classroom_2645@reddit

Dr_Ambiorix@reddit

Inaeipathy@reddit

Evening_Ad6637@reddit

Inevitable_Ad3676@reddit

XMasterrrr@reddit (OP)

Euchale@reddit

dacevnim@reddit

Lissanro@reddit

yukiarimo@reddit

Sidran@reddit

Linkpharm2@reddit

Ardalok@reddit

XMasterrrr@reddit (OP)

XMasterrrr@reddit (OP)