ambient_temp_xeno

i dedicate this meme to you r/LocalLLaMA

Posted by LPFchan@reddit | LocalLLaMA | View on Reddit | 44 comments

MiniMax M3 is dope

Posted by FBIFreezeNow@reddit | LocalLLaMA | View on Reddit | 20 comments

Add EXAONE 4.5 implementations by nuxlear · Pull Request #21733 · ggml-org/llama.cpp

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 2 comments

MiniMax M3 is dope

Posted by FBIFreezeNow@reddit | LocalLLaMA | View on Reddit | 20 comments

(YT) PewDiePie released his harness/webui

Posted by Dany0@reddit | LocalLLaMA | View on Reddit | 450 comments

DIY Local 2x DGX Spark cluster cooler with automatic temperature controlled fan.

Posted by Porespellar@reddit | LocalLLaMA | View on Reddit | 6 comments

My home data center

Posted by alecKarfonta@reddit | LocalLLaMA | View on Reddit | 86 comments

Anti-AI people will hate you for keeping AI open.

Posted by Equal_Giraffe8866@reddit | LocalLLaMA | View on Reddit | 92 comments

ambient_temp_xeno@reddit

Nobody takes any notice of the anti-ai people anyway. Just hope that the *Days of their Lives level* drama between the open source inference projects doesn't eventually collapse it all.

Are there more easy techniques than --tensor-split to fill VRAM in llama.cpp?

Posted by GregoryfromtheHood@reddit | LocalLLaMA | View on Reddit | 19 comments

Is he crazy to say that?

Posted by pmv143@reddit | LocalLLaMA | View on Reddit | 203 comments

260K-param LLM running on an emulated 90s CPU inside an 18-year-old RTOS

Posted by MironV@reddit | LocalLLaMA | View on Reddit | 18 comments

Behold! Probably the most ghetto local AI server:

Posted by MackThax@reddit | LocalLLaMA | View on Reddit | 301 comments

Why are the AI Companies spreading F.U.D. about AI?

Posted by supracode@reddit | LocalLLaMA | View on Reddit | 57 comments

Went to the monthly AI dev meetup

Posted by nathandreamfast@reddit | LocalLLaMA | View on Reddit | 38 comments

Stop QwenLLama! Every other 4th post in this sub is about Qwen models in the past month

Posted by prselzh@reddit | LocalLLaMA | View on Reddit | 43 comments

Server build for local inference. 128 gb 3200 or 256 gb 2133mhz RAM?

Posted by PreparationTrue9138@reddit | LocalLLaMA | View on Reddit | 31 comments

The Financial Times has published an article about Heretic

Posted by -p-e-w-@reddit | LocalLLaMA | View on Reddit | 218 comments

The Financial Times has published an article about Heretic

Posted by -p-e-w-@reddit | LocalLLaMA | View on Reddit | 218 comments

ambient_temp_xeno@reddit

One way of looking at that is you've already gone wrong by releasing abliterated models and/or the tools to do it with your name attached. Obviously there are ways to make it sound worse, they were probably hoping for some comment on what people might do with them. Dzzzzt no.

The Financial Times has published an article about Heretic

Posted by -p-e-w-@reddit | LocalLLaMA | View on Reddit | 218 comments

ambient_temp_xeno@reddit

I think it's just about worth observing that the FT is from England, where you can easily fall afoul of the law by badly drawing something obscene with a pencil or writing scary things in your own diary.

The Financial Times has published an article about Heretic

Posted by -p-e-w-@reddit | LocalLLaMA | View on Reddit | 218 comments

OSCAR RotationZoo - Offline Spectral Covariance-Aware Rotation for 2-bit KV Cache Quantization

Posted by pmttyji@reddit | LocalLLaMA | View on Reddit | 8 comments

ambient_temp_xeno@reddit

I don't believe so: *All OSCAR parameters are estimated once from a small MMLU-style calibration set. For each model, we run one calibration pass and dump per-layer Q, K, V activations (8878 tokens × number of layers), from which we compute the key/value rotations and per-layer clipping thresholds, then reuse the same parameters for all benchmarks. No task-specific calibration is used.*

Next year we're getting 0.5T model from Grok

Posted by pmttyji@reddit | LocalLLaMA | View on Reddit | 200 comments

Next year we're getting 0.5T model from Grok

Posted by pmttyji@reddit | LocalLLaMA | View on Reddit | 200 comments

Have we passed the peak of inflated expectations?

Posted by fairydreaming@reddit | LocalLLaMA | View on Reddit | 158 comments

Have we passed the peak of inflated expectations?

Posted by fairydreaming@reddit | LocalLLaMA | View on Reddit | 158 comments

Have we passed the peak of inflated expectations?

Posted by fairydreaming@reddit | LocalLLaMA | View on Reddit | 158 comments

ambient_temp_xeno@reddit

I just think it's funny when people turn up and then go away again because they watched something on youtube. Don't take everything so personally and get mad about things.

Have we passed the peak of inflated expectations?

Posted by fairydreaming@reddit | LocalLLaMA | View on Reddit | 158 comments

Have we passed the peak of inflated expectations?

Posted by fairydreaming@reddit | LocalLLaMA | View on Reddit | 158 comments

Have we passed the peak of inflated expectations?

Posted by fairydreaming@reddit | LocalLLaMA | View on Reddit | 158 comments

Have we passed the peak of inflated expectations?

Posted by fairydreaming@reddit | LocalLLaMA | View on Reddit | 158 comments

ambient_temp_xeno@reddit

I see. As a non-coder I'm just glad I can get them to make anything at all instead of hoping there's some abandoned github project. Needless to say this is only for non-internet facing stuff.

Have we passed the peak of inflated expectations?

Posted by fairydreaming@reddit | LocalLLaMA | View on Reddit | 158 comments

What would 2x RTX 3060 12GB get me?

Posted by ObjectiveActuator8@reddit | LocalLLaMA | View on Reddit | 64 comments

ambient_temp_xeno@reddit

>I mention wanting 2 cards instead of one for the experience of running multiple GPUs. There's not much to it. If you needed to run multiple cards in future it wouldn't take you long to get it running.

Have we passed the peak of inflated expectations?

Posted by fairydreaming@reddit | LocalLLaMA | View on Reddit | 158 comments

Have we passed the peak of inflated expectations?

Posted by fairydreaming@reddit | LocalLLaMA | View on Reddit | 158 comments

Gemma is so much better than Qwen, prove me wrong

Posted by Mountain_Patience231@reddit | LocalLLaMA | View on Reddit | 62 comments

DRAM relief calendar

Posted by Terminator857@reddit | LocalLLaMA | View on Reddit | 51 comments

DRAM relief calendar

Posted by Terminator857@reddit | LocalLLaMA | View on Reddit | 51 comments

Heretic has been served a legal notice by Meta, Inc.

Posted by -p-e-w-@reddit | LocalLLaMA | View on Reddit | 349 comments

I tested 42 LLMs on their willingness to build the apocalypse. The "safest" closed-source models are lying to you.

Posted by Ok-Awareness9993@reddit | LocalLLaMA | View on Reddit | 144 comments

Heretic has been served a legal notice by Meta, Inc.

Posted by -p-e-w-@reddit | LocalLLaMA | View on Reddit | 349 comments

Heretic has been served a legal notice by Meta, Inc.

Posted by -p-e-w-@reddit | LocalLLaMA | View on Reddit | 349 comments

Heretic has been served a legal notice by Meta, Inc.

Posted by -p-e-w-@reddit | LocalLLaMA | View on Reddit | 349 comments

Heretic has been served a legal notice by Meta, Inc.

Posted by -p-e-w-@reddit | LocalLLaMA | View on Reddit | 349 comments

Heretic has been served a legal notice by Meta, Inc.

Posted by -p-e-w-@reddit | LocalLLaMA | View on Reddit | 349 comments

Gemma 4 thinks I'm gaslighting it when I talk about Gemma 4 line of models

Posted by Jorlen@reddit | LocalLLaMA | View on Reddit | 14 comments

ambient_temp_xeno@reddit

My own tinfoil theory for this is that they did a lot of distilling from Gemini during training but because they have access to the system prompt of Gemini they gave it specific instructions to not say it was Gemini in any outputs.

HF flagged safetensors as unsafe? wtf?

Posted by No_Afternoon_4260@reddit | LocalLLaMA | View on Reddit | 5 comments

Re. what ever happened to Cohere’s Command-A series of models?

Posted by nick_frosst@reddit | LocalLLaMA | View on Reddit | 102 comments

I tested 42 LLMs on their willingness to build the apocalypse. The "safest" closed-source models are lying to you.

Posted by Ok-Awareness9993@reddit | LocalLLaMA | View on Reddit | 144 comments

I tested 42 LLMs on their willingness to build the apocalypse. The "safest" closed-source models are lying to you.

Posted by Ok-Awareness9993@reddit | LocalLLaMA | View on Reddit | 144 comments

What happens to local LLM if/when LLMs are no longer released for free?

Posted by JohnBooty@reddit | LocalLLaMA | View on Reddit | 238 comments

ambient_temp_xeno@reddit

The knowledge cut-off will not be that big of problem compared to the models just being outdated in terms of brains. Just look it as a glass half full: we could've ended up with just the Llama 1 leaks in another timeline.