drplan

Deal on Ryzen 395 w/ 128GB, now 1581€ in Europe

Posted by Zyj@reddit | LocalLLaMA | View on Reddit | 78 comments

Deal on Ryzen 395 w/ 128GB, now 1581€ in Europe

Posted by Zyj@reddit | LocalLLaMA | View on Reddit | 78 comments

drplan@reddit

https://preview.redd.it/73h3ap3922eg1.jpeg?width=4284&format=pjpg&auto=webp&s=25b762fd6e09bee7db6d4570176a6a4459b3116a Yes 🙂

I'm hex editing an old videogame, how do I feed a (locally run) AI the game's code?

Posted by Xaxaxa-9@reddit | LocalLLaMA | View on Reddit | 17 comments

I'm hex editing an old videogame, how do I feed a (locally run) AI the game's code?

Posted by Xaxaxa-9@reddit | LocalLLaMA | View on Reddit | 17 comments

drplan@reddit

I would strongly doubt AI can deal with disassembled ASM on full application scale. Info loss is to be expected with decompilation, however the overall logic should be recognisable. And what AI is fairly OK at, is explaining existing code bases. So create a readable codebase with decompiler, open the directory in cursor, ans ask questions. As far as i remember many decompilers have the option to reference disassembly in the decompiled code.

I'm hex editing an old videogame, how do I feed a (locally run) AI the game's code?

Posted by Xaxaxa-9@reddit | LocalLLaMA | View on Reddit | 17 comments

drplan@reddit

I think my approach would be to use a decompiler first and align the dissassembly with it. Identify the relevant pieces, then analyze where/what you need to change with AI help.

You can now do 500K context length fine-tuning - 6.4x longer

Posted by danielhanchen@reddit | LocalLLaMA | View on Reddit | 54 comments

20,000 Epstein Files in a single text file available to download (~100 MB)

Posted by tensonaut@reddit | LocalLLaMA | View on Reddit | 326 comments

Is it normal to hear weird noises when running an LLM on 4× Pro 6000 Max-Q cards?

Posted by PlusProfession9245@reddit | LocalLLaMA | View on Reddit | 233 comments

Deal on Ryzen 395 w/ 128GB, now 1581€ in Europe

Posted by Zyj@reddit | LocalLLaMA | View on Reddit | 78 comments

Deal on Ryzen 395 w/ 128GB, now 1581€ in Europe

Posted by Zyj@reddit | LocalLLaMA | View on Reddit | 78 comments

drplan@reddit

https://preview.redd.it/teur16fnbi0g1.jpeg?width=3840&format=pjpg&auto=webp&s=a5a0653b72d2c4984cf16627f8906f4c96be076e Here it is

Deal on Ryzen 395 w/ 128GB, now 1581€ in Europe

Posted by Zyj@reddit | LocalLLaMA | View on Reddit | 78 comments

[MEGATHREAD] Local AI Hardware - November 2025

Posted by eck72@reddit | LocalLLaMA | View on Reddit | 84 comments

Strix Halo vs DGX Spark - Initial Impressions (long post with TL;DR at the end)

Posted by Eugr@reddit | LocalLLaMA | View on Reddit | 52 comments

Strix Halo vs DGX Spark - Initial Impressions (long post with TL;DR at the end)

Posted by Eugr@reddit | LocalLLaMA | View on Reddit | 52 comments

drplan@reddit

Thanks mate, this is awesome. Can you share a script on how you exactly compiled the Strix Halo optimal version of llama.cpp? That would be super-useful.

AMD Ryzen AI Max+ 395 --EVO-X2 128GB RAM...or...Minisforum MS-S1 Max

Posted by Excellent_Koala769@reddit | LocalLLaMA | View on Reddit | 37 comments

Deal on Ryzen 395 w/ 128GB, now 1581€ in Europe

Posted by Zyj@reddit | LocalLLaMA | View on Reddit | 78 comments

drplan@reddit

Wondering, how they are doing it. I am currently importing a Sixunited STHT1 board. 1581 € /which would include 19% taxes in Germany so it is 1328 € net) leaves no margin. They must have gotten an impressive deal with Six United.

I built a tiny fully local AI agent for a Raspberry Pi

Posted by syxa@reddit | LocalLLaMA | View on Reddit | 92 comments

Moxie goes local

Posted by Over-Mix7071@reddit | LocalLLaMA | View on Reddit | 54 comments

Elon Musk says that xAI will make Grok 2 open source next week

Posted by Nunki08@reddit | LocalLLaMA | View on Reddit | 214 comments

gpt-oss-120b blazing fast on M4 Max MBP

Posted by entsnack@reddit | LocalLLaMA | View on Reddit | 38 comments

drplan@reddit

Well the benchmarks do not seem very good at least, from what I am reading. My first test are OKish, however capabilities on languages other than English seem limited. Do not get me wrong, there is lots of potential. Benchmarks will tell us where these models will find their place.

gpt-oss-120b blazing fast on M4 Max MBP

Posted by entsnack@reddit | LocalLLaMA | View on Reddit | 38 comments

drplan@reddit

Jep, can we all agree on that the models are not very good, but that the architecture choices have the potential to move the needle performance-wise?

Ok, we get a lobotobot. Great.

Posted by Reno0vacio@reddit | LocalLLaMA | View on Reddit | 51 comments

Ok, we get a lobotobot. Great.

Posted by Reno0vacio@reddit | LocalLLaMA | View on Reddit | 51 comments

drplan@reddit

OPs prompt implied alignment with the ideology. The models responds if you ask in a neutral way. https://preview.redd.it/4zqh2x570dhf1.png?width=1936&format=png&auto=webp&s=b83ef734816b82ab6ea70fb4a6ae192acd199a7c

Ok, we get a lobotobot. Great.

Posted by Reno0vacio@reddit | LocalLLaMA | View on Reddit | 51 comments

Run gpt-oss locally with Unsloth GGUFs + Fixes!

Posted by danielhanchen@reddit | LocalLLaMA | View on Reddit | 88 comments

drplan@reddit

Performance on AMD AI Max 395 using llama.cpp on gpt-oss-20b is pretty decent. ./llama-bench -m /home/denkbox/models/gpt-oss-20b-F16.gguf --n-gpu-layers 100 warning: asserts enabled, performance may be affected warning: debug build, performance may be affected ggml\_vulkan: Found 1 Vulkan devices: ggml\_vulkan: 0 = Radeon 8060S Graphics (RADV GFX1151) (radv) | uma: 1 | fp16: 1 | bf16: 0 | warp size: 64 | shared memory: 65536 | int dot: 1 | matrix cores: KHR\_coopmat register\_backend: registered backend Vulkan (1 devices) register\_device: registered device Vulkan0 (Radeon 8060S Graphics (RADV GFX1151)) register\_backend: registered backend CPU (1 devices) register\_device: registered device CPU (AMD RYZEN AI MAX+ 395 w/ Radeon 8060S) load\_backend: failed to find ggml\_backend\_init in /home/denkbox/software/llama.cpp/build/bin/libggml-vulkan.so load\_backend: failed to find ggml\_backend\_init in /home/denkbox/software/llama.cpp/build/bin/libggml-cpu.so | model                          |       size |     params | backend    | ngl |            test |                  t/s | | ------------------------------ | ---------: | ---------: | ---------- | --: | --------------: | -------------------: | | gpt-oss ?B F16                 |  12.83 GiB |    20.91 B | Vulkan     | 100 |           pp512 |        485.92 ± 4.69 | | gpt-oss ?B F16                 |  12.83 GiB |    20.91 B | Vulkan     | 100 |           tg128 |         44.02 ± 0.31 |

Help! How to access the full 96GB VRAM on AMD Strix Halo (Ryzen AI Max+ 395) with PyTorch in Ubuntu 24.04?

Posted by ashwin3005@reddit | LocalLLaMA | View on Reddit | 6 comments

China no. 1!

Posted by entsnack@reddit | LocalLLaMA | View on Reddit | 61 comments

The Holy Grail

Posted by No-Search9350@reddit | LocalLLaMA | View on Reddit | 23 comments

drplan@reddit

Aside from the benchmarks, I see quantification in the light of biological neural processing also relying on approximate "calculations". Using full precision is simply bad use of resources.

Ryzen AI Max+ 395 + a gpu?

Posted by Alarming-Ad8154@reddit | LocalLLaMA | View on Reddit | 52 comments

4x 4090 48GB inference box (I may have overdone it)

Posted by 101m4n@reddit | LocalLLaMA | View on Reddit | 178 comments

I'm using a local Llama model for my game's dialogue system!

Posted by LandoRingel@reddit | LocalLLaMA | View on Reddit | 158 comments

drplan@reddit

Um. This might be a stupid idea.. but couldn't one try to "enhance" older adventure games (e.g. Monkey Island, Indiana Jones, etc.) by parsing the available SCUMM files? I know, i know sacrilege... but this could be a fun experiment?

Mindblowing demo: John Link led a team of AI agents to discover a forever-chemical-free immersion coolant using Microsoft Discovery.

Posted by cjsalva@reddit | LocalLLaMA | View on Reddit | 74 comments

Mindblowing demo: John Link led a team of AI agents to discover a forever-chemical-free immersion coolant using Microsoft Discovery.

Posted by cjsalva@reddit | LocalLLaMA | View on Reddit | 74 comments

Mindblowing demo: John Link led a team of AI agents to discover a forever-chemical-free immersion coolant using Microsoft Discovery.

Posted by cjsalva@reddit | LocalLLaMA | View on Reddit | 74 comments

Are these real prices? Seems low. Never used e-bay I'm from Europe (sorry).

Posted by Sufficient_Bit_8636@reddit | LocalLLaMA | View on Reddit | 56 comments

What OS are you ladies and gent running?

Posted by No-Report-1805@reddit | LocalLLaMA | View on Reddit | 74 comments

Grok is cheaper & better than DeepSeek

Posted by BidHot8598@reddit | LocalLLaMA | View on Reddit | 19 comments

LLM coding prompt obfuscator

Posted by robertpiosik@reddit | LocalLLaMA | View on Reddit | 3 comments

drplan@reddit

You want to pseudonymize relevant identifiers (like names, credit numbers, etc.)? There are projects/services out there doing this. [https://www.nymiz.com/ai-and-data-anonymization/](https://www.nymiz.com/ai-and-data-anonymization/)

the budget rig goes bigger, 5060tis bought! test results incoming tonight

Posted by gaspoweredcat@reddit | LocalLLaMA | View on Reddit | 31 comments

Price vs LiveBench Performance of non-reasoning LLMs

Posted by Balance-@reddit | LocalLLaMA | View on Reddit | 54 comments

drplan@reddit

If a model is cheaper and better, it "dominates" the models which are more expensive and worse. However if a model is only cheaper but not better, or better but more expensive, it cannot be really compared, because it is up to individual priorities to rank both properties. If it is wins in both aspects, there is no discussion (given that these aspects are the only variables looked at for deciding).

Price vs LiveBench Performance of non-reasoning LLMs

Posted by Balance-@reddit | LocalLLaMA | View on Reddit | 54 comments

drplan@reddit

A model is on the pareto front if no other model is both cheaper and better at the same time. https://preview.redd.it/jarpfkzui7ve1.png?width=3126&format=png&auto=webp&s=3ee644770935ae27331e56acdddba95a2e7ed270

Price vs LiveBench Performance of non-reasoning LLMs

Posted by Balance-@reddit | LocalLLaMA | View on Reddit | 54 comments

Persistent Memory simulation using Local AI on 4090

Posted by Evening-Active1768@reddit | LocalLLaMA | View on Reddit | 66 comments

drplan@reddit

Well, the go-to nowadays to publish code would be github. \- Go to [https://github.com](https://github.com) , Click + → New repository, name it, Click Create repository \- On the new repo page, click the button "Add file" → "Upload files", Drag & drop the .py file, click "Commit changes"

"You are the product" | Google as usual | Grok likes anonymity

Posted by BidHot8598@reddit | LocalLLaMA | View on Reddit | 112 comments

Ideal setup for local LLM Coding Assistant.

Posted by drplan@reddit | LocalLLaMA | View on Reddit | 11 comments

drplan@reddit (OP)

You realize you are on r/LocalLLaMA right ;) ? Also I do not agree with the ideology thing, it's also an IP / confidentiality issues that matters in some professional environments. [continue.dev](http://continue.dev) is OK, but not as great. It's not just the models with Cursor, it's also the preprocessing / prompting. .

Ideal setup for local LLM Coding Assistant.

Posted by drplan@reddit | LocalLLaMA | View on Reddit | 11 comments

Open source, when?

Posted by Specter_Origin@reddit | LocalLLaMA | View on Reddit | 127 comments

The new king? M3 Ultra, 80 Core GPU, 512GB Memory

Posted by Hanthunius@reddit | LocalLLaMA | View on Reddit | 295 comments

drplan@reddit

I like the machine, a step in the right direction, however the price is out of range for the 99% of all hobbyists. We have to accept that it will remain so for some years to come. And when it has come down in price eventually the required specs for SOTA models will have gone up. Running SOTA models locally will always be expensive and/or require creative efforts. Which is part of its charm.

I used Kokoro-82M, Llama 3.2, and Whisper Small to build a real-time speech-to-speech chatbot that runs locally on my MacBook!

Posted by tycho_brahes_nose_@reddit | LocalLLaMA | View on Reddit | 82 comments

drplan@reddit

I really like that your project is a compact python script :) Finally an implementation that is easy to follow. Wonderful achievement!

Open-source, local RAG to index files from a shared SMB folder?

Posted by drplan@reddit | LocalLLaMA | View on Reddit | 4 comments