drplan

Deal on Ryzen 395 w/ 128GB, now 1581€ in Europe

Posted by Zyj@reddit | LocalLLaMA | View on Reddit | 78 comments

[-]

Deal on Ryzen 395 w/ 128GB, now 1581€ in Europe

Posted by Zyj@reddit | LocalLLaMA | View on Reddit | 78 comments

[-]

drplan@reddit

https://preview.redd.it/73h3ap3922eg1.jpeg?width=4284&format=pjpg&auto=webp&s=25b762fd6e09bee7db6d4570176a6a4459b3116a Yes 🙂

I'm hex editing an old videogame, how do I feed a (locally run) AI the game's code?

Posted by Xaxaxa-9@reddit | LocalLLaMA | View on Reddit | 17 comments

[-]

drplan@reddit

As for the decompilers: DeDe and IDR seem to be a start?

I'm hex editing an old videogame, how do I feed a (locally run) AI the game's code?

Posted by Xaxaxa-9@reddit | LocalLLaMA | View on Reddit | 17 comments

[-]

I would strongly doubt AI can deal with disassembled ASM on full application scale. Info loss is to be expected with decompilation, however the overall logic should be recognisable. And what AI is fairly OK at, is explaining existing code bases. So create a readable codebase with decompiler, open the directory in cursor, ans ask questions. As far as i remember many decompilers have the option to reference disassembly in the decompiled code.

I'm hex editing an old videogame, how do I feed a (locally run) AI the game's code?

Posted by Xaxaxa-9@reddit | LocalLLaMA | View on Reddit | 17 comments

[-]

drplan@reddit

I think my approach would be to use a decompiler first and align the dissassembly with it. Identify the relevant pieces, then analyze where/what you need to change with AI help.

You can now do 500K context length fine-tuning - 6.4x longer

Posted by danielhanchen@reddit | LocalLLaMA | View on Reddit | 54 comments

[-]

drplan@reddit

Wow 😮 Could one get this work on a an AMD 395?

20,000 Epstein Files in a single text file available to download (~100 MB)

Posted by tensonaut@reddit | LocalLLaMA | View on Reddit | 326 comments

[-]

drplan@reddit

Seems like a MINOR problem...

Is it normal to hear weird noises when running an LLM on 4× Pro 6000 Max-Q cards?

Posted by PlusProfession9245@reddit | LocalLLaMA | View on Reddit | 233 comments

[-]

drplan@reddit

The weird noises of me grumbling of envy

Deal on Ryzen 395 w/ 128GB, now 1581€ in Europe

Posted by Zyj@reddit | LocalLLaMA | View on Reddit | 78 comments

[-]

drplan@reddit

No it was included. We post more when i have it up and running. Documentation is lacking.

Deal on Ryzen 395 w/ 128GB, now 1581€ in Europe

Posted by Zyj@reddit | LocalLLaMA | View on Reddit | 78 comments

[-]

drplan@reddit

https://preview.redd.it/teur16fnbi0g1.jpeg?width=3840&format=pjpg&auto=webp&s=a5a0653b72d2c4984cf16627f8906f4c96be076e Here it is

Deal on Ryzen 395 w/ 128GB, now 1581€ in Europe

Posted by Zyj@reddit | LocalLLaMA | View on Reddit | 78 comments

[-]

drplan@reddit

Direkt von Sixunited. Wenn Du eins brauchst, lass es mich wissen. Ich importiere demnächst wahrscheinlich eine Charge.

[MEGATHREAD] Local AI Hardware - November 2025

Posted by eck72@reddit | LocalLLaMA | View on Reddit | 84 comments

[-]

drplan@reddit

Brutal rig. How do you make so much money with a fruit shop?

Strix Halo vs DGX Spark - Initial Impressions (long post with TL;DR at the end)

Posted by Eugr@reddit | LocalLLaMA | View on Reddit | 52 comments

[-]

drplan@reddit

Thanks !!!

Strix Halo vs DGX Spark - Initial Impressions (long post with TL;DR at the end)

Posted by Eugr@reddit | LocalLLaMA | View on Reddit | 52 comments

[-]

drplan@reddit

Thanks mate, this is awesome. Can you share a script on how you exactly compiled the Strix Halo optimal version of llama.cpp? That would be super-useful.

AMD Ryzen AI Max+ 395 --EVO-X2 128GB RAM...or...Minisforum MS-S1 Max

Posted by Excellent_Koala769@reddit | LocalLLaMA | View on Reddit | 37 comments

[-]

drplan@reddit

PCIE x16 Expansion slot. Otherwise same.

Deal on Ryzen 395 w/ 128GB, now 1581€ in Europe

Posted by Zyj@reddit | LocalLLaMA | View on Reddit | 78 comments

[-]

drplan@reddit

Wondering, how they are doing it. I am currently importing a Sixunited STHT1 board. 1581 € /which would include 19% taxes in Germany so it is 1328 € net) leaves no margin. They must have gotten an impressive deal with Six United.

I built a tiny fully local AI agent for a Raspberry Pi

Posted by syxa@reddit | LocalLLaMA | View on Reddit | 92 comments

[-]

drplan@reddit

This is the stuff, this channel was made for. Congrats!

Moxie goes local

Posted by Over-Mix7071@reddit | LocalLLaMA | View on Reddit | 54 comments

[-]

drplan@reddit

I can't express how much I love this.

Elon Musk says that xAI will make Grok 2 open source next week

Posted by Nunki08@reddit | LocalLLaMA | View on Reddit | 214 comments

[-]

drplan@reddit

Finally LocalMechaHitler SCNR

gpt-oss-120b blazing fast on M4 Max MBP

Posted by entsnack@reddit | LocalLLaMA | View on Reddit | 38 comments

[-]

drplan@reddit

Well the benchmarks do not seem very good at least, from what I am reading. My first test are OKish, however capabilities on languages other than English seem limited. Do not get me wrong, there is lots of potential. Benchmarks will tell us where these models will find their place.

gpt-oss-120b blazing fast on M4 Max MBP

Posted by entsnack@reddit | LocalLLaMA | View on Reddit | 38 comments

[-]

drplan@reddit

Jep, can we all agree on that the models are not very good, but that the architecture choices have the potential to move the needle performance-wise?

Ok, we get a lobotobot. Great.

Posted by Reno0vacio@reddit | LocalLLaMA | View on Reddit | 51 comments

[-]

drplan@reddit

Prompt literally contains the words "red-pilled truth", which implies well... truth?

Ok, we get a lobotobot. Great.

Posted by Reno0vacio@reddit | LocalLLaMA | View on Reddit | 51 comments

[-]

drplan@reddit

OPs prompt implied alignment with the ideology. The models responds if you ask in a neutral way. https://preview.redd.it/4zqh2x570dhf1.png?width=1936&format=png&auto=webp&s=b83ef734816b82ab6ea70fb4a6ae192acd199a7c

Ok, we get a lobotobot. Great.

Posted by Reno0vacio@reddit | LocalLLaMA | View on Reddit | 51 comments

[-]

drplan@reddit

Actually, in this specific case, I am relieved.

Run gpt-oss locally with Unsloth GGUFs + Fixes!

Posted by danielhanchen@reddit | LocalLLaMA | View on Reddit | 88 comments

[-]

drplan@reddit

Performance on AMD AI Max 395 using llama.cpp on gpt-oss-20b is pretty decent. ./llama-bench -m /home/denkbox/models/gpt-oss-20b-F16.gguf --n-gpu-layers 100 warning: asserts enabled, performance may be affected warning: debug build, performance may be affected ggml\_vulkan: Found 1 Vulkan devices: ggml\_vulkan: 0 = Radeon 8060S Graphics (RADV GFX1151) (radv) | uma: 1 | fp16: 1 | bf16: 0 | warp size: 64 | shared memory: 65536 | int dot: 1 | matrix cores: KHR\_coopmat register\_backend: registered backend Vulkan (1 devices) register\_device: registered device Vulkan0 (Radeon 8060S Graphics (RADV GFX1151)) register\_backend: registered backend CPU (1 devices) register\_device: registered device CPU (AMD RYZEN AI MAX+ 395 w/ Radeon 8060S) load\_backend: failed to find ggml\_backend\_init in /home/denkbox/software/llama.cpp/build/bin/libggml-vulkan.so load\_backend: failed to find ggml\_backend\_init in /home/denkbox/software/llama.cpp/build/bin/libggml-cpu.so | model | size | params | backend | ngl | test | t/s | | ------------------------------ | ---------: | ---------: | ---------- | --: | --------------: | -------------------: | | gpt-oss ?B F16 | 12.83 GiB | 20.91 B | Vulkan | 100 | pp512 | 485.92 ± 4.69 | | gpt-oss ?B F16 | 12.83 GiB | 20.91 B | Vulkan | 100 | tg128 | 44.02 ± 0.31 |

Help! How to access the full 96GB VRAM on AMD Strix Halo (Ryzen AI Max+ 395) with PyTorch in Ubuntu 24.04?

Posted by ashwin3005@reddit | LocalLLaMA | View on Reddit | 6 comments

[-]

drplan@reddit

I've been running the system on Fedora, no major problems there. Can you post some code to reproduce?

China no. 1!

Posted by entsnack@reddit | LocalLLaMA | View on Reddit | 61 comments

[-]

drplan@reddit

I do not understand. Why the cables?

The Holy Grail

Posted by No-Search9350@reddit | LocalLLaMA | View on Reddit | 23 comments

[-]

drplan@reddit

Aside from the benchmarks, I see quantification in the light of biological neural processing also relying on approximate "calculations". Using full precision is simply bad use of resources.

Ryzen AI Max+ 395 + a gpu?

Posted by Alarming-Ad8154@reddit | LocalLLaMA | View on Reddit | 52 comments

[-]

drplan@reddit

Do you have any sources for this? Thanks!

4x 4090 48GB inference box (I may have overdone it)

Posted by 101m4n@reddit | LocalLLaMA | View on Reddit | 178 comments

[-]

drplan@reddit

Nice! Can you share more pics of the case?

I'm using a local Llama model for my game's dialogue system!

Posted by LandoRingel@reddit | LocalLLaMA | View on Reddit | 158 comments

[-]

drplan@reddit

Um. This might be a stupid idea.. but couldn't one try to "enhance" older adventure games (e.g. Monkey Island, Indiana Jones, etc.) by parsing the available SCUMM files? I know, i know sacrilege... but this could be a fun experiment?

Mindblowing demo: John Link led a team of AI agents to discover a forever-chemical-free immersion coolant using Microsoft Discovery.

Posted by cjsalva@reddit | LocalLLaMA | View on Reddit | 74 comments

[-]

drplan@reddit

Ok, you are right: the compounds are halogenated hydrocarbons, not CFCs. Sorry not a chemist ;)

Mindblowing demo: John Link led a team of AI agents to discover a forever-chemical-free immersion coolant using Microsoft Discovery.

Posted by cjsalva@reddit | LocalLLaMA | View on Reddit | 74 comments

[-]

drplan@reddit

+Prompt should smell and taste like Red Bull

Mindblowing demo: John Link led a team of AI agents to discover a forever-chemical-free immersion coolant using Microsoft Discovery.

Posted by cjsalva@reddit | LocalLLaMA | View on Reddit | 74 comments

[-]

drplan@reddit

Uhm the solutions look like Chlorofluorocarbons? Isn't that old stuff and bad for the ozone layer?

Are these real prices? Seems low. Never used e-bay I'm from Europe (sorry).

Posted by Sufficient_Bit_8636@reddit | LocalLLaMA | View on Reddit | 56 comments

[-]

drplan@reddit

Yes real but honestly not worth it…

What OS are you ladies and gent running?

Posted by No-Report-1805@reddit | LocalLLaMA | View on Reddit | 74 comments

[-]

drplan@reddit

All of them. However for LLM only Mac OS and Linux.

Grok is cheaper & better than DeepSeek

Posted by BidHot8598@reddit | LocalLLaMA | View on Reddit | 19 comments

[-]

drplan@reddit

Fascist owner, not interested

LLM coding prompt obfuscator

Posted by robertpiosik@reddit | LocalLLaMA | View on Reddit | 3 comments

[-]

drplan@reddit

You want to pseudonymize relevant identifiers (like names, credit numbers, etc.)? There are projects/services out there doing this. [https://www.nymiz.com/ai-and-data-anonymization/](https://www.nymiz.com/ai-and-data-anonymization/)

the budget rig goes bigger, 5060tis bought! test results incoming tonight

Posted by gaspoweredcat@reddit | LocalLLaMA | View on Reddit | 31 comments

[-]

drplan@reddit

RemindMe! 1 Day

Price vs LiveBench Performance of non-reasoning LLMs

Posted by Balance-@reddit | LocalLLaMA | View on Reddit | 54 comments

[-]

drplan@reddit

If a model is cheaper and better, it "dominates" the models which are more expensive and worse. However if a model is only cheaper but not better, or better but more expensive, it cannot be really compared, because it is up to individual priorities to rank both properties. If it is wins in both aspects, there is no discussion (given that these aspects are the only variables looked at for deciding).

Price vs LiveBench Performance of non-reasoning LLMs

Posted by Balance-@reddit | LocalLLaMA | View on Reddit | 54 comments

[-]

drplan@reddit

A model is on the pareto front if no other model is both cheaper and better at the same time. https://preview.redd.it/jarpfkzui7ve1.png?width=3126&format=png&auto=webp&s=3ee644770935ae27331e56acdddba95a2e7ed270

Price vs LiveBench Performance of non-reasoning LLMs

Posted by Balance-@reddit | LocalLLaMA | View on Reddit | 54 comments

[-]

drplan@reddit

Gemma/Gemini owning the pareto front...

Persistent Memory simulation using Local AI on 4090

Posted by Evening-Active1768@reddit | LocalLLaMA | View on Reddit | 66 comments

[-]

drplan@reddit

Well, the go-to nowadays to publish code would be github. \- Go to [https://github.com](https://github.com) , Click + → New repository, name it, Click Create repository \- On the new repo page, click the button "Add file" → "Upload files", Drag & drop the .py file, click "Commit changes"

"You are the product" | Google as usual | Grok likes anonymity

Posted by BidHot8598@reddit | LocalLLaMA | View on Reddit | 112 comments

[-]

drplan@reddit

Thanks for this - super useful in any pro-local discussions/slides

Ideal setup for local LLM Coding Assistant.

Posted by drplan@reddit | LocalLLaMA | View on Reddit | 11 comments

[-]

drplan@reddit (OP)

You realize you are on r/LocalLLaMA right ;) ? Also I do not agree with the ideology thing, it's also an IP / confidentiality issues that matters in some professional environments. [continue.dev](http://continue.dev) is OK, but not as great. It's not just the models with Cursor, it's also the preprocessing / prompting. .

Ideal setup for local LLM Coding Assistant.

Posted by drplan@reddit | LocalLLaMA | View on Reddit | 11 comments

[-]

drplan@reddit (OP)

Exactly... this sucks. I will also try Aider, but it looks too "vibe cody" to me.

Open source, when?

Posted by Specter_Origin@reddit | LocalLLaMA | View on Reddit | 127 comments

[-]

drplan@reddit

Well they "only" raised 125k in 2021. After this nothing seems to have happened, at least not according to Crunchbase.

The new king? M3 Ultra, 80 Core GPU, 512GB Memory

Posted by Hanthunius@reddit | LocalLLaMA | View on Reddit | 295 comments

[-]

drplan@reddit

I like the machine, a step in the right direction, however the price is out of range for the 99% of all hobbyists. We have to accept that it will remain so for some years to come. And when it has come down in price eventually the required specs for SOTA models will have gone up. Running SOTA models locally will always be expensive and/or require creative efforts. Which is part of its charm.

I used Kokoro-82M, Llama 3.2, and Whisper Small to build a real-time speech-to-speech chatbot that runs locally on my MacBook!

Posted by tycho_brahes_nose_@reddit | LocalLLaMA | View on Reddit | 82 comments

[-]

drplan@reddit

I really like that your project is a compact python script :) Finally an implementation that is easy to follow. Wonderful achievement!

Open-source, local RAG to index files from a shared SMB folder?

Posted by drplan@reddit | LocalLLaMA | View on Reddit | 4 comments

[-]

drplan@reddit (OP)

Open-webui looks great, but yes it seems to lack requirement 2. Not sure where to start the modifications.