ninjasaid13

>Gemini 3.1 Pro, Claude 4.7 Opus have the capacity to reason at a PhD level of a given field given the person doing the prompts is also highly skilled in the field to be able to give clear instructions and maybe provide grounding sources. They know everything but understand nothing.

<thinking></thinking>

Posted by Comfortable-Rock-498@reddit | LocalLLaMA | View on Reddit | 89 comments

[-]

ninjasaid13@reddit

>Aren't LLMs already statistically smarter than a majority of humans? At answering new questions, they are knowledgeable, at creating new questions, no.

<thinking></thinking>

Posted by Comfortable-Rock-498@reddit | LocalLLaMA | View on Reddit | 89 comments

[-]

ninjasaid13@reddit

The nature of AI mistakes and human mistakes are different.

Best Local LLMs - Apr 2026

Posted by rm-rf-rm@reddit | LocalLLaMA | View on Reddit | 365 comments

[-]

ninjasaid13@reddit

it's soo censored, which is a bad quality for a creative writing / RP model to have.

Embracing the noise: How to build an agent that is both neuro-symbolic and probabilistic.

Posted by DepthOk4115@reddit | LocalLLaMA | View on Reddit | 10 comments

[-]

ninjasaid13@reddit

Maybe the neuro symbolic part is part of the hardware and not possible to turn it into software.

Decreased Intelligence Density in DeepSeek V4 Pro

Posted by Mindless_Pain1860@reddit | LocalLLaMA | View on Reddit | 90 comments

[-]

ninjasaid13@reddit

Deepseek V4.1 probably

r/LocalLLaMa Rule Updates

Posted by rm-rf-rm@reddit | LocalLLaMA | View on Reddit | 121 comments

[-]

ninjasaid13@reddit

Karma amount that can be gained within 5 minutes.

I made a tiny world model game that runs locally on iPad

Posted by howthefrondsfold@reddit | LocalLLaMA | View on Reddit | 27 comments

[-]

ninjasaid13@reddit

r/aigamedev

Qwen-Image-2.0 is out - 7B unified gen+edit model with native 2K and actual text rendering

Posted by RIPT1D3_Z@reddit | LocalLLaMA | View on Reddit | 120 comments

[-]

ninjasaid13@reddit

Beaten them at what?

Meta to open source versions of its next AI models

Posted by abkibaarnsit@reddit | LocalLLaMA | View on Reddit | 62 comments

[-]

ninjasaid13@reddit

I don't think alexandr wang has the experience.

Mistral AI to release Voxtral TTS, a 3-billion-parameter text-to-speech model with open weights that the company says outperformed ElevenLabs Flash v2.5 in human preference tests. The model runs on about 3 GB of RAM, achieves 90-millisecond time-to-first-audio, supports nine languages.

Posted by Nunki08@reddit | LocalLLaMA | View on Reddit | 186 comments

[-]

ninjasaid13@reddit

is it finetunable?

Introducing ARC-AGI-3

Posted by Complete-Sea6655@reddit | LocalLLaMA | View on Reddit | 100 comments

[-]

ninjasaid13@reddit

I think this will be harder as this takes learning into account.

Introducing ARC-AGI-3

Posted by Complete-Sea6655@reddit | LocalLLaMA | View on Reddit | 100 comments

[-]

ninjasaid13@reddit

>What I find interesting about AGI-3 is that it shifts the evaluation unit from 'can it solve this task' to 'how efficiently does it acquire the skill.' That's a much harder thing to fake. You can brute force a benchmark. You can't brute force a learning curve. Exactly, I had this idea for a while for a benchmark.

High school student seeking advice: Found an architectural breakthrough that scales a 17.6B model down to 417M?

Posted by Appropriate-Scar3116@reddit | LocalLLaMA | View on Reddit | 210 comments

[-]

ninjasaid13@reddit

その主張を裏付けるソースやエビデンスはありますか？ (Geminiを使用して英語から翻訳しました)

Qwen3.5B VS the SOTA same size models from 2 years ago.

Posted by Uncle___Marty@reddit | LocalLLaMA | View on Reddit | 59 comments

[-]

ninjasaid13@reddit

0.777777778\* (these scores).

Qwen 2.5 -> 3 -> 3.5, smallest models. Incredible improvement over the generations.

Posted by airbus_a360_when@reddit | LocalLLaMA | View on Reddit | 136 comments

[-]

ninjasaid13@reddit

qwen 3.5 lacks brevity.

Qwen3.5-397B-A17B-UD-TQ1 bench results FW Desktop Strix Halo 128GB

Posted by dabiggmoe2@reddit | LocalLLaMA | View on Reddit | 58 comments

[-]

ninjasaid13@reddit

>Qwen3.5-397B-A17B-UD-TQ1 https://preview.redd.it/2vjkv17ufglg1.png?width=1633&format=png&auto=webp&s=ba23ec946d34a5ea70be82399adc606d4c872ab8

meanwhile in China

Posted by Tiny_Judge_2119@reddit | LocalLLaMA | View on Reddit | 33 comments

[-]

ninjasaid13@reddit

>Good, and a new version which should match or exceed Seedance 2.0's capabilities should release within a month or two. If it was within a month or two of releasing, it would be demoed.

How I mapped every High Court of Australia case and their citations (1901-2025)

Posted by Neon0asis@reddit | LocalLLaMA | View on Reddit | 6 comments

[-]

ninjasaid13@reddit

r/dataisbeautiful

Pack it up guys, open weight AI models running offline locally on PCs aren't real. 😞

Posted by CesarOverlorde@reddit | LocalLLaMA | View on Reddit | 294 comments

[-]

ninjasaid13@reddit

Anything to prevent poor people from touching it.

Anthropic is deploying 20M$ to support AI regulation in sight of 2026 elections

Posted by 1998marcom@reddit | LocalLLaMA | View on Reddit | 81 comments

[-]

ninjasaid13@reddit

An LLM isn't walking a non-technical user through anything if they don't have the basic underlying technical knowledge and wouldn't even know if the AI was "hallucinating" a dangerous or impossible step in a protocol. I don't think this has changed at all in several years. Research consistently shows that AI is an assistive tool, it doesn't grant tacit knowledge to the end user even as the AI starts to become more knowledgeable on virology than experts. The internet provides a what, an LLM might provide a better what, but none of them provide a how.

Anthropic is deploying 20M$ to support AI regulation in sight of 2026 elections

Posted by 1998marcom@reddit | LocalLLaMA | View on Reddit | 81 comments

[-]

ninjasaid13@reddit

>Genuinely these technologies post serious risks to society. Society will walk on. They pose no risk.

Qwen-Image-2.0 is out - 7B unified gen+edit model with native 2K and actual text rendering

Posted by RIPT1D3_Z@reddit | LocalLLaMA | View on Reddit | 120 comments

[-]

ninjasaid13@reddit

>7B model, down from 20B in v1, which is great news for local runners If you're counting the text encoder, it would be 13B.

Qwen-Image-2.0 is out - 7B unified gen+edit model with native 2K and actual text rendering

Posted by RIPT1D3_Z@reddit | LocalLLaMA | View on Reddit | 120 comments

[-]

ninjasaid13@reddit

I mean chinese is only written one language despite being spoken in multiple.

I'm playing telephone pictionary with LLMs, VLMs, SDs, and Kokoro on my Strix Halo

Posted by jfowers_amd@reddit | LocalLLaMA | View on Reddit | 9 comments

[-]

ninjasaid13@reddit

How about multi-turn editing/generation for comics?

Unsloth just unleashed Glm 5! GGUF NOW!

Posted by RickyRickC137@reddit | LocalLLaMA | View on Reddit | 82 comments

[-]

ninjasaid13@reddit

Can I run this on my rtx 4070 laptop? lmk when you manage it k.

New stealth model: Pony Alpha

Posted by sirjoaco@reddit | LocalLLaMA | View on Reddit | 30 comments

[-]

ninjasaid13@reddit

I mean why say concerned?

New stealth model: Pony Alpha

Posted by sirjoaco@reddit | LocalLLaMA | View on Reddit | 30 comments

[-]

ninjasaid13@reddit

>I'm concerned it's the next Sonnet tbh is that good or bad? why bad?

We built an 8B world model that beats 402B Llama 4 by generating web code instead of pixels — open weights on HF

Posted by jshin49@reddit | LocalLLaMA | View on Reddit | 46 comments

[-]

ninjasaid13@reddit

Lecun means something way different by world model.

Fei Fei Li dropped a non-JEPA world model, and the spatial intelligence is insane

Posted by coloradical5280@reddit | LocalLLaMA | View on Reddit | 90 comments

[-]

ninjasaid13@reddit

Record a video of genie, train a splat then boom you got the same thing.

Fei Fei Li dropped a non-JEPA world model, and the spatial intelligence is insane

Posted by coloradical5280@reddit | LocalLLaMA | View on Reddit | 90 comments

[-]

ninjasaid13@reddit

>This is what makes Reddit what it is. What makes reddit what it is is breaking the rules to advertise a paid product?

Fei Fei Li dropped a non-JEPA world model, and the spatial intelligence is insane

Posted by coloradical5280@reddit | LocalLLaMA | View on Reddit | 90 comments

[-]

ninjasaid13@reddit

>And the point is that this does NOT uses triangles or vertices. Nor does my paper >Papers aren't products So the only difference is that it's a product? Is being a product supposed to be what makes it revolutionary?

Fei Fei Li dropped a non-JEPA world model, and the spatial intelligence is insane

Posted by coloradical5280@reddit | LocalLLaMA | View on Reddit | 90 comments

[-]

ninjasaid13@reddit

>Text-to-3D generates isolated meshes (triangles, vertices). https://arxiv.org/abs/2311.13384

Fei Fei Li dropped a non-JEPA world model, and the spatial intelligence is insane

Posted by coloradical5280@reddit | LocalLLaMA | View on Reddit | 90 comments

[-]

ninjasaid13@reddit

I'm not seeing what's special here, it's just like those text to 3D models.

GLM-Image is released!

Posted by foldl-li@reddit | LocalLLaMA | View on Reddit | 82 comments

[-]

ninjasaid13@reddit

I don't think people really care about text at all. That shit could be done easily with simple programs.

The current state of sparse-MoE's for agentic coding work (Opinion)

Posted by ForsookComparison@reddit | LocalLLaMA | View on Reddit | 80 comments

[-]

ninjasaid13@reddit

Is there any that smart long task oriented and is bad at code?

Apple introduces SHARP, a model that generates a photorealistic 3D Gaussian representation from a single image in seconds.

Posted by themixtergames@reddit | LocalLLaMA | View on Reddit | 140 comments

[-]

ninjasaid13@reddit

Which part?

Apple introduces SHARP, a model that generates a photorealistic 3D Gaussian representation from a single image in seconds.

Posted by themixtergames@reddit | LocalLLaMA | View on Reddit | 140 comments

[-]

ninjasaid13@reddit

r/gaussiansplatting

Basketball AI with RF-DETR, SAM2, and SmolVLM2

Posted by RandomForests92@reddit | LocalLLaMA | View on Reddit | 48 comments

[-]

ninjasaid13@reddit

He has this code in another comment: https://www.reddit.com/r/LocalLLaMA/s/xIT3yN4DtX

WTF! Is this real? Teenagers are building AGI Research Lab

Posted by Illustrious-Yak-9195@reddit | LocalLLaMA | View on Reddit | 16 comments

[-]

ninjasaid13@reddit

this looks like r/singularity tech bro slop.

When you figure out it’s all just math:

Posted by Current-Ticket4214@reddit | LocalLLaMA | View on Reddit | 381 comments

[-]

ninjasaid13@reddit

Your brain uses electric charge and a calculator uses electric charge, does that mean that your brain is not in contradiction to a calculator? >And we do know what is AGI, and its criterias We do not have any besides defining it in terms of human intelligence. > Ok tell me if it is not based predictive processing and attention processing? This doesn't mean LLMs have human-like thinking. The LLM predicts the most probable next token by learning statistical language while humans are not based on token or language at all. What humans predict is the state of the world. If anyone tells you that LLMs have a world model or that video generators have a world model, they're sorely mistaken about what a world model is. This is what a real world model requires: [https://en.wikipedia.org/wiki/Schema\_(psychology)](https://en.wikipedia.org/wiki/Schema_(psychology))