charmander_cha

Microsoft Aion 1.0 Instruct and Aion 1.0 Plan models!

Posted by Mysterious_Finish543@reddit | LocalLLaMA | View on Reddit | 109 comments

[-]

What's the status of non-CUDA inference?

Posted by IngwiePhoenix@reddit | LocalLLaMA | View on Reddit | 30 comments

[-]

charmander_cha@reddit

Texto eu uso vulkan, comfyui eu não tive problemas com rocm utilizando uma RX 7600 XT Já tts eu apenas não utilizo os o modelos mais novos pois não há suporte a minha língua nativa

StepFun 3.7 Flash

Posted by Everlier@reddit | LocalLLaMA | View on Reddit | 151 comments

[-]

Looks like Miminax-M3 is just around the corner

Posted by OnkelBB@reddit | LocalLLaMA | View on Reddit | 40 comments

[-]

charmander_cha@reddit

Tomara que eles incorpore alguma das evoluções de preço do deepseek

[NEW] Supra-50M Released!

Posted by Dangerous_Try3619@reddit | LocalLLaMA | View on Reddit | 60 comments

[-]

Waiting on Qwen to drop those 3.7 models be like:

Posted by Porespellar@reddit | LocalLLaMA | View on Reddit | 47 comments

[-]

Carbon: Decoding the Language of Life

Posted by loubnabnl@reddit | LocalLLaMA | View on Reddit | 49 comments

[-]

Soube deste modelo estes dias, eu não entendo desta tecnologia, só queria saber se está tecnologia se envolve de alguma forma com a sua tecnologia, não sobre o que elas fazem mas sobre como cada abordagem entende uma LLM e o que ela de fato faz com representações textuais de informação

Rewrite Bun in Rust has been merged

Posted by gruenistblau@reddit | programming | View on Reddit | 412 comments

[-]

charmander_cha@reddit

Provavelmente será feito um dia, os modelos entendem mais de typescript e python, é mais uma questão de conseguir entregar rapidamente

GrapheneOS: Google's Play Integrity API requires hardware attestation ... Apple already has it as a requirement. Over the long term, this will increasingly lock out hardware and OS competition.

Posted by TheTwelveYearOld@reddit | linux | View on Reddit | 245 comments

[-]

charmander_cha@reddit

Eles vão forçar o mundo a usar opções chinesas lKKKKKKKKKKK AMÉM

Any news (or hope) of Qwen-3.6 14B and 9B distills for local coding ?

Posted by QuchchenEbrithin2day@reddit | LocalLLaMA | View on Reddit | 54 comments

[-]

charmander_cha@reddit

Quero a versão 3.6 para 9B Seria incrível

Any news (or hope) of Qwen-3.6 14B and 9B distills for local coding ?

Posted by QuchchenEbrithin2day@reddit | LocalLLaMA | View on Reddit | 54 comments

[-]

charmander_cha@reddit

Utilizo ele, bem responsa

You can now read Gemma 3's mind

Posted by DigiDecode_@reddit | LocalLLaMA | View on Reddit | 20 comments

[-]

charmander_cha@reddit

Esta pesquisa é daora, problema é que eu queria pesquisar os modelos chineses, pouco importa os americanos, além de poucos, são sempre os pequenos, se tiver uma forma de estudar os modelos de 9 ou 27B localmente seria incrível. Provavelmente isso foi lançado para "competir" em visibilidade com qwen scope

HOT TAKE: local models + agent harnesses are now capable enough to hand off junior-level IT professional tasks to [human written]

Posted by Porespellar@reddit | LocalLLaMA | View on Reddit | 67 comments

[-]

charmander_cha@reddit

Não sabia que isso era uma opinião polêmica, mas não é isso que vai matar os Junior, eles já estão mortos porque a tendência do capitalismo é monopólio, se é monopólio não tem como ter emprego para todos.

Introducing SubQ: The First Fully Subquadratic LLM

Posted by hltt@reddit | LocalLLaMA | View on Reddit | 6 comments

[-]

charmander_cha@reddit

Se isso não é local, porque foi postada aqui? Deveriam só apagar estes anúncios

Bun is being rewritten to Rust

Posted by aabbdev@reddit | programming | View on Reddit | 152 comments

[-]

charmander_cha@reddit

Simples, seria lento.

Open source models are going to be the future on Cursor, OpenCode etc.

Posted by _maverick98@reddit | LocalLLaMA | View on Reddit | 148 comments

[-]

charmander_cha@reddit

A China é o futuro

Qwen 3.6 seems to have a lot of trouble with tool calling

Posted by Perfect-Campaign9551@reddit | LocalLLaMA | View on Reddit | 58 comments

[-]

charmander_cha@reddit

Mas cadê as info?

Qwen Models are such good models?

Posted by FeiX7@reddit | LocalLLaMA | View on Reddit | 27 comments

[-]

charmander_cha@reddit

Se existir uma versão 3.6 do 9b teremos provavelmente o melhor modelo local mais democrático

Meta’s $2 billion Manus acquisition blocked by China.

Posted by Nunki08@reddit | LocalLLaMA | View on Reddit | 97 comments

[-]

charmander_cha@reddit

Literalmente as big techs estão envolvidas em genocídios. De um lado nos temos etnocentrismo, comum da humanidade. Do outro, uma big tech que tem levado a massacres e quedas de governos com intuito deixar a população em situação pior socialmente falando. Talvez seja a hora de você pesquisar sobre os crimes da Meta na África. E avaliar se seu comparativo ante de tudo, tem ou não sentido.

Meta’s $2 billion Manus acquisition blocked by China.

Posted by Nunki08@reddit | LocalLLaMA | View on Reddit | 97 comments

[-]

charmander_cha@reddit

Corretamente, não se permite que nazistas possam ter propriedade de qualquer tipo em seu país. Simples assim.

Opencode-power-pack – Claude Code skills ported to OpenCode

Posted by waybarrios@reddit | LocalLLaMA | View on Reddit | 8 comments

[-]

charmander_cha@reddit

Está na hora de começarmos a investirmos dinheiro nas APIs corretas para de forma organizada agirmos contra as big tech e implementar esse meio mundo de paper. Será que a comunidade conseguiria padronizar um método eficiente de implementar um paper? (na medida do possível)

Qwen 3.6 27B Makes Huge Gains in Agency on Artificial Analysis - Ties with Sonnet 4.6

Posted by dionysio211@reddit | LocalLLaMA | View on Reddit | 177 comments

[-]

charmander_cha@reddit

Tomara que seja, tenho amigos mais pobres que eu que querem participar da brincadeira

Qwen3 TTS is seriously underrated - I got it running locally in real-time and it's one of the most expressive open TTS models I've tried

Posted by fagenorn@reddit | LocalLLaMA | View on Reddit | 117 comments

[-]

charmander_cha@reddit

Triste, pessoal tem que entender que só há ia local se for realmente acessível

US gov memo on “adversarial distillation” - are we heading toward tighter controls on open models?

Posted by MLExpert000@reddit | LocalLLaMA | View on Reddit | 399 comments

[-]

charmander_cha@reddit

Os nazi imperialistas tão achando ruim quem os desafia, que se foda o império

Qwen-3.6-27B, llamacpp, speculative decoding - appreciation post

Posted by Then-Topic8766@reddit | LocalLLaMA | View on Reddit | 95 comments

[-]

charmander_cha@reddit

Acredito que você deveria chamar seu post por algo como: aumentando tok/s usando ngram

Qwen3 TTS is seriously underrated - I got it running locally in real-time and it's one of the most expressive open TTS models I've tried

Posted by fagenorn@reddit | LocalLLaMA | View on Reddit | 117 comments

[-]

charmander_cha@reddit

Funciona com vulkan ou rocm? Alguém saberia dizer?

I don’t believe this benchmark 27b size model next opus 4.5! Anyone can confirm testing with real agentic workflow?

Posted by Wonderful-Ad-5952@reddit | LocalLLaMA | View on Reddit | 50 comments

[-]

charmander_cha@reddit

Comecem A usar melhores questionamentos, o que importa é : o quanto ele degrada em quantizacoes de 2 ou 3 bits?

MIT & the IMO released MathNet, the world’s largest dataset of International Math Olympiad problems & solutions. MathNet is 5x larger than previous datasets & is sourced from over 40 countries across 4 decades

Posted by Nunki08@reddit | LocalLLaMA | View on Reddit | 5 comments

[-]

charmander_cha@reddit

Incrível

Qwen 3.6 27B is out

Posted by NoConcert8847@reddit | LocalLLaMA | View on Reddit | 609 comments

[-]

charmander_cha@reddit

Precisamos da versão de 9B urgentemente

llama.cpp is the linux of llm

Posted by DevelopmentBorn3978@reddit | LocalLLaMA | View on Reddit | 96 comments

[-]

charmander_cha@reddit

Qualquer coisa comparada ao Windows parece uma ofensa

Why doesn't any OSS tool treat llama.cpp as a first class citizen?

Posted by rm-rf-rm@reddit | LocalLLaMA | View on Reddit | 120 comments

[-]

charmander_cha@reddit

Se você fizer isso você vai fortalecer a narrativa do opensource/software livre, e eles defendem aqueles que funcionam mais próximos de serem produtos, com prospecção de lucros. É questão comercial, que exige de nós um posicionamento ético e também de propaganda

When did Github stop being about Git?

Posted by dgkimpton@reddit | programming | View on Reddit | 134 comments

[-]

charmander_cha@reddit

Desde da compra da Microsoft

Qwen3.6 (35B-A3B) with OpenCode. Running locally with llama.cpp

Posted by curiousily_@reddit | LocalLLaMA | View on Reddit | 5 comments

[-]

charmander_cha@reddit

Qual quantizacao usou?

Are you guys actually using local tool calling or is it a collective prank?

Posted by Mayion@reddit | LocalLLaMA | View on Reddit | 197 comments

[-]

charmander_cha@reddit

Olha, eu uso opencode com modelos locais e ele definitivamente faz coisas para mim

When is Qwen 3.6 27B dropping? Didn’t it win the vote?

Posted by GrungeWerX@reddit | LocalLLaMA | View on Reddit | 72 comments

[-]

charmander_cha@reddit

Quero a versão 9B

Don't ask Qwen 3.6 35b to give you aski image of Yoshi :)

Posted by anzzax@reddit | LocalLLaMA | View on Reddit | 19 comments

[-]

charmander_cha@reddit

assustador

Qwen3.6. This is it.

Posted by Local-Cardiologist-5@reddit | LocalLLaMA | View on Reddit | 420 comments

[-]

charmander_cha@reddit

Tenta fazer isso com umas quantizacao mais agressiva

Bonsai models are pure hype: Bonsai-8B is MUCH dumber than Gemma-4-E2B

Posted by WeGoToMars7@reddit | LocalLLaMA | View on Reddit | 71 comments

[-]

charmander_cha@reddit

A comparação com outros modelos não parece ser algo que realmente faça jus entender o modelo e suas limitações.

Ternary Bonsai: Top intelligence at 1.58 bits

Posted by pmttyji@reddit | LocalLLaMA | View on Reddit | 89 comments

[-]

charmander_cha@reddit

Mas esse não é o paper original? Ele tinha menções a quantizacao ternária? Eu vou olhar novamente, mas N lembro de ter visto nada sobre

Ternary Bonsai: Top intelligence at 1.58 bits

Posted by pmttyji@reddit | LocalLLaMA | View on Reddit | 89 comments

[-]

charmander_cha@reddit

Este artigo tem a ver com aquele paper de bit destillation? Se for, ele dizia que a técnica não parecia ser viável em modelos grandes

Ternary Bonsai: Top intelligence at 1.58 bits

Posted by pmttyji@reddit | LocalLLaMA | View on Reddit | 89 comments

[-]

charmander_cha@reddit

Sim, mas um de 35 ou 27B serão muito mais democráticos

Comparison Qwen 3.6 35B MoE vs Qwen 3.5 35B MoE on Research Paper to WebApp

Posted by dreamai87@reddit | LocalLLaMA | View on Reddit | 29 comments

[-]

charmander_cha@reddit

Ngm acessa mais Internet msm

Its just a new Qwen model

Posted by StandardLovers@reddit | LocalLLaMA | View on Reddit | 21 comments

[-]

charmander_cha@reddit

Sinal que eles farão mais mistério lkkkkk

VAD issues - takes too much time to understand when the user has stopped talking

Posted by Male_Cat_@reddit | LocalLLaMA | View on Reddit | 9 comments

[-]

charmander_cha@reddit

Local? Certeza que não é latência devido o hardware? Eu uso livekit talvez na documentação deles tenha alguma dica para vc

(llama.cpp) Possible to disable reasoning for some requests (while leaving reasoning on by default)?

Posted by regunakyle@reddit | LocalLLaMA | View on Reddit | 21 comments

[-]

charmander_cha@reddit

Isso seria possível de fazer com o sistema roteador padrão do llama.cpp?

(llama.cpp) Possible to disable reasoning for some requests (while leaving reasoning on by default)?

Posted by regunakyle@reddit | LocalLLaMA | View on Reddit | 21 comments

[-]

charmander_cha@reddit

Tem como fazer isso no opencode?

BlueTTS is basically supertonic look at the paper and the code

Posted by Elegant-Condition206@reddit | LocalLLaMA | View on Reddit | 6 comments

[-]

charmander_cha@reddit

Simples, com posts como este que parecem de IA. Se quiser sucesso, eu preciso primeiro não precisar clicar no link e saber sobre o que se trata

huge improvement after moving from ollama to llama.cpp

Posted by leonardosalvatore@reddit | LocalLLaMA | View on Reddit | 76 comments

[-]

charmander_cha@reddit

Sorry, I didn't have time to thank you! I'll try to use it later today, thank you!

It was fun while it lasted...

Posted by Specter_Origin@reddit | LocalLLaMA | View on Reddit | 15 comments

[-]

charmander_cha@reddit

Vocês acham realmente que isso não tem ver com crise energética???????????

It was fun while it lasted...

Posted by Specter_Origin@reddit | LocalLLaMA | View on Reddit | 15 comments

[-]

charmander_cha@reddit

E porque a porra dos Estados Unidos bombardeou uma nação que literalmente estava quieta (ao contrário do que diz a propaganda naziamericana) e agora porque esses filhos da puta possuem como hobby matar crianças e principalmente garotas, teremos que pagar a conta do petróleo. Nação governada por pedófilos que governam burros racistas