AryanEmbered
-
Openai New Memory feature is just Vector Search?
Posted by AryanEmbered@reddit | LocalLLaMA | View on Reddit | 41 comments
-
KBLaM by microsoft, This looks interesting
Posted by AryanEmbered@reddit | LocalLLaMA | View on Reddit | 56 comments
-
Is slower inference and non-realtime cheaper?
Posted by AryanEmbered@reddit | LocalLLaMA | View on Reddit | 5 comments
-
No benchmarks or details on the performance of 0.6B qwen?🧐
Posted by AryanEmbered@reddit | LocalLLaMA | View on Reddit | 12 comments
-
Dolphin translator incoming (eventually)
Posted by AryanEmbered@reddit | LocalLLaMA | View on Reddit | 6 comments
-
A host of rumours
Posted by AryanEmbered@reddit | LocalLLaMA | View on Reddit | 19 comments
-
This Video model is like 5-8B params only? wtf
Posted by AryanEmbered@reddit | LocalLLaMA | View on Reddit | 14 comments
-
Llama 4 is not omnimodal
Posted by AryanEmbered@reddit | LocalLLaMA | View on Reddit | 30 comments
-
Google released Gemma 3 QAT, is this going to be better than Bartowski's stuff
Posted by AryanEmbered@reddit | LocalLLaMA | View on Reddit | 32 comments
-
Multi threaded LLM?
Posted by AryanEmbered@reddit | LocalLLaMA | View on Reddit | 8 comments
-
Is a multimodal focused release from openai the best for us?
Posted by AryanEmbered@reddit | LocalLLaMA | View on Reddit | 20 comments
-
Do you think this will catch on? Amazon's nova models are not very good.
Posted by AryanEmbered@reddit | LocalLLaMA | View on Reddit | 12 comments
-
Why don't we have LLMs that truly learn?
Posted by AryanEmbered@reddit | LocalLLaMA | View on Reddit | 27 comments
-
How Much VRAM Do you need to run a 32B with 32k context?
Posted by AryanEmbered@reddit | LocalLLaMA | View on Reddit | 42 comments
-
What is the current open SOTA for Text2SQL?
Posted by AryanEmbered@reddit | LocalLLaMA | View on Reddit | 2 comments
-
What application to use Florence2 or SAM2 locally?
Posted by AryanEmbered@reddit | LocalLLaMA | View on Reddit | 10 comments