Qwen 3 coder next ud-q8-xl F16 filling up the two orin rpc mesh!
Posted by braydon125@reddit | LocalLLaMA | View on Reddit | 10 comments
running great and as you can see here llama.cpp -fit is doing a great job at splitting this evenly . the largest piece of traffic between these two during initial tensor transfer was <5Gbps
10 Comments
Artistic_Okra7288@reddit
braydon125@reddit (OP)
braydon125@reddit (OP)
ManufacturerWeird161@reddit
braydon125@reddit (OP)
braydon125@reddit (OP)
ClimateBoss@reddit
braydon125@reddit (OP)
ClimateBoss@reddit
braydon125@reddit (OP)