dual spark with llama.cpp

Posted by koibKop4@reddit | LocalLLaMA | View on Reddit | 6 comments

I'm daily driving dual Asus GX10 (spark) with vllm and it's fantastic.
But I want to try model that is GGUF only and won't fit into single spark.

I couldn't find any howtos about running llama cpp with dual sparks.

Did anyone tried it? Any suggestions how to run it?
I want to run uncensored minimax