Running DeepSeek locally using ONNX Runtime

Posted by DangerousGood4561@reddit | LocalLLaMA | View on Reddit | 7 comments

Just wanted to drop this here for anyone interested in running models locally using ONNX Runtime. The focus here is on using the NPU in Snapdragon X Elite, but can be extended to other systems as well!