Training on 8x v100 32GB with NVLink or 2x RTX Pro 6000?

Posted by ClimateBoss@reddit | LocalLLaMA | View on Reddit | 1 comments

Does anyone have experience fine tuning models QLoRA, LoRa and full training on 8x v100 32gb? * Is **Volta** still a viable option? Pytorch support looks deprecated * What models fit? * Training speed? * Thoughts on 8x v100 32GB compared to 2x RTX Pro 6000 96gb?