5090 is crashing randomly, nvlddmkm errors.
Posted by NikiGavia@reddit | buildapc | View on Reddit | 8 comments
I recently finished building my pc at the end of last year. I haven't had any issues until now. I am having black screens, freezes with weird artifacts on the screen, and reboots. All the while, my event viewer is throwing event id 153 and 13 errors.
System:
Ryzen 9950x3d
TUF Gaming B850M-Plus Wifi7
Teamgroup T-Create Expert 64gb CL34 DDR5-6000Mhz
Asus Astral 5090
Asus Loki 1200w
4k@144hz TCL TV
I have tried a negative offset PBO for my CPU, undervolt for my CPU, and stock. The worst part, is there is not rhyme or reason for the crashes. It is always random. I can not reproduce any crashes in any capacity. Sometimes it will crash while watching youtube videos or idle, other times it will crash while playing a game (Warframe for example). Worst yet, there are times I have long gaming sessions or go days with everything being fine.
I have tried using DDU to get older drivers and installed latest drivers. I have reseated my ram, gpu, and cables. My temps have ranged between 40 and 60c between idle and load.
What am I missing? Any help would be welcomed.
NikiGavia@reddit (OP)
Update: nvlddmkm errors came back on the troublesome gpu but worse this time. Over 400 errors within milliseconds. Looking into RMA. Any tips on dealing with the ASUS RMA process?
NikiGavia@reddit (OP)
UPDATE: Made some progress. I moved the GPU to a different PC and after some initial issues, neither PC has any issues and I'm more stumped. There were some driver updates on both PCs after I swapped GPU's but now everything seems to be working after multiple days. One confusing update though is the GPU from the troubled PC has a steady blue light bar on startup and then goes back to my normal argb color scheme once windows fully loads in. I read some places this could be a voltage imbalance but I'm having difficulty confirming this. I can't seem to find a manual either. Should I be concerned about this blue light on startup? Where can I find more info about this?
Virtual-Okra4152@reddit
Any updates on this? Experiencing the same thing running X-Plane 12.
System:
Ryzen 9850X3D
ASUS Strix B850i
KLEVV CRAS V RGB 48GB CL28 DDR5-6000MHz
MSi Ventus 3x 5090 OC
Corsair SF1000 ATX 3.1
Pyr0blad3@reddit
same here for 2 months now...
PenFar9334@reddit
Do you have another rig? Or know someone with a rig capable of handling it? Might be worth testing since I have a hunch it's the board
NikiGavia@reddit (OP)
I'll try testing the gpu in another computer later today. Why do you think it's the motherboard?
PenFar9334@reddit
Came across a video from a pc repair shop. Dunno if I can find it again but it had the same problem. Had Nvidia driver errors and when trying to install it the error still occurs. Only then did he swap the gpu onto a different board and it worked
lichtspieler@reddit
Its most likely the GPU driver. Did you try to use older versions?
PBO-CO is the last thing you should do with a system that has stability issues. Testing for UV/CO stability is difficult enough and than you have to test for clock stretching. Neither of those can be tested with all-core benchmarks, since the problematic HIGH frequency issues happen during idle / light gaming load.
=> keep the CPU at stock till you figure out if its the GPU driver or not
AFAIK the last stable NVIDIA driver is still 591.86 from 27.01.2026, at least for 4000 series. Try it, maybe it helps.