My graphics card seemingly stops outputting a signal and I'm forced to restart the computer (replugging the HDMI cable does nothing). I'll be watching Netflix (can't overheat off Netflix!) or playing a game and then my monitor randomly loses signal and goes to sleep. I can still talk to my friends on Discord (voice chat), so the computer is still functioning. I take a screenshot every time it does this and I haven't seen any error messages except once when I was running 'Windows Experience Index' and it said: "The NVIDIA GeForce GTX 670 device is not removable and cannot be ejected or unplugged".
I should mention at my old house it worked flawlessly. So I figured it was a electrical grid issue and I bought an Uninteruptible Power Supply, which all my components and LAN cable connect to. Did not fix the issue.
In Windows Event Viewer it was saying "the Display driver "nvlddmkm" stopped responding and has successfully recovered." so I set the TDR-Delay to 8 seconds (from 2) and underclocked my memory and core by -100. Now I don't see that error in Event Viewer but the issue still occurs. Considering disabling TDR entirely?
Here's the what I'm getting in Event Viewer now after the monitor stops receiving a signal (Administrative Events):
Something is causing the GPU to stop working and then it's unable to recover, as I understand? Any ideas about how to fix this or at least get the GPU to successfully recover in a couple seconds (much better than hard reset of computer).
I've checked wiring, temperatures (GPU max 85C in hardcore game), ran memtest, GPU/CPU benchmarks for 15+ minutes and like 500 other things. Don't know what to do!
Here's my PC specs:
Operating System: Windows-7, 64bit (Desktop)
System Model: Alienware X51 R2
BIOS: 07/13/15 11:13:58; Ver: 04.06.05
Processor: Intel(R) Core(TM) i7-4770 CPU @ 3.40GHz (8 CPUs), ~3.4GHz
RAM: 16GB
GPU: NVIDIA GeForce GTX 670
Best Answer
If you're able to take and store screenshots after the monitor goes to sleep, then the GPU isn't crashing. Windows 7 can't render frames without the GPU working.
It looks like you used to have a problem with heat or partial failure of the GPU causing reliability issues, thus the TDRs. But since you're no longer getting TDRs after underclocking and increasing the TDR timeout, that tells me that your GPU is now working well enough to be stable.
If you disable the TDR timeout, your PC would totally lock up (no ability to take screenshots or hear audio, etc) every time the GPU crashed, and it would stay locked up indefinitely or BSOD and you'd have to reboot. Similarly, if you're not getting BSODs when you do get TDRs, then your GPU is recovering from the crash successfully.
So this seems to be strictly limited to the display output path and has nothing to do with the "processing" part of the GPU. It could be:
Things you should try:
If swapping the Nvidia card or running off of the Intel HD graphics solves the problem, it's very likely to be a failing Nvidia card (some part of it; but it doesn't really matter which part because it's not like you're going to break out a $10k+ microsoldering kit and start physically repairing the traces, are you?).
If the problem persists with the Intel HD graphics only (with the Nvidia card playing no part in the display output path), it's very likely either the monitor cable, or the monitor itself going bad.
If the only thing that fixes it is running Ubuntu, it's very likely to be a software issue -- either a bad driver, bad configuration (of display timeout, etc.), or even a virus/malware in your Windows installation.