cancel
Showing results for 
Search instead for 
Did you mean: 

ROG Strix z690A Gaming WiFi WHEa 17 Errors

Grendel602
Level 7
Swapped my fully functioning 3090RTX, 9900K, Z390 Gigabyte Designare setup for a brand new Z690 and 12900K rig.

After swapping all components over to the Gaming Wifi MOBO and 12900k combo, install WIN 11 and I get thousands of WHEA errors for ven_8086&dev_ 460d&SUBSYS_86941043&REV_02

I also get random hangup, BSODS, and straight crashes.

I'm a day of frustration away from RMA. I swapped back to old system and all works perfectly with 9900k and Z390 mobo, so the only possible hardware issue is the mobo or CPU....I guessing this is a mobo hardware component issue.

PCI ID Repository suggests it is a 12th Gen Core Processor PCI Express x16 Controller #1.

Device manager showed no issues with any hardware.
80,626 Views
333 REPLIES 333

STARRAIN_ROG
Customer Service Agent
Hi Drayco31,
Not sure if you also told the retailer the issue only occurs when install the GPU in when you RMA'd the board back in January.
If you don't have GPU to cross check, we would suggest also share the information to the retailer.
Thank you.

Drayco31
Level 11
@ STARRAIN@ROG

Back then I hadn't figured out the issue is only when a dedicated GPU was installed. Only since the GPU got RMA'd for 5 weeks the system was stable and became unstable again <24 hours after the GPU was reinstalled. Unfortunately I don't have another GPU to test . Tbh this is the 3rd RTX 3070 I've had since September so it wouldn't suprise me if it's the GPU. They did get the fault when testing previous GPUs though. Retailer ignored the fact I said it only occurs low load and tested it using intensive tests for 24 hours. That's why I demanded they return it to Gigabyte because the idiots tested it precisely how I said no faults occured so them not finding a fault is not surprising. Possible Gigabyte also found no faults if the retailler didn't make it clear the issue is low load only.

Since I forgot to turn Multicore enhancments off after Bios update/load optimised defaults I have done that again and while I had intended on calling the retailers today I feel I should rule that out. Was no issue enabled for 5 weeks without the GPU so I doubt that'd be a solution but the more I can rule out the better. Always the problem with such intermittent faults. Sometimes it's multiple times a day and then 5/6 days without issue. If no crashes I'll have more testing but if crashes occur before the end of Thursday I'll be calling both retailers on Friday morning for options.

EDIT: Happened again so called GPU Retailer. Spent half hour on phone and they processed an RMA again, fnally. 3rd GPU in a row with intermittent crashes involving Display driver nvlddmkm stopped responding and has successfully recovered. Not proven yet but I always suspect it was the GPU. If it turns out it is the GPU, again, I must be sooo unlucky that the 3rd RTX 3070 GPU since September 2021 has been faulty.

Purchased a used 2080 from elsewhere that I should be able to use whilst the 3070 gets tested. Confident and hopeful the crashes don't return with the 2080

Drayco31
Level 11
Got a used 2080 Super to try out. Hopefully everything runs fine so I can have proof that it is the GPU they supplied thjat is causing all the issues so they need to keep testing it.

Model I got was ROG-STRIX-RTX2080S-O8G-GAMING and everything is running fine so far. I do seem to have PCI Device under the Other device heading in Device Manager. Anyone know what that is and where I can find a driver for it?

I've contacted Asus support about this and they sent me a form to fill out to send to their technicians so they can try and repeat the problem.
Has anyone else here followed the same procedure and found a fix?

Obviously, the workaround of disabling PCIEX power management is a workaround, but we don't want to have to keep an essential feature disabled.

For the record I am using a RTX 2080, not a PCIE4 graphics card, so it has nothing to do with PCIE4 as far as I can see.

It also has nothing to do with ASPM, but the entire power management featureset.

I own the ROG STRIX Z690-I GAMING WIFI

93605

STARRAIN_ROG
Customer Service Agent
Hi Dodgexander,
Do you have any issue when get WHEA 17 warning?
Do you also get WHEA 17 warning on bios 1505 with bios all default settings?
Thank you.

STARRAIN@ROG wrote:
Hi Dodgexander,
Do you have any issue when get WHEA 17 warning?
Do you also get WHEA 17 warning on bios 1505 with bios all default settings?
Thank you.

Yes, when I keep Native Power Management enabled and I get the WHEA17 spam the system will slow down to a halt sometimes and there will be a BSOD or full system lockup. Despite this 90% of the time the system seems stable despite these messages.

I have the latest 1505 Bios and reset to default settings, and I still get the error.

I've been in touch with Asus customer support and would recommend everyone do the same thing. The more cases they get, the easier they solve the problem.

Thankfully, they have recognized the issue with their second level support and said they are working on a fix.
Here's my last email back from them:

My name is xxxx. and it's my pleasure to help you with your problem.

I'm writing this email as an update regarding the escalation case. In coordination with our service center, Please disable PCIE native power management to stop WHEA error for temporarily solution. We are working on final solution and we will send you an update once we have it.

Anyone else with the same issue disabling PCIE Native Power Management does stop the stability issues. I expect if you are still having problems after disabling this, you have a problem elsewhere.

For now, we have to keep this disabled and hope Asus can fix it.

Drayco31
Level 11
Update

Thought I'd say problem solved. RMA'd the GPU again and finally the morons at the retailer ran it low load/idle and managed to re-produce the fault versus stressing it hard for 24 hours. Keep telling them I've had it crash with absolutely nothing loaded and me going afk and returning to a crashed system. Kinda a Yay accept now thats upto 4 weeks with no GPU. 3rd RTX 3070 in a row since September 2021. I only upgraded from my 7700K build because they returned the 3rd GPU as no fault found and I've been trouble shooting and RMA'ing other components.

Their tech support even suggested it could be a short in my case before I bit his head off at such a stupid suggestion when he's been dealing with me for 9 months. How can it only be a short when the GPU goes in yet no short running on iGPU or a GT1030? Or the fact it was fine on the GTX 1070, had since release, before I upgraded to a 3070.

Sorry for wasting peoples time and I appreciate the assistance. Enjoy your weekend

I had the same WHEA Errors.

I set PCI Express Native power management > Disabled as read in a post before some days ago.
Afterwards i had 2-3x BSOD with an error of the dll of nvida. I did a complete reinstall of the nvidia driver with nvidia experience.
Now the system seem so be very stable

STARRAIN_ROG
Customer Service Agent
Hi Dodgexander,
Thank you for your reply
If our local support has got back to you, please wait for an update from them.
Thank you.

STARRAIN@ROG wrote:
Hi Dodgexander,
Thank you for your reply
If our local support has got back to you, please wait for an update from them.
Thank you.

Here's the latest update from support:

When will WHEA Error 17 appear? There are no known problems. Please help confirm what you are using environment and operation methods, thank you.


Very discouraging to say the least after being told a fix would be worked on.

This is a widespread issue with all Z690 motherboards, with Gigabyte they even had to replace customer boards.

What's Asus official stance on this please?