cancel
Showing results for 
Search instead for 
Did you mean: 

ROG-STRIX Z690-E WHEA-Logger ID.17 whith cold Board

twobnt
Level 7
If the mainboard is cold (room temperature >21°C < approx. 30°C), i.e. after a longer standstill (power supply unit is permanently switched on), the WHEA logger throws warning messages with the ID.17 for the Intel for a certain time (R) PCI Express Root Port #2 - 7AB9 : A corrected hardware error has occurred.
PCI Slot 5 (PCI Bus 7, Device 0, Function 0)


  • It only does this when the system is cool and longer if it's colder in the room.
  • If the warm computer is switched off briefly (also in combination with a completely dead power supply unit), or if a reboot is only carried out, no warning messages appear.
  • I have not connected any devices to the SATA bus in question.
  • After that, the computer runs super stable, no freezes or blue screens, not even under full load and all possible simultaneous tests



Can anyone confirm this phenomenon?

Here my hardware:


  • Board: Asus ROG-STRIX Z690-E GAMING WIFI
  • Processor: Intel I9 12900KS
  • Cooling: Corsair iCUE H170i ELITE liquid CPU cooler with LCD display
  • RAM: 64GB from 2 x G.Skill Trident Z5 RGB - DDR5 - Kit - 32GB: 2 x 16GB - DIMM 288-PIN - 6400 MHz / PC5-51200 - CL32 - non-ECC - (F5-6400J3239G16GX2-TZ5RS)
  • Graphics Card ASUS ROG Strix GeForce RTX 3080 10GB V2 OC Version Gaming Graphics Card
  • NVMe M2: 2 x Samsung 980 PRO 1TB PCIe 4.0 (up to 7,000MB/s) NVMe M.2 (2280) Internal Solid State Drive (SSD) (MZ-V8P1T0BW)
  • Additional network card Asus PCE-C2500 2.5G network card (RJ45 port, IEEE 802.3an 2.5G Base-T)
  • PSU: Corsair RM1000x 80 PLUS Gold fully modular 1000 watt ATX power supply



What have I already tried:


  • Bios version 1720 and 1505 tested, makes no difference
  • Made multiple CMOS resets.
  • OC is always switched off, I always work with the default values from the bios.
  • Windows 21H2 reinstalled with all updates.
  • All driver updates installed
  • Tried different start scenarios. Always with the same result.
  • MEMtest86 runs without errors.
  • All cable connections checked.
  • Absolutely no problems with Samsung's NVMe M.2
  • Waiting 5 Minuts in Bios before bootimg into Windows if the systrem is cold -> No Warnings




Here is the always the same error message:

Log Name: System
Source: Microsoft-Windows-WHEA-Logger
Date: 07/01/2022 12:56:29
Event ID: 17
Task Category:None
Level: Warning
Keywords:
User: Local Service
Computer: I9
Description:
A corrected hardware error has occurred.

Component: PCI Express Root Port
Error source: Advanced Error Reporting (PCI Express)

Primary Bus:Device:Function: 0x0:0x1C:0x1
Secondary bus:device:function: 0x0:0x0:0x0
Primary device name: PCI\VEN_8086&DEV_7AB9&SUBSYS_86941043&REV_11
Secondary device name:
Event XML:



17
1
3
0
0
0x8000000000000000

4152


System
I9



4
{00000000-0000-0000-0000-000000000000}


0xdf
4
0x101
0x10
0x407
0x0
0x1c
0x1
0x0
0x0
0x0
0x0
0x8086
0x7ab9
0x30400
0x0
0x0
0x0
0x0
0x1
00000000000000000000000000000000
PCI\VEN_8086&DEV_7AB9&SUBSYS_86941043&REV_11



996 Views
10 REPLIES 10

doomkitten
Level 7
Our builds are relatively similar:

CPU: Intel Core i7-12700K
Motherboard: Asus ROG STRIX Z690-E GAMING WIFI
Memory: G.Skill Trident Z5 RGB 32 GB (2 x 16 GB) F5-6000U3636E16GX2-TZ5RS (almost the same model as yours it seems)
Storage: Samsung 980 Pro 1 TB M.2-2280 PCIe 4.0 X4 NVME Solid State Drive
Video Card: NVIDIA Founders Edition GeForce RTX 3080

I get the exact same error constantly, but I don't know if I've noticed it being related to temperature, so I'll try to watch for it. It does seem like it doesn't happen randomly, so that could be the factor I haven't considered. I've assumed it been related to my M2 drive, and you have the same one, so that's interesting. Which ports do you have your two m2 drives in?

Nate152
Moderator
Hello,

Welcome to the ROG forum.

It's suggested to disable PCIe Native Power Management to rid the WHEA errors.

Good morning,

The errors do not come from the ports where the m2 are plugged in, but from the empty SATA port. At least that's what my Windows says

And that only after the start for a certain time.
The colder the starting temperature of the mainboard, the longer and more of these warnings come up.
Also, the warnings come faster at the beginning and build up,...

Today it was just under 19°C in the room and there were around 765 warning messages over a period of 6 minutes.
Yesterday I parked the computer in the bios for 10 minutes to warm it up before I started Windows, not a single message came up.

I had already studied the solution provided by Nate152 in other threads for other problems, primarily in connection with m2 drives, but had not yet adapted it to my temperature problem.

I'll test the setting tomorrow - let's see what happens then.

get in touch

twobnt wrote:
The errors do not come from the ports where the m2 are plugged in, but from the empty SATA port. At least that's what my Windows says


You can do what SilentScone suggested, but that was also the case on my Z690 Extreme.

Disabling PCIe Native Power Management in the BIOS as previously suggested "fixed" the issue.

Silent_Scone
Super Moderator
Hello,

Reseat the CPU whilst checking for any bent pins, then reattach making sure for even mounting pressure

Check the motherboard stand-offs are not overtightened

If no change, remove the motherboard from the enclosure and retest
13900KS / 8000 CAS36 / ROG APEX Z790 / ROG TUF RTX 4090

Falkentyne
Level 12
That ID is from the ASMedia SATA controller, while 7AB4 is from the Standard NVM Express Controller.
Setting PCIE native power management to disabled fixed the NVM Express WHEA. Also setting the m2 drives on the m2 riser card (dimm_1 and dimm_2) to Gen 3 also removed these errors.

I could have sworn that a newer bios (at least on the Maximus Extreme....1302 or newer) completely fixed this though. I would have thought if that were the case, the same fix would work on the Strix. I do know that MSI and Gigabyte also had WHEA errors related to this. Seems to be something directly chipset related.

Hi

@Silent Scone@ROG

Thank you for your feedback signal.
I'll save your suggested solution for later.

On the one hand I was very careful with the initial assembly, on the one hand in relation to inserting the CPU itself and on the other hand with the adapter plate for the CPU cooling and the water block.
All screws were tightened crosswise in several moves until they were only slightly hand-tight.
On the other hand, the system is now running very well and without errors and without warnings.


@Nate152

After two cold starts with the deactivated PCIe Native Power Management and the new Bios 2004 I haven't received any new WHEA 17 errors so far.

Will turn on the PCIe Native Power Management again tomorrow and see what happens with the new Bios 2004 only.

I'll get back.


Until then, many thanks to all supporters.

Morning,

So, i have tried only the new BIOS 2004 with enabled PCIe Native Power Management - unfortunately the calculator threw WHEA 17 warnings again, although not as much. So again the PCIe Native Power Management switched off. And every thing is fine

VRT
Level 8
I take it no-one ever figured out this problem I think I may know where it's coming from. Shared resources, in my particular case it's SATA, I have 1 4TB SSD in SATA, my other drives are M.2, that's the one that keeps popping up all the time from a cold boot, I don't seem to get them after a few hours.
97577