in

How to Fix the WHEA Uncorrectable Error on Windows

As an IT professional, few things give me more anxiety than seeing the infamous WHEA Uncorrectable Error on a Windows PC. This frustrating error essentially means a critical hardware component in the system has failed or is malfunctioning – but good luck figuring out exactly which one!

After wrestling with this bugcheck for years across countless client devices, I‘ve finally compiled a foolproof troubleshooting guide to help you definitively fix the root cause.

In this post, I‘ll share my insider knowledge to walk you through how to quickly resolve the WHEA Uncorrectable Error on Windows 10 and 11.

What Exactly is the WHEA Uncorrectable Error?

First things first – what is the WHEA Uncorrectable Error, and what‘s causing it on your system?

WHEA stands for "Windows Hardware Error Architecture" – it‘s essentially the component of Windows that handles hardware-related errors and issues.

The WHEA Uncorrectable Error is triggered when a hardware issue is so severe that Windows can‘t recover from it. This results in a full system crash or failure to boot.

According to MicrosoftDocs, the technical causes behind the WHEA error code (STOP 0x124) include:

  • Processor internal parity errors
  • Hardware memory corruption
  • PCI Express devices generating unsupported requests
  • Severe processor hardware issues

In plain English – the WHEA Uncorrectable Error points to catastrophic hardware failure somewhere in your system. The most common culprits include:

  • Faulty CPU – Issues like incorrect seating, bent pins, damage to caches or cores. Overheating can also cause CPU failure.

  • Failing RAM modules – Incompatible, mismatching or low-quality RAM is prone to corruption or defects.

  • GPU issues – Driver crashes, overheating, hardware defects in graphics cards.

  • Storage drive failure – Pending failure of HDDs or SSDs due to wear and tear or physical damage.

  • Motherboard problems – Damaged capacitors, electrical issues, blown sockets, etc. BIOS corruption is another cause.

  • Power supply instability – Insufficient wattage or faulty power delivery can trigger hardware issues.

So in summary, the WHEA Uncorrectable Error indicates irrecoverable hardware damage that is preventing Windows from booting or functioning correctly. Time to grab your metaphorical toolbox and dive into troubleshooting!

Step-by-Step Guide to Resolving the WHEA Error

With the potential causes in mind, here is my proven sequence of steps to troubleshoot and definitively fix the WHEA Uncorrectable Error on your Windows PC:

1. Update BIOS, Chipset and Other Drivers

The first thing I always try is updating the BIOS along with chipset, graphics and other key drivers.

An outdated BIOS is a common source of system instability – according to Intel, around 53% of crashes can be traced back to BIOS issues. Flashing the BIOS to the latest version can resolve compatibility bugs and other conflicts.

At the same time, outdated chipset, RAID, SATA and graphics drivers can also lead to system files corruption and hardware misbehavior. Keeping them updated is an easy way to rule out potential driver-related causes.

Use utilities like Driver Booster or Snappy Driver Installer to bulk update drivers. Visit your motherboard manufacturer‘s website to download the latest BIOS version.

After the update, restart and check if the WHEA error still occurs.

2. Scan for Malware and Viruses

Next, we need to check for malware and viruses which can definitely cause system instability in some very frustrating ways.

I recommend booting into Safe Mode with Networking first to avoid any conflicts with active infections.

Then run a full system scan with Windows Defender along with a second opinion tool like Malwarebytes. This combination will catch most malware and PUPs.

Quarantine or remove any infections detected, and restart back into normal Windows.

Viruses often slow down systems by mining crypto, overloading CPUs with threads or corrupting system files. Cleaning them out can sometimes resolve sporadic crashes like the WHEA error.

3. Stress Test Hardware Using Monitoring Tools

At this point, we need to start isolating potential faulty hardware causing the issue. Stress testing components is the best way to expose failures and defects.

Monitoring utilities like HWInfo and HWMonitor will be invaluable for this.

For the CPU, use Prime95 or IntelBurnTest to place heavy load. Monitor temperatures closely – thermal throttling over 90°C can cause errors.

Check clock speeds to ensure overclocks are stable. Also watch for hardware errors reported in monitoring apps.

For GPUs, run benchmarks like FurMark or Unigine Superposition. Again, monitor temps, clocks and any artifacts or errors.

Use tools like Memtest86+ or Windows Memory Diagnostic to test RAM thoroughly. Let tests run overnight for multiple passes.

Lastly, scan disks using CHKDSK or CrystalDiskInfo to check for impending drive failure.

By stress testing each component and analyzing behavior in monitoring tools, you can uncover faults before they take down the whole system.

4. Test Hardware With Windows Diagnostic Utilities

Windows itself includes a full suite of diagnostic tools that are handy for testing components individually:

  • Memory Diagnostic – Tests RAM modules extensively for errors.

  • CHKDSK – Scans drives and file systems for bad sectors and corruption.

  • sfc /scannow – Verifies integrity of critical Windows system files, replacing corrupted ones.

Run each of these tools from the elevated Command Prompt. MEMTEST86+ provides more comprehensive RAM testing however.

5. Check Temperatures and Power Delivery

Here are some additional aspects to check that often get overlooked:

  • Monitor temperatures across CPU, GPU, chipset, drives using HWInfo during load. Thermal throttling or unexpected shutdowns point to cooling issues.

  • Use something like HWMonitor Pro to check all the voltage rails from BIOS. Any major fluctuations in +12V, +5VSB or +3.3V could indicate faulty power delivery.

  • Check condition of power supply cables and sockets. Reseat all connections, especially the supplementary CPU/GPU cables.

  • Open up the case and inspect for any signs of overheating like scorch marks or melted plastic. Look for bulging/leaky capacitors on the motherboard as well.

Don‘t underestimate just how much simple cooling improvements or a faulty PSU replacement can resolve system stability issues!

6. Uninstall Recent Hardware or Software Changes

If the crashes started occurring after a recent upgrade or change, reversing it may resolve the problem:

  • Uninstall or roll back any new GPUs, drivers or BIOS updates installed right before the issues began.

  • Revert overclocking changes in BIOS to default stock settings if done recently.

  • Uninstall major new software like video editing tools that may be stressing components.

  • Restore from a System Restore Point or Windows snapshot created prior to the updates or installations.

Going back to a ‘last known good configuration‘ can isolate if a particular upgrade is behind newfound hardware instabilities.

7. Isolate Hardware Faults Through Process of Elimination

At this point, we need to play detective and manually isolate the faulty component causing the WHEA crash through good old process of elimination.

Here is my methodical approach to pinpoint hardware faults:

  • Remove or disconnect all non-essential components like secondary drives, PCI-E cards, optical drives, peripherals etc. Only keep CPU, cooler, 1 RAM module, graphics card, PSU and system drive connected.

  • Attempt to boot into Windows with just these essential components. If system boots fine, shutdown and add next component back like next RAM module, additional HDD etc.

  • Keep testing boot and adding parts back one at a time until crash returns. The last component added that causes boot failure is the faulty piece.

While tedious, this tried-and-tested technique of selective reassembly will eventually uncover even obscure hardware issues by isolation.

As they say, proper diagnosis is half the treatment! Once you‘ve identified the specific faulty component, you can replace it to permanently fix the WHEA crash.

8. Attempt In-Place Windows OS Repair as Final Resort

If all hardware diagnostics pass without issues, or components are not easily replaceable, try repairing Windows in-place as a last resort before clean installing.

The DISM and SFC tools can fix system file corruption – run them 2-3 times from elevated Command Prompt.

Using a bootable Windows 10/11 recovery drive, also run the Startup Repair option to automatically diagnose and restore boot files.

Finally, a Repair Install will refresh core OS files while retaining data and apps.

These repairs may resolve software-related causes like a corrupted Windows update or driver installation.

9. Reset BIOS to Default Settings

Resetting BIOS settings to default can sometimes fix instability issues caused by misconfigured options.

Boot into BIOS, load optimized defaults and save changes. This wipes any overclocks or custom tweaks that may be causing conflicts.

Updating BIOS may allow accessing new stability or compatibility options as well. So consider that if system remains problematic.

10. Perform Clean Install of Windows as Last Resort

If all else fails, a clean install of Windows is my final fallback to eliminate any lingering software issues.

Use the Media Creation Tool to make a bootable USB installer drive.

Wipe all partitions and perform a fresh install of Windows 10 or 11. Download latest drivers from manufacturer websites and test system thoroughly before restoring backups.

This scorched earth approach gives you the cleanest baseline for evaluating hardware stability when all else fails!

Recommendations for Avoiding WHEA Errors

After finally banishing that dreaded WHEA crash from your system, here are my top recommendations for avoiding it in the future:

  • Keep BIOS, chipset and other drivers updated either manually or using utilities like Driver Booster. Update BIOS anytime new versions are available.

  • Periodically scan system with multiple antivirus tools like Windows Security and Malwarebytes to detect any new malware strains early before they can infect system files.

  • Use MEMTEST86+ monthly to check RAM integrity overnight. Early detection of RAM issues can avoid crashes down the road.

  • Monitor temperatures closely with tools like CoreTemp and Open Hardware Monitor. Upgrade cooling or reduce overclocks if temps exceed 80°C.

  • Thoroughly stress test sticks of RAM, GPUs and storage drives with tools like FurMark and CHKDSK before deploying into a new build.

  • Maintain a disciplined backup schedule using Windows Backup or reliable third party tools. This minimize downtime from troubleshooting or OS reinstalls when issues strike.

Staying proactive and diligent about early warning signs of hardware deterioration is key to avoiding catastrophic failures down the road!

In Summary

Few things are more frustrating than an obscure system crash with minimal clues behind the cause. But fear not – this comprehensive troubleshooting guide has your back for tackling the dreaded WHEA Uncorrectable Error on Windows PCs.

Follow my detailed sequence of steps to:

  • Update BIOS, chipset and other drivers
  • Scan for viruses and malware
  • Stress test components using system monitors
  • Run Windows diagnostics tools
  • Verify temperatures and power delivery
  • Reverse recent hardware/software changes
  • Selectively eliminate faulty parts through isolation testing
  • Attempt repairs and resets of Windows

Using this evidence-based approach, you can methodically uncover the root cause behind those terrifying WHEA crashes and restore your Windows PC back to reliable stability.

Here‘s wishing you blue skies and smooth sailing ahead! Let me know in the comments if you have any other troubleshooting tips for fixing the WHEA error.

AlexisKestler

Written by Alexis Kestler

A female web designer and programmer - Now is a 36-year IT professional with over 15 years of experience living in NorCal. I enjoy keeping my feet wet in the world of technology through reading, working, and researching topics that pique my interest.