Skip to main content
  1. Posts/

Scratching the Lottery Ticket

Author
Pratik Kapoor

Every graphics card is unique. Not in a ’every GPU is beautiful’ way, but in a physics way.

Born from Sand
#

Your RTX 4090 started as sand. Actual sand, refined into silicon so pure that only one in a billion atoms is an impurity. That silicon gets sliced into wafers thinner than a human hair, then blasted with ultraviolet light through masks that print transistors smaller than a virus.

The AD102 chip in a 4090 has 76 billion transistors. Each one is about 4 nanometers wide. For perspective: a COVID virus is 100nm. A strand of DNA is 2nm. We’re building structures at the scale of molecules, billions of them, and expecting them all to work.

They don’t all work. Not perfectly, anyway.

The Lottery
#

As transistors shrink, quantum effects creep in. Electrons tunnel through barriers they shouldn’t. Tiny variations in the manufacturing process—a few atoms here, a slight temperature fluctuation there—create chips that are electrically unique.

Some chips conduct efficiently at low voltages. Others need more juice to stay stable. Some run hot. Some boost higher. Same assembly line, same design, different results.

This is the silicon lottery.

NVIDIA bins their chips: the best ones go into flagship products, the mediocre ones get some cores disabled and become lower-tier cards, the worst ones get recycled. But even among the “winners,” there’s enormous variation.

The Second Life of Failed Silicon
#

Here’s where it gets wild. NVIDIA designs multiple GPU dies for each generation:

DieTransistorsIntended For
AD10276 billionRTX 4090
AD10346 billionRTX 4080
AD10436 billionRTX 4070

But what happens when an AD102 comes off the line with defects in some of its 144 compute clusters? It can’t be a 4090. Is it garbage?

No. NVIDIA disables the broken sections and sells it as something else.

In 2024, people started finding RTX 4070 Ti Supers with AD102 dies inside—the same 76-billion-transistor chip from the RTX 4090, but with 54% of its cores disabled. These “failed” 4090 chips got a second life as mid-range cards. Similarly, RTX 4070s have been spotted with AD103 silicon (the 4080’s chip), running with 43% of its cores turned off.

Your RTX 4070 might literally contain a 4080 chip that didn’t make the cut. Your 4070 Ti Super might be a lobotomized 4090. You’d never know without checking GPU-Z.

This is the silicon lottery at scale: not just “how good is my chip,” but “what chip do I actually have?”

How to Find Out Where You Stand
#

So how do you actually measure your chip’s quality? You can’t just look at benchmark scores—a “good” score might be hiding inefficiencies, and a “bad” score might be thermal throttling, not silicon limits.

This Works for Any GPU
#

The methodology below applies to virtually any modern graphics card—not just the RTX 4090. The specific numbers change, but the process is identical:

What ChangesNVIDIAAMD
Tuning toolMSI AfterburnerMSI Afterburner or Radeon Software
Typical voltage range850-1100mV900-1150mV
Clock targetsVaries by GPU tierVaries by GPU tier
Memory OC headroomUsually +500 to +1500 MHzUsually +50 to +150 MHz (GDDR6)

What stays the same:

  • Establish a stock baseline first
  • Track effective clocks, not just reported clocks
  • Calculate clock stretch to find hidden inefficiencies
  • Test in phases: memory → voltage → combined
  • Stress test for stability, not just benchmarks
  • Your optimal settings will be unique to your chip

Whether you have a GTX 1660, RX 7800 XT, or RTX 4090, the silicon lottery affects you. The only way to know where your chip stands is to test it.

Here’s the methodology I used to evaluate my PNY RTX 4090:

Why PNY Verto?
#

When I started researching undervolt settings for my card, I found plenty of results for ASUS, MSI, Gigabyte—but almost nothing for the PNY Verto edition. It’s one of the more affordable 4090s, which means plenty of people own one, yet the community knowledge just isn’t there.

So I documented everything. If you’ve got a PNY Verto 4090 and wondered how it stacks up in the silicon lottery, this is for you.

Step 1: Establish a Baseline
#

Run your card completely stock. No overclocks, no undervolts. Use a consistent, repeatable benchmark (I used 3DMark Steel Nomad) and log sensor data with HWiNFO64.

Key metrics to capture:

  • GPU Clock (reported) — what the card claims it’s running
  • GPU Effective Clock — what it’s actually running
  • Power draw — at high load (>90% GPU utilization)
  • Temperatures — edge and hotspot

Step 2: Calculate Clock Stretch
#

This is the secret metric most people ignore. Clock stretch is the gap between reported and effective clocks:

Clock Stretch % = (Reported Clock - Effective Clock) / Reported Clock × 100

A chip hitting power or thermal limits will report high clocks but actually run lower. This “stretching” is the GPU lying to you. High stretch (>2-3%) means your card is struggling.

Step 3: Test Everything
#

I ran 15+ configurations across three categories: memory overclocks, core undervolts, and combinations of both. Here’s the full dataset:

Phase 1: Memory Overclock (stock voltage)

Config3DMark Scorevs StockPowerNotes
Stock8816434WBaseline
+1000 MHz mem8965+1.7%~400WStable
+1500 MHz mem9105+3.3%~400WBest score
+1750 MHz mem9056+2.7%~400WRegression — memory errors

Finding: Memory OC sweet spot is +1500 MHz. Going higher causes error correction that tanks performance.

Phase 2: Core Undervolt (stock memory)

Config3DMark Scorevs StockPowerNotes
1000mV @ 2850 MHz8555-3.0%~390WToo aggressive
975mV @ 2800 MHz8748-0.8%~370WStable but slower
950mV @ 2750 MHzCrashed
950mV @ 2730 MHz8699-1.3%~355WStable
925mV @ 2700 MHz8593-2.5%~340WMax efficiency

Finding: Pure undervolts trade performance for power savings. My chip is stable at 950mV but only up to ~2730 MHz. The 2750 MHz crash tells me where the silicon limit is.

Phase 3: Combo (undervolt + memory OC)

Config3DMark Scorevs StockPowerStability
950mV/2750 MHz + 1500 mem8917+1.1%~355W❌ Failed stress test
950mV/2750 MHz + 1750 mem8911+1.1%~383W❌ Failed stress test
950mV/2730 MHz + 1500 mem8948+1.5%394WPassed 20 loops

Finding: 2750 MHz looked fine in short benchmarks but failed extended stress testing. Backing off 20 MHz to 2730 MHz provided the stability margin needed.

Step 4: Stress Test the Finalists
#

Quick benchmarks lie. A config might score well once but crash under sustained load. I ran 20-loop Steel Nomad stress tests (~21 minutes each) on promising candidates:

ConfigStability %LoopsResult
950mV/2750 MHz + 1500 mem8/20❌ Crashed
950mV/2750 MHz + 1750 mem12/20❌ Crashed
950mV/2730 MHz + 1500 mem98.9%20/20Passed

Step 5: Analyze the Winner
#

Here’s the detailed comparison between stock and my final config:

MetricStock950mV/2730/+1500memDelta
3DMark Score88168948+1.5%
Reported Clock2553 MHz2714 MHz+161 MHz
Effective Clock2480 MHz2693 MHz+213 MHz
Power (high load)434W394W-40W
Clock Stretch2.87%0.77%-2.1%
Edge Temp~72°C~68°C-4°C
Hotspot Temp~85°C~81°C-4°C

The undervolted config runs higher effective clocks on 40 fewer watts with nearly 4x less clock stretch.

What This Tells Me
#

My card isn’t a lottery winner. Stock boost clocks (~2550 MHz reported) are below average for a 4090—most hit 2700-2800+. The 2.87% clock stretch confirms it: stock, this chip is power-limited and struggling.

But it responds well to undervolting. At 950mV, it stops fighting itself and runs clean. The silicon isn’t bad—NVIDIA’s default voltage curve just isn’t optimized for this specific chip.

My lottery standing: ~45th percentile. Below average stock performance, but recoverable with tuning.

My Daily Driver
#

950mV @ 2730 MHz core, +1500 MHz memory

vs StockBenefit
+1.5% performanceFaster in games
-40W powerLower electricity bill, less connector stress
-4°C coolerQuieter fans
0.77% stretchCard running clean

Not the highest-scoring config (that’s stock voltage +1500 mem at 9105), but the best balance of performance, efficiency, and thermals. For max FPS at any cost, I’d run stock +1500 mem. For daily use, the undervolt wins.

A Note on the 12VHPWR Connector
#

Remember the RTX 4090 melting connector fiasco? The 16-pin 12VHPWR connector (12 power pins + 4 sense pins) pushing 450W+ caused fires when connectors weren’t fully seated or wires fatigued over time. NVIDIA and connector manufacturers took the blame, but physics was the real culprit: that much power through such a compact connector leaves zero margin for error.

Undervolting helps. My config pulls ~394W instead of ~434W—that’s 40W less current stressing those pins. It won’t save a badly seated connector, but it reduces thermal stress on a component that’s already operating near its limits.

One more reason to tune your card: longevity. Less power, less heat, less stress on the weakest link in the system.

Your Card Is Waiting
#

Here’s what fascinates me: right now, in your PC, there’s a chip with characteristics no one has fully mapped. Your specific GPU has a voltage/frequency curve that’s never been perfectly optimized. It has thermal behaviors unique to its particular arrangement of imperfect transistors.

When you undervolt and overclock, you’re not following a guide—you’re running experiments on a one-of-a-kind piece of physics. The “stable” settings you find are yours. No one else’s card will behave exactly the same way.

Some people have golden samples that hit 2900+ MHz at 950mV. Some have duds that crash at stock. Most of us are somewhere in between, with chips that have quirks waiting to be discovered.

The Miracle We Ignore
#

We’ve gotten so used to GPUs that we forget what they are: billions of molecular-scale switches, built from refined sand, that simulate entire worlds in real-time. The fact that they work at all is absurd. The fact that they’re all slightly different is inevitable.

The silicon lottery isn’t a flaw in manufacturing. It’s a reminder that we’re working at the edge of what’s physically possible—where quantum mechanics meets engineering, and perfection is impossible.


This is part of a series documenting my RTX 4090 undervolt/overclock journey. The chip lottery gave me a ~45th percentile sample. Through testing, I found that 950mV @ 2730MHz with +1500MHz memory delivers better performance than stock while saving 40W. Your results will vary—that’s the whole point.