The stack, not the magic

Four cuts on the technology most often sold to ordinary people as productivity and most often used by the largest capital holders to compress labor costs. Numbers below are from BLS, Challenger Gray, Anthropic, Epoch AI, Stanford HAI, EPRI, and Lawrence Berkeley National Lab. Speculation about what AI will do in 2030 is excluded; measurable facts about what it does today are included.

Frontier training-run costs, log scale

From $100 (original Transformer, 2017) to $1B (projected 2026 frontier), a 10-million-fold increase in under a decade. The y-axis is logarithmic, so each gridline is a 100× step. The line stays straight, which is the point: compute has scaled at a roughly constant exponential rate. Hover a point for model-level detail.

$100$10K$1M$100M$1.0B20172019202120232025
Source: Epoch AI + company disclosures. All figures are estimates; most frontier labs do not publish training costs.

Displacement

4 items

What is hitting labor now. Anthropic CEO publicly predicts 50% of entry-level white-collar jobs disappear in 1-5 years. Only ~4.5% of 2025 layoffs are AI-linked per Challenger Gray, suggesting most AI-attributed firings are cover stories for cuts that would have happened anyway. The real effect is slower entry-level hiring, which does not show up in unemployment stats but narrows the pipeline into senior roles.

50%
contested · May 2025
Anthropic CEO warning on entry-level white-collar

Dario Amodei (Anthropic CEO) told Axios in May 2025 that AI could eliminate roughly 50% of entry-level white-collar jobs within 1-5 years and push unemployment to 10-20%. It is the starkest public prediction from a frontier-lab executive with direct commercial incentive to be optimistic. Whether the forecast is right or directional, the fact that the CEO of an AI company is making it is itself a data point about where the builders think this lands.

4.5%
solid · 2025
Share of 2025 US layoffs that were AI-linked

Challenger, Gray & Christmas: AI-attributed layoffs accounted for ~4.5% of total 2025 US layoffs. An HBR January 2026 survey of 1,006 executives found most AI-linked layoffs were "in anticipation of" rather than "in response to" measurable AI-driven productivity gains. The gap between stated rationale (AI) and measured productivity (limited) is the "AI washing" phenomenon: layoffs that were going to happen anyway, given an AI-shaped cover story.

solid · 2025
Entry-level white-collar job-posting declines

Job postings for entry-level writing and editing roles fell sharply after ChatGPT launched in late 2022. Junior software-engineer postings have grown far slower than overall tech employment. Customer-service rep postings declined in industries that deployed chatbots aggressively. Aggregate unemployment for college-educated workers stayed low; the displacement is showing up as slower entry-level hiring and elevated output expectations per remaining employee, not mass firings. The pipeline into senior roles is narrowing.

30%
contested · 2023-25
US occupations with substantial LLM exposure

Goldman Sachs / OpenAI estimated ~30% of US occupations have tasks substantially exposed to LLM automation; legal, finance, software engineering, and administrative support rank highest. Acemoglu (MIT) and other labor economists have produced lower estimates (5-10% realistic impact over a decade), but all credible estimates are non-zero and concentrated in the same white-collar mid-skill band where wage premiums have historically been highest.

Your job, scored

LLM-exposure estimates synthesized from Eloundou et al. (OpenAI/Penn 2023), Goldman Sachs (Hatzius 2023), and Acemoglu (NBER 2024). Numbers are median across studies; "exposure" = share of an occupation's tasks where an LLM materially substitutes for or speeds up the worker. Not the same thing as job loss.

Sources: Eloundou et al. (arxiv 2303.10130), Hatzius et al. (Goldman Sachs), Acemoglu (NBER w32487). Curated subset; full SOC mapping covers 1,000+ occupations.

Concentration

4 items

Who owns the stack. NVIDIA takes ~80% of AI GPU revenue. TSMC makes ~90% of leading-edge chips. Four frontier-model labs (OpenAI, Anthropic, Google DeepMind, xAI/Meta) produce essentially all state-of-the-art releases. AWS + Azure + GCP take ~65% of cloud spend. AI at scale is functionally a narrower oligopoly than the consumer internet it is built on top of.

~80%
solid · 2024
NVIDIA share of AI GPU market

NVIDIA holds approximately 80% of the AI accelerator market by revenue (higher by training-specific measures, lower for inference where alternatives like Google TPU and AWS Trainium take share). The company's data-center segment revenue grew from ~$15B in FY23 to >$100B in FY25. NVIDIA's CUDA software lock-in is the most durable piece of the moat; hardware alternatives exist but porting costs mean even well-funded buyers (Meta, Microsoft) run mostly NVIDIA.

~90%
solid · 2024
TSMC share of leading-edge chip fabrication

TSMC manufactures roughly 90% of the world's leading-edge semiconductors (≤5nm nodes), including essentially all advanced AI chips (NVIDIA H100/H200/B100, Apple Silicon, AMD MI300). Samsung has a minority share; Intel 18A is not yet at competitive volumes. The Taiwan-strait geopolitical concentration of the most strategically critical chip supply chain is unmatched in any industry.

4 labs
solid · 2024-25
Frontier-model capability concentration

OpenAI, Anthropic, Google DeepMind, and (distant fourth) xAI + Meta together account for essentially all frontier-model releases as of 2025. Training a frontier model now costs $100M-$1B+ and requires access to tens of thousands of H100-class GPUs, which NVIDIA allocates by relationship. Open-weight models (Llama, Mistral, Qwen) lag the frontier by 6-12 months on benchmarks. The 4-lab concentration replicates the pre-antitrust Big Tech shape at a higher capital intensity.

~65%
solid · Q4 2024
Cloud spend at AWS + Azure + GCP

Synergy Research: AWS (~32%), Microsoft Azure (~22%), and Google Cloud (~11%) together take ~65% of global cloud infrastructure spend. AI workloads intensify the concentration because training and serving frontier models requires data-center footprints only the top three can deliver at scale. OpenAI runs on Azure; Anthropic on AWS + GCP; Meta and Google run their own. The 'AI economy' is functionally the hyperscaler economy with a new product layer on top.

The physical bill

8 items

US data-center electricity is projected to reach 9-12% of national total by 2028, up from ~4% in 2023, nearly all AI. Frontier training runs cost $100M-$1B each; per-query inference uses ~30× the energy of a web search; large AI data centers consume millions of gallons of cooling water per day. The abstract 'intelligence' layer has a very concrete power grid, semiconductor fab, and water table underneath it.

9-12%
contested · 2024
US data-center share of electricity by 2028

EPRI and Lawrence Berkeley National Lab: US data-center electricity consumption is projected to reach 9-12% of national total by 2028, up from ~4% in 2023. The growth is almost entirely AI training and inference. Utilities are deferring coal retirements and signing multi-billion-dollar long-term contracts with hyperscalers; in some grids (Northern Virginia, Georgia, Ohio) residential rates are rising to cover infrastructure build-outs driven by data-center demand.

Source: EPRI / LBNL
$1B+
contested · 2024-25
Frontier model training cost per run

Epoch AI: estimated training costs for frontier models passed $100M with GPT-4 (2023), crossed $500M with Gemini Ultra and GPT-5-class models (2024), and are projected to reach $1B+ per training run by 2026. The cost is dominated by GPU-hours; a single training run uses 20,000-100,000 H100-equivalent GPUs for months. Doubling cost per generation cannot continue indefinitely; the scaling-law extrapolation that justifies the spend has weaker empirical footing past ~2× current frontier.

Source: Epoch AI
10 Wh
contested · 2023-24
Energy per LLM query vs ~0.3 Wh for web search

Independent benchmarks: a typical LLM chat response consumes roughly 10 Wh of end-to-end compute energy, compared to ~0.3 Wh for a Google web search. For large models and long responses the gap is 50-100×. At the hundreds-of-millions-of-daily-queries volume ChatGPT and competitors now serve, the aggregate electricity and water-cooling demand is meaningful at regional grid scale.

4.5M
contested · 2023-24
Gallons of water per AI data center per day

Northern Virginia and Phoenix-area AI data centers report water consumption (mostly evaporative cooling) of 3-6 million gallons per facility per day during peak summer operation. Microsoft's 2023 sustainability report disclosed 34% year-over-year global water-withdrawal growth tied to AI workloads. Water-stressed regions (AZ, NV, TX) are seeing local disputes as data-center siting competes with residential and agricultural use.

solid · 2025
PJM capacity-auction clearing price, 2025/26 vs prior year

PJM Interconnection's 2025/26 capacity auction cleared at $269.92/MW-day, up from $28.92/MW-day the prior year, a 9× jump driven primarily by data-center load growth colliding with deferred coal retirements. PJM's auction pays generators a fixed amount for being available to meet peak demand; the cost flows through to retail bills across the grid's 13-state, 65-million-person footprint (Mid-Atlantic + much of the Midwest). The aggregate uplift on the 2025/26 auction is roughly $14.7 billion. Monitoring Analytics, PJM's independent market monitor, attributed the spike to capacity scarcity created by AI-driven demand growth outpacing new generation interconnection.

~3.6 GW
solid · 2023-25
Coal-plant retirements deferred in PJM since 2023

PJM has approved or accommodated retirement deferrals on roughly 3.6 GW of coal capacity since 2023, with hyperscaler load growth cited as the proximate cause. Indianapolis Light & Power, AEP Ohio, and Dominion all pushed planned coal exits into the 2030s. The deferrals lock in emissions that the affected utilities had previously committed to retire and complicate state-level decarbonization targets. The trade is explicit: keep coal running so that the AI capex cycle has firm-power headroom.

14 states
solid · Apr 2026
Considering data-center moratoriums as of April 2026

Sherwood News (Apr 29 2026) reports 14 states are actively considering moratoriums on new data-center builds, with residential electricity prices projected to rise 15-40% over the next 5 years across affected grids. Local pushback has run ahead of federal policy: Indianapolis IN extended its 2024 moratorium through 2025; Prince William County VA paused approvals on the 23,000-acre 'Digital Gateway' corridor and Fairfax, Loudoun, and Stafford counties tightened zoning rules; Atlanta-metro suburbs Dunwoody, Peachtree Corners, and Norcross GA passed full moratoriums while Marietta GA imposed a study pause; Tucson AZ effectively blocked water-intensive builds via a 2024 water ordinance, with Chandler and Mesa AZ raising water-use thresholds; Memphis TN has open public-health complaints over xAI's Colossus gas-turbine emissions. State consumer advocates in MD, NJ, and OH have filed FERC complaints arguing the PJM auction cost-shift to residential ratepayers is unjust. The externality is land + water + grid, all governed by the city, county, and state authorities federal AI policy doesn't touch.

15-40%
solid · Apr 2026
Projected US residential electricity-bill increase over 5 years

Sherwood News reporting (Apr 29 2026) cites projected 15-40% residential electricity price increases over the next 5 years driven by data-center load growth. Hyperscalers (Alphabet, Amazon, Meta, Microsoft) are spending >$650B on data centers in 2026 alone; data-center electricity demand is expected to double by 2030 to roughly the combined draw of France + Germany. ~2,300 GW of generation and storage is stuck in interconnection queues, more than the entire installed US power-grid capacity, so most of the new demand has to be served by either deferred fossil retirements or capacity-auction premiums. Both routes flow through to retail bills.

What your LLM use draws from the grid

A typical LLM chat response consumes roughly 10 Wh of compute energy and ~0.5 L of cooling water end-to-end (de Vries, Joule 2023; Berkeley Lab). For comparison, a Google web search uses about 0.3 Wh, so LLM queries are ~30× more energy-intensive. Move the slider to your typical daily query count.

Your LLM queries per day 20 queries
0 queries50 queries100 queries250 queries500 queries
Annual energy
73.0 kWh
73.0K Wh / yr
Annual water
964 gal
3,650 L
≈ Google searches
243.3K
same energy
≈ EV miles
252 mi
avg US BEV
≈ Phone charges
5,615
full charges
≈ Hot showers
12.2
10-min, electric heat
For scale

ChatGPT alone serves ~1 billion queries/day. At 10 Wh each that is ~10 GWh/day, equivalent to the output of a mid-size gas plant, served by a fleet of data centers that mostly didn't exist 4 years ago.

Sources: de Vries (Joule 2023), Berkeley Lab cooling-water studies, EPA EV efficiency data, Google sustainability disclosures.

Where the pushback is happening

14 US states are actively considering moratoriums or restrictions on new data-center development as of April 2026. The cases cluster on four externalities: water, grid capacity, residential bill cost-shift, and emissions / public health. Click a state to see the specific jurisdictions.

Water Grid Bills Emissions

Virginia

Grid Bills
Prince William, Fairfax, Loudoun, Stafford counties

Prince William County paused approvals on the 23,000-acre "Digital Gateway" corridor. Fairfax, Loudoun, and Stafford counties tightened zoning and design rules. Dominion Energy has projected 6-7× growth in data-center demand by 2039 with residential rate increases approved 2024 + 2025.

Sources: Sherwood News (Apr 29 2026), Data Center Watch, county and PSC filings, FERC docket entries.

If scaling laws hold, what's the next generation cost?

Today's frontier training run: ~$1B, ~100K H100-equivalent GPUs for ~90 days, ~100 GWh of electricity (Epoch AI). The scaling-laws thesis says compute must grow ~10× per generation to keep capability gains. Move the slider to see what the next pretrain looks like at that pace, and where it crashes into physical or capital limits.

Compute scale factor vs today's frontier
10×100×1000×
Training cost
$2.00B
per training run
GPUs needed
200.0K
H100-equivalent
Electricity
200 GWh
per training run
Gas-plant equivalent
20 days
@ ~10 GWh/day plant
Baseline data: Epoch AI frontier-compute tracker, NVIDIA 10-K, EIA electricity totals. Numbers are order-of-magnitude.

Hype vs measurement

4 items

Legal LLMs hallucinate on 58-82% of queries per Stanford. Unsupervised LLM news pipelines have ~40% factual-error rates. Clinical AI tools have documented race-and-sex bias. Fewer than 3% of enterprises report measurable AI ROI despite record spending. None of this means AI doesn't work; it means the gap between what vendors claim and what is measurable is the single largest driver of near-term backlash.

58-82%
solid · 2024
Legal queries where LLMs hallucinate

Stanford HAI 2024 study "Hallucinating Law": tested GPT-4, Claude, and PaLM on ~200,000 legal queries. Hallucination rates ranged from 58% to 82% depending on model and query type, even for purpose-built legal assistants. The authors concluded that "current legal LLMs are substantially incapable of serving as reliable research assistants." Courts have sanctioned multiple law firms for filings that contained fabricated case citations.

~40%
contested · 2024-25
Of "AI-generated" news passes factual checks

NewsGuard and independent fact-checking groups: approximately 40% of news content produced by unsupervised LLM pipelines (auto-summarization, auto-rewrites) contains factual errors or unverifiable claims. The rate is higher for time-sensitive content (financial news, politics). The error rate has not meaningfully improved over 2023-25 despite model upgrades; the failure modes shift but the aggregate unreliable-output rate is roughly stable.

Source: NewsGuard
Biased
solid · 2019-24
Race- and sex-correlated errors in clinical AI

Obermeyer et al. (Science 2019) showed that widely-deployed clinical risk-score tools systematically under-prioritized Black patients for care. Follow-up audits through 2024 have found similar patterns in FDA-cleared AI diagnostic tools: training data skewed toward specific demographics produces quieter failures at worse rates for under-represented populations. The FDA's 2024 AI/ML guidance requires reporting of demographic performance gaps; compliance and auditing remain uneven.

<3%
contested · 2024-25
Firms reporting measurable AI ROI

Independent reviews: fewer than 3% of US firms have publicly reported measurable ROI from generative-AI deployments as of 2025; most cite productivity improvements without quantified bottom-line impact. MIT's Sloan Management Review 2024 survey: ~75% of enterprises report ongoing AI pilots, ~15% report production deployments, and a minority report measurable efficiency gains. The gap between investment levels (record enterprise spending) and realized returns is structurally similar to the dot-com era 1998-2000.