← All Research
2026-02-27 · SCIENCE & EXPERIMENTS

Benford's Law and the Fingerprints of Real Data

Testing 78,913 economic statistics from 215 countries against the first-digit law. What does the math say about the quality of global economic data?

Key Findings

Data Points
78,913
Countries
215
Indicators
12
Overall MAD
0.0043

What is Benford's Law?

In 1938, physicist Frank Benford noticed something strange: the first pages of logarithm tables were more worn than the last. He documented a pattern that appears across an astonishing variety of natural datasets — from river lengths to stock prices to city populations — the leading digit is not uniformly distributed. Instead, the digit 1 appears as the leading digit about 30% of the time, while 9 appears less than 5% of the time.

P(d) = log10(1 + 1/d)     for d = 1, 2, ..., 9

This gives us exact predictions: digit 1 should appear 30.1% of the time, digit 2 at 17.6%, digit 3 at 12.5%, and so on. The law emerges naturally whenever data spans multiple orders of magnitude and grows multiplicatively. It has been used for decades in forensic accounting and election fraud detection — fabricated numbers tend to have too-uniform digit distributions, or suspicious spikes at certain digits.

The question I wanted to answer: does the economic data reported to the World Bank by 215 countries follow Benford's Law? If so, how closely? And which countries or indicators deviate most — and why?

The Overall Picture: Close Conformity

The answer, pooling all data, is a resounding yes. Here's the observed first-digit distribution of 78,913 World Bank data points plotted against Benford's prediction:

Overall first-digit distribution vs Benford's Law
Figure 1. Observed vs expected first-digit frequencies across all World Bank data. The match is remarkably close.

The largest deviation is a slight excess of digits 2 and 3, with a corresponding deficit in digit 1. The overall Mean Absolute Deviation (MAD) is 0.0043, which falls firmly in the "close conformity" range defined by Nigrini's forensic accounting thresholds. The chi-squared test rejects the null hypothesis (p ≈ 0), but that's expected with 79,000 observations — even tiny deviations become statistically significant at that sample size. The MAD metric is more meaningful for assessing practical conformity.

Which Indicators Follow Benford Best?

Not all economic metrics are equally Benford-friendly. The theory predicts that data must span several orders of magnitude and arise from multiplicative processes. Absolute dollar figures (GDP, trade volumes) span from millions to trillions — perfect for Benford. Percentages, by contrast, are bounded between 0 and 100, typically clustering in a narrow range.

MAD scores by indicator
Figure 2. Benford conformity by economic indicator. Green = close conformity, cyan = acceptable, yellow = marginal, red = nonconformity.

The standout finding: "Revenue excluding grants (% of GDP)" shows extreme nonconformity (MAD = 0.062). This isn't fraud — it's math. Government revenue typically ranges from 10% to 50% of GDP, meaning leading digits cluster around 1-4. The data doesn't span enough orders of magnitude for Benford to apply. This is an important reminder: Benford violations don't automatically imply manipulation.

Indicator Conformity Table

Indicator MAD Conformity N
GDP (current US$)0.0020Close10,711
Exports of goods and services (current US$)0.0027Close8,880
Imports of goods and services (current US$)0.0031Close8,879
Government expenditure (current LCU)0.0040Close4,462
Gross capital formation (current US$)0.0043Close8,444
GDP per capita (current US$)0.0062Acceptable11,699
Population total0.0073Acceptable14,205
GDP PPP (current international $)0.0073Acceptable6,901
Foreign direct investment net inflows (BoP, current US$)0.0227Nonconformity110
Revenue excluding grants (% of GDP)0.0616Nonconformity4,618

The Country-Level View

When we break the data down by country, the picture gets more nuanced. Each country has between 50 and 550 data points (pooling across all indicators and years). With smaller samples, deviations from Benford are expected even in perfectly clean data.

Country scatter plot: MAD vs sample size
Figure 3. Each dot is one country. Small territories with few data points naturally show high deviation. Countries with many data points tend to converge toward Benford's prediction.

The scatter plot reveals a clear pattern: deviation scales inversely with sample size. Countries like Haiti (MAD = 0.007, n = 341) and South Korea (MAD = 0.010, n = 529) conform well because they have large, diverse datasets. Gibraltar (MAD = 0.155, n = 65) deviates wildly because a micro-territory with 65 data points simply can't generate a smooth Benford distribution.

Histogram of country MAD scores
Figure 4. Distribution of MAD scores across all countries with ≥100 data points. Most cluster in the 0.02–0.05 range.

The Most Interesting Deviations

After filtering out micro-territories (which deviate for statistical reasons), several patterns emerge among the most deviant countries:

Digit distributions for anomalous countries
Figure 5. Digit distributions for the six most deviant countries with ≥200 data points. The cyan bars show Benford's prediction; the red bars show observed frequencies.

Post-Soviet and transition economies are overrepresented among the deviants: Poland, Latvia, Lithuania, Moldova, Ukraine, and Bosnia and Herzegovina all appear in the top 30. These countries underwent currency redenominations, periods of hyperinflation, and fundamental economic restructuring in the 1990s. When your currency goes from thousands of units to single units overnight, the leading-digit distribution of your economic statistics gets disrupted in ways that take years to wash out.

North Korea (MAD = 0.116) is a genuinely interesting case. With extremely limited data (n = 65), most of which consists of population estimates and sparse GDP guesses, the deviation may reflect both the small sample and the fact that outsiders are essentially guessing at North Korean economic statistics.

Best-Conforming Countries

Digit distributions for best-conforming countries
Figure 6. Digit distributions for the six best-conforming countries with ≥200 data points. The match to Benford's prediction is nearly perfect.

The best conformers tend to be countries with long, stable statistical histories and large, diverse economies: South Korea, Thailand, Nicaragua, Honduras. Haiti — the single best conformer at MAD = 0.007 — is a genuinely surprising result that warrants closer examination. (It may reflect the diversity of its data: spanning decades of economic volatility creates exactly the kind of multi-order-of-magnitude dataset that Benford loves.)

Does Data Quality Change Over Time?

Benford conformity by decade
Figure 7. MAD score by decade. All decades fall within acceptable conformity.

One might expect that economic data quality has improved over time as statistical agencies modernized. The temporal analysis tells a subtler story: conformity has been remarkably stable across six decades. The 1970s show the best conformity (MAD = 0.004), while the 2020s show slight degradation (MAD = 0.008) — possibly because the 2020s only include four years of data so far, or because COVID-era economic disruptions created unusual distributions.

The key takeaway: there's no evidence of systematic improvement or deterioration in the quality of data reported to the World Bank over the past 65 years. The data has been consistently Benford-conforming throughout.

The Second Digit

Benford's Law also predicts the distribution of the second digit (0–9). The second-digit test is more sensitive to certain types of manipulation because fabricators who know about Benford's first-digit law may still get the second digit wrong.

Second-digit distribution
Figure 8. Second-digit distribution. The conformity is excellent (chi-squared p = 0.014), with deviations well within normal range.

The second-digit conformity is strong, with all deviations under 0.3 percentage points. This is consistent with real, unmanipulated data. A fabricator sophisticated enough to match the first-digit distribution but careless about the second digit would show up here — and doesn't.

Explore the Full Dataset

Search and sort the complete country rankings below. MAD (Mean Absolute Deviation) is the primary conformity metric; lower values indicate closer conformity to Benford's Law.

Country Code MAD Conformity Chi² p N
Isle of ManIMN0.0971Nonconformity< 1e-300143
Channel IslandsCHI0.0895Nonconformity2.0e-15115
American SamoaASM0.0847Nonconformity< 1e-300149
DominicaDMA0.0816Nonconformity< 1e-300196
CuracaoCUW0.0771Nonconformity< 1e-300176
BarbadosBRB0.0664Nonconformity< 1e-300258
Virgin Islands (U.S.)VIR0.0660Nonconformity4.4e-10149
GrenadaGRD0.0620Nonconformity< 1e-300196
St. LuciaLCA0.0616Nonconformity2.1e-14226
Antigua and BarbudaATG0.0603Nonconformity2.2e-16214
MonacoMCO0.0601Nonconformity2.5e-10175
PalauPLW0.0587Nonconformity< 1e-300290
PolandPOL0.0556Nonconformity< 1e-300325
Cayman IslandsCYM0.0553Nonconformity3.6e-06127
Bosnia and HerzegovinaBIH0.0541Nonconformity< 1e-300301
French PolynesiaPYF0.0541Nonconformity3.9e-14265
MontenegroMNE0.0537Nonconformity< 1e-300224
LatviaLVA0.0534Nonconformity< 1e-300310
Sint Maarten (Dutch part)SXM0.0529Nonconformity3.8e-11113
LithuaniaLTU0.0527Nonconformity< 1e-300314
MoldovaMDA0.0512Nonconformity< 1e-300315
GuineaGIN0.0509Nonconformity1.1e-14335
UkraineUKR0.0491Nonconformity< 1e-300295
JamaicaJAM0.0487Nonconformity< 1e-300296
KosovoXKX0.0477Nonconformity9.8e-07167
GuyanaGUY0.0473Nonconformity< 1e-300368
SerbiaSRB0.0472Nonconformity< 1e-300277
AndorraAND0.0454Nonconformity3.2e-08216
TuvaluTUV0.0451Nonconformity7.2e-10153
United KingdomGBR0.0438Nonconformity2.2e-16435
San MarinoSMR0.0438Nonconformity4.5e-08218
MaltaMLT0.0434Nonconformity< 1e-300477
BermudaBMU0.0430Nonconformity4.6e-13275
BulgariaBGR0.0427Nonconformity< 1e-300393
CroatiaHRV0.0427Nonconformity< 1e-300326
DjiboutiDJI0.0420Nonconformity5.6e-06191
AlbaniaALB0.0418Nonconformity6.7e-12365
ItalyITA0.0415Nonconformity< 1e-300487
HungaryHUN0.0414Nonconformity6.2e-12400
UruguayURY0.0411Nonconformity1.2e-14465
Northern Mariana IslandsMNP0.0410Nonconformity1.6e-04149
CzechiaCZE0.0402Nonconformity3.1e-10335
DenmarkDNK0.0400Nonconformity< 1e-300511
ArmeniaARM0.0393Nonconformity1.3e-09316
West Bank and GazaPSE0.0393Nonconformity1.6e-05207
GuamGUM0.0389Nonconformity1.2e-02149
SpainESP0.0386Nonconformity< 1e-300481
Hong Kong SAR, ChinaHKG0.0383Nonconformity1.2e-12423
EstoniaEST0.0381Nonconformity1.8e-07323
Sao Tome and PrincipeSTP0.0376Nonconformity2.6e-06210
NorwayNOR0.0373Nonconformity1.1e-16499
South SudanSSD0.0372Nonconformity7.7e-03113
SwedenSWE0.0371Nonconformity1.1e-16528
St. Kitts and NevisKNA0.0368Nonconformity1.3e-15278
Cabo VerdeCPV0.0360Nonconformity6.8e-07330
BrazilBRA0.0355Nonconformity5.8e-12454
SurinameSUR0.0355Nonconformity2.4e-04245
New ZealandNZL0.0354Nonconformity5.2e-12474
BelgiumBEL0.0351Nonconformity6.8e-13451
KazakhstanKAZ0.0351Nonconformity2.2e-06313
UzbekistanUZB0.0349Nonconformity1.1e-07253
GreenlandGRL0.0347Nonconformity5.9e-12270
United Arab EmiratesARE0.0341Nonconformity4.4e-04257
CambodiaKHM0.0340Nonconformity1.7e-07373
LiechtensteinLIE0.0337Nonconformity1.1e-03173
GuatemalaGTM0.0335Nonconformity4.8e-09493
AustriaAUT0.0331Nonconformity1.7e-11499
Central African RepublicCAF0.0327Nonconformity2.7e-07453
ChinaCHN0.0327Nonconformity1.6e-07444
Slovak RepublicSVK0.0327Nonconformity2.8e-10332
Africa Eastern and SouthernAFE0.0325Nonconformity8.9e-08420
IcelandISL0.0324Nonconformity4.0e-10499
Faroe IslandsFRO0.0316Nonconformity1.5e-12349
PortugalPRT0.0313Nonconformity3.2e-07488
United StatesUSA0.0312Nonconformity4.6e-07435
JapanJPN0.0311Nonconformity2.4e-12446
St. Vincent and the GrenadinesVCT0.0310Nonconformity1.7e-03276
North MacedoniaMKD0.0308Nonconformity5.6e-04313
Bahamas, TheBHS0.0307Nonconformity2.0e-09439
Kyrgyz RepublicKGZ0.0307Nonconformity1.2e-03296
NauruNRU0.0307Nonconformity4.9e-05258
Russian FederationRUS0.0307Nonconformity8.4e-06339
FinlandFIN0.0305Nonconformity7.9e-09499
GeorgiaGEO0.0304Nonconformity2.0e-06349
Brunei DarussalamBRN0.0303Nonconformity1.3e-06348
IrelandIRL0.0303Nonconformity2.0e-08497
GermanyDEU0.0294Nonconformity1.9e-10498
ZambiaZMB0.0293Nonconformity3.9e-04296
BoliviaBOL0.0292Nonconformity6.7e-08460
New CaledoniaNCL0.0292Nonconformity2.2e-04289
LuxembourgLUX0.0291Nonconformity8.2e-09499
NepalNPL0.0291Nonconformity7.1e-06456
MoroccoMAR0.0289Nonconformity1.5e-06487
MauritiusMUS0.0289Nonconformity1.8e-06480
Timor-LesteTLS0.0288Nonconformity1.1e-04271
Venezuela, RBVEN0.0285Nonconformity9.7e-06347
ArubaABW0.0279Nonconformity2.8e-04268
IsraelISR0.0278Nonconformity8.9e-09499
CameroonCMR0.0277Nonconformity1.9e-06446
MaldivesMDV0.0276Nonconformity3.3e-06313
Syrian Arab RepublicSYR0.0273Nonconformity4.5e-03333
LibyaLBY0.0271Nonconformity4.6e-04335
SloveniaSVN0.0271Nonconformity1.0e-03325
South AfricaZAF0.0268Nonconformity5.6e-06528
BelarusBLR0.0266Nonconformity3.5e-14339
EritreaERI0.0264Nonconformity2.3e-02185
ColombiaCOL0.0260Nonconformity1.2e-04466
MalaysiaMYS0.0258Nonconformity1.8e-06481
Africa Western and CentralAFW0.0257Nonconformity1.8e-03289
ChileCHL0.0255Nonconformity1.5e-05529
Dominican RepublicDOM0.0254Nonconformity4.1e-05529
FranceFRA0.0253Nonconformity3.0e-06529
El SalvadorSLV0.0253Nonconformity5.1e-04453
KenyaKEN0.0251Nonconformity6.1e-04441
TogoTGO0.0249Nonconformity1.5e-04450
LiberiaLBR0.0247Nonconformity4.6e-02232
QatarQAT0.0247Nonconformity4.4e-03297
Yemen, Rep.YEM0.0243Nonconformity4.3e-02185
Marshall IslandsMHL0.0242Nonconformity3.1e-03317
SingaporeSGP0.0242Nonconformity6.1e-05530
Turks and Caicos IslandsTCA0.0242Nonconformity6.0e-01107
Viet NamVNM0.0238Nonconformity3.9e-02248
GhanaGHA0.0234Nonconformity5.8e-05465
IndonesiaIDN0.0234Nonconformity7.4e-04457
AustraliaAUS0.0233Nonconformity4.7e-07527
MaliMLI0.0232Nonconformity2.8e-03432
SudanSDN0.0229Nonconformity3.3e-03433
TurkmenistanTKM0.0229Nonconformity6.6e-02229
CyprusCYP0.0228Nonconformity4.8e-05451
Sri LankaLKA0.0225Nonconformity6.4e-04478
AzerbaijanAZE0.0224Nonconformity2.4e-03318
MalawiMWI0.0224Nonconformity1.9e-01239
SwitzerlandCHE0.0223Nonconformity6.8e-05489
Saudi ArabiaSAU0.0223Nonconformity3.5e-03430
EcuadorECU0.0221Nonconformity2.1e-04445
KiribatiKIR0.0220Nonconformity9.8e-03358
MyanmarMMR0.0217Nonconformity1.4e-02282
SenegalSEN0.0217Nonconformity1.2e-02438
NigerNER0.0216Nonconformity6.1e-03435
RomaniaROU0.0216Nonconformity1.8e-03366
TajikistanTJK0.0215Nonconformity1.5e-03281
AfghanistanAFG0.0213Nonconformity1.9e-01230
SeychellesSYC0.0213Nonconformity7.6e-04411
Macao SAR, ChinaMAC0.0210Nonconformity2.1e-02371
KuwaitKWT0.0206Nonconformity4.1e-03456
NigeriaNGA0.0206Nonconformity3.5e-03232
TunisiaTUN0.0202Nonconformity1.4e-02436
LebanonLBN0.0201Nonconformity4.7e-02324
PhilippinesPHL0.0201Nonconformity6.7e-03424
UgandaUGA0.0200Nonconformity4.3e-02378
ComorosCOM0.0199Nonconformity4.4e-02325
EswatiniSWZ0.0199Nonconformity1.9e-01310
BahrainBHR0.0197Nonconformity4.5e-02407
MexicoMEX0.0195Nonconformity2.1e-02514
OmanOMN0.0194Nonconformity3.6e-02402
ChadTCD0.0194Nonconformity3.8e-02422
Micronesia, Fed. Sts.FSM0.0193Nonconformity5.7e-02299
RwandaRWA0.0193Nonconformity3.2e-02460
Papua New GuineaPNG0.0192Nonconformity2.3e-03409
PanamaPAN0.0191Nonconformity1.0e-02441
Cote d'IvoireCIV0.0187Nonconformity3.9e-02472
Egypt, Arab Rep.EGY0.0181Nonconformity1.7e-02497
EthiopiaETH0.0180Nonconformity8.4e-02339
GabonGAB0.0180Nonconformity7.3e-02435
AlgeriaDZA0.0179Nonconformity3.4e-02425
VanuatuVUT0.0178Nonconformity1.2e-01314
AngolaAGO0.0177Nonconformity6.5e-02306
Costa RicaCRI0.0177Nonconformity1.8e-02528
CanadaCAN0.0176Nonconformity1.2e-03491
IndiaIND0.0176Nonconformity3.1e-02517
BangladeshBGD0.0175Nonconformity1.3e-02467
FijiFJI0.0172Nonconformity2.0e-01284
Puerto Rico (US)PRI0.0171Nonconformity3.0e-03410
SamoaWSM0.0171Nonconformity2.9e-01296
Lao PDRLAO0.0169Nonconformity6.7e-02302
ArgentinaARG0.0168Nonconformity2.3e-02484
NamibiaNAM0.0167Nonconformity5.4e-02386
Congo, Dem. Rep.COD0.0166Nonconformity3.0e-01387
PakistanPAK0.0166Nonconformity5.1e-02429
BeninBEN0.0164Nonconformity1.3e-01428
Guinea-BissauGNB0.0164Nonconformity2.5e-02389
JordanJOR0.0156Nonconformity9.2e-02432
MadagascarMDG0.0155Nonconformity1.6e-02473
CubaCUB0.0153Nonconformity3.7e-02320
GreeceGRC0.0153Nonconformity2.2e-02521
ParaguayPRY0.0152Nonconformity5.4e-02458
BelizeBLZ0.0151Nonconformity2.5e-01421
Congo, Rep.COG0.0147Marginal2.0e-01467
Gambia, TheGMB0.0146Marginal3.1e-01397
NetherlandsNLD0.0145Marginal2.1e-01500
BhutanBTN0.0144Marginal1.8e-02414
IraqIRQ0.0144Marginal3.3e-01406
Equatorial GuineaGNQ0.0143Marginal3.1e-01316
LesothoLSO0.0143Marginal4.8e-01421
TongaTON0.0143Marginal9.5e-02366
MauritaniaMRT0.0140Marginal4.0e-01420
Burkina FasoBFA0.0139Marginal1.7e-01469
Trinidad and TobagoTTO0.0138Marginal4.1e-01298
BotswanaBWA0.0135Marginal7.3e-02477
MongoliaMNG0.0135Marginal3.3e-01381
Somalia, Fed. Rep.SOM0.0135Marginal5.9e-01374
PeruPER0.0131Marginal1.7e-01525
Iran, Islamic Rep.IRN0.0128Marginal2.2e-01501
MozambiqueMOZ0.0124Marginal5.4e-01295
Solomon IslandsSLB0.0124Marginal4.3e-01373
ZimbabweZWE0.0123Marginal6.0e-01346
TurkiyeTUR0.0121Marginal6.5e-01445
TanzaniaTZA0.0120Acceptable6.4e-01367
HondurasHND0.0114Acceptable3.5e-01468
ThailandTHA0.0114Acceptable1.8e-01530
NicaraguaNIC0.0109Acceptable2.7e-01493
Korea, Rep.KOR0.0096Acceptable2.3e-01529
Sierra LeoneSLE0.0095Acceptable6.5e-01397
BurundiBDI0.0094Acceptable8.2e-01270
HaitiHTI0.0073Acceptable8.2e-01341

Methodology

Data source: World Bank Open Data API (api.worldbank.org), accessed February 27, 2026. 12 indicators for all countries, 1960–2024.

Indicators: GDP (current USD), GDP PPP, population, exports, imports, government expenditure (LCU), GDP per capita, FDI inflows, gross capital formation, unemployment rate, and government revenue (% of GDP).

Exclusions: World Bank aggregate regions (e.g., "Sub-Saharan Africa", "High income") were excluded, leaving 215 individual countries and territories.

Statistical tests:

Minimum sample size: 50 observations required for any analysis; 100 for inclusion in the country rankings table.

Software: Python 3.12, NumPy, SciPy, Matplotlib. All code available in the project repository.

What This Does (and Doesn't) Tell Us

Benford's Law is a powerful forensic tool, but it's not a lie detector. A dataset that conforms to Benford is consistent with being real, but conformity can be manufactured. More importantly, nonconformity does not prove fraud. This analysis reveals several perfectly innocent explanations for deviation:

  1. Bounded data. Percentages, ratios, and indices that don't span multiple orders of magnitude will always violate Benford, no matter how accurate they are.
  2. Small samples. Any country with fewer than ~200 data points will show noisy digit distributions simply due to random variation.
  3. Structural breaks. Currency redenominations, hyperinflation, and economic transitions disrupt the natural digit distribution.
  4. Narrow-range variables. Population data for a country that's been ~5 million for 65 years will be dominated by digit 5.

The strongest signal in this analysis is not about any individual country's data quality — it's the remarkable conformity of the pooled global dataset. Seventy-nine thousand economic measurements, reported by 215 different national statistical agencies over 65 years, match the mathematical prediction to within a fraction of a percentage point. That's either evidence of a massive, coordinated global conspiracy, or — more likely — it's evidence that the World Bank's economic data is, by and large, real.