Seven estimators, one wage panel: why the union premium triples
Nagoya University (GSID)
June 11, 2026
Act I
A simple regression says joining a union raises wages by 7.5%. Compare each worker to themselves across years and the number jumps to 21%.
One dataset, two stories. Which one do you report?
Six panel estimators with 95% CIs. POLS / Between / RE cluster near 0.07–0.11; FDFE / FE / CRE cluster near 0.21. The Hausman test is annotated.
Act II
With \(T = 2\), every worker contributes exactly two rows — the cleanest setting to see that first-differences and the within estimator are the same thing.
Answers: “union vs non-union workers?”
Answers: “same worker, switched status?”
Hausman and the Mundlak term are the formal tests for choosing between the two camps.
Between vs within variance shares for the four key variables. Union is 93.9% between; schooling is 100% between (zero within).
Log-wage trajectories for 30 sampled workers. Teal lines change union status; orange/blue lines never do.
Highly significant (\(t \approx 3.25\)) — and almost certainly biased if ability selects out of union jobs.
\[y_{i,2012} - y_{i,2010} = \beta\,(x_{i,2012} - x_{i,2010}) + (u_{i,2012} - u_{i,2010})\]
The worker-specific effect \(\alpha_i\) — ability, schooling, gender — cancels in the subtraction. What is left is identified only by workers who changed union status.
FDFE \(= 0.2113\) (SE 0.079); the SE is \(3.4\times\) larger than POLS — the switcher-only signature.
Within transformation: raw scatter with the shallow POLS slope (left); demeaned scatter with the steeper FE slope through the origin (right).
Within transformation, first-differences, and dummy-variable FE are three recipes for the same dish. Absorption (| ID) is just the fast one.
Absorbing the year effect removes the aggregate wage trend FD’s intercept was capturing. Time-invariant regressors (schooling, female) are silently absorbed.
RE is a variance-weighted average of between and within. With only 9% within, it leans toward the between picture — and SE is \(2.7\times\) tighter than FE.
\[H = (\hat\beta_{\mathrm{FE}} - \hat\beta_{\mathrm{RE}})'\,[V_{\mathrm{FE}} - V_{\mathrm{RE}}]^{-1}\,(\hat\beta_{\mathrm{FE}} - \hat\beta_{\mathrm{RE}}) \sim \chi^2(k)\]
If FE and RE disagree a lot, \(H\) is large and RE is suspect. Here \(H = 1.79\), \(p = 0.180\) — fail to reject.
But \(H\) has low power exactly when within variation is thin: a noisy FE inflates \(V_{\mathrm{FE}}\) and shrinks \(H\) mechanically.
\[y_{it} = \alpha + \beta\,x_{it} + \gamma\,\bar{x}_i + u_{it}\]
Add each worker’s mean union exposure \(\bar{x}_i\), then run RE. The within coefficient \(\beta\) equals FE; the mean coefficient \(\gamma\) tests whether \(\alpha_i\) correlates with \(x\).
CRE within \(= 0.2103\) (matches FE exactly); Mundlak term \(\gamma = -0.144\), \(p = 0.072\) — borderline, hinting at negative selection.
Act III
0.210
\(\hat\beta_{\mathrm{FE}}\) on union (SE 0.081) — vs pooled OLS 0.075; FDFE, TWFE, and CRE all agree near 0.21
| Method | Coef | SE | Variation used |
|---|---|---|---|
| POLS | 0.0750 | 0.0231 | all (ignores panel) |
| Between | 0.0662 | 0.0311 | cross-sectional means |
| RE | 0.1092 | 0.0299 | GLS between + within |
| FDFE | 0.2113 | 0.0792 | within differences |
| FE | 0.2103 | 0.0812 | within demeaned |
| CRE | 0.2103 | 0.0703 | RE + Mundlak (= within) |
Cross-sectional 7–11% · within ~21%. Standard errors swing inversely — the within camp is noisier but causally cleaner.
Extended models — union, age, schooling, female across POLS / TWFE / RE / CRE. The within premium survives controls.
Objection. Within estimators just net out fixed traits — they can’t manufacture identification.
Response. Correct. FE/FDFE/TWFE/CRE target the ATE for union switchers only — and only under strict exogeneity given the worker fixed effect.
p = 0.072
Mundlak term — borderline, more honest than Hausman’s confident p = 0.180 “fail to reject”