第10章: 计量经济学基础

经济学提出因果主张——最低工资影响就业，教育提高收入，制度决定增长。检验这些主张需要数据和区分因果关系与相关关系的方法。计量经济学就是这种方法。

本章不是统计学课程。我们假设读者熟悉基本的概率论和回归分析。相反，我们关注实证经济学的核心问题：识别——找到可信的外生变异来源，使我们能够估计因果效应。本章的每种工具——OLS、工具变量、双重差分、回归断点——都是解决识别问题的策略。

前置知识：第2章和第5章（示例的经济学背景）。数学前置知识：线性代数、概率论与统计学。

10.1 识别问题

考虑这个问题：多受一年教育是否会增加收入？我们观察到受教育程度更高的人收入更高。但这是因为：

两者都与观察到的相关性一致。识别问题在于我们无法直接比较同一个人接受教育和未接受教育的情况——反事实是不可观测的。

其中 $Y_i$ 是结果（收入），$X_i$ 是处理（受教育年限），$\beta$ 是我们感兴趣的因果参数，$\varepsilon_i$ 捕捉影响 $Y_i$ 的所有其他因素——能力、家庭背景、动机、运气、健康以及数千个其他因素。

当 $X_i$ 与 $\varepsilon_i$ 相关时——即"处理"不是随机分配的——识别问题就会出现。在统计学中，这被称为内生性。在经济学中，这是常态而非例外：人们选择自己的教育（而这种选择与能力相关），国家选择自己的政策（而这种选择与其经济条件相关），企业选择自己的价格（而这种选择与需求条件相关）。

在随机实验中，处理 $X_i$ 由抛硬币决定——它在构造上独立于 $\varepsilon_i$。但经济学家很少有机会对重大问题进行随机化。本章的方法——OLS、IV、DiD、RD——是在观测数据中寻找近似随机化的"自然实验"的策略。

10.2 普通最小二乘法（OLS）

在这些假设下，OLS是BLUE——最优线性无偏估计量。"最优"意味着在所有线性无偏估计量中方差最低。"无偏"意味着 $E[\hat{\beta}] = \beta$。

关键假设是第4条：$E[\varepsilon|X] = 0$。当此假设失败时——由于遗漏变量、联立性或 $X$ 的测量误差——OLS是有偏的。估计值 $\hat{\beta}$ 即使在无限数据下也不会收敛到真实的 $\beta$。这不是小样本问题——它是一个根本性的设计缺陷，更多的数据无法修复。

遗漏变量偏差

假设真实模型为 $Y = \beta_0 + \beta_1 X + \beta_2 Z + u$，但我们遗漏 $Z$ 并运行 $Y = \alpha_0 + \alpha_1 X + e$。那么：

偏差等于遗漏变量的效应（$\beta_2$）乘以遗漏变量与纳入回归变量之间的关联。

10.3 工具变量（IV）

	$Cov(X, Z) > 0$	$Cov(X, Z) < 0$
$\beta_2 > 0$	向上偏差（高估 $\beta_1$）	向下偏差
$\beta_2 < 0$	向下偏差	向上偏差

当OLS因 $X$ 内生（$Cov(X, \varepsilon) \neq 0$）而有偏时，工具变量可以拯救估计。

这分离出由工具变量驱动的 $X$ 的部分——外生部分。拟合值 $\hat{X}_i$ 代表 $X$ 中的"干净"变异。

IV估计值是简约形式（$Z$ 对 $Y$ 的效应）与第一阶段（$Z$ 对 $X$ 的效应）的比率。直觉：$Z$ 仅通过 $X$ 影响 $Y$（排除性限制），因此除以第一阶段可以分离出 $X$ 对 $Y$ 的因果效应。

IV估计的是什么。在异质性处理效应下，IV识别的是局部平均处理效应（LATE）——即行为受工具变量改变的亚群体（"依从者"）的因果效应。

弱工具变量

如果 $Z$ 与 $X$ 的相关性很弱，第一阶段就很弱，IV估计就不可靠（偏向OLS，置信区间很宽）。经验法则：第一阶段F统计量 > 10。

10.4 双重差分（DiD）

第一次差分消除了时间不变的组特征。第二次差分消除了共同的时间趋势。

关键假设：平行趋势。在没有处理的情况下，处理组和对照组会遵循相同的趋势。这在处理后时期不可检验，但可在处理前时期进行评估。

大问题 #3

最低工资会导致失业吗？

You now have difference-in-differences, instrumental variables, and the tools of causal identification. This is where the minimum wage debate gets resolved — not by theory, but by evidence.

模型的解释

Card and Krueger (1994) applied the method you just learned — difference-in-differences — to a natural experiment. When New Jersey raised its minimum wage from \$4.25 to \$5.05 in 1992, neighboring Pennsylvania didn't. By surveying fast-food restaurants on both sides of the border before and after the increase, they constructed a clean DiD estimate: the treatment group (NJ) versus the control group (PA), differencing out common trends. The result stunned the profession: employment in New Jersey fast-food restaurants didn't fall. If anything, it rose slightly. The competitive model's prediction — that a binding price floor reduces quantity demanded — failed its most direct empirical test. Subsequent studies using county-border designs (Dube, Lester & Reich, 2010) confirmed the pattern: comparing adjacent counties across state lines where one side raised its minimum wage and the other didn't, employment effects were small to negligible for moderate increases.

最强的反驳

Neumark and Wascher mounted the most sustained challenge. Using payroll data from the Bureau of Labor Statistics instead of Card and Krueger's telephone surveys, they found employment did decline in New Jersey — the original result, they argued, was an artifact of noisy survey data. Beyond data quality, the critique has structural force: DiD captures short-run effects, but firms adjust on multiple margins over time. Hours get cut even when headcount doesn't (Jardim et al., 2022, on Seattle's \$15 minimum). Benefits erode. Automation accelerates — self-order kiosks and scheduling software aren't coincidental. And the border-design studies may systematically understate effects by comparing areas that are economically similar precisely because they trade workers across the border, contaminating the control group. The meta-analysis is genuinely mixed: which studies you weight, and how, determines whether you find small negative effects or no effects.

主流的回应

The field's response illustrates what economists call the "credibility revolution" — the shift from estimating structural models to designing identification strategies. Card and Krueger didn't just challenge a prediction; they changed how empirical economics is done. The question moved from "what does the model predict?" to "can we find a credible research design that isolates the causal effect?" Cengiz, Dube, Lindner, and Zipperer (2019) produced the most comprehensive answer to date, analyzing 138 state-level minimum wage changes using a bunching estimator. They looked at the entire wage distribution: jobs paying just below the new minimum disappeared, jobs paying at or just above it appeared, and — crucially — total employment in the affected range barely changed. The jobs didn't vanish; they moved up the wage ladder. This is exactly what the monopsony model from Chapter 6 predicts and exactly what the competitive model says shouldn't happen.

判断（在当前水平）

The textbook prediction — that minimum wages cause unemployment — is wrong as a general empirical claim. Moderate minimum wage increases, up to roughly 50–60% of the local median wage, produce minimal detectable employment effects in most credible studies. This is consistent with monopsony power in low-wage labor markets: when employers have wage-setting power, a moderate minimum wage pushes them toward the competitive outcome rather than away from it. But "moderate" is the operative word. The competitive model isn't wrong — it's incomplete. Push the minimum wage high enough relative to local conditions (above 60% of the median, as a federal \$15 would in low-wage regions), and the standard prediction reasserts itself. The deeper lesson is methodological: a theoretical prediction that seemed airtight for decades was overturned not by better theory but by better identification. The model was logically correct; its empirical relevance was the question all along.

目前无法解决的问题

This Big Question is essentially resolved at this level: moderate minimum wages don't cause significant unemployment, consistent with monopsony. The remaining frontier is calibration, not direction. How high can you go before disemployment appears? The answer varies by region, sector, and time horizon — and the automation margin (kiosks, AI scheduling, self-checkout) may make long-run effects larger than short-run DiD estimates capture. The debate has shifted from "does it cause unemployment?" to "what's the right number for this labor market?" — which is a policy design question, not an economic theory question. The tools you learned in this chapter — DiD, IV, identification strategy — are exactly how that calibration question gets answered.

"A \$7.25 minimum wage is a starvation wage" — AOC on the House floor, 2019

The Fight for \$15 made a number into a movement. But \$15 in San Francisco is very different from \$15 in rural Mississippi. The evidence says moderate increases work — is \$15 moderate?

入门

观点

What should a living wage be?

If the minimum wage isn't about employment anymore, it's about adequacy. How do economists measure what "enough" means — and who decides?

中级

← Previous: Ch 6 — Monopsony and market power Stop 3 of 3 (Final)

10.5 回归断点（RD）

关键假设：连续性。影响 $Y$ 的所有因素（除处理外）在截断点处连续变化——在阈值附近没有排序或操纵。

10.6 随机对照试验（RCTs）

随机对照试验是内部效度的"金标准"，因为随机化在构造上保证了 $E[\varepsilon|X] = 0$。Banerjee、Duflo和Kremer因其减轻全球贫困的实验方法获得了2019年诺贝尔奖。

随机对照试验的局限性

10.7 标准误与推断

标准误（SE）是对角元素的平方根。95%置信区间约为 $\hat{\beta} \pm 1.96 \cdot SE(\hat{\beta})$。

统计显著性：如果 $|t| = |\hat{\beta}/SE(\hat{\beta})| > 1.96$，我们在5%水平上拒绝 $H_0: \beta = 0$。

经济显著性与统计显著性：一个系数可以在统计上显著但在经济上微不足道。反之，一个不精确的估计可以在经济上很大但在统计上不显著。好的实证研究会讨论两者。

有效推断的威胁

一条实用规则：在现代应用经济学中，始终使用稳健标准误或聚类标准误。

10.8 效度威胁

策略	关键假设	威胁	诊断方法
OLS	无遗漏变量（$E[\varepsilon\|X]=0$）	混杂	理论 + 敏感性分析
IV	排除性限制	$Z$ 对 $Y$ 的直接效应	无法直接检验；从理论上论证
IV	相关性	弱工具变量	第一阶段 F > 10
DiD	平行趋势	差异性处理前趋势	绘制处理前趋势图
RD	截断点处无操纵	围绕阈值的排序	McCrary密度检验
RCT	无流失、无溢出	差异性退出；污染	平衡检验、流失分析

	政策前（2023年）	政策后（2025年）	变化
东部（处理组）	55	63	+8
西部（对照组）	52	56	+4
DiD估计值			+4

标签	公式	描述
方程 10.1	$Y_i = \alpha + \beta X_i + \varepsilon_i$	结构方程
方程 10.2	$\hat{\beta}_{OLS} = (X'X)^{-1}X'Y$	OLS估计量
方程 10.3	$E[\hat{\alpha}_1] = \beta_1 + \beta_2 \cdot Cov(X,Z)/Var(X)$	遗漏变量偏差公式
方程 10.5	$\hat{\beta}_{IV} = Cov(Z,Y)/Cov(Z,X)$	IV估计量（简单形式）
方程 10.6	$\hat{\tau}_{DiD}$ = (处理组变化) − (对照组变化)	DiD估计量
方程 10.7	$Y_{it} = \alpha + \beta_1 Treat + \beta_2 Post + \tau(Treat \times Post) + \varepsilon$	DiD回归
方程 10.8	$\hat{\tau}_{RD} = \lim_{x \downarrow c} E[Y\|X=x] - \lim_{x \uparrow c} E[Y\|X=x]$	RD估计量
方程 10.9	$\hat{\tau}_{RCT} = \bar{Y}_{treat} - \bar{Y}_{control}$	RCT估计量
方程 10.10	$Var(\hat{\beta}) = \sigma^2(X'X)^{-1}$	OLS方差

练习题

基础练习

假设你使用OLS将工资对受教育年限回归，估计的系数为0.10（每多受一年教育与10%更高的工资相关）。列出两个可能使该估计产生偏差的遗漏变量，并预测每个的偏差方向。
一项IV研究使用"到最近大学的距离"作为受教育年限的工具变量。(a) 论证相关性。(b) 排除性限制是什么，什么可能违反它？
在A市颁布苏打税前后，与B市进行比较。税前，A市的苏打消费量为每人100罐，B市为90罐。税后，A市为80罐，B市为85罐。计算DiD估计值。这里的平行趋势假设是什么？
一个奖学金项目录取GPA ≥ 3.5的学生。你有GPA从3.0到4.0的学生数据。(a) 描述RD设计。(b) 什么是运行变量？(c) 关于学生在截断点附近的行为，必须满足什么假设？

应用练习

政府随机化职业培训项目的参与权。被提供项目的人中有60%实际参加。意向治疗估计为收入增加500美元。处理效应估计是多少？你需要什么假设？这与IV有什么关系？
一位经济学家声称民主促进经济增长，引用了跨国相关性。用本章的框架批评这一主张。你会提出什么具体的识别策略？
一项DiD研究估计环境法规的效应。处理前趋势显示处理组的污染已经在比对照组更快地下降。这如何违反平行趋势？DiD估计的偏差方向是什么？

挑战题

通过最小化 $S(\beta) = (Y - X\beta)'(Y - X\beta)$ 推导OLS估计量 $\hat{\beta} = (X'X)^{-1}X'Y$。证明一阶条件给出正规方程 $X'X\hat{\beta} = X'Y$。
用代数方法证明，当工具变量 $Z$ 为二值时，IV估计量简化为Wald估计量：$\hat{\beta}_{IV} = (\bar{Y}_1 - \bar{Y}_0)/(\bar{X}_1 - \bar{X}_0)$。
讨论经济学中的"可信性革命"（Angrist and Pischke, 2010）。结构计量经济学与基于设计的实证研究之间发生了什么变化？各自的优势和局限性是什么？

第10章计量经济学基础

引言