因果推断笔记（一）

EconML

Problem Setup

We assume we have data that are generated from some collection policy. In particular, we assume that we have data of the form: $\{Y_i(T_i), T_i, X_i, W_i, Z_i\}$ , where $Y_i(T_i)$ is the observed outcome for the chosen treatment, $T_i$ is the treatment, $X_i$ are the co-variates used for heterogeneity, $W_i$ are other observable co-variates that we believe are affecting the potential outcome $Y_i(T_i)$ and potentially also the treatment $T_i$ ; and $Z_i$ are variables that affect the treatment $T_i$ but do not directly affect the potential outcome. We will refer to variables $W_i$ as controls and variables $Z_i$ as instruments. The variables $X_i$ can also be thought of as control variables, but they are special in the sense that they are a subset of the controls with respect to which we want to measure treatment effect heterogeneity. We will refer to them as features.

在EconML包的定义中， $T_i$ 是指对个体 $i$ 的处理， $Y_i(T_i)$ 则是在 $T_i$ 处理下的观测结果。 $X_i$ 是针对异质性的控制变量， $W_i$ 是指可能会同时影响到处理 $T_i$ 和潜在结果 $Y_i(T_i)$ 的观测变量， $Z_i$ 是指会影响到处理 $T_i$ 但不会直接影响到潜在结果的变量。在这里， $W_i$ 即为控制变量，而 $Z_i$ 为工具变量。 $X_i$ 和 $W_i$ 均为控制变量，但 $X_i$ 主要为能够反映个体异质性的特征集合，它未必会影响处理 $T_i$ ，而 $W_i$ 更倾向于指能够影响到处理 $T_i$ 的控制变量。

最后编辑于：2022.07.25 21:00:41

因果推断笔记（一）

EconML

Problem Setup

推荐阅读更多精彩内容