因果推断笔记(一)

EconML

Problem Setup

We assume we have data that are generated from some collection policy. In particular, we assume that we have data of the form: \{Y_i(T_i), T_i, X_i, W_i, Z_i\}, where Y_i(T_i) is the observed outcome for the chosen treatment, T_i is the treatment, X_i are the co-variates used for heterogeneity, W_i are other observable co-variates that we believe are affecting the potential outcome Y_i(T_i) and potentially also the treatment T_i; and Z_i are variables that affect the treatment T_i but do not directly affect the potential outcome. We will refer to variables W_i as controls and variables Z_i as instruments. The variables X_i can also be thought of as control variables, but they are special in the sense that they are a subset of the controls with respect to which we want to measure treatment effect heterogeneity. We will refer to them as features.

在EconML包的定义中,T_i是指对个体i的处理,Y_i(T_i)则是在T_i处理下的观测结果。X_i是针对异质性的控制变量,W_i是指可能会同时影响到处理T_i和潜在结果Y_i(T_i)的观测变量,Z_i是指会影响到处理T_i但不会直接影响到潜在结果的变量。在这里,W_i即为控制变量,而Z_i为工具变量。X_iW_i均为控制变量,但X_i主要为能够反映个体异质性的特征集合,它未必会影响处理T_i,而W_i更倾向于指能够影响到处理T_i的控制变量。

最后编辑于
©著作权归作者所有,转载或内容合作请联系作者
平台声明:文章内容(如有图片或视频亦包括在内)由作者上传并发布,文章内容仅代表作者本人观点,简书系信息发布平台,仅提供信息存储服务。

推荐阅读更多精彩内容