Lasso problem

Here is a geometric interpretation of the solutions for the equation in c above. We use the alternate form of Lasso constraints | \hat{\beta}_1 | + | \hat{\beta}_2 | < s.

The Lasso constraint take the form | \hat{\beta}_1 | + | \hat{\beta}_2 | < s, which when plotted take the familiar shape of a diamond centered at origin (0, 0). Next consider the squared optimization constraint (y_1 - \hat{\beta}_1x_{11} - \hat{\beta}_2x_{12})^2 + (y_2 - \hat{\beta}_1x_{21} - \hat{\beta}_2x_{22})^2. We use the facts x_{11} = x_{12}, x_{21} = x_{22}, x_{11} + x_{21} = 0, x_{12} + x_{22} = 0 and y_1 + y_2 = 0 to simplify it to

Minimize: 2.(y_1 - (\hat{\beta}_1 + \hat{\beta}_2)x_{11})^2.

This optimization problem has a simple solution: \hat{\beta}_1 + \hat{\beta}_2 = \frac{y_1}{x_{11}}. This is a line parallel to the edge of Lasso-diamond \hat{\beta}_1 + \hat{\beta}_2 = s. Now solutions to the original Lasso optimization problem are contours of the function (y_1 - (\hat{\beta}_1 + \hat{\beta}_2)x_{11})^2 that touch the Lasso-diamond \hat{\beta}_1 + \hat{\beta}_2 = s. Finally, as \hat{\beta}_1 and \hat{\beta}_2 very along the line \hat{\beta}_1 + \hat{\beta}_2 = \frac{y_1}{x_{11}}, these contours touch the Lasso-diamond edge \hat{\beta}_1 + \hat{\beta}_2 = s at different points. As a result, the entire edge \hat{\beta}_1 + \hat{\beta}_2 = s is a potential solution to the Lasso optimization problem!

Similar argument can be made for the opposite Lasso-diamond edge: \hat{\beta}_1 + \hat{\beta}_2 = -s.

Thus, the Lasso problem does not have a unique solution. The general form of solution is given by two line segments:

\hat{\beta}_1 + \hat{\beta}_2 = s; \hat{\beta}_1 \geq 0; \hat{\beta}_2 \geq 0
and
\hat{\beta}_1 + \hat{\beta}_2 = -s; \hat{\beta}_1 \leq 0; \hat{\beta}_2 \leq 0

©著作权归作者所有,转载或内容合作请联系作者
平台声明:文章内容(如有图片或视频亦包括在内)由作者上传并发布,文章内容仅代表作者本人观点,简书系信息发布平台,仅提供信息存储服务。

推荐阅读更多精彩内容