给定数据集$ D={(x_1,y_1),(x_2,y_2),\dots,(x_m,y_m)},y_i \in R$. 线性回归(linear regression)试图学得一个线性模型以尽可能地预测实值输出标记,即:
$ \frac{\partial MSE(w,b)}{\partial b} = \sum_{i=1}^m2(wx_i+b-y_i) $
$ \qquad\qquad,,=2\left(w\sum_{i=1}^mx_i+mb-\sum_{i=1}^my_i \right)(3.4) $ 令公式3.4等于0,有:
$ w\sum_{i=1}^mx_i+mb-\sum_{i=1}^my_i=0 $
$ mb = \sum_{i=1}^my_i-w\sum_{i=1}^mx_i $
$ b = \frac{1}{m}\sum_{i=1}^my_i-w\frac{1}{m}\sum_{i=1}^mx_i $
$ b = \overline{y}-w\overline{x}(3.5) $
再把$MSE(w,b)$对$w$求偏导数:
$ \frac{\partial MSE(w,b)}{\partial w} =\sum_{i=1}^m2(wx_i+b-y_i)x_i $
$ \qquad\qquad,,=\sum_{i=1}^n2(wx_1^2+bx_i-y_ix_i) $
$ \qquad\qquad,,=2\left(w\sum_{i=1}^mx_i^2-\sum_{i=1}^m(y_i-b)x_i\right)(3.6) $
令上式等于0,把式3.5代入有:
$ w\sum_{i=1}^mx_i^2-\sum_{i=1}^m(y_i-\overline{y}+w\overline{x})x_i=0 $
$ w\sum_{i=1}^mx_i^2-\sum_{i=1}^m(y_ix_i-\overline{y}x_i)-w\sum_{i=1}^m\overline{x}x_i=0 $
$ w\sum_{i=1}^m(x_i^2-\overline{x}x_i) = \sum_{i=1}^m(y_ix_i-\overline{y}x_i) $
$ w = \frac{\sum_{i=1}^m(y_ix_i-\overline{y}x_i)}{\sum_{i=1}^m(x_i^2-\overline{x}x_i)} $
$
\quad =\frac{\sum_{i=1}^m(y_ix_i-\overline{y}x_i-y_i\overline{x}+\overline{x},\overline{y})}{\sum_{i=1}^m(x_i^2-\overline{x}x_i-x_i\overline{x}+\overline{x},\overline{x})}
$
\quad = \frac{\sum_{i=1}^m(y_i-\overline{y})(x_i-\overline{x})}{\sum_{i=1}^m(x_i-\overline{x})^2}
综上所述,给出给定训练数据集$ D={(x_1,y_1),(x_2,y_2),\dots,(x_m,y_m)},y_i \in R$,可以根据式3.7计算出$w$的值,然后再根据式3.5计算出$b$的值,至此,线性回归模型$f(x)$就确定下来了。