Skip to main content

矩阵求导

符号

  • 标量(Scalar):xx
  • 向量(Vector):x\mathbf{x}
  • 矩阵(Matrix):X\mathbf{X}

布局

  • Numerator Layout:分子布局,分子不变,分母转置
  • Denominator Layout:分母布局,分母不变,分子转置

向量对标量求导

yx=[y1xy2xymx]\frac{\partial \mathbf{y}}{\partial x}=\begin{bmatrix} \frac{\partial y_1}{\partial x} \\ \frac{\partial y_2}{\partial x} \\ \vdots \\ \frac{\partial y_m}{\partial x} \end{bmatrix}

标量对向量求导

yx=[yx1yx2yxn]\frac{\partial y}{\partial \mathbf{x}}=\begin{bmatrix} \frac{\partial y}{\partial x_1} & \frac{\partial y}{\partial x_2} & \cdots & \frac{\partial y}{\partial x_n} \end{bmatrix}

向量对向量求导

The vector-by-vector derivative is the Jacobian matrix of y\mathbf{y} with respect to x\mathbf{x}.

yx=[y1x1y1x2y1xny2x1y2x2y2xnymx1ymx2ymxn]\frac{\partial \mathbf{y}}{\partial \mathbf{x}}=\begin{bmatrix} \frac{\partial y_1}{\partial x_1} & \frac{\partial y_1}{\partial x_2} & \cdots & \frac{\partial y_1}{\partial x_n} \\ \frac{\partial y_2}{\partial x_1} & \frac{\partial y_2}{\partial x_2} & \cdots & \frac{\partial y_2}{\partial x_n} \\ \vdots & \vdots & \ddots & \vdots \\ \frac{\partial y_m}{\partial x_1} & \frac{\partial y_m}{\partial x_2} & \cdots & \frac{\partial y_m}{\partial x_n} \end{bmatrix}

where

y=[y1y2ym],x=[x1x2xn]\mathbf{y}=\begin{bmatrix} y_1 \\ y_2 \\ \vdots \\ y_m \end{bmatrix} ,\quad \mathbf{x}=\begin{bmatrix} x_1 \\ x_2 \\ \vdots \\ x_n \end{bmatrix}

The bold symbols for vectors and matrices are not necessary, but they are used here for clarity.

矩阵对标量求导

Yx=[y11xy12xy1nxy21xy22xy2nxym1xym2xymnx]\frac{\partial \mathbf{Y}}{\partial x}=\begin{bmatrix} \frac{\partial y_{11}}{\partial x} & \frac{\partial y_{12}}{\partial x} & \cdots & \frac{\partial y_{1n}}{\partial x} \\ \frac{\partial y_{21}}{\partial x} & \frac{\partial y_{22}}{\partial x} & \cdots & \frac{\partial y_{2n}}{\partial x} \\ \vdots & \vdots & \ddots & \vdots \\ \frac{\partial y_{m1}}{\partial x} & \frac{\partial y_{m2}}{\partial x} & \cdots & \frac{\partial y_{mn}}{\partial x} \end{bmatrix}

标量对矩阵求导

yX=[yx11yx12yx1nyx21yx22yx2nyxm1yxm2yxmn]\frac{\partial y}{\partial \mathbf{X}}=\begin{bmatrix} \frac{\partial y}{\partial x_{11}} & \frac{\partial y}{\partial x_{12}} & \cdots & \frac{\partial y}{\partial x_{1n}} \\ \frac{\partial y}{\partial x_{21}} & \frac{\partial y}{\partial x_{22}} & \cdots & \frac{\partial y}{\partial x_{2n}} \\ \vdots & \vdots & \ddots & \vdots \\ \frac{\partial y}{\partial x_{m1}} & \frac{\partial y}{\partial x_{m2}} & \cdots & \frac{\partial y}{\partial x_{mn}} \end{bmatrix}