线性代数《Linear Algebra and Its Application》学习总结

此文仅为学习记录，内容会包括一些数学概念，定义，个人理解的摘要。希望能够分享一些学习内容。

第一节：Row Reduction and Echelon Forms

Echelon form: 行消元后的矩阵
Reduced echelon form: 行消元并且leading entry为1的矩阵。
Echelon form and reduced echelon form are row equivalent to the original form.
Span{v1, v2, v3,...... vp} is the collection of all vectors that can be written in the form c1*v1 + c2*v2 + ...... cp*vp with c1, .... cp scalars.
Ax = 0 has a nontrival solution if and only if the equation has at least one free variable.(not full column rank)
Ax = b 的解等于 Ax = 0 和特解的和。
解线性方程组流程P54。
线性无关指任何向量不能组合成其中一个向量。
Ax = b : ColA1 * x1 + ColA2 * x2 +.... ColAm * xm = b
Matrix Transformations: T(x) = Ax is linear transformation.
转换矩阵是各维单位转换后的组合。A = [T(e1) T(e2) .. T(en)]
A mapping T: R^n -> R^m is said to be onto R^m if each b in R^m is the image of at least one x in R^n. (Ax = b 有解)
A mapping T: R^n -> R^m is said to be one-to-one R^m if each b in R^m is the image of at most one x in R^n.

第二节：Matrix Operation

Each column of AB is a linear combination of the columns of A using weightings from the corresponding columns of B. AB = A[b1 b2 b3 b4 ,,, bp] = [Ab1 Ab2 ... Abp]
Each row of AB is a linear combination of the columns of B using weightings from the corresponding rows of A.
Warning: AB != BA. AB = AC !=> B = C. AB = 0 !=> A = 0 or B = 0
逆矩阵的定义：A-1*A = A*A-1 = E. 可以推导出A为方阵，详见Exercise 23-25 ，Section 2.1. A可逆的充要条件为A满秩（行列式不等于0）。
对[A I] 做行消元可以得到[I A-1]
矩阵满秩的所有等价定义:P129,P179.
LU分解：A = LU，其中L为对角元素为1,的下半方阵，U为m*n的上半矩阵。L为变换矩阵的乘机的逆，U为A的Echelon form。计算L不需要计算各变换矩阵。详见P146。
subspace, column space, null space的定义。
A = m*n => rank(A) + rank(Nul(A)) = n.
The dimension of a nonzero subspace H, denoted by dim H, is the numbers of vectors in any basis for H. The dimension of the zero subspace {0} us defined to be zero.

第三节：Introduction to Determinants

determinant的定义和计算方式。
行消元不改变行列式值。交换行改变正负号。某一行乘以k，那么行列式乘以k。
三角矩阵的行列式为对角元素的乘积。
det(AB) = det(A) * det(B)。
Let A be an invertible n*n matrix. For any b in R^n, the unique solutionx of Ax = b has entries given by xi = det Ai(b)/det(A)。 Ai(b) 表示用b替换A的第i行。
由5可以推导出A^-1 = 1/det(A) * adj A. adj A = [(-1)^i+j* det(Aji)]
行列式与体积的关系：平行几何体的面积或者体积等于|det(A)|。而且 det(Ap) = det(A)*det(p)

第四节：Vector Spaces

An indexed set {v1, v2, ... ... vp} of two or more vectors, with vi != 0, is linearly dependent, if and only if some vj (with j > 1) is a linear combination of the preceding vectors.
Elementary row operation on a matrix do not affect the linear dependence relations among the columns of the matrix.
Row operations can change the column space of a matrix.
x = Pb [x]b: we call Pb the change-of-coordinates matrix from B to the standard basis in R^n.
Let B and C be bases of a vector space V. Then there is a unique n*n matrix P_C<-B such that [x]c = P_C<-B [x]b. The columns of P_C<-B are the C-coordinate vectors of the vectors in the basis B, that is P_C<-B = [[b1]c [b2]c ... [bn]c]. [ C B ] ~ [ I P_C<-B]

第五节：Eigenvectors and Eigenvalues

\(Ax =\lambda * x\)
不同特征值对应的特征向量线性无关。
det(A - λ *I) = 0. 因为(A - λ *I)有非零解。
A is similar to B if there is an invertible matrix P such that P^-1AP = B. They have same eigenvalues.
矩阵能够对角化的条件是有n个线性无关的特征向量（特征向量有无穷多个，线性无关向量的数量最多为n）。
特征空间的维度小于等于特征根的幂。当特征空间的维度等于特征根的幂，矩阵能够对角化。
相同坐标变换矩阵在不同维度空间坐标系下的转换：P328。相同坐标变换矩阵在不同坐标系的转换：P329。其实都是一样的。
Suppose A = PDP^-1, where D is a diagonal n*n matrix. If B is the basis for R^n formed from the columns of P, then D is the B-matrix for the transformation x ->Ax. 当坐标系转换为P时，转换矩阵对应变成对角矩阵。
复数系统。
迭代求特征值和特征向量。先估计一个特近的特征值和一个向量\(x_0\)（其中的最大元素为1）。然后迭代，迭代流程详见P365。迭代可以得到最大特征值的原因如下：因为\((\lambda_1)^{-k}A^kx\rightarrow c_1v_1\),所以对于任意\(x\),当k趋近无穷的时候，\(A^kx\)会和特征向量同向。虽然\(\lambda\)和\(c_1v_1\)都未知，但是由于\(Ax_k\)会趋近\(\lambda*x_k\),我们只要令\(x_k\)的最大元素为1，就能得到\(\lambda\)。

第六节：Inner Product, Length, and Orthogonality

\((Row A)^{\bot} = Nul A\) and \((Col A)^{\bot} = Nul A^{\top}\). 这很显然，其中\(A^{\bot}\)表示与A空间垂直的空间。
An orthogonal basis for a subspace W of \(R^n\) is a basis for W that is also an orthogonal set.
一个向量在某一维的投影：\(\hat{y} = proj_L y = \frac{y\cdot u}{u\cdot u}u\).
An set is an orthonormal set if it is an orthogonal set of unit vectors.
An m*n matrix U has orthonormal columns if and only if \(U^\top U = I\)
一个向量在某一空间的投影：\(\hat{y} = proj_w y = \frac{y\cdot u_1}{u_1\cdot u_1}u_1 + \frac{y\cdot u_2}{u_1\cdot u_2}u_2 + ... + \frac{y{\cdot}u_p}{u_p\cdot u_p}u_p.\)
如何将一堆向量弄成正交单位向量: repeat 3.
QR分解：如果A有线性无关的列向量，那么可以分解成Q（正交向量）和R（上三角矩阵，就是原坐标在正交坐标系的系数）\(Q^{\top}A=Q^{\top}(QR) = IR = R\)
最小平方lse（机器学习基础：非贝叶斯条件下的线性拟合问题），由\(A^{\top}(b-A\hat{x})=0\)得到\(\hat{x}=(A^\top A)^{-1}A^{\top}b\)。如果A可逆，此式可以化简。如果可以做QR分解，那么\(\hat{x}=R^{-1}Q^{\top}b\).
函数内积的概念。

第七节：Diagonaliztion of Symmetric matrixs

如果一个矩阵是对称的，那么它的任何两个特征值所对应的特征空间是正交的。
矩阵可正交对角化等价于它是一个对称矩阵。
\(A=PDP^{-1}\)可以得到PCA（机器学习算法主成分分析，对协方差矩阵（对称）做对角化）
将二次方程转化成没有叉乘项的形式。x=Py, \(A = PDP^{-1}\).
对于二次函数\(x^{\top}Ax\)，在|x| = 1的条件下，最大值为最大特征值，最小值为最小特征值。如果最大特征值（\(x^{\top}u_1\)）不能选，则选择次之。
正交矩阵P大概意思就是在该坐标系下，函数比较对称，D为坐标轴的伸展比例。
SVD分解（该书的最后一个内容，蕴含了很多上述的内容）是要将矩阵分解成类似PDP^-1的形式，但是不是任何矩阵都能表示成这种形式（有n个线性无关的特征向量，正交的话还要是对称矩阵）。其中\(A=U{\Sigma}V^{\top}\)，\({\Sigma}\)是A的singular value（\(A^{\top}A\)的特征值的开方），V是\(A^{\top}A\)的对应特征向量，U是\(AV\)的归一化。AV内的向量是垂直的。\(U{\Sigma}\)是AV的另外一种表示。

时间： 2024-10-05 22:46:46

线性代数《Linear Algebra and Its Application》学习总结的相关文章

线性代数 -- Linear Algebra with Applications

@.如果线性方程组无解,则称该方程组是不相容的(inconsistent). @.如果线性方程组至少存在一个解,则称该方程组是相容的(consistent). @.等价方程组(equivalent systems). @.定义:若两个含有相同变量的方程组具有相同的解集,则称它们是等价的(equivalent). @.得到等价的方程组: 1.交换任意两个方程的顺序. 2.任一方程两边同乘一个非零的实数. 3.任一方程的倍数加到另一方程上. @.定义:若方程组中,第k个方程的前k-1个变量的系数均为

《Linear Algebra and Its Application》-线性方程组的解

线性方程组的解: 通过先前文章对矩阵方程.线性方程组和向量方程这三种方程的通解性的介绍,现在我们就可以比较简便的表达一个线性方程组了.即有如下形式: Ax = b.其中A是m x n的矩阵,对应线性方程组的系数矩阵,而x是R^n的一个向量,记录了n个未知量,b则是线性方程组等式右边的尝试,在这里本质上也是R^n的一个向量,那么基于这种表达形式,我们开始对线性方程组的解的结构进行讨论(类似于对于微分方程我们讨论其通解.特解的一个过程). 我们从b出发,无非面临如下的两种情况. 齐次方程: b =

线性代数导论 | Linear Algebra 课程

搞统计的线性代数和概率论必须精通,最好要能锻炼出直觉,再学机器学习才会事半功倍. 线性代数只推荐Prof. Gilbert Strang的MIT课程,有视频,有教材,有习题,有考试,一套学下来基本就入门了. 不多,一共10次课. 链接:https://ocw.mit.edu/courses/mathematics/18-06-linear-algebra-spring-2010/calendar/ SES # TOPICS KEY DATES 1 The geometry of linear e

读Linear Algebra -- Gilbert Strang

转眼间已经我的学士学位修读生涯快要到期了,重读线性代数,一是为了重新理解Algebra的的重要概念以祭奠大一刷过的计算题,二是为了将来的学术工作先打下一点点(薄弱的)基础.数学毫无疑问是指导着的科研方向与科学发展,每次读同一本好的数学书都能读出不同的韵味. P1-149 Strang在书的序言便给出了linear algebra的一以贯之之道,我们所看到一切的来源便在于Ax=b这个方程组中.虽然从向量矩阵.线性方程组到向量空间.线性变换,费了好大劲才将任意一个线性变化凝练到一个矩阵上,但对于Ax

《Linear Algebra and Its Applications》- 线性方程组

同微分方程一样,线性代数也可以称得上是一门描述自然的语言,它在众多自然科学.经济学有着广阔的建模背景,这里笔者学识有限暂且不列举了,那么这片文章来简单的讨论一个问题——线性方程组. 首先从我们中学阶段就很熟系的二元一次方程组,我们采用换元(其实就是高斯消元)的方法.但是现在我们需要讨论更加一般的情况,对于线性方程,有如下形式: a1x1+a2x2+…anxn = b. 现在我们给出多个这样的方程构成方程组,我们是否有通用的解法呢? 在<Linear Algebra and Its Applica

Here’s just a fraction of what you can do with linear algebra

Here’s just a fraction of what you can do with linear algebra The next time someone wonders what the point of linear algebra is, send them here. I write a blog on math and programming and I see linear algebra applied to computer science all the time.

《Linear Algebra and Its Applications》-矩阵运算

可以说第一章<Linear Algebra and Its Applications>着重介绍了线性代数中几个核心概念(向量.矩阵和线性方程组)之间的关系(方程的同解性),那么下面这本书开始分别介绍这几个核心概念,比如从这篇文章开始,会简单的介绍矩阵方面的内容. 首先对于我们定义的计算工具(矩阵),我们有必要研究其运算规律,这个方法在定义很多新的运算符号的时候都是适用的.矩阵的加减法这里就不用累述的,非常好理解,这篇文中我们主要来讨论矩阵的乘法运算的定义过程. 其实不管是从离散的角度还是在线性

Machine Learning - III. Linear Algebra Review (Week 1, Optional)

机器学习Machine Learning - Andrew NG courses学习笔记矩阵和向量及其表示介绍 what are matrices矩阵 matrix is just another way for saying, is a 2D or a two dimensional array. dimension of the matrix is going to be written as the number of row times the number of columns in

A Linear Algebra Problem（唯一性的判定）

A Linear Algebra Problem Time Limit: 3000/1000MS (Java/Others) Memory Limit: 65535/65535KB (Java/Others) Submit Status God Kufeng is the God of Math. However, Kufeng is not so skilled with linear algebra, especially when dealing with matrixes. On