Optimization and Machine Learning(优化与机器学习)

这是根据(ShanghaiTech University)王浩老师的授课所作的整理。

需要的预备知识:数分、高代、统计、优化

machine learning:(Tom M. Mitchell) “A computer program is said to learn from experience E with respect to some class of tasks T and performance measure P if its performance at tasks in T, as measured by P, improves with experience E”.

? What is experience: historical data

? How to learn: learning models and algorithms

? Performance measure: cost functions (error, penalty)

Machine learning, a branch of artificial intelligence, concerns the study and

construction of systems that can learn and predict from data
The core of machine learning deals with representation and generalization:

? Representation/Explanation of data instances and functions evaluated on these instances are part of all machine learning systems

? Generalization (prediction) is the property that the system will perform well on unseen data instances

Machine learning tasks are typically classified into three broad categories

监督学习? Supervised learning: The computer is presented with example inputs and their desired outputs, given by a ”teacher”, and the goal is to learn a general rule that maps inputs to outputs.

【semi-supervised learning】

? Unsupervised learning: No labels are given to the learning algorithm, leaving it on its own to find structure in its input. Unsupervised learning can be a goal in itself (discovering hidden patterns in data) or a means towards an end (feature learning).

? Reinforcement learning: A computer program interacts with a dynamic environment in which it must perform a certain goal (such as driving a vehicle), without a teacher explicitly telling it whether it has come close to its goal. Another example is learning to play a game by playing against an opponent.

Learning Tasks

? classification

? regression回归

? clustering聚类

? Density estimation

? Dimensionality reduction降维

Methods:Regression、Decision trees、 k?means algortihm、Support vector machine、 Apriori algorithm、EM algorithm、 PageRank、kNN、Naive Bayes、Neural networks ...

The difference between machine learning and data mining:The overall goal of the data mining process is to extract information from a data set and transform it into an understandable structure for further use.

Machine learning also has intimate ties to optimization:

? The three pillars: statistical modeling, feature selection, learning via optimization (netflix prize)

? Many learning problems are formulated as minimization of some loss on a training set of examples

Optimization algorithms/techniques

? Sparse optimization

? Iteratively reweighted least squares algorithm (IRLS)

? Gradient Descent Methods

? Online Gradient Methods

? Stochastic Gradient Methods

? Newton method

? Quasi-newton method (BFGS)

? Limited memory BFGS

? Coordinate Descent

? Alternating Direction methods of multipliers

? Penalty method, Augmented Lagrangian

? Gradient Projection method

? Iterative-thresholding method (IST)

? Active set method

? recursive least squares

? Line search, Convergence rate, Duality, KKT/Optimality conditions

参考书目:

1 For machine learning methods: “Machine Learning, A probabilistic Perspective”, Kevin P. Murphy, the MIT Press.

2 For optimization knowledge: “Numerical Optimization”, Stephen Wright, Jorge Nocedal, 2nd Edition, Springer.

3 For optimization techniques in machine learning: “Optimization for Machine Learning”, Suvrit Sra, Sebastian Nowozin, and Stephen J. Wright, the MIT Press.

4 Some lectures will be based on these books, but not all of them. Reading the textbooks is not required, but it is recommended. You are not responsible for textbook material that is not covered in lecture.

时间: 2024-10-04 23:46:32

Optimization and Machine Learning(优化与机器学习)的相关文章

利用Microsoft Azure Machine Learning Studio创建机器学习实例

Microsoft Azure云服务推出机器学习的模块,用户只需上传数据,利用机器学习模块提供的一些算法接口和R语言或别的语言接口,就能利用Microsoft Azure强大的云计算能力来实现自己的机器学习的任务. 本文浅尝了该机器学习模块,参照官方实例和帮助文档,完成了一个简单的应用实例,具体步骤如下. 1.新建workspace 注意,在填写workspace owner,一定要填写一个有效的windows live 账号 进入创建的workspace,界面如下图所示 2.上传数据 数据源:

[Machine Learning & Algorithm]CAML机器学习系列2:深入浅出ML之Entropy-Based家族

声明:本博客整理自博友@zhouyong计算广告与机器学习-技术共享平台,尊重原创,欢迎感兴趣的博友查看原文. 写在前面 记得在<Pattern Recognition And Machine Learning>一书中的开头有讲到:“概率论.决策论.信息论3个重要工具贯穿着<PRML>整本书,虽然看起来令人生畏…”.确实如此,其实这3大理论在机器学习的每一种技法中,或多或少都会出现其身影(不局限在概率模型). <PRML>书中原话:”This chapter also

Machine Learning: Clustering &amp; Retrieval机器学习之聚类和信息检索(框架)

Case Studies: Finding Similar DocumentsLearning Outcomes:  By the end of this course, you will be able to:(通过本章的学习,你将掌握)   -Create a document retrieval system using k-nearest neighbors.用K近邻构建文本检索系统   -Identify various similarity metrics for text data

Pattern Recognition And Machine Learning (模式识别与机器学习) 笔记 (1)

By Yunduan Cui 这是我自己的PRML学习笔记,目前持续更新中. 第二章 Probability Distributions 概率分布 本章介绍了书中要用到的概率分布模型,是之后章节的基础.已知一个有限集合 \(\{x_{1}, x_{2},..., x_{n}\}\), 概率分布是用来建立一个模型:\(p(x)\). 这一问题又称作密度估计( density estimation ). 主要内容 1. Binomial and Multinomial distributions 面

【机器学习实战】Machine Learning in Action 代码 视频 项目案例

MachineLearning 欢迎任何人参与和完善:一个人可以走的很快,但是一群人却可以走的更远 Machine Learning in Action (机器学习实战) | ApacheCN(apache中文网) 视频每周更新:如果你觉得有价值,请帮忙点 Star[后续组织学习活动:sklearn + tensorflow] ApacheCN - 学习机器学习群[629470233] 第一部分 分类 1.) 机器学习基础 2.) k-近邻算法 3.) 决策树 4.) 基于概率论的分类方法:朴素

Everything You Wanted to Know About Machine Learning

Everything You Wanted to Know About Machine Learning 翻译了理解机器学习的10个重要的观点,加入了自己的理解,这些原则在大部分情况下也许是这样,但是具体问题具体分析才是王道,不加思索的应用只能是一知半解.所以张小龙才说'我说的都是错的'. note by 王犇 1. How Does Machine Learning Work? 一般来说机器学习算法做这三件事情来建立模型: A set of possible models to look th

Attention Please!TWO HUNDREDS OF machine learning turorial summary

Machine Learning The First Column The Second Column Machine Learning 1 从机器学习入手 Machine Learning 2 机器学习很有趣! Machine Learning 3 机器学习规则:ML工程的最佳实践 Machine Learning 4 机器学习速成课程:第一部分 Machine Learning 5 第二部分;伯克利机器学习 Machine Learning 6 第三部分;伯克利机器学习 Machine Le

Awesome Machine Learning

Awesome Machine Learning  A curated list of awesome machine learning frameworks, libraries and software (by language). Inspired by awesome-php. If you want to contribute to this list (please do), send me a pull request or contact me @josephmisiti Als

(转)Introduction to Gradient Descent Algorithm (along with variants) in Machine Learning

Introduction Optimization is always the ultimate goal whether you are dealing with a real life problem or building a software product. I, as a computer science student, always fiddled with optimizing my code to the extent that I could brag about its