Machine Learning - XVI. Recommender Systems 推荐系统(Week 9)

http://blog.csdn.net/pipisorry/article/details/44850971

机器学习Machine Learning - Andrew NG courses学习笔记

Recommender Systems 推荐系统

{an important application of machine learning}

Problem Formulation 问题规划

Note:

1. to allow 0 to 5 stars as well,because that just makes some of the math come out just nicer.

2. for this example, I have loosely 3 romantic or romantic comedy movies and 2 action movies.

3. to look through the data and look at all the movie ratings that are missing and to try topredict
what these values of the question marks should be.

Content Based Recommendations基于内容的推荐

Note:

1. add an extra feature interceptor feature X0, which is equal to 1

2. set n to be the number of features, not counting this X zero intercept term so n is equal to two because we have two features x1 and x2

3. to make predictions, we could treat predicting the ratings of each user as aseparate linear regression problem. So specifically lets say that for each user j we are going to learn a parameter vector theta J which would
be in r n+1, where n is the number of features,and we‘re going to predict user J as rating movie I, with just the inner product between the parameters vector theta and the features "XI".

4. let‘s say that you know we have somehow already gotten a parameter
vector theta 1
for Alice.{线性规划求出:对于Alice评过的每部电影就是一个example,其中example0中x = [0.9 0], y = 5,用梯度下降求出theta}

Optimization algorithm:estimate of parameter vector theta j

Note:

1. to simplify the subsequent math,get rid of this term MJ.that‘s just a constant.

2. because our regularization term here regularizes only the values of theta JK for K not equal to zero.wedon‘t regularize theta 0.

3. can also plug them into a more advanced optimization algorithm like cluster gradient or LBFGS and use that to try to minimize the cost function J as well.

4. content based recommendations,because we assume that we have features for the different movies.that capture what is the content of these movies. How romantic/action is this movie?And we are really using features of the content of the
movies to make our predictions.

Collaborative Filtering协同过滤

{CF has an interesting property:feature learning can start to learn for itself what features to use}

Note:we do not know the values of these features of movies.But assume we‘ve gone to each of our users, and each of our users has told us how much they like the romantic movies and how much they like action
packed movies.each user J just tells us what is the value of theta J for them.

Optimization algorithm

Note:

1. kind of a chicken and egg problem.So randomly guess some value of the thetas.Now based on yourinitial random guess for the thetas, you can then go ahead and use the procedure to learn features for your different movies.

2. by rating a few movies myself,the system learn better features and then these features can be used by the system to make better movie predictions for everyone else.And so there is a sense of collaboration where every user is helping the system learn better
features for the common good. This is this collaborative filtering.

Collaborative Filtering Algorithm协同过渡算法

{more efficient algorithm that doesn‘t need to go back and forth between the x‘s and the thetas, but that can solve for theta and x simultaneously}

Collaborative filteringoptimization objective

Note:

1. Sum over J says, for every user, the sum of all the movies rated by that user.for every movie I,
sum over all the users J that have rated that movie.

2. just something over all the user movie pairs for which you have a rating.

3. if you were to hold the x‘s constant and just minimize with respect to the thetas then you‘d be solving exactly the first problem.

4. Previously we have been using this convention that we have a feature x0 equals one that corresponds to an interceptor.When we are using this sort of formalism where we‘re are actually learning the features,we are actually going to do away with
feature x0
.And so the features we are going to learn x, will be in Rn.

Collaborative filtering algorithm

Vectorization_ Low Rank Matrix Factorization向量化_低秩矩阵分解

Implementational Detail_ Mean Normalization实现细节_均值

from:http://blog.csdn.net/pipisorry/article/details/44850971

时间: 2024-08-27 14:15:27

Machine Learning - XVI. Recommender Systems 推荐系统(Week 9)的相关文章

【RS】Wide & Deep Learning for Recommender Systems - 广泛和深度学习的推荐系统

[论文标题]Wide & Deep Learning for Recommender Systems (DLRS'16) [论文作者] Heng-Tze Cheng, Levent Koc, Jeremiah Harmsen, Tal Shaked, Tushar Chandra,Hrishi Aradhye, Glen Anderson, Greg Corrado, Wei Chai, Mustafa Ispir, Rohan Anil,Zakaria Haque, Lichan Hong,

论文笔记-Wide & Deep Learning for Recommender Systems

本文提出的W&D是针对rank环节的模型. 网络结构: 本文提出的W&D是针对rank环节的模型. 网络结构: wide是简单的线性模型,但是可以预先对特征做各种变换.交叉等来增加wide模型的非线性性. deep是一个FNN,对高维稀疏类别特征采取embedding降维,embedding的结果是在训练时候学出来的. wide与deep结合的方式,是将两者的输出通过加权最后喂给一个logistic损失函数.值得注意的是,这里是join train并不是ensemble,ensemble是

Introduction - What is machine learning

摘要: 本文是吴恩达 (Andrew Ng)老师<机器学习>课程,第一章<绪论:初识机器学习>中第2课时<什么是机器学习?>的视频原文字幕.为本人在视频学习过程中逐字逐句记录下来以便日后查阅使用.现分享给大家.如有错误,欢迎大家批评指正,在此表示诚挚地感谢!同时希望对大家的学习能有所帮助. What is machine learning? In this article we will try to define what it is and also try to

机器学习系统设计(Building Machine Learning Systems with Python)- Willi Richert Luis Pedro Coelho

机器学习系统设计(Building Machine Learning Systems with Python)- Willi Richert Luis Pedro Coelho 总述 本书是 2014 的,看完以后才发现有第二版的更新,2016.建议阅读最新版,有能力的建议阅读英文版,中文翻译有些地方比较别扭(但英文版的书确实是有些贵). 我读书的目的:泛读主要是想窥视他人思考的方式. 作者写书的目标:面向初学者,但有时间看看也不错.作者说"我希望它能激发你的好奇心,并足以让你保持渴望,不断探索

Awesome Machine Learning

Awesome Machine Learning  A curated list of awesome machine learning frameworks, libraries and software (by language). Inspired by awesome-php. If you want to contribute to this list (please do), send me a pull request or contact me @josephmisiti Als

【MATLAB】Machine Learning (Coursera Courses Outline &amp; Schedule)

课程涉及技术: 梯度下降.线性回归.监督/非监督学习.分类/逻辑回归.正则化.神经网络.梯度检验/数值计算.模型选择/诊断.学习曲线.评估度量.SVM.K-Means聚类.PCA.Map Reduce & Data Parallelism 等- 课程涉及应用: 邮件分类.肿瘤诊断.手写识别.自动驾驶.模型优化.OCR等- Coursera machine learning course materials, including problem sets and my solutions (usi

机器学习算法之旅A Tour of Machine Learning Algorithms

In this post we take a tour of the most popular machine learning algorithms. It is useful to tour the main algorithms in the field to get a feeling of what methods are available. There are so many algorithms available and it can feel overwhelming whe

Machine Learning第九周笔记

博客已经迁移到Marcovaldo's blog Andrew Ng在Machine Learning的第九周介绍了异常检测(anomaly detection)和推荐系统(recommender system),将笔记整理在下面. Anomaly Detection Density Estimation Problem Motivation 视频开头,Andrew Ng告诉我们,异常检测(anomaly detection)主要应用于无监督式学习,但它更像是监督式学习.我们以飞行器发动机为例来

Java Machine Learning Tools &amp; Libraries--转载

原文地址:http://www.demnag.com/b/java-machine-learning-tools-libraries-cm570/?ref=dzone This is a list of 25 Java Machine learning tools & libraries. Weka has a collection of machine learning algorithms for data mining tasks. The algorithms can either be