Deep Learning: Assuming a deep neural network is properly regulated, can adding more layers actually make the performance degrade?

Deep Learning: Assuming a deep neural network is properly regulated, can adding more layers actually make the performance degrade?

I found this to be really puzzling. A deeper NN is supposed to be more powerful or at least equal to a shallower NN. I have already used dropout to prevent overfitting. How can the performance be degraded?

Yoshua‘s Answer

View 2 Other Answers

Yoshua Bengio, My lab has been one of the three that started the deep learning approach, bac...

Upvoted by Prateek Tandon, Robotics and Strong Artificial Intelligence Researcher• Paul King, Computational Neuroscientist, Technology Entrepreneur • Jack Rae,Google DeepMind Research Engineer

Yoshua has 25 endorsements in Deep Learning.

If you do not change the size of the layers and just add more layers, capacity should increase, so you could be overfitting. However, you should check whether training error increases or decreases. If it increases (which is also very plausible), it means that adding the layer made the optimization harder, with the optimization methods and initialization that you are using. That could also explain your problem. However, if training error decreases and test error increases, you are overfitting.

时间： 2024-08-13 11:09:29

Deep Learning: Assuming a deep neural network is properly regulated, can adding more layers actually make the performance degrade?的相关文章

【转帖】【面向代码】学习 Deep Learning（一）Neural Network

最近一直在看Deep Learning,各类博客.论文看得不少但是说实话,这样做有些疏于实现,一来呢自己的电脑也不是很好,二来呢我目前也没能力自己去写一个toolbox 只是跟着Andrew Ng的UFLDL tutorial 写了些已有框架的代码(这部分的代码见github) 后来发现了一个matlab的Deep Learning的toolbox,发现其代码很简单,感觉比较适合用来学习算法再一个就是matlab的实现可以省略掉很多数据结构的代码,使算法思路非常清晰所以我想在解读这个too

Deep Learning 论文笔记 (2): Neural network regularization via robust weight factorization

under review as a conference paper at ICLR 2015. Motivation: 本文提出来一种regularization的方法,叫做FaMe (Factored Mean training). The proposed FaMe model aims to apply a similar strategy, yet learns a factorization of each weight matrix such that the factors ar

Andrew Ng 的 Machine Learning 课程学习 (week5) Neural Network Learning

这学期一直在跟进 Coursera上的 Machina Learning 公开课, 老师Andrew Ng是coursera的创始人之一,Machine Learning方面的大牛.这门课程对想要了解和初步掌握机器学习的人来说是不二的选择.这门课程涵盖了机器学习的一些基本概念和方法,同时这门课程的编程作业对于掌握这些概念和方法起到了巨大的作用. 课程地址 https://www.coursera.org/learn/machine-learning 笔记主要是简要记录下课程内容,以及MATLAB

Deep learning与Neural Network

该文章转自深度学习微信公众号深度学习是机器学习研究中的一个新的领域,其动机在于建立.模拟人脑进行分析学习的神经网络,它模仿人脑的机制来解释数据,例如图像,声音和文本.深度学习是无监督学习的一种. 深度学习的概念源于人工神经网络的研究.含多隐层的多层感知器就是一种深度学习结构.深度学习通过组合低层特征形成更加抽象的高层表示属性类别或特征,以发现数据的分布式特征表示. Deep learning本身算是machine learning的一个分支,简单可以理解为neural network的发展.大

[C3] Andrew Ng - Neural Networks and Deep Learning

About this Course If you want to break into cutting-edge AI, this course will help you do so. Deep learning engineers are highly sought after, and mastering deep learning will give you numerous new career opportunities. Deep learning is also a new "s

机器学习001 deeplearning.ai 深度学习课程 Neural Networks and Deep Learning 第一周总结

Deep Learning Specialization 吴恩达老师最近在coursera上联合deeplearning.ai 推出了有关深度学习的一系列课程,相对于之前的machine learning课程,这次的课程更加实用,作业语言也有matlab改为了python从而更加贴合目前的趋势.在此将对这个系列课程做一个学习笔记. 而这次的Deep Learning Specialization分为五门课程,分别为:Neural Networks and Deep Learning,Improv

Neural Networks and Deep Learning

Neural Networks and Deep Learning This is the first course of the deep learning specialization at Coursera which is moderated by moderated by DeepLearning.ai. The course is taught by Andrew Ng. Introduction to deep learning Be able to explain the maj

(转) Deep Learning in a Nutshell: Reinforcement Learning

Deep Learning in a Nutshell: Reinforcement Learning Share: Posted on September 8, 2016by Tim Dettmers No CommentsTagged Deep Learning, Deep Neural Networks, Machine Learning,Reinforcement Learning This post is Part 4 of the Deep Learning in a Nutshel

Deep Learning（深度学习）学习笔记整理系列之（三）

Deep Learning(深度学习)学习笔记整理系列 [email protected] http://blog.csdn.net/zouxy09 作者:Zouxy version 1.0 2013-04-08 声明: 1)该Deep Learning的学习系列是整理自网上很大牛和机器学习专家所无私奉献的资料的.具体引用的资料请看参考文献.具体的版本声明也参考原文献. 2)本文仅供学术交流,非商用.所以每一部分具体的参考资料并没有详细对应.如果某部分不小心侵犯了大家的利益,还望海涵,并联系博主

猜你喜欢

几个和DataTable相关的函数

一.关于本文本文中的DataTableHelper类包括了4个操作DataTable的函数,分别是 1)public static DataTable GetTestDataTable() 这是一个 ...

火狐浏览器中一个插件-httpfox

今天突然想到保存有以前老师上课录的唯一音频,就打开听了听,是讲http协议的,里面涉及到一个查看协议的插件httpfox,所以,如下就是httpfox的安装和打开. HttpFox是Firefox的插 ...

HTTPS和HTTP有什么区别

广泛应用于互联网世界的HTTP想必是大家再熟悉不过的了,然而细心的朋友却能发现淘宝.百度.网上银行等页面都是HTTPS开头的,那么这个HTTPS和HTTP有什么区别呢? 说到HTTPS和HTTP的区 ...

Android中的复制粘贴

The Clipboard Framework 当使用clipboard framework时,把数据放在一个剪切对象(clip object)里,然后这个对象会放在系统的剪贴板里. clip obj ...

ubuntu virtualbox 网络模式Host-only

Host-only Adapter模式特点: 1.虚拟机不可以上网 2.虚拟机之间可以ping通 3.虚拟机可以ping通主机(注意虚拟机与主机通信是通过主机的名为VirtualBox Host-O ...

C++编程->pair（对组）

pair 是一种模版类型.每个pair 可以存储两个值.这两种值无限制,可以是tuple,vector ,string,struct等等. 首先来看一下pair的函数初始化,复制等相关操作如下: ...

原来这么拍（41）——进退与透视

这张图片是用了28mm镜头来拍摄,这是广角的一个感觉,站在原地不动在没有改变距离换上了一直50mm的镜头再来拍摄对比如下: 它们只是改变了视角,改变了我们画面拍摄到的一个角度但并没有改变透视 ti ...

Android应用开发基础篇（6）-----Service

链接地址:http://www.cnblogs.com/lknlfy/archive/2012/02/20/2360336.html 一.概述我们知道,Service是Android的四大组件之一. ...

基于FPGA的图像开发平台其他摄像头附件说明（OV5642 OV9655）

基于FPGA的图像开发平台其他摄像头附件说明 FPGA_VIP_V101 编者奇迹再现个人博客 http://www.cnblogs.com/ccjt/ 联系邮箱 [email protecte ...

在windows上如何安装python web引擎jinja2

首先要把你的Python文件夹加到环境变量里头去.假设你的Python文件夹位于C:\Python34,那么你需要打开CMD并输入: SETX PATH "%path%;C:\Python3 ...

Android 软件工程师----炼成计划

Android了解背景,未来,大致架构了解部分重要UI组件学习四大控件四大控件应用文件数据库网络编程项目实践传感器地理位置服务音频.视频以及摄像头的使用蓝牙.NFC.网络和Wi-F ...

【转】HTML中的几种定位方式

http://www.cnblogs.com/Jerry-Chou/archive/2011/11/02/2233094.html 1,static(默认) 当你没有为一个元素(例如div)指定定位方 ...

学习Python 笔记

实例1.登录网易邮箱 #coding=utf-8 from selenium import webdriver from selenium.webdriver.common.keys import K ...

搪叟讨坌哑g1723fsqbyt1048x9n

比如九幽獓,虽然曾经败给石昊,但是有人认为,他也许有资格进入十大高手内."这是--天啊,他是消失很久的--荒!"r1152原本这个只因美丽绝世而闻名的女子,自此一战后,震动天下,所 ...

开展亲子植树活动t

脱贫工作中,巡视督查要跟上,发现问题要动真刀真枪解决.要实施异地检验,脱贫成效不能由本地说了算.组织部门要把脱贫工作考核结果作为干部使用的重要依据,不能干好干坏一个样.对做得好的,该提拔重用的就提拔重 ...

制作通用framework的几点注意

一.创建framework,调成静态的framework . 二.匹配bitcode 三.增加-ObjC 在BuildSettting ->Linking->Other Linker Fl ...

codevs3038 3n+1问题

题目描述 Description 3n+1问题是一个简单有趣而又没有解决的数学问题.这个问题是由L. Collatz在1937年提出的.克拉兹问题(Collatz problem)也被叫做hailst ...

eclipse中git解决冲突

摘录自http://blog.csdn.net/rosten/article/details/17068285 1. 工程->Team->同步 2.从远程pull至本地,就会出现如下内容 ...

【ASP.NET】——SQL注入

关于SQL注入,师父给验收项目的时候就提过.但一直也没深入去想是怎么回事~~在学ASP.NET,做新闻发布系统的时候,又遇到了,这次不能放过了~~ 定义所谓SQL注入,就是通过把SQL命令插入到We ...

Unix/Linux系统下获得时间戳函数

在Unix/Linux系统下,使用gettimeofday函数来获得当前系统的时间戳,精度可达到微秒(microsecond,即μs)级别. 通过结构体timeval来存放当前时间戳的信息: #ifn ...

专题

随机推荐

© 2024 憋错料 | info#biecuoliao.com | 10 q. 0.021 s.