RNN and LSTM saliency Predection Scene Label

http://handong1587.github.io/deep_learning/2015/10/09/rnn-and-lstm.html //RNN and LSTM

http://handong1587.github.io/deep_learning/2015/10/09/saliency-prediction.html //saliency Predection

http://handong1587.github.io/deep_learning/2015/10/09/scene-labeling.html //Scene Label

RNN and LSTM

Published: 09 Oct 2015 Category: deep_learning

Types of RNN

1) Plain Tanh Recurrent Nerual Networks

2) Gated Recurrent Neural Networks (GRU)

3) Long Short-Term Memory (LSTM)

Tutorials

A Beginner’s Guide to Recurrent Networks and LSTMs

http://deeplearning4j.org/lstm.html

A Deep Dive into Recurrent Neural Nets

http://nikhilbuduma.com/2015/01/11/a-deep-dive-into-recurrent-neural-networks/

Long Short-Term Memory: Tutorial on LSTM Recurrent Networks

http://people.idsia.ch/~juergen/lstm/index.htm

LSTM implementation explained

http://apaszke.github.io/lstm-explained.html

Recurrent Neural Networks Tutorial

Part 1(Introduction to RNNs): http://www.wildml.com/2015/09/recurrent-neural-networks-tutorial-part-1-introduction-to-rnns/
Part 2(Implementing a RNN using Python and Theano):http://www.wildml.com/2015/09/recurrent-neural-networks-tutorial-part-2-implementing-a-language-model-rnn-with-python-numpy-and-theano/
Part 3(Understanding the Backpropagation Through Time (BPTT) algorithm):http://www.wildml.com/2015/10/recurrent-neural-networks-tutorial-part-3-backpropagation-through-time-and-vanishing-gradients/
Part 4(Implementing a GRU/LSTM RNN): http://www.wildml.com/2015/10/recurrent-neural-network-tutorial-part-4-implementing-a-grulstm-rnn-with-python-and-theano/

Understanding LSTM Networks

Recurrent Neural Networks in DL4J

http://deeplearning4j.org/usingrnns.html

Train RNN

A Simple Way to Initialize Recurrent Networks of Rectified Linear Units

arxiv: http://arxiv.org/abs/1504.00941
gitxiv: http://gitxiv.com/posts/7j5JXvP3kn5Jf8Waj/irnn-experiment-with-pixel-by-pixel-sequential-mnist
github: https://github.com/fchollet/keras/blob/master/examples/mnist_irnn.py
github: https://gist.github.com/GabrielPereyra/353499f2e6e407883b32
blog(“Implementing Recurrent Neural Net using chainer!”): http://t-satoshi.blogspot.jp/2015/06/implementing-recurrent-neural-net-using.html
reddit:https://www.reddit.com/r/MachineLearning/comments/31rinf/150400941_a_simple_way_to_initialize_recurrent/
reddit:https://www.reddit.com/r/MachineLearning/comments/32tgvw/has_anyone_been_able_to_reproduce_the_results_in/

Sequence Level Training with Recurrent Neural Networks

Papers

Generating Sequences With Recurrent Neural Networks

DRAW: A Recurrent Neural Network For Image Generation

arXiv: http://arxiv.org/abs/1502.04623
github: https://github.com/vivanov879/draw
github(Theano): https://github.com/jbornschein/draw
github(Lasagne): https://github.com/skaae/lasagne-draw

Unsupervised Learning of Video Representations using LSTMs(ICML2015)

LSTM: A Search Space Odyssey

paper: http://arxiv.org/abs/1503.04069
notes: https://www.evernote.com/shard/s189/sh/48da42c5-8106-4f0d-b835-c203466bfac4/50d7a3c9a961aefd937fae3eebc6f540
blog(“Dissecting the LSTM”): https://medium.com/jim-fleming/implementing-lstm-a-search-space-odyssey-7d50c3bacf93#.crg8pztop
github: https://github.com/jimfleming/lstm_search

Inferring Algorithmic Patterns with Stack-Augmented Recurrent Nets

paper: http://arxiv.org/abs/1503.01007
code: https://github.com/facebook/Stack-RNN

A Critical Review of Recurrent Neural Networks for Sequence Learning

arXiv: http://arxiv.org/abs/1506.00019
intro: “A rigorous & readable review on RNNs”
http://blog.terminal.com/a-thorough-and-readable-review-on-rnns/

Scheduled Sampling for
Sequence Prediction with Recurrent Neural Networks(Winner of MSCOCO image
captioning challenge, 2015)

arXiv: http://arxiv.org/abs/1506.03099

Visualizing and
Understanding Recurrent Networks(Andrej Karpathy, Justin Johnson, Fei-Fei Li)

Grid Long Short-Term
Memory

arxiv: http://arxiv.org/abs/1507.01526
github(Torch7): https://github.com/coreylynch/grid-lstm/

Depth-Gated LSTM

paper: http://arxiv.org/abs/1508.03790
code: GitHub(dglstm.h+dglstm.cc)

Deep Knowledge Tracing

Top-down Tree Long
Short-Term Memory Networks

arxiv: http://arxiv.org/abs/1511.00060
github: https://github.com/XingxingZhang/td-treelstm

Alternative structures
for character-level RNNs(INRIA & Facebook AI Research)

arXiv: http://arxiv.org/abs/1511.06303
github: https://github.com/facebook/Conditional-character-based-RNN

Pixel Recurrent Neural
Networks (Google DeepMind)

arxiv: http://arxiv.org/abs/1601.06759
notes(by Hugo Larochelle): https://www.evernote.com/shard/s189/sh/fdf61a28-f4b6-491b-bef1-f3e148185b18/aba21367d1b3730d9334ed91d3250848

Long Short-Term
Memory-Networks for Machine Reading

arxiv: http://arxiv.org/abs/1601.06733

Lipreading with Long
Short-Term Memory

arxiv: http://arxiv.org/abs/1601.08188

Associative Long
Short-Term Memory

arxiv: http://arxiv.org/abs/1602.03032

Representation of
linguistic form and function in recurrent neural networks

arxiv: http://arxiv.org/abs/1602.08952

Architectural
Complexity Measures of Recurrent Neural Networks

arxiv: http://arxiv.org/abs/1602.08210

Easy-First Dependency
Parsing with Hierarchical Tree LSTMs

arxiv: http://arxiv.org/abs/1603.00375

Training Input-Output
Recurrent Neural Networks through Spectral Methods

arxiv: http://arxiv.org/abs/1603.00954

Learn To Execute Programs

Learning to Execute

arXiv: http://arxiv.org/abs/1410.4615
github: https://github.com/wojciechz/learning_to_execute

Neural
Programmer-Interpreters (Google DeepMind)

arXiv: http://arxiv.org/abs/1511.06279
project page: http://www-personal.umich.edu/~reedscot/iclr_project.html

A
Programmer-Interpreter Neural Network Architecture for Prefrontal Cognitive
Control

paper:https://www.researchgate.net/publication/273912337_A_ProgrammerInterpreter_Neural_Network_Architecture_for_Prefrontal_Cognitive_Control

Convolutional RNN: an
Enhanced Model for Extracting Features from Sequential Data

arxiv: http://arxiv.org/abs/1602.05875

Attention Models

Recurrent Models of
Visual Attention (Google
DeepMind. NIPS2014)

Recurrent Model of
Visual Attention(Google DeepMind)

Show, Attend and Tell:
Neural Image Caption Generation with Visual Attention

paper: http://arxiv.org/abs/1502.03044
code: https://github.com/kelvinxu/arctic-captions

A Neural Attention
Model for Abstractive Sentence Summarization(EMNLP 2015. Facebook AI Research)

arXiv: http://arxiv.org/abs/1509.00685
github: https://github.com/facebook/NAMAS

Effective Approaches
to Attention-based Neural Machine Translation(EMNLP2015)

paper: http://nlp.stanford.edu/pubs/emnlp15_attn.pdf
project: http://nlp.stanford.edu/projects/nmt/

Generating Images from
Captions with Attention

arxiv: http://arxiv.org/abs/1511.02793
github: https://github.com/emansim/text2image
demo: http://www.cs.toronto.edu/~emansim/cap2im.html

Attention and Memory
in Deep Learning and NLP

blog: http://www.wildml.com/2016/01/attention-and-memory-in-deep-learning-and-nlp/

Survey on the
attention based RNN model and its applications in computer vision

arxiv: http://arxiv.org/abs/1601.06823

Train RNN

Training Recurrent
Neural Networks (PhD thesis)

atuhor: Ilya Sutskever
thesis: https://www.cs.utoronto.ca/~ilya/pubs/ilya_sutskever_phd_thesis.pdf

Deep learning for
control using augmented Hessian-free optimization

Hierarchical Conflict
Propagation: Sequence Learning in a Recurrent Deep Neural Network

arxiv: http://arxiv.org/abs/1602.08118

Recurrent Batch
Normalization

arxiv: http://arxiv.org/abs/1603.09025
github: https://github.com/iassael/torch-bnlstm

Optimizing Performance
of Recurrent Neural Networks on GPUs

Codes

NeuralTalk
(Deprecated): a Python+numpy project for learning Multimodal Recurrent Neural
Networks that describe images with sentences

github: https://github.com/karpathy/neuraltalk

NeuralTalk2: Efficient
Image Captioning code in Torch, runs on GPU

github: https://github.com/karpathy/neuraltalk2

char-rnn in Blocks

github: https://github.com/johnarevalo/blocks-char-rnn

Project:
pycaffe-recurrent

code: https://github.com/kuprel/pycaffe-recurrent/

Using neural networks
for password cracking

Recurrent neural
networks for decoding CAPTCHAS

torch-rnn: Efficient,
reusable RNNs and LSTMs for torch

github: https://github.com/jcjohnson/torch-rnn

Deploying a model
trained with GPU in Torch into JavaScript, for everyone to use

blog: http://testuggine.ninja/blog/torch-conversion
demo: http://testuggine.ninja/DRUMPF-9000/
github: https://github.com/Darktex/char-rnn

LSTM implementation on
Caffe

github: https://github.com/junhyukoh/caffe-lstm

Blog

Survey on
Attention-based Models Applied in NLP

http://yanran.li/peppypapers/2015/10/07/survey-attention-model-1.html

Survey on Advanced
Attention-based Models

http://yanran.li/peppypapers/2015/10/07/survey-attention-model-2.html

Online Representation
Learning in Recurrent Neural Language Models

http://www.marekrei.com/blog/online-representation-learning-in-recurrent-neural-language-models/

Fun with Recurrent
Neural Nets: One More Dive into CNTK and TensorFlow

http://esciencegroup.com/2016/03/04/fun-with-recurrent-neural-nets-one-more-dive-into-cntk-and-tensorflow/

Materials to
understand LSTM

https://medium.com/@shiyan/materials-to-understand-lstm-34387d6454c1#.4mt3bzoau

Understanding LSTM and
its diagrams (★★★★★)

Persistent RNNs: 30
times faster RNN layers at small mini-batch sizes (Greg Diamos, Baidu Silicon
Valley AI Lab)

http://svail.github.io/persistent_rnns/

All of Recurrent
Neural Networks

https://medium.com/@jianqiangma/all-about-recurrent-neural-networks-9e5ae2936f6e#.q4s02elqg

Resources

Awesome Recurrent
Neural Networks - A curated list of resources dedicated to RNN

homepage: http://jiwonkim.org/awesome-rnn/
github: https://github.com/kjw0612/awesome-rnn

Jürgen Schmidhuber’s
page on Recurrent Neural Networks

http://people.idsia.ch/~juergen/rnn.html

Reading and
Questions

Are there any
Recurrent convolutional neural network network implementations out there ?

reddit:https://www.reddit.com/r/MachineLearning/comments/4chu3y/are_there_any_recurrent_convolutional_neural/

« Reinforcement Learning Saliency Prediction »

Saliency Prediction

Published: 09 Oct 2015 Category: deep_learning

This task involves predicting the salient regions of an image given by human eye fixations.

Large-scale optimization of hierarchical features for saliency prediction in natural images

paper: http://coxlab.org/pdfs/cvpr2014_vig_saliency.pdf

Predicting Eye Fixations using Convolutional Neural Networks

paper: http://www.escience.cn/system/file?fileId=72648

DeepFix: A Fully Convolutional Neural Network for predicting Human Eye Fixations

arxiv: http://arxiv.org/abs/1510.02927

DeepSaliency: Multi-Task Deep Neural Network Model for Salient Object Detection

arxiv: http://arxiv.org/abs/1510.05484

SuperCNN: A Superpixelwise Convolutional Neural Network for Salient Object Detection

paper: www.shengfenghe.com/supercnn-a-superpixelwise-convolutional-neural-network-for-salient-object-detection.html

Shallow and Deep Convolutional Networks for Saliency Prediction

arxiv: http://arxiv.org/abs/1603.00845
github: https://github.com/imatge-upc/saliency-2016-cvpr

Scene Labeling

Published: 09 Oct 2015 Category: deep_learning

Papers

Learning hierarchical features for scene labeling

intro: “Their approach comprised of densely computing multi-scale CNN features for each pixel and aggregating them over image regions upon which they are classified. However, their methodstill required the post-processing step of generating over-segmented regions, like superpixels, for obtaining the final segmentation result. Additionally, the CNNs used for multi-scale feature learning were not very deep with only three convolution layers.”
paper: http://yann.lecun.com/exdb/publis/pdf/farabet-pami-13.pdf

Indoor Semantic Segmentation using depth information

arxiv: http://arxiv.org/abs/1301.3572

Multi-modal unsupervised feature learning for rgb-d scene labeling

paper: http://www3.ntu.edu.sg/home/wanggang/WangECCV2014.pdf

Using neon for Scene Recognition: Mini-Places2

intro: This is an implementation of the deep residual network used for Mini-Places2 as described in He et. al., “Deep Residual Learning for Image Recognition”.
blog: http://www.nervanasys.com/using-neon-for-scene-recognition-mini-places2/
github: https://github.com/hunterlang/mpmz

Attend, Infer, Repeat: Fast Scene Understanding with Generative Models

arxiv: http://arxiv.org/abs/1603.08575

Challenges

Large-scale Scene Understanding Challenge

homepage: http://lsun.cs.princeton.edu/

时间： 2024-10-13 06:38:43

RNN and LSTM saliency Predection Scene Label的相关文章

RNN和LSTM

一.RNN 全称为Recurrent Neural Network,意为循环神经网络,用于处理序列数据. 序列数据是指在不同时间点上收集到的数据,反映了某一事物.现象等随时间的变化状态或程度.即数据之间有联系. RNN的特点:1,,层间神经元也有连接(主要为隐层):2,共享参数其结构如上图所示,数据为顺序处理,在处理长序列数据时,极易导致梯度消失问题. 二.LSTM LSTM为长短期记忆,是一种变种的RNN,在RNN的基础上引入了细胞状态,根据细胞状态可决定哪些状态应该保留下来,哪些状态应该被

深度学习与自然语言处理之五：从RNN到LSTM

/* 版权声明:可以任意转载,转载时请标明文章原始出处和作者信息 .*/ author: 张俊林大纲如下: 1.RNN 2.LSTM 3.GRN 4.Attention Model 5.应用 6.探讨与思考扫一扫关注微信号:"布洛卡区" ,深度学习在自然语言处理等智能应用的技术研讨与科普公众号.

深度学习之六，基于RNN(GRU,LSTM)的语言模型分析与theano代码实现

引言前面已经介绍过RNN的基本结构,最基本的RNN在传统的BP神经网络上,增加了时序信息,也使得神经网络不再局限于固定维度的输入和输出这个束缚,但是从RNN的BPTT推导过程中,可以看到,传统RNN在求解梯度的过程中对long-term会产生梯度消失或者梯度爆炸的现象,这个在这篇文章中已经介绍了原因,对于此,在1997年的Grave大作[1]中提出了新的新的RNN结构:Long Short Term Dependency.LSTM在传统RNN的基础上加了许多的"门",如input

转：深度学习与自然语言处理之五：从RNN到LSTM

原文地址:http://blog.csdn.net/malefactor/article/details/50436735/ 大纲如下: 1.RNN 2.LSTM 3.GRN 4.Attention Model 5.应用 6.探讨与思考

RNN 与 LSTM 的应用

之前已经介绍过关于 Recurrent Neural Nnetwork 与 Long Short-Trem Memory 的网络结构与参数求解算法( 递归神经网络(Recurrent Neural Networks,RNN) ,LSTM网络(Long Short-Term Memory )),本文将列举一些 RNN 与 LSTM 的应用, RNN (LSTM)的样本可以是如下形式的:1)输入输出均为序列:2)输入为序列,输出为样本标签:3)输入单个样本,输出为序列.本文将列举一些 RNN(LST

3. RNN神经网络-LSTM模型结构

1. RNN神经网络模型原理 2. RNN神经网络模型的不同结构 3. RNN神经网络-LSTM模型结构 1. 前言之前我们对RNN模型做了总结.由于RNN也有梯度消失的问题,因此很难处理长序列的数据,大牛们对RNN做了改进,得到了RNN的特例LSTM(Long Short-Term Memory),它可以避免常规RNN的梯度消失,因此在工业界得到了广泛的应用.下面我们就对LSTM模型做一个总结. 2. LSTM模型结构我们先看下LSTM的整体结构. 由于RNN梯度消失的问题,大牛们对于序列

浅谈RNN、LSTM + Kreas实现及应用

本文主要针对RNN与LSTM的结构及其原理进行详细的介绍,了解什么是RNN,RNN的1对N.N对1的结构,什么是LSTM,以及LSTM中的三门(input.ouput.forget),后续将利用深度学习框架Kreas,结合案例对LSTM进行进一步的介绍. 一.RNN的原理 RNN(Recurrent Neural Networks),即全称循环神经网络,它是一种对序列型的数据进行建模的深度模型.如图1.1所示. 图1.1 1.其中为序列数据.即神经网络的输入,例如nlp中,X1可以看作第一个单词

深度学习：浅谈RNN、LSTM+Kreas实现与应用

主要针对RNN与LSTM的结构及其原理进行详细的介绍,了解什么是RNN,RNN的1对N.N对1的结构,什么是LSTM,以及LSTM中的三门(input.ouput.forget),后续将利用深度学习框架Kreas,结合案例对LSTM进行进一步的介绍. 一.RNN的原理 RNN(Recurrent Neural Networks),即全称循环神经网络,它是一种对序列型的数据进行建模的深度模型.如图1.1所示. 图1.1 1.其中为序列数据.即神经网络的输入,例如nlp中,X1可以看作第一个单词.

（数据科学学习手札39）RNN与LSTM基础内容详解

一.简介循环神经网络(recurrent neural network,RNN),是一类专门用于处理序列数据(时间序列.文本语句.语音等)的神经网络,尤其是可以处理可变长度的序列:在与传统的时间序列分析进行比较的过程之中,RNN因为其梯度弥散等问题对长序列表现得不是很好,而据此提出的一系列变种则展现出很明显的优势,最具有代表性的就是LSTM(long short-term memory),而本文就从标准的循环神经网络结构和原理出发,再到LSTM的网络结构和原理,对其有一个基本的认识和阐述:

猜你喜欢

【BZOJ 3735】苹果树树上莫队(树分块+离线莫队+鬼畜的压行)

学习了树上莫队,树分块后对讯问的$dfs序$排序,然后就可以滑动树链处理答案了. 关于树链的滑动,只需要特殊处理一下$LCA$就行了. 在这里一条树链保留下来给后面的链来转移的$now$的为这条树链上 ...

Hello World——用思考揭开世界的一角

在开始进入正题之前,还是先按照惯例,介绍一下人物背景. 本人女,92年11月生,是标准的90后,喜欢听周杰伦,最喜欢的作家是大仲马,只读过一点儿韩寒,经历过郭敬明好几版<小时代>的摧残,爱 ...

perl 大文本词频统计.

思想是设置子文本最大长度,然后分割成多个子文本, 最后合并. 词频则是当前位置字和前一位置的字的组合进入hash. 代码如下 use Encode; ##编码解码 system("tim ...

《编程导论（Java）》格言录

★的后面重要言论/建议/格言-- ★计算机软件开发的核心有二:程序的组织(面向对象技术).问题求解(算法). ★柏拉图法则:类的世界独立存在,对象世界由类创建而来. ★面向对象技术通过颠倒的理念世界而 ...

Android Fragment完全解析，关于碎片你所需知道的一切

本文首发于CSDN博客,转载请注明出处:http://blog.csdn.net/guolin_blog/article/details/8881711 我们都知道,Android上的界面展示都是通过 ...

关于python知识点的blog

http://www.cnblogs.com/wupeiqi/articles/4356675.html python线程 http://www.cnblogs.com/bizhu/archiv ...

响应设备的横竖屏状态，旋转界面达到横竖屏效果

监听 UIDeviceOrientationDidChangeNotification的广播,再根据[[UIDevice currentDevice] orientation]获取到屏幕的方向. -( ...

NET Core Hosting

ASP.NET Core 运行原理解剖[1]:Hosting ASP.NET Core 是新一代的 ASP.NET,第一次出现时代号为 ASP.NET vNext,后来命名为ASP.NET 5,随着它 ...

【T^T 1736】【FJUTOJ 1077】排座位

http://59.77.139.92/problem.php?id=1077 水题,小心PE // <1736.cpp> - 11/12/16 17:17:52 // This file ...

EF CodeFirst下的自动迁移

当我们修改数据模型,添加一个如下字段再次运行程序,会因为数据库结构与模型不一致而报错为解决以上错误可以采取以下三种方式 1. 删除数据库,重新运行站点,会重新生成数据库,这样就会丢失数据 2. ...

wkhtmltopdf错误解决办法

Odoo/openerp 打印报表时,无法输出PDF格式,提示下面的错误 Unable to find Wkhtmltopdf on this system. The report will be s ...

python egg打包安装

python ./setup.py sdist/opt/calamari/venv/bin/python2.6 ./setup.py install

quartz 数据表字典

首次整理,可能有错误,还有少许的未整理,希望看到的人能给点补充(包括指点错误) 表名表说明自定义触发器 QRTZ_BLOB_TRIGGERS 列名(英) 列名(中) 数据类型列长度是否为空列 ...

通过编程为Outlook 2007添加邮件规则

Outlook 所支持的邮件规则相当有用,我们经常需要针对某些特征的邮件做特殊的处理.例如将其移动到某个特定文件夹,或者删除它等等. Outlook所支持的邮件规则主要两大类:收到邮件时和发送邮件时 ...

UVA 10010- Where's Waldorf?（八方向寻找字符串）

Where's Waldorf? Time Limit:3000MS Memory Limit:0KB 64bit IO Format:%lld & %llu Submit S ...

支付宝扫码支付

应用场景二维码收款接口官方文档:https://doc.open.alipay.com/doc2/apiDetail.htm?spm=a219a.7395905.0.0.O4mxCP&doc ...

做人的道理

刚刚回来,喝酒喝的有点多,不过我仍然打开电脑,想写点什么. 我和团队的所有成员在一家农家饭庄吃饭,庆祝我们的项目第一阶段通过验收.其实这已经不是我第一次带团队了,已经没有了第一次带团队时的担心和种种. ...

Nginx源码分析—worker进程的创建

假设现在ngx_init_cycle已经结束(毕竟这个函数确实庞大),也就是说关于nginx的初始化都已经结束.那么看看如何创建进程模型ngx_master_process_cycle. 在这个函数中 ...

【APUE】vim常用命令

转自:http://coolshell.cn/articles/5426.html 基本命令: i → Insert 模式,按 ESC 回到 Normal 模式. x → 删当前光标所在的一个字符. ...

Oracle课程档案，第六天

体系结构: instance:实例 database:数据库 RAC:多实例对一个数据库 SGA:最大总数 (系统全局区域)缓存区 PGA:其中的一块, 也是一个缓存区 server process: ...

专题

随机推荐

© 2024 憋错料 | info#biecuoliao.com | 10 q. 0.022 s.