Awesome Recurrent Neural Networks

A curated list of resources dedicated to recurrent neural networks (closely related to deep learning).

Maintainers - Jiwon Kim, Myungsub Choi

We have pages for other topics: awesome-deep-vision, awesome-random-forest

Contributing

Please feel free to pull requests, email Myungsub Choi ([email protected]) or join our chats to add links.

Sharing

Codes

Theano - Python
- Simple IPython tutorial on Theano
- Deep Learning Tutorials
  - RNN for semantic parsing of speech
  - LSTM network for sentiment analysis
- Keras : Theano-based Deep Learning Library
- theano-rnn by Graham Taylor
- Passage : Library for text analysis with RNNs
Caffe - C++ with MATLAB/Python wrappers
- LRCN by Jeff Donahue
Torch - Lua
- char-rnn by Andrej Karpathy : multi-layer RNN/LSTM/GRU for training/sampling from character-level language models
- LSTM by Wojciech Zaremba : Long Short Term Memory Units to train a language model on word level Penn Tree Bank dataset
- Oxford by Nando de Freitas : Oxford Computer Science - Machine Learning 2015 Practicals
Etc.
- RNNLIB by Alex Graves : C++ based LSTM library
- RNNLM by Tomas Mikolov : C++ based simple code
- neuraltalk by Andrej Karpathy : numpy-based RNN/LSTM implementation
- gist by Andrej Karpathy : raw numpy code that implements an efficient batched LSTM

Theory

Lectures

Stanford NLP (CS224d) by Richard Socher
- Lecture Note 3 : neural network basics
- Lecture Note 4 : RNN language models, bi-directional RNN, GRU, LSTM
Oxford Machine Learning by Nando de Freitas
- Lecture 12 : Recurrent neural networks and LSTMs
- Lecture 13 : (guest lecture) Alex Graves on Hallucination with RNNs

Books / Thesis

Alex Graves (2008)
- Supervised Sequence Labelling with Recurrent Neural Networks
Tomas Mikolov (2012)
- Statistical Language Models based on Neural Networks
Ilya Sutskever (2013)
- Training Recurrent Neural Networks
Richard Socher (2014)
- Recursive Deep Learning for Natural Language Processing and Computer Vision

Network Variants

Bi-directional RNN [Paper]
- Mike Schuster and Kuldip K. Paliwal, Bidirectional Recurrent Neural Networks, Trans. on Signal Processing 1997
LSTM [Paper]
- Sepp Hochreiter and Jurgen Schmidhuber, Long Short-Term Memory, Neural Computation 1997
Multi-dimensional RNN [Paper]
- Alex Graves, Santiago Fernandez, and Jurgen Schmidhuber, Multi-Dimensional Recurrent Neural Networks, ICANN 2007
GRU (Gated Recurrent Unit) [Paper]
- Kyunghyun Cho, Bart van Berrienboer, Caglar Gulcehre, Dzmitry Bahdanau, Fethi Bougares, Holger Schwenk, and Yoshua Bengio, Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation, arXiv:1406.1078 / EMNLP 2014
GFRNN [Paper-arXiv] [Paper-ICML] [Supplementary]
- Junyoung Chung, Caglar Gulcehre, Kyunghyun Cho, Yoshua Bengio, Gated Feedback Recurrent Neural Networks, arXiv:1502.02367 / ICML 2015

Surveys

Klaus Greff, Rupesh Kumar Srivastava, Jan Koutnik, Bas R. Steunebrink, Jurgen Schmidhuber,LSTM: A Search Space Odyssey, arXiv:1503.04069
Zachary C. Lipton, A Critical Review of Recurrent Neural Networks for Sequence Learning, arXiv:1506.00019
Andrej Karpathy, Justin Johnson, Li Fei-Fei, Visualizing and Understanding Recurrent Networks, arXiv:1506.02078
Rafal Jozefowicz, Wojciech Zaremba, Ilya Sutskever, An Empirical Exploration of Recurrent Network Architectures, ICML, 2015.

Applications

Language Modeling

Tomas Mikolov, Martin Karafiat, Lukas Burget, Jan "Honza" Cernocky, Sanjeev Khudanpur,Recurrent Neural Network based Language Model, Interspeech 2010 [Paper]
Tomas Mikolov, Stefan Kombrink, Lukas Burget, Jan "Honza" Cernocky, Sanjeev Khudanpur,Extensions of Recurrent Neural Network Language Model, ICASSP 2011 [Paper]
Stefan Kombrink, Tomas Mikolov, Martin Karafiat, Lukas Burget, Recurrent Neural Network based Language Modeling in Meeting Recognition, Interspeech 2011 [Paper]

Speech Recognition

Geoffrey Hinton, Li Deng, Dong Yu, George E. Dahl, Abdel-rahman Mohamed, Navdeep Jaitly, Andrew Senior, Vincent Vanhoucke, Patrick Nguyen, Tara N. Sainath, and Brian Kingsbury,Deep Neural Networks for Acoustic Modeling in Speech Recognition, IEEE Signam Processing Magazine 2012 [Paper]
Alex Graves, Abdel-rahman Mohamed, and Geoffrey Hinton, Speech Recognition with Deep Recurrent Neural Networks, arXiv:1303.5778 / ICASSP 2013 [Paper]

Machine Translation

Univ. Montreal [Paper]
- Kyunghyun Cho, Bart van Berrienboer, Caglar Gulcehre, Dzmitry Bahdanau, Fethi Bougares, Holger Schwenk, and Yoshua Bengio, Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation, arXiv:1406.1078 / EMNLP 2014
Google [Paper]
- Ilya Sutskever, Oriol Vinyals, and Quoc V. Le, Sequence to Sequence Learning with Neural Networks, arXiv:1409.3215 / NIPS 2014
Univ. Montreal [Paper]
- Dzmitry Bahdanau, KyungHyun Cho, and Yoshua Bengio, Neural Machine Translation by Jointly Learning to Align and Translate, arXiv:1409.0473 / ICLR 2015

Image Captioning

Baidu + UCLA [Web] [Paper-arxiv1], [Paper-ICLR]
- Junhua Mao, Wei Xu, Yi Yang, Jiang Wang, and Alan L. Yuille, Explain Images with Multimodal Recurrent Neural Networks, arXiv:1410.1090
- Junhua Mao, Wei Xu, Yi Yang, Jiang Wang, Zhiheng Huang, and Alan L. Yuille, Deep Captioning with Multimodal Recurrent Neural Networks (m-RNN), arXiv:1412.6632 / ICLR 2015
Univ. Toronto [Paper] [Web demo]
- Ryan Kiros, Ruslan Salakhutdinov, and Richard S. Zemel, Unifying Visual-Semantic Embeddings with Multimodal Neural Language Models, arXiv:1411.2539 / TACL 2015
Berkeley [Web] [Paper]
- Jeff Donahue, Lisa Anne Hendricks, Sergio Guadarrama, Marcus Rohrbach, Subhashini Venugopalan, Kate Saenko, and Trevor Darrell, Long-term Recurrent Convolutional Networks for Visual Recognition and Description, arXiv:1411.4389 / CVPR 2015
Google [Paper]
- Oriol Vinyals, Alexander Toshev, Samy Bengio, and Dumitru Erhan, Show and Tell: A Neural Image Caption Generator, arXiv:1411.4555 / CVPR 2015
Microsoft [Paper]
- Hao Fang, Saurabh Gupta, Forrest Iandola, Rupesh Srivastava, Li Deng, Piotr Dollar, Jianfeng Gao, Xiaodong He, Margaret Mitchell, John C. Platt, Lawrence Zitnick, and Geoffrey Zweig, From Captions to Visual Concepts and Back, arXiv:1411.4952 / CVPR 2015
Microsoft [Paper-arxiv], [Paper-CVPR]
- Xinlei Chen, and C. Lawrence Zitnick, Learning a Recurrent Visual Representation for Image Caption Generation
- Xinlei Chen, and C. Lawrence Zitnick, Mind’s Eye: A Recurrent Visual Representation for Image Caption Generation, CVPR 2015
Univ. Toronto + Univ. Montreal [Web] [Paper]
- Kelvin Xu, Jimmy Lei Ba, Ryan Kiros, Kyunghyun Cho, Aaron Courville, Ruslan Salakhutdinov, Richard S. Zemel, and Yoshua Bengio, Show, Attend, and Tell: Neural Image Caption Generation with Visual Attention, arXiv:1502.03044 / ICML 2015
Idiap + EPFL + Facebook [Paper]
- Remi Lebret, Pedro O. Pinheiro, and Ronan Collobert, Phrase-based Image Captioning, arXiv:1502.03671 / ICML 2015
Baidu + UCLA [Paper]
- Junhua Mao, Wei Xu, Yi Yang, Jiang Wang, Zhiheng Huang, and Alan L. Yuille, Learning like a Child: Fast Novel Visual Concept Learning from Sentence Descriptions of Images, arXiv:1504.06692

Video Captioning

Berkeley [Web] [Paper]
- Jeff Donahue, Lisa Anne Hendricks, Sergio Guadarrama, Marcus Rohrbach, Subhashini Venugopalan, Kate Saenko, and Trevor Darrell, Long-term Recurrent Convolutional Networks for Visual Recognition and Description, arXiv:1411.4389 / CVPR 2015
UT Austin + UML + Berkeley [Paper]
- Subhashini Venugopalan, Huijuan Xu, Jeff Donahue, Marcus Rohrbach, Raymond Mooney, and Kate Saenko, Translating Videos to Natural Language Using Deep Recurrent Neural Networks, arXiv:1412.4729
Microsoft [Paper]
- Yingwei Pan, Tao Mei, Ting Yao, Houqiang Li, and Yong Rui, Joint Modeling Embedding and Translation to Bridge Video and Language, arXiv:1505.01861
UT Austin + Berkeley + UML [Paper]
- Subhashini Venugopalan, Marcus Rohrbach, Jeff Donahue, Raymond Mooney, Trevor Darrell, and Kate Saenko, Sequence to Sequence--Video to Text, arXiv:1505.00487

Question Answering

MSR + Virginia Tech. [Web] [Paper]
- Stanislaw Antol, Aishwarya Agrawal, Jiasen Lu, Margaret Mitchell, Dhruv Batra, C. Lawrence Zitnick, and Devi Parikh, VQA: Visual Question Answering, arXiv:1505.00468 / CVPR 2015 SUNw:Scene Understanding workshop
MPI + Berkeley [Web] [Paper]
- Mateusz Malinowski, Marcus Rohrbach, and Mario Fritz, Ask Your Neurons: A Neural-based Approach to Answering Questions about Images, arXiv:1505.01121
Univ. Toronto [Paper] [Dataset]
- Mengye Ren, Ryan Kiros, and Richard Zemel, Image Question Answering: A Visual Semantic Embedding Model and a New Dataset, arXiv:1505.02074 / ICML 2015 deep learning workshop
Baidu + UCLA [Paper] [Dataset]
- Hauyuan Gao, Junhua Mao, Jie Zhou, Zhiheng Huang, Lei Wang, and Wei Xu, Are You Talking to a Machine? Dataset and Methods for Multilingual Image Question Answering, arXiv:1505.05612

Image Generation

Karol Gregor, Ivo Danihelka, Alex Graves, Danilo J. Rezende, and Daan Wierstra, DRAW: A Recurrent Neural Network for Image Generation, ICML 2015 [Paper]
Angeliki Lazaridou, Dat T. Nguyen, R. Bernardi, and M. Baroni, Unveiling the Dreams of Word Embeddings: Towards Language-Driven Image Generation, arXiv:1506.03500
Lucas Theis and Matthias Bethge, Generative Image Modeling Using Spatial LSTMs,arXiv:1506.03478

Turing Machines

A.Graves, G. Wayne, and I. Danihelka., Neural Turing Machines, arXiv preprint arXiv:1410.5401 [Paper]
Jason Weston, Sumit Chopra, Antoine Bordes, Memory Networks, arXiv:1410.3916. [Paper]
Wojciech Zaremba, Ilya Sutskever, Reinforcement Learning Neural Turing Machines,arXiv:1505.00521. [Paper]

Robotics

Marvin Zhang, Sergey Levine, Zoe McCarthy, Chelsea Finn, Pieter Abbeel, Policy Learning with Continuous Memory States for Partially Observed Robotic Control, arXiv:1507.01273. [Paper]

Datasets

Image Captioning
Image Question Answering - all based on MS COCO images
- VQA
- Image QA
- [Multilingual Image QA] : in Chinese, with English translation

时间： 2024-10-12 11:09:56

Awesome Recurrent Neural Networks

Awesome Recurrent Neural Networks

Contributing

Sharing

Table of Contents

Codes

Theory

Lectures

Books / Thesis

Network Variants

Surveys

Applications

Language Modeling

Speech Recognition

Machine Translation

Image Captioning

Video Captioning

Question Answering

Image Generation

Turing Machines

Robotics

Datasets

Awesome Recurrent Neural Networks的相关文章

Attention and Augmented Recurrent Neural Networks

(zhuan) Attention in Long Short-Term Memory Recurrent Neural Networks

Recurrent Neural Networks Tutorial, Part 1 – Introduction to RNNs

《转》循环神经网络(RNN, Recurrent Neural Networks)学习笔记：基础理论

Recurrent Neural Networks Tutorial, Part 2 – Implementing a RNN with Python, Numpy and Theano

转：RNN(Recurrent Neural Networks)

RNN(Recurrent Neural Networks)公式推导和实现

第十四章——循环神经网络（Recurrent Neural Networks）（第二部分）

Recurrent Neural Networks, LSTM, GRU