Read the original document by opening this link in a new tab.
Table of Contents
1. Introduction
2. Previous Work
3. Constant Error Backprop
Summary
Abstract: Learning to store information over extended time intervals via recurrent backpropagation takes a very long time, mostly due to insufficient, decaying error backflow. We briefly review Hochreiter's 1991 analysis of this problem, then address it by introducing a novel, efficient, gradient-based method called 'Long Short-Term Memory' (LSTM)...