OPUS 4 | Search

1 search hit

1 to 1

Improving gradient-based LSTM training for offline handwriting recognition by careful selection of the optimization method (2016)

Schall, Martin ; Schambach, Marc-Peter ; Franz, Matthias O.

Recent years have seen the proposal of several different gradient-based optimization methods for training artificial neural networks. Traditional methods include steepest descent with momentum, newer methods are based on per-parameter learning rates and some approximate Newton-step updates. This work contains the result of several experiments comparing different optimization methods. The experiments were targeted at offline handwriting recognition using hierarchical subsampling networks with recurrent LSTM layers. We present an overview of the used optimization methods, the results that were achieved and a discussion of why the methods lead to different results.

1 to 1

Open Access

Refine

Author

Year of publication

Document Type

Language

Has Fulltext

1 search hit