Bidirectional LSTM networks for improved phoneme classification and recognition

Alex Graves, Santiago Fernández, Jürgen Schmidhuber

Research output: Chapter in Book/Report/Conference proceedingConference contribution

414 Scopus citations

Abstract

In this paper, we carry out two experiments on the TIMIT speech corpus with bidirectional and unidirectional Long Short Term Memory (LSTM) networks. In the first experiment (framewise phoneme classification) we find that bidirectional LSTM outperforms both unidirectional LSTM and conventional Recurrent Neural Networks (RNNs). In the second (phoneme recognition) we find that a hybrid BLSTM-HMM system improves on an equivalent traditional HMM system, as well as unidirectional LSTM-HMM. © Springer-Verlag Berlin Heidelberg 2005.
Original languageEnglish (US)
Title of host publicationLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Pages799-804
Number of pages6
StatePublished - Dec 1 2005
Externally publishedYes

ASJC Scopus subject areas

  • Theoretical Computer Science
  • Computer Science(all)

Fingerprint

Dive into the research topics of 'Bidirectional LSTM networks for improved phoneme classification and recognition'. Together they form a unique fingerprint.

Cite this