Open Vocabulary Handwriting Recognition Using Combined Word-Level and Character-Level Language Models

Michal Kozielski
Stefan Hahn
Ralf Schlüter
Hermann Ney
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)(2013), pp. 8257-8261

Abstract

In this paper, we present a unified search strategy for open vocabulary handwriting recognition using weighted finite state transducers. Additionally to a standard word-level language model we introduce a separate n-gram character-level language model for out-of-vocabulary word detection and recognition. The probabilities assigned by those two models are combined into one Bayes decision rule. We evaluate the proposed method on the IAM database of English handwriting. An improvement from 22.2% word error rate to 17.3% is achieved comparing to the closed-vocabulary scenario and the best published result.

Research Areas