Stroke Verification with Gray-level Image for Hangul Video Text Recognition

Jinsik Kim
Seonghun Lee
Jin Hyung Kim
Proceedings of 18th International Conference on Pattern Recognition (ICPR), Hong Kong, China, August 20-24, 2006, pp. 1074-1077

Abstract

Traditional OCR uses binarization technique, which makes OCR simple. But it makes strokes ambiguous and that causes recognition errors. Main reason of those errors is similar grapheme pair confusing error. It can be reduced by verifying ambiguous area of gray level image. After checking whether there is similar grapheme pair by analyzing traditional OCR result candidates, the base stroke of confused grapheme can be found using the fitness function which reflects the base stroke characteristics. The possibility of confused stroke existence can be measured by analyzing the boundary area of the base stroke. The result is merged with traditional OCR using score-probability converting. We achieved 68.1% error reduction for target grapheme pair errors by the proposed method and it means that 23.1 % total error is reduced

Research Areas