Stroke Verification with Gray-level Image for Hangul Video Text Recognition
Abstract
Traditional OCR uses binarization technique, which makes OCR simple. But it makes strokes ambiguous and that causes recognition errors. Main reason of those errors is similar grapheme pair confusing error. It can be reduced by verifying ambiguous area of gray level image. After checking whether there is similar grapheme pair by analyzing traditional OCR result candidates, the base stroke of confused grapheme can be found using the fitness function which reflects the base stroke characteristics. The possibility of confused stroke existence can be measured by analyzing the boundary area of the base stroke. The result is merged with traditional OCR using score-probability converting. We achieved 68.1% error reduction for target grapheme pair errors by the proposed method and it means that 23.1 % total error is reduced