Loading…
Entropy Quantifiers Useful for Establishing Equivalence between Text Document Images
There are many requirements in document image analysis, which warrant understanding the equivalence of document images if possible without OCRing the text contents and in some cases OCRs do not exist. In this paper we propose to employ the entropy notion to' feel' the text content in a doc...
Saved in:
Main Authors: | , |
---|---|
Format: | Conference Proceeding |
Language: | English |
Subjects: | |
Online Access: | Request full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | There are many requirements in document image analysis, which warrant understanding the equivalence of document images if possible without OCRing the text contents and in some cases OCRs do not exist. In this paper we propose to employ the entropy notion to' feel' the text content in a document image without actually reading it, and hence establish the equivalence or otherwise of two corresponding text components (line/word/character). We introduce Conventional Entropy Quantifier (CEQ) and also define Modified Entropy Quantifier (MEQ) to measure the energy content in the components. The results of experiments performed at line, word and character level are reported. These initial steps in the sequel are expected to establish the equivalence between the two text document images. |
---|---|
DOI: | 10.1109/ICCIMA.2007.304 |