Loading…

Intelligent hierarchical layout segmentation of document images on the basis of colour content

This paper proposes a general methodology for automatic layout segmentation of documents. We first use colour histograms for extracting dominant colours of an image. This information is then used to hierarchically segment documents into regions of interest represented as polygons. If a region of int...

Full description

Saved in:
Bibliographic Details
Main Authors: Mighlani, D., Hennig, A., Sherkat, N., Whitrow, R.J.
Format: Conference Proceeding
Language:English
Subjects:
Online Access:Request full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:This paper proposes a general methodology for automatic layout segmentation of documents. We first use colour histograms for extracting dominant colours of an image. This information is then used to hierarchically segment documents into regions of interest represented as polygons. If a region of interest is a picture the algorithm intelligently refrains from segmenting it further, while coloured regions that contain text are subsegmented. The method has been tested on 50 real life documents, such as office letters, brochures, and technical papers, scanned at 100/spl times/100 dpi resolution. Regions are detected with about 68% reliability. A critical analysis of the results is presented.
DOI:10.1109/TENCON.1997.647289