Loading…
FoveaBox with Consistent Label Assignment Distillation for Page Object Detection
With the increasing prevalence of digital documents replacing traditional paper documents, there is an urgent need to develop techniques to extract information from them. In particular, image-based documents pose a significant challenge in object detection, where the identification of objects such a...
Saved in:
Main Authors: | , , |
---|---|
Format: | Conference Proceeding |
Language: | English |
Subjects: | |
Online Access: | Request full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | With the increasing prevalence of digital documents replacing traditional paper documents, there is an urgent need to develop techniques to extract information from them. In particular, image-based documents pose a significant challenge in object detection, where the identification of objects such as figures, tables, and captions can be difficult. The crux of object detection lies in the accuracy of label assignment, and we all know that the human visual system is highly capable and proficient in recognizing objects. In light of this, we propose a novel approach, FoveaLAD, which integrates FoveaBox drawed inspiration from the human eye, and Label Assignment Distillation to enhance the label assignment process. We evaluate FoveaLAD on the UIT-DODV-Ext image document dataset, which comprises three classes: Table, Figure, and Caption. Our findings demonstrate that FoveaLAD effectively selects different scales for objects of varying sizes and improves the accuracy of label assignment in object detection. |
---|---|
ISSN: | 2770-6850 |
DOI: | 10.1109/MAPR59823.2023.10288889 |