Loading…

libcrn, an Open-Source Document Image Processing Library

In this paper we introduce libcrn, a multiplatform open-source document image processing library aimed at researchers and companies. It is written in C++11 and has a non-contaminating license that makes it available for use in any project without legal constraints. The features include low-level ima...

Full description

Saved in:
Bibliographic Details
Main Authors: Leydier, Yann, Duong, Jean, Bres, Stephane, Eglin, Veronique, Lebourgeois, Frank, Tola, Martial
Format: Conference Proceeding
Language:English
Subjects:
Online Access:Request full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:In this paper we introduce libcrn, a multiplatform open-source document image processing library aimed at researchers and companies. It is written in C++11 and has a non-contaminating license that makes it available for use in any project without legal constraints. The features include low-level image processing (color format conversion, binarization, convolution, PDE...), document images specific tools (connected components extraction, recursive block description, PDF export...), maths (matrix arithmetics, linear algebra, GMMs, equation solvers...), classification and clustering (kNN, k-means, HMMs...). The API is comprehensively documented and libcrn's architecture follows modern C++ guidelines to facilitate the handling of the library and enforce its safe usage. A sample OCR, which is only 30 lines long, is described to illustrate libcrn's scope of possibilities.
ISSN:2167-6445
DOI:10.1109/ICFHR.2016.0049