Loading…

Improving Nastalique specific pre-recognition process for Urdu OCR

Urdu language is written using Arabic script in Nastalique writing style. Nastalique script is highly cursive, context sensitive and is hard to process as only the last character in its ligature sits on the baseline. In addition, it exhibits character and ligature level spatial overlap. Due to these...

Full description

Saved in:
Bibliographic Details
Main Authors: Javed, S.T., Hussain, S.
Format: Conference Proceeding
Language:English
Subjects:
Online Access:Request full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites
container_end_page 6
container_issue
container_start_page 1
container_title
container_volume
creator Javed, S.T.
Hussain, S.
description Urdu language is written using Arabic script in Nastalique writing style. Nastalique script is highly cursive, context sensitive and is hard to process as only the last character in its ligature sits on the baseline. In addition, it exhibits character and ligature level spatial overlap. Due to these factors, the placement of dots and other diacritics is also highly contextual and variable. There is now increasing amount of work to process and recognize Nastalique script to develop Urdu OCR. This paper proposes improvements to these methods. The paper focuses on Nastalique specific pre-processing methods which can be employed before the text recognition process. The recognition and post recognition processes will be addressed separately.
doi_str_mv 10.1109/INMIC.2009.5383111
format conference_proceeding
fullrecord <record><control><sourceid>ieee_6IE</sourceid><recordid>TN_cdi_ieee_primary_5383111</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>5383111</ieee_id><sourcerecordid>5383111</sourcerecordid><originalsourceid>FETCH-LOGICAL-i90t-581c3a14788ffad8c88b9177b407fd6ffff1081636fd09303aad84955f89d4c73</originalsourceid><addsrcrecordid>eNo1kNFKwzAYhSMy0M2-gN7kBTr_NEnz51KLzsLcQOb1yNJkRLa2Jp3g21txnpvDgY_D4RByy2DOGOj7evVaV_MCQM8lR84YuyCZVshEIYRAxfUlmf6HQk3I9JfVIBHwimQpfcAoIbkAdU0e62Mfu6_Q7unKpMEcwufJ0dQ7G3ywtI8uj852-zYMoWvH3FmXEvVdpO-xOdF19XZDJt4cksvOPiOb56dN9ZIv14u6eljmQcOQS2SWGyYUovemQYu400yp3TjDN6UfxQBZyUvfgObAzQgJLaVH3Qir-Izc_dUG59y2j-Fo4vf2fAH_AfOvTTw</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>Improving Nastalique specific pre-recognition process for Urdu OCR</title><source>IEEE Electronic Library (IEL) Conference Proceedings</source><creator>Javed, S.T. ; Hussain, S.</creator><creatorcontrib>Javed, S.T. ; Hussain, S.</creatorcontrib><description>Urdu language is written using Arabic script in Nastalique writing style. Nastalique script is highly cursive, context sensitive and is hard to process as only the last character in its ligature sits on the baseline. In addition, it exhibits character and ligature level spatial overlap. Due to these factors, the placement of dots and other diacritics is also highly contextual and variable. There is now increasing amount of work to process and recognize Nastalique script to develop Urdu OCR. This paper proposes improvements to these methods. The paper focuses on Nastalique specific pre-processing methods which can be employed before the text recognition process. The recognition and post recognition processes will be addressed separately.</description><identifier>ISBN: 1424448727</identifier><identifier>ISBN: 9781424448722</identifier><identifier>EISBN: 9781424448739</identifier><identifier>EISBN: 1424448735</identifier><identifier>DOI: 10.1109/INMIC.2009.5383111</identifier><identifier>LCCN: 2009905808</identifier><language>eng</language><publisher>IEEE</publisher><subject>Character recognition ; Data mining ; Image recognition ; Image segmentation ; Information retrieval ; Optical character recognition software ; Optical distortion ; Optical sensors ; Text recognition ; Writing</subject><ispartof>2009 IEEE 13th International Multitopic Conference, 2009, p.1-6</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/5383111$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,780,784,789,790,2058,27925,54920</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/5383111$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Javed, S.T.</creatorcontrib><creatorcontrib>Hussain, S.</creatorcontrib><title>Improving Nastalique specific pre-recognition process for Urdu OCR</title><title>2009 IEEE 13th International Multitopic Conference</title><addtitle>INMIC</addtitle><description>Urdu language is written using Arabic script in Nastalique writing style. Nastalique script is highly cursive, context sensitive and is hard to process as only the last character in its ligature sits on the baseline. In addition, it exhibits character and ligature level spatial overlap. Due to these factors, the placement of dots and other diacritics is also highly contextual and variable. There is now increasing amount of work to process and recognize Nastalique script to develop Urdu OCR. This paper proposes improvements to these methods. The paper focuses on Nastalique specific pre-processing methods which can be employed before the text recognition process. The recognition and post recognition processes will be addressed separately.</description><subject>Character recognition</subject><subject>Data mining</subject><subject>Image recognition</subject><subject>Image segmentation</subject><subject>Information retrieval</subject><subject>Optical character recognition software</subject><subject>Optical distortion</subject><subject>Optical sensors</subject><subject>Text recognition</subject><subject>Writing</subject><isbn>1424448727</isbn><isbn>9781424448722</isbn><isbn>9781424448739</isbn><isbn>1424448735</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2009</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><recordid>eNo1kNFKwzAYhSMy0M2-gN7kBTr_NEnz51KLzsLcQOb1yNJkRLa2Jp3g21txnpvDgY_D4RByy2DOGOj7evVaV_MCQM8lR84YuyCZVshEIYRAxfUlmf6HQk3I9JfVIBHwimQpfcAoIbkAdU0e62Mfu6_Q7unKpMEcwufJ0dQ7G3ywtI8uj852-zYMoWvH3FmXEvVdpO-xOdF19XZDJt4cksvOPiOb56dN9ZIv14u6eljmQcOQS2SWGyYUovemQYu400yp3TjDN6UfxQBZyUvfgObAzQgJLaVH3Qir-Izc_dUG59y2j-Fo4vf2fAH_AfOvTTw</recordid><startdate>200912</startdate><enddate>200912</enddate><creator>Javed, S.T.</creator><creator>Hussain, S.</creator><general>IEEE</general><scope>6IE</scope><scope>6IL</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIL</scope></search><sort><creationdate>200912</creationdate><title>Improving Nastalique specific pre-recognition process for Urdu OCR</title><author>Javed, S.T. ; Hussain, S.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i90t-581c3a14788ffad8c88b9177b407fd6ffff1081636fd09303aad84955f89d4c73</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2009</creationdate><topic>Character recognition</topic><topic>Data mining</topic><topic>Image recognition</topic><topic>Image segmentation</topic><topic>Information retrieval</topic><topic>Optical character recognition software</topic><topic>Optical distortion</topic><topic>Optical sensors</topic><topic>Text recognition</topic><topic>Writing</topic><toplevel>online_resources</toplevel><creatorcontrib>Javed, S.T.</creatorcontrib><creatorcontrib>Hussain, S.</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE/IET Electronic Library</collection><collection>IEEE Proceedings Order Plans (POP All) 1998-Present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Javed, S.T.</au><au>Hussain, S.</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Improving Nastalique specific pre-recognition process for Urdu OCR</atitle><btitle>2009 IEEE 13th International Multitopic Conference</btitle><stitle>INMIC</stitle><date>2009-12</date><risdate>2009</risdate><spage>1</spage><epage>6</epage><pages>1-6</pages><isbn>1424448727</isbn><isbn>9781424448722</isbn><eisbn>9781424448739</eisbn><eisbn>1424448735</eisbn><abstract>Urdu language is written using Arabic script in Nastalique writing style. Nastalique script is highly cursive, context sensitive and is hard to process as only the last character in its ligature sits on the baseline. In addition, it exhibits character and ligature level spatial overlap. Due to these factors, the placement of dots and other diacritics is also highly contextual and variable. There is now increasing amount of work to process and recognize Nastalique script to develop Urdu OCR. This paper proposes improvements to these methods. The paper focuses on Nastalique specific pre-processing methods which can be employed before the text recognition process. The recognition and post recognition processes will be addressed separately.</abstract><pub>IEEE</pub><doi>10.1109/INMIC.2009.5383111</doi><tpages>6</tpages></addata></record>
fulltext fulltext_linktorsrc
identifier ISBN: 1424448727
ispartof 2009 IEEE 13th International Multitopic Conference, 2009, p.1-6
issn
language eng
recordid cdi_ieee_primary_5383111
source IEEE Electronic Library (IEL) Conference Proceedings
subjects Character recognition
Data mining
Image recognition
Image segmentation
Information retrieval
Optical character recognition software
Optical distortion
Optical sensors
Text recognition
Writing
title Improving Nastalique specific pre-recognition process for Urdu OCR
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-24T17%3A57%3A56IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_6IE&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Improving%20Nastalique%20specific%20pre-recognition%20process%20for%20Urdu%20OCR&rft.btitle=2009%20IEEE%2013th%20International%20Multitopic%20Conference&rft.au=Javed,%20S.T.&rft.date=2009-12&rft.spage=1&rft.epage=6&rft.pages=1-6&rft.isbn=1424448727&rft.isbn_list=9781424448722&rft_id=info:doi/10.1109/INMIC.2009.5383111&rft.eisbn=9781424448739&rft.eisbn_list=1424448735&rft_dat=%3Cieee_6IE%3E5383111%3C/ieee_6IE%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-i90t-581c3a14788ffad8c88b9177b407fd6ffff1081636fd09303aad84955f89d4c73%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=5383111&rfr_iscdi=true