Loading…

Recognition of printed Urdu script in Nastaleeq font by using CNN-BiGRU-GRU Based Encoder-Decoder Framework

•This research was conducted from January, 2021 to January, 2023.•RNN based Deep learning models has shown tremendous success in sequential and temporal data where the order is critical to achieve higher accuracy in context understanding. RNN family like LSTM, BLSTM, GRU, BiGRU etc. are the mainly u...

Full description

Saved in:
Bibliographic Details
Published in:Intelligent systems with applications 2023-05, Vol.18, p.200194, Article 200194
Main Authors: Zia, Sohail, Azhar, Muhammad, Lee, Bumshik, Tahir, Adnan, Ferzund, Javed, Murtaza, Fozia, Ali, Moazam
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c410t-de48fc05fe35fd5a1230730438d7bb0f7c23ad553ec30f4dc748d7027961b713
cites cdi_FETCH-LOGICAL-c410t-de48fc05fe35fd5a1230730438d7bb0f7c23ad553ec30f4dc748d7027961b713
container_end_page
container_issue
container_start_page 200194
container_title Intelligent systems with applications
container_volume 18
creator Zia, Sohail
Azhar, Muhammad
Lee, Bumshik
Tahir, Adnan
Ferzund, Javed
Murtaza, Fozia
Ali, Moazam
description •This research was conducted from January, 2021 to January, 2023.•RNN based Deep learning models has shown tremendous success in sequential and temporal data where the order is critical to achieve higher accuracy in context understanding. RNN family like LSTM, BLSTM, GRU, BiGRU etc. are the mainly used models in these kind of sequential tasks.•RNN family based Encoder-decoder frameworks are widely used for the recognition of various languages scripts. However, in Urdu, very less research has been done especially with the deep learning models.•The existing proposed algorithms for printed Urdu recognition only work for very basic sentences of Urdu but in case of complex words and sentences, these algorithms totally fail in terms of accuracy and the time complexity in identification of the Nastaleeq font writing.•To identify printed Urdu text in images, we have proposed an encoder-decoder based hybrid deep learning approach with Convolutional Neural Network (CNN) for feature extraction part, bi-directional Gated Recurrent Unit network (BiGRU) as encoder and Gated Recurrent Unit network (GRU) as decoder. The CNN layer of the algorithm is used to obtain ligature features in Urdu, which are subsequently utilized by encoder (BiGRU) and decoder (GRU) to recognize the sentences by accurately distinguishing the characters and joiners.•Experimental results have shown that our proposed CNN-BiGRU-GRU hybrid technique with specific hyper-parameter tuning performs well as compared to other state-of-the-art algorithms. RNN based Deep learning models has shown tremendous success in sequential and temporal data where the order is critical to achieve higher accuracy in context understanding. RNN family like LSTM, BLSTM, GRU, BiGRU etc. are the mainly used models in these kind of sequential tasks. RNN family based Encoder-decoder frameworks are widely used for the recognition of various languages scripts. However, in Urdu, very less research has been done especially with the deep learning models. The existing research work for printed Urdu recognition have shown that the current models only work for very basic sentences of Urdu but in case of complex words and sentences, these algorithms totally fail in terms of accuracy and the time complexity in identification of the Nastaleeq font writing. To identify printed Urdu text in images, we have proposed an encoder-decoder based hybrid deep learning approach with Convolutional Neural Network (CNN) for feature extraction part, bi-d
doi_str_mv 10.1016/j.iswa.2023.200194
format article
fullrecord <record><control><sourceid>elsevier_doaj_</sourceid><recordid>TN_cdi_doaj_primary_oai_doaj_org_article_56d20a81fdfa4cf4b1b081c633690270</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S2667305323000194</els_id><doaj_id>oai_doaj_org_article_56d20a81fdfa4cf4b1b081c633690270</doaj_id><sourcerecordid>S2667305323000194</sourcerecordid><originalsourceid>FETCH-LOGICAL-c410t-de48fc05fe35fd5a1230730438d7bb0f7c23ad553ec30f4dc748d7027961b713</originalsourceid><addsrcrecordid>eNp9kNtKAzEQhhdRsKgv4FVeYOvksIeCN7ZqFUoFqdchm0xKarupyVbp25t2RbzyYg7M8H_M_Fl2TWFIgZY3q6GLX2rIgPGUgI7ESTZgZVnlHAp--qc_z65iXAEAqynlQgyy91fUftm6zvmWeEu2wbUdGvIWzI5EHdy2I64lcxU7tUb8INa3HWn2ZBdduyST-Twfu-nrW56CjFVM0odWe4Mhv8djJY9BbfDLh_fL7MyqdcSrn3qRLR4fFpOnfPYyfZ7czXItKHS5QVFbDYVFXlhTKMo4VBwEr03VNGArzbgyRcFRc7DC6EqkDbBqVNKmovwie-6xxquVTA9tVNhLr5w8DnxYShU6p9coi9IwUDW1xiqhrWhoAzXVJeflKAEhsVjP0sHHGND-8ijIg_lyJQ_my4P5sjc_iW57EaYnPx0GGbXDVqNxAXWXznD_yb8BJ4mMOg</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Recognition of printed Urdu script in Nastaleeq font by using CNN-BiGRU-GRU Based Encoder-Decoder Framework</title><source>ScienceDirect</source><creator>Zia, Sohail ; Azhar, Muhammad ; Lee, Bumshik ; Tahir, Adnan ; Ferzund, Javed ; Murtaza, Fozia ; Ali, Moazam</creator><creatorcontrib>Zia, Sohail ; Azhar, Muhammad ; Lee, Bumshik ; Tahir, Adnan ; Ferzund, Javed ; Murtaza, Fozia ; Ali, Moazam</creatorcontrib><description>•This research was conducted from January, 2021 to January, 2023.•RNN based Deep learning models has shown tremendous success in sequential and temporal data where the order is critical to achieve higher accuracy in context understanding. RNN family like LSTM, BLSTM, GRU, BiGRU etc. are the mainly used models in these kind of sequential tasks.•RNN family based Encoder-decoder frameworks are widely used for the recognition of various languages scripts. However, in Urdu, very less research has been done especially with the deep learning models.•The existing proposed algorithms for printed Urdu recognition only work for very basic sentences of Urdu but in case of complex words and sentences, these algorithms totally fail in terms of accuracy and the time complexity in identification of the Nastaleeq font writing.•To identify printed Urdu text in images, we have proposed an encoder-decoder based hybrid deep learning approach with Convolutional Neural Network (CNN) for feature extraction part, bi-directional Gated Recurrent Unit network (BiGRU) as encoder and Gated Recurrent Unit network (GRU) as decoder. The CNN layer of the algorithm is used to obtain ligature features in Urdu, which are subsequently utilized by encoder (BiGRU) and decoder (GRU) to recognize the sentences by accurately distinguishing the characters and joiners.•Experimental results have shown that our proposed CNN-BiGRU-GRU hybrid technique with specific hyper-parameter tuning performs well as compared to other state-of-the-art algorithms. RNN based Deep learning models has shown tremendous success in sequential and temporal data where the order is critical to achieve higher accuracy in context understanding. RNN family like LSTM, BLSTM, GRU, BiGRU etc. are the mainly used models in these kind of sequential tasks. RNN family based Encoder-decoder frameworks are widely used for the recognition of various languages scripts. However, in Urdu, very less research has been done especially with the deep learning models. The existing research work for printed Urdu recognition have shown that the current models only work for very basic sentences of Urdu but in case of complex words and sentences, these algorithms totally fail in terms of accuracy and the time complexity in identification of the Nastaleeq font writing. To identify printed Urdu text in images, we have proposed an encoder-decoder based hybrid deep learning approach with Convolutional Neural Network (CNN) for feature extraction part, bi-directional Gated Recurrent Unit network (BiGRU) as encoder and Gated Recurrent Unit network (GRU) as decoder. The CNN layer of the algorithm is used to obtain ligature features in Urdu, which are subsequently utilized by encoder (BiGRU) and decoder (GRU) to recognize the sentences by accurately distinguishing the characters and joiners. Experimental results have shown that our proposed CNN-BiGRU-GRU hybrid technique with specific hyper-parameter tuning performs well as compared to other state-of-the-art algorithms in terms of epochs (70 epochs as compared to 100 with BLSTM-LSTM based encoder decoder), 6 percent increase of Character Recognition Accuracy (86.95 percent as compared to 81.08 percent of BLSTM-LSTM), 10 percent increase of Word Recognition Accuracy (WRA) (89.48% as compared to 79.06 percent of BLSTM-LSTM) and less time complexity (18 seconds less than BLSTM-LSTM with same system configuration).</description><identifier>ISSN: 2667-3053</identifier><identifier>EISSN: 2667-3053</identifier><identifier>DOI: 10.1016/j.iswa.2023.200194</identifier><language>eng</language><publisher>Elsevier Ltd</publisher><subject>BiGRU ; Deep learning ; Encoder-Decoder model ; GRU ; OCR ; Sequence learning</subject><ispartof>Intelligent systems with applications, 2023-05, Vol.18, p.200194, Article 200194</ispartof><rights>2023 The Author(s)</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c410t-de48fc05fe35fd5a1230730438d7bb0f7c23ad553ec30f4dc748d7027961b713</citedby><cites>FETCH-LOGICAL-c410t-de48fc05fe35fd5a1230730438d7bb0f7c23ad553ec30f4dc748d7027961b713</cites><orcidid>0000-0003-3687-0270</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.sciencedirect.com/science/article/pii/S2667305323000194$$EHTML$$P50$$Gelsevier$$Hfree_for_read</linktohtml><link.rule.ids>314,776,780,3535,27903,27904,45759</link.rule.ids></links><search><creatorcontrib>Zia, Sohail</creatorcontrib><creatorcontrib>Azhar, Muhammad</creatorcontrib><creatorcontrib>Lee, Bumshik</creatorcontrib><creatorcontrib>Tahir, Adnan</creatorcontrib><creatorcontrib>Ferzund, Javed</creatorcontrib><creatorcontrib>Murtaza, Fozia</creatorcontrib><creatorcontrib>Ali, Moazam</creatorcontrib><title>Recognition of printed Urdu script in Nastaleeq font by using CNN-BiGRU-GRU Based Encoder-Decoder Framework</title><title>Intelligent systems with applications</title><description>•This research was conducted from January, 2021 to January, 2023.•RNN based Deep learning models has shown tremendous success in sequential and temporal data where the order is critical to achieve higher accuracy in context understanding. RNN family like LSTM, BLSTM, GRU, BiGRU etc. are the mainly used models in these kind of sequential tasks.•RNN family based Encoder-decoder frameworks are widely used for the recognition of various languages scripts. However, in Urdu, very less research has been done especially with the deep learning models.•The existing proposed algorithms for printed Urdu recognition only work for very basic sentences of Urdu but in case of complex words and sentences, these algorithms totally fail in terms of accuracy and the time complexity in identification of the Nastaleeq font writing.•To identify printed Urdu text in images, we have proposed an encoder-decoder based hybrid deep learning approach with Convolutional Neural Network (CNN) for feature extraction part, bi-directional Gated Recurrent Unit network (BiGRU) as encoder and Gated Recurrent Unit network (GRU) as decoder. The CNN layer of the algorithm is used to obtain ligature features in Urdu, which are subsequently utilized by encoder (BiGRU) and decoder (GRU) to recognize the sentences by accurately distinguishing the characters and joiners.•Experimental results have shown that our proposed CNN-BiGRU-GRU hybrid technique with specific hyper-parameter tuning performs well as compared to other state-of-the-art algorithms. RNN based Deep learning models has shown tremendous success in sequential and temporal data where the order is critical to achieve higher accuracy in context understanding. RNN family like LSTM, BLSTM, GRU, BiGRU etc. are the mainly used models in these kind of sequential tasks. RNN family based Encoder-decoder frameworks are widely used for the recognition of various languages scripts. However, in Urdu, very less research has been done especially with the deep learning models. The existing research work for printed Urdu recognition have shown that the current models only work for very basic sentences of Urdu but in case of complex words and sentences, these algorithms totally fail in terms of accuracy and the time complexity in identification of the Nastaleeq font writing. To identify printed Urdu text in images, we have proposed an encoder-decoder based hybrid deep learning approach with Convolutional Neural Network (CNN) for feature extraction part, bi-directional Gated Recurrent Unit network (BiGRU) as encoder and Gated Recurrent Unit network (GRU) as decoder. The CNN layer of the algorithm is used to obtain ligature features in Urdu, which are subsequently utilized by encoder (BiGRU) and decoder (GRU) to recognize the sentences by accurately distinguishing the characters and joiners. Experimental results have shown that our proposed CNN-BiGRU-GRU hybrid technique with specific hyper-parameter tuning performs well as compared to other state-of-the-art algorithms in terms of epochs (70 epochs as compared to 100 with BLSTM-LSTM based encoder decoder), 6 percent increase of Character Recognition Accuracy (86.95 percent as compared to 81.08 percent of BLSTM-LSTM), 10 percent increase of Word Recognition Accuracy (WRA) (89.48% as compared to 79.06 percent of BLSTM-LSTM) and less time complexity (18 seconds less than BLSTM-LSTM with same system configuration).</description><subject>BiGRU</subject><subject>Deep learning</subject><subject>Encoder-Decoder model</subject><subject>GRU</subject><subject>OCR</subject><subject>Sequence learning</subject><issn>2667-3053</issn><issn>2667-3053</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>DOA</sourceid><recordid>eNp9kNtKAzEQhhdRsKgv4FVeYOvksIeCN7ZqFUoFqdchm0xKarupyVbp25t2RbzyYg7M8H_M_Fl2TWFIgZY3q6GLX2rIgPGUgI7ESTZgZVnlHAp--qc_z65iXAEAqynlQgyy91fUftm6zvmWeEu2wbUdGvIWzI5EHdy2I64lcxU7tUb8INa3HWn2ZBdduyST-Twfu-nrW56CjFVM0odWe4Mhv8djJY9BbfDLh_fL7MyqdcSrn3qRLR4fFpOnfPYyfZ7czXItKHS5QVFbDYVFXlhTKMo4VBwEr03VNGArzbgyRcFRc7DC6EqkDbBqVNKmovwie-6xxquVTA9tVNhLr5w8DnxYShU6p9coi9IwUDW1xiqhrWhoAzXVJeflKAEhsVjP0sHHGND-8ijIg_lyJQ_my4P5sjc_iW57EaYnPx0GGbXDVqNxAXWXznD_yb8BJ4mMOg</recordid><startdate>202305</startdate><enddate>202305</enddate><creator>Zia, Sohail</creator><creator>Azhar, Muhammad</creator><creator>Lee, Bumshik</creator><creator>Tahir, Adnan</creator><creator>Ferzund, Javed</creator><creator>Murtaza, Fozia</creator><creator>Ali, Moazam</creator><general>Elsevier Ltd</general><general>Elsevier</general><scope>6I.</scope><scope>AAFTH</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>DOA</scope><orcidid>https://orcid.org/0000-0003-3687-0270</orcidid></search><sort><creationdate>202305</creationdate><title>Recognition of printed Urdu script in Nastaleeq font by using CNN-BiGRU-GRU Based Encoder-Decoder Framework</title><author>Zia, Sohail ; Azhar, Muhammad ; Lee, Bumshik ; Tahir, Adnan ; Ferzund, Javed ; Murtaza, Fozia ; Ali, Moazam</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c410t-de48fc05fe35fd5a1230730438d7bb0f7c23ad553ec30f4dc748d7027961b713</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>BiGRU</topic><topic>Deep learning</topic><topic>Encoder-Decoder model</topic><topic>GRU</topic><topic>OCR</topic><topic>Sequence learning</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Zia, Sohail</creatorcontrib><creatorcontrib>Azhar, Muhammad</creatorcontrib><creatorcontrib>Lee, Bumshik</creatorcontrib><creatorcontrib>Tahir, Adnan</creatorcontrib><creatorcontrib>Ferzund, Javed</creatorcontrib><creatorcontrib>Murtaza, Fozia</creatorcontrib><creatorcontrib>Ali, Moazam</creatorcontrib><collection>ScienceDirect Open Access Titles</collection><collection>Elsevier:ScienceDirect:Open Access</collection><collection>CrossRef</collection><collection>DOAJ Directory of Open Access Journals</collection><jtitle>Intelligent systems with applications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Zia, Sohail</au><au>Azhar, Muhammad</au><au>Lee, Bumshik</au><au>Tahir, Adnan</au><au>Ferzund, Javed</au><au>Murtaza, Fozia</au><au>Ali, Moazam</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Recognition of printed Urdu script in Nastaleeq font by using CNN-BiGRU-GRU Based Encoder-Decoder Framework</atitle><jtitle>Intelligent systems with applications</jtitle><date>2023-05</date><risdate>2023</risdate><volume>18</volume><spage>200194</spage><pages>200194-</pages><artnum>200194</artnum><issn>2667-3053</issn><eissn>2667-3053</eissn><abstract>•This research was conducted from January, 2021 to January, 2023.•RNN based Deep learning models has shown tremendous success in sequential and temporal data where the order is critical to achieve higher accuracy in context understanding. RNN family like LSTM, BLSTM, GRU, BiGRU etc. are the mainly used models in these kind of sequential tasks.•RNN family based Encoder-decoder frameworks are widely used for the recognition of various languages scripts. However, in Urdu, very less research has been done especially with the deep learning models.•The existing proposed algorithms for printed Urdu recognition only work for very basic sentences of Urdu but in case of complex words and sentences, these algorithms totally fail in terms of accuracy and the time complexity in identification of the Nastaleeq font writing.•To identify printed Urdu text in images, we have proposed an encoder-decoder based hybrid deep learning approach with Convolutional Neural Network (CNN) for feature extraction part, bi-directional Gated Recurrent Unit network (BiGRU) as encoder and Gated Recurrent Unit network (GRU) as decoder. The CNN layer of the algorithm is used to obtain ligature features in Urdu, which are subsequently utilized by encoder (BiGRU) and decoder (GRU) to recognize the sentences by accurately distinguishing the characters and joiners.•Experimental results have shown that our proposed CNN-BiGRU-GRU hybrid technique with specific hyper-parameter tuning performs well as compared to other state-of-the-art algorithms. RNN based Deep learning models has shown tremendous success in sequential and temporal data where the order is critical to achieve higher accuracy in context understanding. RNN family like LSTM, BLSTM, GRU, BiGRU etc. are the mainly used models in these kind of sequential tasks. RNN family based Encoder-decoder frameworks are widely used for the recognition of various languages scripts. However, in Urdu, very less research has been done especially with the deep learning models. The existing research work for printed Urdu recognition have shown that the current models only work for very basic sentences of Urdu but in case of complex words and sentences, these algorithms totally fail in terms of accuracy and the time complexity in identification of the Nastaleeq font writing. To identify printed Urdu text in images, we have proposed an encoder-decoder based hybrid deep learning approach with Convolutional Neural Network (CNN) for feature extraction part, bi-directional Gated Recurrent Unit network (BiGRU) as encoder and Gated Recurrent Unit network (GRU) as decoder. The CNN layer of the algorithm is used to obtain ligature features in Urdu, which are subsequently utilized by encoder (BiGRU) and decoder (GRU) to recognize the sentences by accurately distinguishing the characters and joiners. Experimental results have shown that our proposed CNN-BiGRU-GRU hybrid technique with specific hyper-parameter tuning performs well as compared to other state-of-the-art algorithms in terms of epochs (70 epochs as compared to 100 with BLSTM-LSTM based encoder decoder), 6 percent increase of Character Recognition Accuracy (86.95 percent as compared to 81.08 percent of BLSTM-LSTM), 10 percent increase of Word Recognition Accuracy (WRA) (89.48% as compared to 79.06 percent of BLSTM-LSTM) and less time complexity (18 seconds less than BLSTM-LSTM with same system configuration).</abstract><pub>Elsevier Ltd</pub><doi>10.1016/j.iswa.2023.200194</doi><orcidid>https://orcid.org/0000-0003-3687-0270</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 2667-3053
ispartof Intelligent systems with applications, 2023-05, Vol.18, p.200194, Article 200194
issn 2667-3053
2667-3053
language eng
recordid cdi_doaj_primary_oai_doaj_org_article_56d20a81fdfa4cf4b1b081c633690270
source ScienceDirect
subjects BiGRU
Deep learning
Encoder-Decoder model
GRU
OCR
Sequence learning
title Recognition of printed Urdu script in Nastaleeq font by using CNN-BiGRU-GRU Based Encoder-Decoder Framework
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-23T12%3A51%3A14IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-elsevier_doaj_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Recognition%20of%20printed%20Urdu%20script%20in%20Nastaleeq%20font%20by%20using%20CNN-BiGRU-GRU%20Based%20Encoder-Decoder%20Framework&rft.jtitle=Intelligent%20systems%20with%20applications&rft.au=Zia,%20Sohail&rft.date=2023-05&rft.volume=18&rft.spage=200194&rft.pages=200194-&rft.artnum=200194&rft.issn=2667-3053&rft.eissn=2667-3053&rft_id=info:doi/10.1016/j.iswa.2023.200194&rft_dat=%3Celsevier_doaj_%3ES2667305323000194%3C/elsevier_doaj_%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c410t-de48fc05fe35fd5a1230730438d7bb0f7c23ad553ec30f4dc748d7027961b713%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true