Loading…

Generation of Missing Words in Assamese text using N-gram based Model

It is very common to miss certain words when writing while listening to others. A similar problem can arise when typing on the computer. The automatic generation of missed words shall very much helpful for users by suggesting the required words. In this research work, missed words of the Assamese se...

Full description

Saved in:
Bibliographic Details
Published in:Journal of physics. Conference series 2020-12, Vol.1706 (1), p.12166
Main Authors: Bhuyan, M. P., Sarma, S. K.
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:It is very common to miss certain words when writing while listening to others. A similar problem can arise when typing on the computer. The automatic generation of missed words shall very much helpful for users by suggesting the required words. In this research work, missed words of the Assamese sentences are generated, at present, there is no such tool/method exists which can provide or generate the missed words in an Assamese sentence. N-gram based models like bigram and trigram are used to generate missed words. Using the bigram and trigram models a rank is calculated for each possible suggested words and the suggested word list is sorted according to this rank value in decreasing order. Finally, these suggested words can be used to fill the place of the missed Assamese words. Different levels of experiments are carried out and the present proposed system can correct the missed words at an accuracy ranging from 58% to 66%. The proposed model can precisely generate accurate five relevant suggestions for a sentence containing an average of six words with two missed words separated by three words.
ISSN:1742-6588
1742-6596
DOI:10.1088/1742-6596/1706/1/012166