Loading…

Prediction of Aptamer Protein Interaction Using Random Forest Algorithm

Aptamers are oligonucleotides that may attach to amino acids, polypeptide, tiny compounds, allergens and living cell membrane. Therapeutics, bio sensing and diagnostics are all sectors where the aptamers may be used. In this work, we present a model based on Random Forest Algorithms to predict the i...

Full description

Saved in:
Bibliographic Details
Published in:IEEE access 2022, Vol.10, p.49677-49687
Main Authors: Manju, N., Samiha, C. M., Kumar, S. P. Pavan, Gururaj, H. L., Flammini, Francesco
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Aptamers are oligonucleotides that may attach to amino acids, polypeptide, tiny compounds, allergens and living cell membrane. Therapeutics, bio sensing and diagnostics are all sectors where the aptamers may be used. In this work, we present a model based on Random Forest Algorithms to predict the interaction of aptamer and target proteins by combining their most prominent characteristics. Amino Acid Composition and Psuedo Amino Acid Composition were utilized to express desired data by employing physicochemical and structural features of the amino acids. The dominant features were selected using feature importance classifiers such as random forest and eXtreme Gradient Boosting. Compared to these, principal component analysis techniques yielded a good feature set. As a result, 98% accuracy is obtained for 50 principal components. Many relevant characteristics in defining aptamer target protein interactions were discovered after analysing the best set of features. Our prediction approach is expected to become a valuable tool for discovering aptamer-target interactions, and the traits chosen and studied in this work might give helpful insight into the process of Aptamer Protein interactions.
ISSN:2169-3536
2169-3536
DOI:10.1109/ACCESS.2022.3172278