Loading…

A hybrid model integrating recurrent neural networks and the semi-supervised support vector machine for identification of early student dropout risk

Student dropout rates are one of the major concerns of educational institutions because they affect the success and efficacy of them. In order to help students continue their learning and achieve a better future, there is a need to identify the risk of student dropout. However, it is challenging to...

Full description

Saved in:
Bibliographic Details
Published in:PeerJ. Computer science 2024-11, Vol.10, p.e2572, Article e2572
Main Authors: Nguyen Thi Cam, Huong, Sarlan, Aliza, Arshad, Noreen Izza
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites cdi_FETCH-LOGICAL-c413t-a8c524a27243f56aed7d5c868a20658d12cccf07cdc89f8b9b6587d1c7ef365a3
container_end_page
container_issue
container_start_page e2572
container_title PeerJ. Computer science
container_volume 10
creator Nguyen Thi Cam, Huong
Sarlan, Aliza
Arshad, Noreen Izza
description Student dropout rates are one of the major concerns of educational institutions because they affect the success and efficacy of them. In order to help students continue their learning and achieve a better future, there is a need to identify the risk of student dropout. However, it is challenging to accurately identify the student dropout risk in the preliminary stages considering the complexities associated with it. This research develops an efficient prediction model using machine learning (ML) and deep learning (DL) techniques for identifying student dropouts in both small and big educational datasets. A hybrid prediction model DeepS3VM is designed by integrating a Semi-supervised support vector machine (S3VM) model with a recurrent neural network (RNN) to capture sequential patterns in student dropout prediction. In addition, a personalized recommendation system (PRS) is developed to recommend personalized learning paths for students who are at risk of dropping out. The potential of the DeepS3VM is evaluated with respect to various evaluation metrics and the results are compared with various existing models such as Random Forest (RF), decision tree (DT), XGBoost, artificial neural network (ANN) and convolutional neural network (CNN). The DeepS3VM model demonstrates outstanding accuracy at 92.54%, surpassing other current models. This confirms the model's effectiveness in precisely identifying the risk of student dropout. The dataset used for this analysis was obtained from the student management system of a private university in Vietnam and generated from an initial 243 records to a total of one hundred thousand records.
doi_str_mv 10.7717/peerj-cs.2572
format article
fullrecord <record><control><sourceid>gale_doaj_</sourceid><recordid>TN_cdi_doaj_primary_oai_doaj_org_article_9d2b3f9b9fd04d50934940df676e1797</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><galeid>A818603267</galeid><doaj_id>oai_doaj_org_article_9d2b3f9b9fd04d50934940df676e1797</doaj_id><sourcerecordid>A818603267</sourcerecordid><originalsourceid>FETCH-LOGICAL-c413t-a8c524a27243f56aed7d5c868a20658d12cccf07cdc89f8b9b6587d1c7ef365a3</originalsourceid><addsrcrecordid>eNptkk1v3CAQhq2qVROlOfZaIfXSHrwFY4M5rqJ-rBSpUj_OCMOwy8Y2LuC0-z_6g4uzadqtCgdGwzPvaOAtiucErzgn_M0EEPaljquq4dWj4ryinJWNENXjv-Kz4jLGPcaYNCQv8bQ4o4I1mLL6vPi5RrtDF5xBgzfQIzcm2AaV3LhFAfQcAowJjTAH1ecjfffhJiI1GpR2gCIMrozzBOHWRTAoh5MPCd2CTj6gQemdGwHZHDuThZx1Omv7EXmLQIX-gGKalxtkgp_8nFBw8eZZ8cSqPsLl_XlRfH339svVh_L64_vN1fq61DWhqVStbqpaVbyqqW2YAsNNo1vWqgqzpjWk0lpbzLXRrbBtJ7qc5YZoDpayRtGLYnPUNV7t5RTcoMJBeuXkXcKHrVQhOd2DFKbqqBWdsAbXpsGC1qLGxjLOgHDBs9aro9YU_LcZYpKDixr6Xo3g5ygpqRnDjLU4oy__Qfd-DmOeNFO0Jow2rP5DbVXu70brU1B6EZXrlrQM04otbVf_ofI2-Wu0H8G6nD8peH1SkJkEP9JWzTHKzedPp2x5ZHXwMQawD29EsFwMKO8MKHWUiwEz_-J-sLkbwDzQv-1GfwEWctg1</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3134163564</pqid></control><display><type>article</type><title>A hybrid model integrating recurrent neural networks and the semi-supervised support vector machine for identification of early student dropout risk</title><source>Publicly Available Content Database</source><source>PubMed Central</source><creator>Nguyen Thi Cam, Huong ; Sarlan, Aliza ; Arshad, Noreen Izza</creator><creatorcontrib>Nguyen Thi Cam, Huong ; Sarlan, Aliza ; Arshad, Noreen Izza</creatorcontrib><description>Student dropout rates are one of the major concerns of educational institutions because they affect the success and efficacy of them. In order to help students continue their learning and achieve a better future, there is a need to identify the risk of student dropout. However, it is challenging to accurately identify the student dropout risk in the preliminary stages considering the complexities associated with it. This research develops an efficient prediction model using machine learning (ML) and deep learning (DL) techniques for identifying student dropouts in both small and big educational datasets. A hybrid prediction model DeepS3VM is designed by integrating a Semi-supervised support vector machine (S3VM) model with a recurrent neural network (RNN) to capture sequential patterns in student dropout prediction. In addition, a personalized recommendation system (PRS) is developed to recommend personalized learning paths for students who are at risk of dropping out. The potential of the DeepS3VM is evaluated with respect to various evaluation metrics and the results are compared with various existing models such as Random Forest (RF), decision tree (DT), XGBoost, artificial neural network (ANN) and convolutional neural network (CNN). The DeepS3VM model demonstrates outstanding accuracy at 92.54%, surpassing other current models. This confirms the model's effectiveness in precisely identifying the risk of student dropout. The dataset used for this analysis was obtained from the student management system of a private university in Vietnam and generated from an initial 243 records to a total of one hundred thousand records.</description><identifier>ISSN: 2376-5992</identifier><identifier>EISSN: 2376-5992</identifier><identifier>DOI: 10.7717/peerj-cs.2572</identifier><identifier>PMID: 39650364</identifier><language>eng</language><publisher>United States: PeerJ. Ltd</publisher><subject>Artificial intelligence ; Artificial neural networks ; At risk students ; Big data ; Colleges &amp; universities ; Customization ; Datasets ; Decision trees ; Deep learning ; Design ; Distance learning ; Education ; Effectiveness ; Feedback ; Forecasts and trends ; Intervention ; Machine learning ; Neural networks ; Performance evaluation ; Personalized learning ; Personalized recommendation system ; Prediction models ; Recommender systems ; Recurrent neural network ; Recurrent neural networks ; S3VM ; School dropout programs ; Student dropout prediction ; Student retention ; Students ; Support vector machines</subject><ispartof>PeerJ. Computer science, 2024-11, Vol.10, p.e2572, Article e2572</ispartof><rights>2024 Nguyen Thi Cam et al.</rights><rights>COPYRIGHT 2024 PeerJ. Ltd.</rights><rights>2024 Nguyen Thi Cam et al. This is an open access article distributed under the terms of the Creative Commons Attribution License: https://creativecommons.org/licenses/by/4.0/ (the “License”), which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c413t-a8c524a27243f56aed7d5c868a20658d12cccf07cdc89f8b9b6587d1c7ef365a3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.proquest.com/docview/3134163564/fulltextPDF?pq-origsite=primo$$EPDF$$P50$$Gproquest$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.proquest.com/docview/3134163564?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>314,776,780,25731,27901,27902,36989,36990,44566,74869</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/39650364$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Nguyen Thi Cam, Huong</creatorcontrib><creatorcontrib>Sarlan, Aliza</creatorcontrib><creatorcontrib>Arshad, Noreen Izza</creatorcontrib><title>A hybrid model integrating recurrent neural networks and the semi-supervised support vector machine for identification of early student dropout risk</title><title>PeerJ. Computer science</title><addtitle>PeerJ Comput Sci</addtitle><description>Student dropout rates are one of the major concerns of educational institutions because they affect the success and efficacy of them. In order to help students continue their learning and achieve a better future, there is a need to identify the risk of student dropout. However, it is challenging to accurately identify the student dropout risk in the preliminary stages considering the complexities associated with it. This research develops an efficient prediction model using machine learning (ML) and deep learning (DL) techniques for identifying student dropouts in both small and big educational datasets. A hybrid prediction model DeepS3VM is designed by integrating a Semi-supervised support vector machine (S3VM) model with a recurrent neural network (RNN) to capture sequential patterns in student dropout prediction. In addition, a personalized recommendation system (PRS) is developed to recommend personalized learning paths for students who are at risk of dropping out. The potential of the DeepS3VM is evaluated with respect to various evaluation metrics and the results are compared with various existing models such as Random Forest (RF), decision tree (DT), XGBoost, artificial neural network (ANN) and convolutional neural network (CNN). The DeepS3VM model demonstrates outstanding accuracy at 92.54%, surpassing other current models. This confirms the model's effectiveness in precisely identifying the risk of student dropout. The dataset used for this analysis was obtained from the student management system of a private university in Vietnam and generated from an initial 243 records to a total of one hundred thousand records.</description><subject>Artificial intelligence</subject><subject>Artificial neural networks</subject><subject>At risk students</subject><subject>Big data</subject><subject>Colleges &amp; universities</subject><subject>Customization</subject><subject>Datasets</subject><subject>Decision trees</subject><subject>Deep learning</subject><subject>Design</subject><subject>Distance learning</subject><subject>Education</subject><subject>Effectiveness</subject><subject>Feedback</subject><subject>Forecasts and trends</subject><subject>Intervention</subject><subject>Machine learning</subject><subject>Neural networks</subject><subject>Performance evaluation</subject><subject>Personalized learning</subject><subject>Personalized recommendation system</subject><subject>Prediction models</subject><subject>Recommender systems</subject><subject>Recurrent neural network</subject><subject>Recurrent neural networks</subject><subject>S3VM</subject><subject>School dropout programs</subject><subject>Student dropout prediction</subject><subject>Student retention</subject><subject>Students</subject><subject>Support vector machines</subject><issn>2376-5992</issn><issn>2376-5992</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>PIMPY</sourceid><sourceid>DOA</sourceid><recordid>eNptkk1v3CAQhq2qVROlOfZaIfXSHrwFY4M5rqJ-rBSpUj_OCMOwy8Y2LuC0-z_6g4uzadqtCgdGwzPvaOAtiucErzgn_M0EEPaljquq4dWj4ryinJWNENXjv-Kz4jLGPcaYNCQv8bQ4o4I1mLL6vPi5RrtDF5xBgzfQIzcm2AaV3LhFAfQcAowJjTAH1ecjfffhJiI1GpR2gCIMrozzBOHWRTAoh5MPCd2CTj6gQemdGwHZHDuThZx1Omv7EXmLQIX-gGKalxtkgp_8nFBw8eZZ8cSqPsLl_XlRfH339svVh_L64_vN1fq61DWhqVStbqpaVbyqqW2YAsNNo1vWqgqzpjWk0lpbzLXRrbBtJ7qc5YZoDpayRtGLYnPUNV7t5RTcoMJBeuXkXcKHrVQhOd2DFKbqqBWdsAbXpsGC1qLGxjLOgHDBs9aro9YU_LcZYpKDixr6Xo3g5ygpqRnDjLU4oy__Qfd-DmOeNFO0Jow2rP5DbVXu70brU1B6EZXrlrQM04otbVf_ofI2-Wu0H8G6nD8peH1SkJkEP9JWzTHKzedPp2x5ZHXwMQawD29EsFwMKO8MKHWUiwEz_-J-sLkbwDzQv-1GfwEWctg1</recordid><startdate>20241129</startdate><enddate>20241129</enddate><creator>Nguyen Thi Cam, Huong</creator><creator>Sarlan, Aliza</creator><creator>Arshad, Noreen Izza</creator><general>PeerJ. Ltd</general><general>PeerJ, Inc</general><general>PeerJ Inc</general><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>ISR</scope><scope>3V.</scope><scope>7XB</scope><scope>8AL</scope><scope>8FE</scope><scope>8FG</scope><scope>8FK</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K7-</scope><scope>M0N</scope><scope>P5Z</scope><scope>P62</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>Q9U</scope><scope>7X8</scope><scope>DOA</scope></search><sort><creationdate>20241129</creationdate><title>A hybrid model integrating recurrent neural networks and the semi-supervised support vector machine for identification of early student dropout risk</title><author>Nguyen Thi Cam, Huong ; Sarlan, Aliza ; Arshad, Noreen Izza</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c413t-a8c524a27243f56aed7d5c868a20658d12cccf07cdc89f8b9b6587d1c7ef365a3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Artificial intelligence</topic><topic>Artificial neural networks</topic><topic>At risk students</topic><topic>Big data</topic><topic>Colleges &amp; universities</topic><topic>Customization</topic><topic>Datasets</topic><topic>Decision trees</topic><topic>Deep learning</topic><topic>Design</topic><topic>Distance learning</topic><topic>Education</topic><topic>Effectiveness</topic><topic>Feedback</topic><topic>Forecasts and trends</topic><topic>Intervention</topic><topic>Machine learning</topic><topic>Neural networks</topic><topic>Performance evaluation</topic><topic>Personalized learning</topic><topic>Personalized recommendation system</topic><topic>Prediction models</topic><topic>Recommender systems</topic><topic>Recurrent neural network</topic><topic>Recurrent neural networks</topic><topic>S3VM</topic><topic>School dropout programs</topic><topic>Student dropout prediction</topic><topic>Student retention</topic><topic>Students</topic><topic>Support vector machines</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Nguyen Thi Cam, Huong</creatorcontrib><creatorcontrib>Sarlan, Aliza</creatorcontrib><creatorcontrib>Arshad, Noreen Izza</creatorcontrib><collection>PubMed</collection><collection>CrossRef</collection><collection>Gale In Context: Science</collection><collection>ProQuest Central (Corporate)</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Computing Database (Alumni Edition)</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central</collection><collection>Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer Science Database</collection><collection>Computing Database</collection><collection>Advanced Technologies &amp; Aerospace Database</collection><collection>ProQuest Advanced Technologies &amp; Aerospace Collection</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central Basic</collection><collection>MEDLINE - Academic</collection><collection>DOAJ Directory of Open Access Journals</collection><jtitle>PeerJ. Computer science</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Nguyen Thi Cam, Huong</au><au>Sarlan, Aliza</au><au>Arshad, Noreen Izza</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A hybrid model integrating recurrent neural networks and the semi-supervised support vector machine for identification of early student dropout risk</atitle><jtitle>PeerJ. Computer science</jtitle><addtitle>PeerJ Comput Sci</addtitle><date>2024-11-29</date><risdate>2024</risdate><volume>10</volume><spage>e2572</spage><pages>e2572-</pages><artnum>e2572</artnum><issn>2376-5992</issn><eissn>2376-5992</eissn><abstract>Student dropout rates are one of the major concerns of educational institutions because they affect the success and efficacy of them. In order to help students continue their learning and achieve a better future, there is a need to identify the risk of student dropout. However, it is challenging to accurately identify the student dropout risk in the preliminary stages considering the complexities associated with it. This research develops an efficient prediction model using machine learning (ML) and deep learning (DL) techniques for identifying student dropouts in both small and big educational datasets. A hybrid prediction model DeepS3VM is designed by integrating a Semi-supervised support vector machine (S3VM) model with a recurrent neural network (RNN) to capture sequential patterns in student dropout prediction. In addition, a personalized recommendation system (PRS) is developed to recommend personalized learning paths for students who are at risk of dropping out. The potential of the DeepS3VM is evaluated with respect to various evaluation metrics and the results are compared with various existing models such as Random Forest (RF), decision tree (DT), XGBoost, artificial neural network (ANN) and convolutional neural network (CNN). The DeepS3VM model demonstrates outstanding accuracy at 92.54%, surpassing other current models. This confirms the model's effectiveness in precisely identifying the risk of student dropout. The dataset used for this analysis was obtained from the student management system of a private university in Vietnam and generated from an initial 243 records to a total of one hundred thousand records.</abstract><cop>United States</cop><pub>PeerJ. Ltd</pub><pmid>39650364</pmid><doi>10.7717/peerj-cs.2572</doi><tpages>e2572</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 2376-5992
ispartof PeerJ. Computer science, 2024-11, Vol.10, p.e2572, Article e2572
issn 2376-5992
2376-5992
language eng
recordid cdi_doaj_primary_oai_doaj_org_article_9d2b3f9b9fd04d50934940df676e1797
source Publicly Available Content Database; PubMed Central
subjects Artificial intelligence
Artificial neural networks
At risk students
Big data
Colleges & universities
Customization
Datasets
Decision trees
Deep learning
Design
Distance learning
Education
Effectiveness
Feedback
Forecasts and trends
Intervention
Machine learning
Neural networks
Performance evaluation
Personalized learning
Personalized recommendation system
Prediction models
Recommender systems
Recurrent neural network
Recurrent neural networks
S3VM
School dropout programs
Student dropout prediction
Student retention
Students
Support vector machines
title A hybrid model integrating recurrent neural networks and the semi-supervised support vector machine for identification of early student dropout risk
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-06T20%3A46%3A03IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-gale_doaj_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20hybrid%20model%20integrating%20recurrent%20neural%20networks%20and%20the%20semi-supervised%20support%20vector%20machine%20for%20identification%20of%20early%20student%20dropout%20risk&rft.jtitle=PeerJ.%20Computer%20science&rft.au=Nguyen%20Thi%20Cam,%20Huong&rft.date=2024-11-29&rft.volume=10&rft.spage=e2572&rft.pages=e2572-&rft.artnum=e2572&rft.issn=2376-5992&rft.eissn=2376-5992&rft_id=info:doi/10.7717/peerj-cs.2572&rft_dat=%3Cgale_doaj_%3EA818603267%3C/gale_doaj_%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c413t-a8c524a27243f56aed7d5c868a20658d12cccf07cdc89f8b9b6587d1c7ef365a3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=3134163564&rft_id=info:pmid/39650364&rft_galeid=A818603267&rfr_iscdi=true