Loading…
Convolution Neural Network Having Multiple Channels with Own Attention Layer for Depression Detection from Social Data
People share textual posts about their interests, routines, and moods on social platforms, which can be targeted to evaluate their mental state using diverse techniques such as lexical approaches, machine learning (ML), and deep learning (DL). Bigger grams (bi, tri, or quad) carry more contextual in...
Saved in:
Published in: | New generation computing 2024-03, Vol.42 (1), p.135-155 |
---|---|
Main Authors: | , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | cdi_FETCH-LOGICAL-c319t-e2571b352bc7df2212cc44a478284e80589f439c0f061c5fb7bcb3921c54ec663 |
---|---|
cites | cdi_FETCH-LOGICAL-c319t-e2571b352bc7df2212cc44a478284e80589f439c0f061c5fb7bcb3921c54ec663 |
container_end_page | 155 |
container_issue | 1 |
container_start_page | 135 |
container_title | New generation computing |
container_volume | 42 |
creator | Dalal, Sumit Jain, Sarika Dave, Mayank |
description | People share textual posts about their interests, routines, and moods on social platforms, which can be targeted to evaluate their mental state using diverse techniques such as lexical approaches, machine learning (ML), and deep learning (DL). Bigger grams (bi, tri, or quad) carry more contextual information than unigrams. However, most of the models used in the classification of depression include only unigrams. Moreover, the well-known depression classifiers, the recurrent neural networks (RNN), retain only the sequential information of the text and ignores the local features of postings. We suggest using a convolutional neural network of multiple channels (MCNN) to capture local features and larger context from user posts. Also, each channel has a dedicated dot-product attention layer to capture global features from local features of various context levels. The proposed model is tested on a depression dataset CLEF-eRisk 2018 with 214 depressed and 1493 non-depressed users’ posts. Experimental results show that our model achieved competitive accuracy, recall, and f-score of 91.00%, 76.50%, and 70.51%, respectively. Accuracy is up to 5.00% higher and recall is approximately 24% higher than multi-channel CNN without an attention layer. Significant grams highlighted by the attention mechanism can be employed to provide a user-level explanation for the depression classification results. However, directly incorporating the attention weights might not be helpful as attention highlightings are dense and entangled. |
doi_str_mv | 10.1007/s00354-023-00237-y |
format | article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_3048755888</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3048755888</sourcerecordid><originalsourceid>FETCH-LOGICAL-c319t-e2571b352bc7df2212cc44a478284e80589f439c0f061c5fb7bcb3921c54ec663</originalsourceid><addsrcrecordid>eNp9kM1OwzAQhC0EEqXwApwscQ44_kmcI0qBIhV6AM6WY5w2JbWD7bTK2-M2SNy47K52v5mVBoDrFN2mCOV3HiHCaIIwSVAseTKcgEnKOU5yxNgpmCDMeUIyws7BhfebiGeE4gnYldbsbNuHxhr4qnsn29jC3rovOJe7xqzgS9-Gpms1LNfSGN16uG_CGi73Bt6HoM1RupCDdrC2Ds5057T3h-VMB62O59rZLXyzqon2MxnkJTirZev11W-fgo_Hh_dyniyWT8_l_SJRJC1CojHL04owXKn8s8Y4xUpRKmnOMaeaI8aLmpJCoRplqWJ1lVeqIgWOM9Uqy8gU3Iy-nbPfvfZBbGzvTHwpCKI8Z4xzHik8UspZ752uReearXSDSJE45CvGfEWMVhzzFUMUkVHkI2xW2v1Z_6P6AUgdf0c</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3048755888</pqid></control><display><type>article</type><title>Convolution Neural Network Having Multiple Channels with Own Attention Layer for Depression Detection from Social Data</title><source>Springer Link</source><creator>Dalal, Sumit ; Jain, Sarika ; Dave, Mayank</creator><creatorcontrib>Dalal, Sumit ; Jain, Sarika ; Dave, Mayank</creatorcontrib><description>People share textual posts about their interests, routines, and moods on social platforms, which can be targeted to evaluate their mental state using diverse techniques such as lexical approaches, machine learning (ML), and deep learning (DL). Bigger grams (bi, tri, or quad) carry more contextual information than unigrams. However, most of the models used in the classification of depression include only unigrams. Moreover, the well-known depression classifiers, the recurrent neural networks (RNN), retain only the sequential information of the text and ignores the local features of postings. We suggest using a convolutional neural network of multiple channels (MCNN) to capture local features and larger context from user posts. Also, each channel has a dedicated dot-product attention layer to capture global features from local features of various context levels. The proposed model is tested on a depression dataset CLEF-eRisk 2018 with 214 depressed and 1493 non-depressed users’ posts. Experimental results show that our model achieved competitive accuracy, recall, and f-score of 91.00%, 76.50%, and 70.51%, respectively. Accuracy is up to 5.00% higher and recall is approximately 24% higher than multi-channel CNN without an attention layer. Significant grams highlighted by the attention mechanism can be employed to provide a user-level explanation for the depression classification results. However, directly incorporating the attention weights might not be helpful as attention highlightings are dense and entangled.</description><identifier>ISSN: 0288-3635</identifier><identifier>EISSN: 1882-7055</identifier><identifier>DOI: 10.1007/s00354-023-00237-y</identifier><language>eng</language><publisher>Tokyo: Springer Japan</publisher><subject>Artificial Intelligence ; Artificial neural networks ; Channels ; Classification ; Computer Hardware ; Computer Science ; Computer Systems Organization and Communication Networks ; Context ; Deep learning ; Machine learning ; Model accuracy ; Neural networks ; Recall ; Recurrent neural networks ; Software Engineering/Programming and Operating Systems</subject><ispartof>New generation computing, 2024-03, Vol.42 (1), p.135-155</ispartof><rights>The Author(s), under exclusive licence to The Japanese Society for Artificial Intelligence and Springer Nature Japan KK, part of Springer Nature 2023. Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.. corrected publication 2024</rights><rights>The Author(s), under exclusive licence to The Japanese Society for Artificial Intelligence and Springer Nature Japan KK, part of Springer Nature 2023. Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law. corrected publication 2024.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c319t-e2571b352bc7df2212cc44a478284e80589f439c0f061c5fb7bcb3921c54ec663</citedby><cites>FETCH-LOGICAL-c319t-e2571b352bc7df2212cc44a478284e80589f439c0f061c5fb7bcb3921c54ec663</cites><orcidid>0000-0002-8736-2148</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,27901,27902</link.rule.ids></links><search><creatorcontrib>Dalal, Sumit</creatorcontrib><creatorcontrib>Jain, Sarika</creatorcontrib><creatorcontrib>Dave, Mayank</creatorcontrib><title>Convolution Neural Network Having Multiple Channels with Own Attention Layer for Depression Detection from Social Data</title><title>New generation computing</title><addtitle>New Gener. Comput</addtitle><description>People share textual posts about their interests, routines, and moods on social platforms, which can be targeted to evaluate their mental state using diverse techniques such as lexical approaches, machine learning (ML), and deep learning (DL). Bigger grams (bi, tri, or quad) carry more contextual information than unigrams. However, most of the models used in the classification of depression include only unigrams. Moreover, the well-known depression classifiers, the recurrent neural networks (RNN), retain only the sequential information of the text and ignores the local features of postings. We suggest using a convolutional neural network of multiple channels (MCNN) to capture local features and larger context from user posts. Also, each channel has a dedicated dot-product attention layer to capture global features from local features of various context levels. The proposed model is tested on a depression dataset CLEF-eRisk 2018 with 214 depressed and 1493 non-depressed users’ posts. Experimental results show that our model achieved competitive accuracy, recall, and f-score of 91.00%, 76.50%, and 70.51%, respectively. Accuracy is up to 5.00% higher and recall is approximately 24% higher than multi-channel CNN without an attention layer. Significant grams highlighted by the attention mechanism can be employed to provide a user-level explanation for the depression classification results. However, directly incorporating the attention weights might not be helpful as attention highlightings are dense and entangled.</description><subject>Artificial Intelligence</subject><subject>Artificial neural networks</subject><subject>Channels</subject><subject>Classification</subject><subject>Computer Hardware</subject><subject>Computer Science</subject><subject>Computer Systems Organization and Communication Networks</subject><subject>Context</subject><subject>Deep learning</subject><subject>Machine learning</subject><subject>Model accuracy</subject><subject>Neural networks</subject><subject>Recall</subject><subject>Recurrent neural networks</subject><subject>Software Engineering/Programming and Operating Systems</subject><issn>0288-3635</issn><issn>1882-7055</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><recordid>eNp9kM1OwzAQhC0EEqXwApwscQ44_kmcI0qBIhV6AM6WY5w2JbWD7bTK2-M2SNy47K52v5mVBoDrFN2mCOV3HiHCaIIwSVAseTKcgEnKOU5yxNgpmCDMeUIyws7BhfebiGeE4gnYldbsbNuHxhr4qnsn29jC3rovOJe7xqzgS9-Gpms1LNfSGN16uG_CGi73Bt6HoM1RupCDdrC2Ds5057T3h-VMB62O59rZLXyzqon2MxnkJTirZev11W-fgo_Hh_dyniyWT8_l_SJRJC1CojHL04owXKn8s8Y4xUpRKmnOMaeaI8aLmpJCoRplqWJ1lVeqIgWOM9Uqy8gU3Iy-nbPfvfZBbGzvTHwpCKI8Z4xzHik8UspZ752uReearXSDSJE45CvGfEWMVhzzFUMUkVHkI2xW2v1Z_6P6AUgdf0c</recordid><startdate>20240301</startdate><enddate>20240301</enddate><creator>Dalal, Sumit</creator><creator>Jain, Sarika</creator><creator>Dave, Mayank</creator><general>Springer Japan</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><orcidid>https://orcid.org/0000-0002-8736-2148</orcidid></search><sort><creationdate>20240301</creationdate><title>Convolution Neural Network Having Multiple Channels with Own Attention Layer for Depression Detection from Social Data</title><author>Dalal, Sumit ; Jain, Sarika ; Dave, Mayank</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c319t-e2571b352bc7df2212cc44a478284e80589f439c0f061c5fb7bcb3921c54ec663</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Artificial Intelligence</topic><topic>Artificial neural networks</topic><topic>Channels</topic><topic>Classification</topic><topic>Computer Hardware</topic><topic>Computer Science</topic><topic>Computer Systems Organization and Communication Networks</topic><topic>Context</topic><topic>Deep learning</topic><topic>Machine learning</topic><topic>Model accuracy</topic><topic>Neural networks</topic><topic>Recall</topic><topic>Recurrent neural networks</topic><topic>Software Engineering/Programming and Operating Systems</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Dalal, Sumit</creatorcontrib><creatorcontrib>Jain, Sarika</creatorcontrib><creatorcontrib>Dave, Mayank</creatorcontrib><collection>CrossRef</collection><jtitle>New generation computing</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Dalal, Sumit</au><au>Jain, Sarika</au><au>Dave, Mayank</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Convolution Neural Network Having Multiple Channels with Own Attention Layer for Depression Detection from Social Data</atitle><jtitle>New generation computing</jtitle><stitle>New Gener. Comput</stitle><date>2024-03-01</date><risdate>2024</risdate><volume>42</volume><issue>1</issue><spage>135</spage><epage>155</epage><pages>135-155</pages><issn>0288-3635</issn><eissn>1882-7055</eissn><abstract>People share textual posts about their interests, routines, and moods on social platforms, which can be targeted to evaluate their mental state using diverse techniques such as lexical approaches, machine learning (ML), and deep learning (DL). Bigger grams (bi, tri, or quad) carry more contextual information than unigrams. However, most of the models used in the classification of depression include only unigrams. Moreover, the well-known depression classifiers, the recurrent neural networks (RNN), retain only the sequential information of the text and ignores the local features of postings. We suggest using a convolutional neural network of multiple channels (MCNN) to capture local features and larger context from user posts. Also, each channel has a dedicated dot-product attention layer to capture global features from local features of various context levels. The proposed model is tested on a depression dataset CLEF-eRisk 2018 with 214 depressed and 1493 non-depressed users’ posts. Experimental results show that our model achieved competitive accuracy, recall, and f-score of 91.00%, 76.50%, and 70.51%, respectively. Accuracy is up to 5.00% higher and recall is approximately 24% higher than multi-channel CNN without an attention layer. Significant grams highlighted by the attention mechanism can be employed to provide a user-level explanation for the depression classification results. However, directly incorporating the attention weights might not be helpful as attention highlightings are dense and entangled.</abstract><cop>Tokyo</cop><pub>Springer Japan</pub><doi>10.1007/s00354-023-00237-y</doi><tpages>21</tpages><orcidid>https://orcid.org/0000-0002-8736-2148</orcidid></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0288-3635 |
ispartof | New generation computing, 2024-03, Vol.42 (1), p.135-155 |
issn | 0288-3635 1882-7055 |
language | eng |
recordid | cdi_proquest_journals_3048755888 |
source | Springer Link |
subjects | Artificial Intelligence Artificial neural networks Channels Classification Computer Hardware Computer Science Computer Systems Organization and Communication Networks Context Deep learning Machine learning Model accuracy Neural networks Recall Recurrent neural networks Software Engineering/Programming and Operating Systems |
title | Convolution Neural Network Having Multiple Channels with Own Attention Layer for Depression Detection from Social Data |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-10T16%3A05%3A12IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Convolution%20Neural%20Network%20Having%20Multiple%20Channels%20with%20Own%20Attention%20Layer%20for%20Depression%20Detection%20from%20Social%20Data&rft.jtitle=New%20generation%20computing&rft.au=Dalal,%20Sumit&rft.date=2024-03-01&rft.volume=42&rft.issue=1&rft.spage=135&rft.epage=155&rft.pages=135-155&rft.issn=0288-3635&rft.eissn=1882-7055&rft_id=info:doi/10.1007/s00354-023-00237-y&rft_dat=%3Cproquest_cross%3E3048755888%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c319t-e2571b352bc7df2212cc44a478284e80589f439c0f061c5fb7bcb3921c54ec663%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=3048755888&rft_id=info:pmid/&rfr_iscdi=true |