Loading…

Convolution Neural Network Having Multiple Channels with Own Attention Layer for Depression Detection from Social Data

People share textual posts about their interests, routines, and moods on social platforms, which can be targeted to evaluate their mental state using diverse techniques such as lexical approaches, machine learning (ML), and deep learning (DL). Bigger grams (bi, tri, or quad) carry more contextual in...

Full description

Saved in:
Bibliographic Details
Published in:New generation computing 2024-03, Vol.42 (1), p.135-155
Main Authors: Dalal, Sumit, Jain, Sarika, Dave, Mayank
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c319t-e2571b352bc7df2212cc44a478284e80589f439c0f061c5fb7bcb3921c54ec663
cites cdi_FETCH-LOGICAL-c319t-e2571b352bc7df2212cc44a478284e80589f439c0f061c5fb7bcb3921c54ec663
container_end_page 155
container_issue 1
container_start_page 135
container_title New generation computing
container_volume 42
creator Dalal, Sumit
Jain, Sarika
Dave, Mayank
description People share textual posts about their interests, routines, and moods on social platforms, which can be targeted to evaluate their mental state using diverse techniques such as lexical approaches, machine learning (ML), and deep learning (DL). Bigger grams (bi, tri, or quad) carry more contextual information than unigrams. However, most of the models used in the classification of depression include only unigrams. Moreover, the well-known depression classifiers, the recurrent neural networks (RNN), retain only the sequential information of the text and ignores the local features of postings. We suggest using a convolutional neural network of multiple channels (MCNN) to capture local features and larger context from user posts. Also, each channel has a dedicated dot-product attention layer to capture global features from local features of various context levels. The proposed model is tested on a depression dataset CLEF-eRisk 2018 with 214 depressed and 1493 non-depressed users’ posts. Experimental results show that our model achieved competitive accuracy, recall, and f-score of 91.00%, 76.50%, and 70.51%, respectively. Accuracy is up to 5.00% higher and recall is approximately 24% higher than multi-channel CNN without an attention layer. Significant grams highlighted by the attention mechanism can be employed to provide a user-level explanation for the depression classification results. However, directly incorporating the attention weights might not be helpful as attention highlightings are dense and entangled.
doi_str_mv 10.1007/s00354-023-00237-y
format article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_3048755888</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3048755888</sourcerecordid><originalsourceid>FETCH-LOGICAL-c319t-e2571b352bc7df2212cc44a478284e80589f439c0f061c5fb7bcb3921c54ec663</originalsourceid><addsrcrecordid>eNp9kM1OwzAQhC0EEqXwApwscQ44_kmcI0qBIhV6AM6WY5w2JbWD7bTK2-M2SNy47K52v5mVBoDrFN2mCOV3HiHCaIIwSVAseTKcgEnKOU5yxNgpmCDMeUIyws7BhfebiGeE4gnYldbsbNuHxhr4qnsn29jC3rovOJe7xqzgS9-Gpms1LNfSGN16uG_CGi73Bt6HoM1RupCDdrC2Ds5057T3h-VMB62O59rZLXyzqon2MxnkJTirZev11W-fgo_Hh_dyniyWT8_l_SJRJC1CojHL04owXKn8s8Y4xUpRKmnOMaeaI8aLmpJCoRplqWJ1lVeqIgWOM9Uqy8gU3Iy-nbPfvfZBbGzvTHwpCKI8Z4xzHik8UspZ752uReearXSDSJE45CvGfEWMVhzzFUMUkVHkI2xW2v1Z_6P6AUgdf0c</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3048755888</pqid></control><display><type>article</type><title>Convolution Neural Network Having Multiple Channels with Own Attention Layer for Depression Detection from Social Data</title><source>Springer Link</source><creator>Dalal, Sumit ; Jain, Sarika ; Dave, Mayank</creator><creatorcontrib>Dalal, Sumit ; Jain, Sarika ; Dave, Mayank</creatorcontrib><description>People share textual posts about their interests, routines, and moods on social platforms, which can be targeted to evaluate their mental state using diverse techniques such as lexical approaches, machine learning (ML), and deep learning (DL). Bigger grams (bi, tri, or quad) carry more contextual information than unigrams. However, most of the models used in the classification of depression include only unigrams. Moreover, the well-known depression classifiers, the recurrent neural networks (RNN), retain only the sequential information of the text and ignores the local features of postings. We suggest using a convolutional neural network of multiple channels (MCNN) to capture local features and larger context from user posts. Also, each channel has a dedicated dot-product attention layer to capture global features from local features of various context levels. The proposed model is tested on a depression dataset CLEF-eRisk 2018 with 214 depressed and 1493 non-depressed users’ posts. Experimental results show that our model achieved competitive accuracy, recall, and f-score of 91.00%, 76.50%, and 70.51%, respectively. Accuracy is up to 5.00% higher and recall is approximately 24% higher than multi-channel CNN without an attention layer. Significant grams highlighted by the attention mechanism can be employed to provide a user-level explanation for the depression classification results. However, directly incorporating the attention weights might not be helpful as attention highlightings are dense and entangled.</description><identifier>ISSN: 0288-3635</identifier><identifier>EISSN: 1882-7055</identifier><identifier>DOI: 10.1007/s00354-023-00237-y</identifier><language>eng</language><publisher>Tokyo: Springer Japan</publisher><subject>Artificial Intelligence ; Artificial neural networks ; Channels ; Classification ; Computer Hardware ; Computer Science ; Computer Systems Organization and Communication Networks ; Context ; Deep learning ; Machine learning ; Model accuracy ; Neural networks ; Recall ; Recurrent neural networks ; Software Engineering/Programming and Operating Systems</subject><ispartof>New generation computing, 2024-03, Vol.42 (1), p.135-155</ispartof><rights>The Author(s), under exclusive licence to The Japanese Society for Artificial Intelligence and Springer Nature Japan KK, part of Springer Nature 2023. Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.. corrected publication 2024</rights><rights>The Author(s), under exclusive licence to The Japanese Society for Artificial Intelligence and Springer Nature Japan KK, part of Springer Nature 2023. Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law. corrected publication 2024.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c319t-e2571b352bc7df2212cc44a478284e80589f439c0f061c5fb7bcb3921c54ec663</citedby><cites>FETCH-LOGICAL-c319t-e2571b352bc7df2212cc44a478284e80589f439c0f061c5fb7bcb3921c54ec663</cites><orcidid>0000-0002-8736-2148</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,27901,27902</link.rule.ids></links><search><creatorcontrib>Dalal, Sumit</creatorcontrib><creatorcontrib>Jain, Sarika</creatorcontrib><creatorcontrib>Dave, Mayank</creatorcontrib><title>Convolution Neural Network Having Multiple Channels with Own Attention Layer for Depression Detection from Social Data</title><title>New generation computing</title><addtitle>New Gener. Comput</addtitle><description>People share textual posts about their interests, routines, and moods on social platforms, which can be targeted to evaluate their mental state using diverse techniques such as lexical approaches, machine learning (ML), and deep learning (DL). Bigger grams (bi, tri, or quad) carry more contextual information than unigrams. However, most of the models used in the classification of depression include only unigrams. Moreover, the well-known depression classifiers, the recurrent neural networks (RNN), retain only the sequential information of the text and ignores the local features of postings. We suggest using a convolutional neural network of multiple channels (MCNN) to capture local features and larger context from user posts. Also, each channel has a dedicated dot-product attention layer to capture global features from local features of various context levels. The proposed model is tested on a depression dataset CLEF-eRisk 2018 with 214 depressed and 1493 non-depressed users’ posts. Experimental results show that our model achieved competitive accuracy, recall, and f-score of 91.00%, 76.50%, and 70.51%, respectively. Accuracy is up to 5.00% higher and recall is approximately 24% higher than multi-channel CNN without an attention layer. Significant grams highlighted by the attention mechanism can be employed to provide a user-level explanation for the depression classification results. However, directly incorporating the attention weights might not be helpful as attention highlightings are dense and entangled.</description><subject>Artificial Intelligence</subject><subject>Artificial neural networks</subject><subject>Channels</subject><subject>Classification</subject><subject>Computer Hardware</subject><subject>Computer Science</subject><subject>Computer Systems Organization and Communication Networks</subject><subject>Context</subject><subject>Deep learning</subject><subject>Machine learning</subject><subject>Model accuracy</subject><subject>Neural networks</subject><subject>Recall</subject><subject>Recurrent neural networks</subject><subject>Software Engineering/Programming and Operating Systems</subject><issn>0288-3635</issn><issn>1882-7055</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><recordid>eNp9kM1OwzAQhC0EEqXwApwscQ44_kmcI0qBIhV6AM6WY5w2JbWD7bTK2-M2SNy47K52v5mVBoDrFN2mCOV3HiHCaIIwSVAseTKcgEnKOU5yxNgpmCDMeUIyws7BhfebiGeE4gnYldbsbNuHxhr4qnsn29jC3rovOJe7xqzgS9-Gpms1LNfSGN16uG_CGi73Bt6HoM1RupCDdrC2Ds5057T3h-VMB62O59rZLXyzqon2MxnkJTirZev11W-fgo_Hh_dyniyWT8_l_SJRJC1CojHL04owXKn8s8Y4xUpRKmnOMaeaI8aLmpJCoRplqWJ1lVeqIgWOM9Uqy8gU3Iy-nbPfvfZBbGzvTHwpCKI8Z4xzHik8UspZ752uReearXSDSJE45CvGfEWMVhzzFUMUkVHkI2xW2v1Z_6P6AUgdf0c</recordid><startdate>20240301</startdate><enddate>20240301</enddate><creator>Dalal, Sumit</creator><creator>Jain, Sarika</creator><creator>Dave, Mayank</creator><general>Springer Japan</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><orcidid>https://orcid.org/0000-0002-8736-2148</orcidid></search><sort><creationdate>20240301</creationdate><title>Convolution Neural Network Having Multiple Channels with Own Attention Layer for Depression Detection from Social Data</title><author>Dalal, Sumit ; Jain, Sarika ; Dave, Mayank</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c319t-e2571b352bc7df2212cc44a478284e80589f439c0f061c5fb7bcb3921c54ec663</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Artificial Intelligence</topic><topic>Artificial neural networks</topic><topic>Channels</topic><topic>Classification</topic><topic>Computer Hardware</topic><topic>Computer Science</topic><topic>Computer Systems Organization and Communication Networks</topic><topic>Context</topic><topic>Deep learning</topic><topic>Machine learning</topic><topic>Model accuracy</topic><topic>Neural networks</topic><topic>Recall</topic><topic>Recurrent neural networks</topic><topic>Software Engineering/Programming and Operating Systems</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Dalal, Sumit</creatorcontrib><creatorcontrib>Jain, Sarika</creatorcontrib><creatorcontrib>Dave, Mayank</creatorcontrib><collection>CrossRef</collection><jtitle>New generation computing</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Dalal, Sumit</au><au>Jain, Sarika</au><au>Dave, Mayank</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Convolution Neural Network Having Multiple Channels with Own Attention Layer for Depression Detection from Social Data</atitle><jtitle>New generation computing</jtitle><stitle>New Gener. Comput</stitle><date>2024-03-01</date><risdate>2024</risdate><volume>42</volume><issue>1</issue><spage>135</spage><epage>155</epage><pages>135-155</pages><issn>0288-3635</issn><eissn>1882-7055</eissn><abstract>People share textual posts about their interests, routines, and moods on social platforms, which can be targeted to evaluate their mental state using diverse techniques such as lexical approaches, machine learning (ML), and deep learning (DL). Bigger grams (bi, tri, or quad) carry more contextual information than unigrams. However, most of the models used in the classification of depression include only unigrams. Moreover, the well-known depression classifiers, the recurrent neural networks (RNN), retain only the sequential information of the text and ignores the local features of postings. We suggest using a convolutional neural network of multiple channels (MCNN) to capture local features and larger context from user posts. Also, each channel has a dedicated dot-product attention layer to capture global features from local features of various context levels. The proposed model is tested on a depression dataset CLEF-eRisk 2018 with 214 depressed and 1493 non-depressed users’ posts. Experimental results show that our model achieved competitive accuracy, recall, and f-score of 91.00%, 76.50%, and 70.51%, respectively. Accuracy is up to 5.00% higher and recall is approximately 24% higher than multi-channel CNN without an attention layer. Significant grams highlighted by the attention mechanism can be employed to provide a user-level explanation for the depression classification results. However, directly incorporating the attention weights might not be helpful as attention highlightings are dense and entangled.</abstract><cop>Tokyo</cop><pub>Springer Japan</pub><doi>10.1007/s00354-023-00237-y</doi><tpages>21</tpages><orcidid>https://orcid.org/0000-0002-8736-2148</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 0288-3635
ispartof New generation computing, 2024-03, Vol.42 (1), p.135-155
issn 0288-3635
1882-7055
language eng
recordid cdi_proquest_journals_3048755888
source Springer Link
subjects Artificial Intelligence
Artificial neural networks
Channels
Classification
Computer Hardware
Computer Science
Computer Systems Organization and Communication Networks
Context
Deep learning
Machine learning
Model accuracy
Neural networks
Recall
Recurrent neural networks
Software Engineering/Programming and Operating Systems
title Convolution Neural Network Having Multiple Channels with Own Attention Layer for Depression Detection from Social Data
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-10T16%3A05%3A12IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Convolution%20Neural%20Network%20Having%20Multiple%20Channels%20with%20Own%20Attention%20Layer%20for%20Depression%20Detection%20from%20Social%20Data&rft.jtitle=New%20generation%20computing&rft.au=Dalal,%20Sumit&rft.date=2024-03-01&rft.volume=42&rft.issue=1&rft.spage=135&rft.epage=155&rft.pages=135-155&rft.issn=0288-3635&rft.eissn=1882-7055&rft_id=info:doi/10.1007/s00354-023-00237-y&rft_dat=%3Cproquest_cross%3E3048755888%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c319t-e2571b352bc7df2212cc44a478284e80589f439c0f061c5fb7bcb3921c54ec663%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=3048755888&rft_id=info:pmid/&rfr_iscdi=true