Loading…
The Croatian psycholinguistic database: Estimates for 6000 nouns, verbs, adjectives and adverbs
Psycholinguistic databases containing ratings of concreteness, imageability, age of acquisition, and subjective frequency are used in psycholinguistic and neurolinguistic studies which require words as stimuli. Linguistic characteristics (e.g. word length, corpus frequency) are frequently coded, but...
Saved in:
Published in: | Behavior research methods 2021-08, Vol.53 (4), p.1799-1816 |
---|---|
Main Authors: | , , , , , , , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | cdi_FETCH-LOGICAL-c541t-7cceb2bbe9cc34465bee00e195cfa206d2ee38634bb0d54c61e3747a3171c5b53 |
---|---|
cites | cdi_FETCH-LOGICAL-c541t-7cceb2bbe9cc34465bee00e195cfa206d2ee38634bb0d54c61e3747a3171c5b53 |
container_end_page | 1816 |
container_issue | 4 |
container_start_page | 1799 |
container_title | Behavior research methods |
container_volume | 53 |
creator | Peti-Stantić, Anita Anđel, Maja Gnjidić, Vedrana Keresteš, Gordana Ljubešić, Nikola Masnikosa, Irina Tonković, Mirjana Tušek, Jelena Willer-Gold, Jana Stanojević, Mateusz-Milan |
description | Psycholinguistic databases containing ratings of concreteness, imageability, age of acquisition, and subjective frequency are used in psycholinguistic and neurolinguistic studies which require words as stimuli. Linguistic characteristics (e.g. word length, corpus frequency) are frequently coded, but word class is seldom systematically treated, although there are indications of its significance for imageability and concreteness. This paper presents the Croatian Psycholinguistic Database (CPD; available at:
https://doi.org/10.17234/megahr.2019.hpb
), containing 6000 Croatian nouns, verbs, adjectives and adverbs, rated for concreteness, imageability, age of acquisition, and subjective frequency. Moreover, we present computationally obtained extrapolations of concreteness and imageability to the remainder of the Croatian lexicon (available at:
https://github.com/megahr/lexicon/blob/master/predictions/hr_c_i.predictions.txt
). In the two studies presented here, we explore the significance of word class for concreteness and imageability in human and computationally obtained ratings. The observed correlations in the CPD indicate correspondences between psycholinguistic measures expected from the literature. Word classes exhibit differences in subjective frequency, age of acquisition, concreteness and imageability, with significant differences between nouns, verbs, adjectives and adverbs. In the computational study which focused on concreteness and imageability, concreteness obtained higher correlations with human ratings than imageability, and the system underpredicted the concreteness of nouns, and overpredicted the concreteness of adjectives and adverbs. Overall, this suggests that word class contains schematic conceptual and distributional information. Schematic conceptual content seems to be more significant in human ratings of concreteness and less significant in computationally obtained ratings, where distributional information seems to play a more significant role. This suggests that word class differences should be theoretically explored. |
doi_str_mv | 10.3758/s13428-020-01533-x |
format | article |
fullrecord | <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_8367916</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2561940015</sourcerecordid><originalsourceid>FETCH-LOGICAL-c541t-7cceb2bbe9cc34465bee00e195cfa206d2ee38634bb0d54c61e3747a3171c5b53</originalsourceid><addsrcrecordid>eNp9kU1LAzEQhoMotlb_gAdZ8OLB1XzuhwdBil9Q8KLnkGSn7ZZtUpPdov_e1FatHjxNJvPMOzO8CB0TfMFyUVwGwjgtUkxxiolgLH3bQX0iBE-ZoMXu1ruHDkKYYcwKSvg-6jFWYk447SP5PIVk6J1qa2WTRXg3U9fUdtLVoa1NUqlWaRXgKrmN-Vy1EJKx80mGMU6s62w4T5bgdQyqmoFp62UklK1i-vl_iPbGqglwtIkD9HJ3-zx8SEdP94_Dm1FqBCdtmhsDmmoNpTGM80xoAIyBlMKMFcVZRQFYkTGuNa4ENxkBlvNcMZITI7RgA3S91l10eg6VAdt61ciFj0v7d-lULX9XbD2VE7eUBcvykmRR4Gwj4N1rB6GV8zoYaBplwXVBUkGKsijjchE9_YPOXOdtPC9SGSk5XtkxQHRNGe9C8DD-XoZgufJPrv2T0T_56Z98i00n22d8t3wZFgG2BkIs2Qn4n9n_yH4AdreneA</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2561940015</pqid></control><display><type>article</type><title>The Croatian psycholinguistic database: Estimates for 6000 nouns, verbs, adjectives and adverbs</title><source>Springer Nature</source><creator>Peti-Stantić, Anita ; Anđel, Maja ; Gnjidić, Vedrana ; Keresteš, Gordana ; Ljubešić, Nikola ; Masnikosa, Irina ; Tonković, Mirjana ; Tušek, Jelena ; Willer-Gold, Jana ; Stanojević, Mateusz-Milan</creator><creatorcontrib>Peti-Stantić, Anita ; Anđel, Maja ; Gnjidić, Vedrana ; Keresteš, Gordana ; Ljubešić, Nikola ; Masnikosa, Irina ; Tonković, Mirjana ; Tušek, Jelena ; Willer-Gold, Jana ; Stanojević, Mateusz-Milan</creatorcontrib><description>Psycholinguistic databases containing ratings of concreteness, imageability, age of acquisition, and subjective frequency are used in psycholinguistic and neurolinguistic studies which require words as stimuli. Linguistic characteristics (e.g. word length, corpus frequency) are frequently coded, but word class is seldom systematically treated, although there are indications of its significance for imageability and concreteness. This paper presents the Croatian Psycholinguistic Database (CPD; available at:
https://doi.org/10.17234/megahr.2019.hpb
), containing 6000 Croatian nouns, verbs, adjectives and adverbs, rated for concreteness, imageability, age of acquisition, and subjective frequency. Moreover, we present computationally obtained extrapolations of concreteness and imageability to the remainder of the Croatian lexicon (available at:
https://github.com/megahr/lexicon/blob/master/predictions/hr_c_i.predictions.txt
). In the two studies presented here, we explore the significance of word class for concreteness and imageability in human and computationally obtained ratings. The observed correlations in the CPD indicate correspondences between psycholinguistic measures expected from the literature. Word classes exhibit differences in subjective frequency, age of acquisition, concreteness and imageability, with significant differences between nouns, verbs, adjectives and adverbs. In the computational study which focused on concreteness and imageability, concreteness obtained higher correlations with human ratings than imageability, and the system underpredicted the concreteness of nouns, and overpredicted the concreteness of adjectives and adverbs. Overall, this suggests that word class contains schematic conceptual and distributional information. Schematic conceptual content seems to be more significant in human ratings of concreteness and less significant in computationally obtained ratings, where distributional information seems to play a more significant role. This suggests that word class differences should be theoretically explored.</description><identifier>ISSN: 1554-3528</identifier><identifier>ISSN: 1554-351X</identifier><identifier>EISSN: 1554-3528</identifier><identifier>DOI: 10.3758/s13428-020-01533-x</identifier><identifier>PMID: 33904142</identifier><language>eng</language><publisher>New York: Springer US</publisher><subject>Age ; Behavioral Science and Psychology ; Cognitive Psychology ; Humans ; Language ; Psycholinguistics ; Psychology ; Ratings & rankings</subject><ispartof>Behavior research methods, 2021-08, Vol.53 (4), p.1799-1816</ispartof><rights>The Author(s) 2021</rights><rights>2021. The Author(s).</rights><rights>The Author(s) 2021. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c541t-7cceb2bbe9cc34465bee00e195cfa206d2ee38634bb0d54c61e3747a3171c5b53</citedby><cites>FETCH-LOGICAL-c541t-7cceb2bbe9cc34465bee00e195cfa206d2ee38634bb0d54c61e3747a3171c5b53</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>230,314,777,781,882,27905,27906</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/33904142$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Peti-Stantić, Anita</creatorcontrib><creatorcontrib>Anđel, Maja</creatorcontrib><creatorcontrib>Gnjidić, Vedrana</creatorcontrib><creatorcontrib>Keresteš, Gordana</creatorcontrib><creatorcontrib>Ljubešić, Nikola</creatorcontrib><creatorcontrib>Masnikosa, Irina</creatorcontrib><creatorcontrib>Tonković, Mirjana</creatorcontrib><creatorcontrib>Tušek, Jelena</creatorcontrib><creatorcontrib>Willer-Gold, Jana</creatorcontrib><creatorcontrib>Stanojević, Mateusz-Milan</creatorcontrib><title>The Croatian psycholinguistic database: Estimates for 6000 nouns, verbs, adjectives and adverbs</title><title>Behavior research methods</title><addtitle>Behav Res</addtitle><addtitle>Behav Res Methods</addtitle><description>Psycholinguistic databases containing ratings of concreteness, imageability, age of acquisition, and subjective frequency are used in psycholinguistic and neurolinguistic studies which require words as stimuli. Linguistic characteristics (e.g. word length, corpus frequency) are frequently coded, but word class is seldom systematically treated, although there are indications of its significance for imageability and concreteness. This paper presents the Croatian Psycholinguistic Database (CPD; available at:
https://doi.org/10.17234/megahr.2019.hpb
), containing 6000 Croatian nouns, verbs, adjectives and adverbs, rated for concreteness, imageability, age of acquisition, and subjective frequency. Moreover, we present computationally obtained extrapolations of concreteness and imageability to the remainder of the Croatian lexicon (available at:
https://github.com/megahr/lexicon/blob/master/predictions/hr_c_i.predictions.txt
). In the two studies presented here, we explore the significance of word class for concreteness and imageability in human and computationally obtained ratings. The observed correlations in the CPD indicate correspondences between psycholinguistic measures expected from the literature. Word classes exhibit differences in subjective frequency, age of acquisition, concreteness and imageability, with significant differences between nouns, verbs, adjectives and adverbs. In the computational study which focused on concreteness and imageability, concreteness obtained higher correlations with human ratings than imageability, and the system underpredicted the concreteness of nouns, and overpredicted the concreteness of adjectives and adverbs. Overall, this suggests that word class contains schematic conceptual and distributional information. Schematic conceptual content seems to be more significant in human ratings of concreteness and less significant in computationally obtained ratings, where distributional information seems to play a more significant role. This suggests that word class differences should be theoretically explored.</description><subject>Age</subject><subject>Behavioral Science and Psychology</subject><subject>Cognitive Psychology</subject><subject>Humans</subject><subject>Language</subject><subject>Psycholinguistics</subject><subject>Psychology</subject><subject>Ratings & rankings</subject><issn>1554-3528</issn><issn>1554-351X</issn><issn>1554-3528</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><recordid>eNp9kU1LAzEQhoMotlb_gAdZ8OLB1XzuhwdBil9Q8KLnkGSn7ZZtUpPdov_e1FatHjxNJvPMOzO8CB0TfMFyUVwGwjgtUkxxiolgLH3bQX0iBE-ZoMXu1ruHDkKYYcwKSvg-6jFWYk447SP5PIVk6J1qa2WTRXg3U9fUdtLVoa1NUqlWaRXgKrmN-Vy1EJKx80mGMU6s62w4T5bgdQyqmoFp62UklK1i-vl_iPbGqglwtIkD9HJ3-zx8SEdP94_Dm1FqBCdtmhsDmmoNpTGM80xoAIyBlMKMFcVZRQFYkTGuNa4ENxkBlvNcMZITI7RgA3S91l10eg6VAdt61ciFj0v7d-lULX9XbD2VE7eUBcvykmRR4Gwj4N1rB6GV8zoYaBplwXVBUkGKsijjchE9_YPOXOdtPC9SGSk5XtkxQHRNGe9C8DD-XoZgufJPrv2T0T_56Z98i00n22d8t3wZFgG2BkIs2Qn4n9n_yH4AdreneA</recordid><startdate>20210801</startdate><enddate>20210801</enddate><creator>Peti-Stantić, Anita</creator><creator>Anđel, Maja</creator><creator>Gnjidić, Vedrana</creator><creator>Keresteš, Gordana</creator><creator>Ljubešić, Nikola</creator><creator>Masnikosa, Irina</creator><creator>Tonković, Mirjana</creator><creator>Tušek, Jelena</creator><creator>Willer-Gold, Jana</creator><creator>Stanojević, Mateusz-Milan</creator><general>Springer US</general><general>Springer Nature B.V</general><scope>C6C</scope><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>4T-</scope><scope>7TK</scope><scope>K9.</scope><scope>7X8</scope><scope>5PM</scope></search><sort><creationdate>20210801</creationdate><title>The Croatian psycholinguistic database: Estimates for 6000 nouns, verbs, adjectives and adverbs</title><author>Peti-Stantić, Anita ; Anđel, Maja ; Gnjidić, Vedrana ; Keresteš, Gordana ; Ljubešić, Nikola ; Masnikosa, Irina ; Tonković, Mirjana ; Tušek, Jelena ; Willer-Gold, Jana ; Stanojević, Mateusz-Milan</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c541t-7cceb2bbe9cc34465bee00e195cfa206d2ee38634bb0d54c61e3747a3171c5b53</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Age</topic><topic>Behavioral Science and Psychology</topic><topic>Cognitive Psychology</topic><topic>Humans</topic><topic>Language</topic><topic>Psycholinguistics</topic><topic>Psychology</topic><topic>Ratings & rankings</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Peti-Stantić, Anita</creatorcontrib><creatorcontrib>Anđel, Maja</creatorcontrib><creatorcontrib>Gnjidić, Vedrana</creatorcontrib><creatorcontrib>Keresteš, Gordana</creatorcontrib><creatorcontrib>Ljubešić, Nikola</creatorcontrib><creatorcontrib>Masnikosa, Irina</creatorcontrib><creatorcontrib>Tonković, Mirjana</creatorcontrib><creatorcontrib>Tušek, Jelena</creatorcontrib><creatorcontrib>Willer-Gold, Jana</creatorcontrib><creatorcontrib>Stanojević, Mateusz-Milan</creatorcontrib><collection>SpringerOpen</collection><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Docstoc</collection><collection>Neurosciences Abstracts</collection><collection>ProQuest Health & Medical Complete (Alumni)</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Behavior research methods</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Peti-Stantić, Anita</au><au>Anđel, Maja</au><au>Gnjidić, Vedrana</au><au>Keresteš, Gordana</au><au>Ljubešić, Nikola</au><au>Masnikosa, Irina</au><au>Tonković, Mirjana</au><au>Tušek, Jelena</au><au>Willer-Gold, Jana</au><au>Stanojević, Mateusz-Milan</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>The Croatian psycholinguistic database: Estimates for 6000 nouns, verbs, adjectives and adverbs</atitle><jtitle>Behavior research methods</jtitle><stitle>Behav Res</stitle><addtitle>Behav Res Methods</addtitle><date>2021-08-01</date><risdate>2021</risdate><volume>53</volume><issue>4</issue><spage>1799</spage><epage>1816</epage><pages>1799-1816</pages><issn>1554-3528</issn><issn>1554-351X</issn><eissn>1554-3528</eissn><abstract>Psycholinguistic databases containing ratings of concreteness, imageability, age of acquisition, and subjective frequency are used in psycholinguistic and neurolinguistic studies which require words as stimuli. Linguistic characteristics (e.g. word length, corpus frequency) are frequently coded, but word class is seldom systematically treated, although there are indications of its significance for imageability and concreteness. This paper presents the Croatian Psycholinguistic Database (CPD; available at:
https://doi.org/10.17234/megahr.2019.hpb
), containing 6000 Croatian nouns, verbs, adjectives and adverbs, rated for concreteness, imageability, age of acquisition, and subjective frequency. Moreover, we present computationally obtained extrapolations of concreteness and imageability to the remainder of the Croatian lexicon (available at:
https://github.com/megahr/lexicon/blob/master/predictions/hr_c_i.predictions.txt
). In the two studies presented here, we explore the significance of word class for concreteness and imageability in human and computationally obtained ratings. The observed correlations in the CPD indicate correspondences between psycholinguistic measures expected from the literature. Word classes exhibit differences in subjective frequency, age of acquisition, concreteness and imageability, with significant differences between nouns, verbs, adjectives and adverbs. In the computational study which focused on concreteness and imageability, concreteness obtained higher correlations with human ratings than imageability, and the system underpredicted the concreteness of nouns, and overpredicted the concreteness of adjectives and adverbs. Overall, this suggests that word class contains schematic conceptual and distributional information. Schematic conceptual content seems to be more significant in human ratings of concreteness and less significant in computationally obtained ratings, where distributional information seems to play a more significant role. This suggests that word class differences should be theoretically explored.</abstract><cop>New York</cop><pub>Springer US</pub><pmid>33904142</pmid><doi>10.3758/s13428-020-01533-x</doi><tpages>18</tpages><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1554-3528 |
ispartof | Behavior research methods, 2021-08, Vol.53 (4), p.1799-1816 |
issn | 1554-3528 1554-351X 1554-3528 |
language | eng |
recordid | cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_8367916 |
source | Springer Nature |
subjects | Age Behavioral Science and Psychology Cognitive Psychology Humans Language Psycholinguistics Psychology Ratings & rankings |
title | The Croatian psycholinguistic database: Estimates for 6000 nouns, verbs, adjectives and adverbs |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-21T07%3A55%3A33IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=The%20Croatian%20psycholinguistic%20database:%20Estimates%20for%206000%20nouns,%20verbs,%20adjectives%20and%20adverbs&rft.jtitle=Behavior%20research%20methods&rft.au=Peti-Stanti%C4%87,%20Anita&rft.date=2021-08-01&rft.volume=53&rft.issue=4&rft.spage=1799&rft.epage=1816&rft.pages=1799-1816&rft.issn=1554-3528&rft.eissn=1554-3528&rft_id=info:doi/10.3758/s13428-020-01533-x&rft_dat=%3Cproquest_pubme%3E2561940015%3C/proquest_pubme%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c541t-7cceb2bbe9cc34465bee00e195cfa206d2ee38634bb0d54c61e3747a3171c5b53%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2561940015&rft_id=info:pmid/33904142&rfr_iscdi=true |