Loading…
Who Are We? Mining Institutional Identities Using n-grams
Disciplines and organizations alike can be defined by the text they produce, the topics they discuss, and the language they employ. Analyzing such large amounts of text is challenging, but is nevertheless needed because it can help stakeholders to understand key themes in, and the evolution of their...
Saved in:
Main Authors: | , |
---|---|
Format: | Conference Proceeding |
Language: | English |
Subjects: | |
Online Access: | Request full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | Disciplines and organizations alike can be defined by the text they produce, the topics they discuss, and the language they employ. Analyzing such large amounts of text is challenging, but is nevertheless needed because it can help stakeholders to understand key themes in, and the evolution of their corporate or disciplinary identity. N-gram analysis is a leading text-mining technique that can be leveraged for this purpose. In this manuscript we present the development and demonstrate the potential utility of an n-gram analysis tool. We focus on revealing several aspects of the identity of an academic journal, namely Communications of the ACM (CACM), through the analysis of over 14 million unique n-grams and their relative frequencies. The results of the study imply that n-gram analyses may be a key tool in resolving the IS identity crisis. Implications for research and practice are discussed. |
---|---|
ISSN: | 1530-1605 2572-6862 |
DOI: | 10.1109/HICSS.2012.642 |