Loading…

Wiping out the limitations of Large Language Models -- A Taxonomy for Retrieval Augmented Generation

Current research on RAGs is distributed across various disciplines, and since the technology is evolving very quickly, its unit of analysis is mostly on technological innovations, rather than applications in business contexts. Thus, in this research, we aim to create a taxonomy to conceptualize a co...

Full description

Saved in:
Bibliographic Details
Published in:arXiv.org 2024-08
Main Authors: Li, Mahei Manhai, Nikishina, Irina, Sevgili, Özge, Semmann, Martin
Format: Article
Language:English
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites
container_end_page
container_issue
container_start_page
container_title arXiv.org
container_volume
creator Li, Mahei Manhai
Nikishina, Irina
Sevgili, Özge
Semmann, Martin
description Current research on RAGs is distributed across various disciplines, and since the technology is evolving very quickly, its unit of analysis is mostly on technological innovations, rather than applications in business contexts. Thus, in this research, we aim to create a taxonomy to conceptualize a comprehensive overview of the constituting characteristics that define RAG applications, facilitating the adoption of this technology in the IS community. To the best of our knowledge, no RAG application taxonomies have been developed so far. We describe our methodology for developing the taxonomy, which includes the criteria for selecting papers, an explanation of our rationale for employing a Large Language Model (LLM)-supported approach to extract and identify initial characteristics, and a concise overview of our systematic process for conceptualizing the taxonomy. Our systematic taxonomy development process includes four iterative phases designed to refine and enhance our understanding and presentation of RAG's core dimensions. We have developed a total of five meta-dimensions and sixteen dimensions to comprehensively capture the concept of Retrieval-Augmented Generation (RAG) applications. When discussing our findings, we also detail the specific research areas and pose key research questions to guide future information system researchers as they explore the emerging topics of RAG systems.
format article
fullrecord <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_3090746306</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3090746306</sourcerecordid><originalsourceid>FETCH-proquest_journals_30907463063</originalsourceid><addsrcrecordid>eNqNjMsKwjAURIMgKOo_XHBdiEkfuiziY6EbEVxKoLc1Jc3VPET_3iJ-gJuZA3OYARsLKRfJMhVixGbet5xzkRciy-SYVRd917YBigHCDcHoTgcVNFkPVMNBuQb7tE1UPRypQuMhSaCEs3qRpe4NNTk4YXAan8pAGZsObcAKdmjRfa-mbFgr43H26wmbbzfn9T65O3pE9OHaUnS2n66Sr3iR5pLn8j_rA6EDRYY</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3090746306</pqid></control><display><type>article</type><title>Wiping out the limitations of Large Language Models -- A Taxonomy for Retrieval Augmented Generation</title><source>Publicly Available Content Database</source><creator>Li, Mahei Manhai ; Nikishina, Irina ; Sevgili, Özge ; Semmann, Martin</creator><creatorcontrib>Li, Mahei Manhai ; Nikishina, Irina ; Sevgili, Özge ; Semmann, Martin</creatorcontrib><description>Current research on RAGs is distributed across various disciplines, and since the technology is evolving very quickly, its unit of analysis is mostly on technological innovations, rather than applications in business contexts. Thus, in this research, we aim to create a taxonomy to conceptualize a comprehensive overview of the constituting characteristics that define RAG applications, facilitating the adoption of this technology in the IS community. To the best of our knowledge, no RAG application taxonomies have been developed so far. We describe our methodology for developing the taxonomy, which includes the criteria for selecting papers, an explanation of our rationale for employing a Large Language Model (LLM)-supported approach to extract and identify initial characteristics, and a concise overview of our systematic process for conceptualizing the taxonomy. Our systematic taxonomy development process includes four iterative phases designed to refine and enhance our understanding and presentation of RAG's core dimensions. We have developed a total of five meta-dimensions and sixteen dimensions to comprehensively capture the concept of Retrieval-Augmented Generation (RAG) applications. When discussing our findings, we also detail the specific research areas and pose key research questions to guide future information system researchers as they explore the emerging topics of RAG systems.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Large language models ; Retrieval ; Taxonomy ; Technology assessment</subject><ispartof>arXiv.org, 2024-08</ispartof><rights>2024. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.proquest.com/docview/3090746306?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>778,782,25736,36995,44573</link.rule.ids></links><search><creatorcontrib>Li, Mahei Manhai</creatorcontrib><creatorcontrib>Nikishina, Irina</creatorcontrib><creatorcontrib>Sevgili, Özge</creatorcontrib><creatorcontrib>Semmann, Martin</creatorcontrib><title>Wiping out the limitations of Large Language Models -- A Taxonomy for Retrieval Augmented Generation</title><title>arXiv.org</title><description>Current research on RAGs is distributed across various disciplines, and since the technology is evolving very quickly, its unit of analysis is mostly on technological innovations, rather than applications in business contexts. Thus, in this research, we aim to create a taxonomy to conceptualize a comprehensive overview of the constituting characteristics that define RAG applications, facilitating the adoption of this technology in the IS community. To the best of our knowledge, no RAG application taxonomies have been developed so far. We describe our methodology for developing the taxonomy, which includes the criteria for selecting papers, an explanation of our rationale for employing a Large Language Model (LLM)-supported approach to extract and identify initial characteristics, and a concise overview of our systematic process for conceptualizing the taxonomy. Our systematic taxonomy development process includes four iterative phases designed to refine and enhance our understanding and presentation of RAG's core dimensions. We have developed a total of five meta-dimensions and sixteen dimensions to comprehensively capture the concept of Retrieval-Augmented Generation (RAG) applications. When discussing our findings, we also detail the specific research areas and pose key research questions to guide future information system researchers as they explore the emerging topics of RAG systems.</description><subject>Large language models</subject><subject>Retrieval</subject><subject>Taxonomy</subject><subject>Technology assessment</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>PIMPY</sourceid><recordid>eNqNjMsKwjAURIMgKOo_XHBdiEkfuiziY6EbEVxKoLc1Jc3VPET_3iJ-gJuZA3OYARsLKRfJMhVixGbet5xzkRciy-SYVRd917YBigHCDcHoTgcVNFkPVMNBuQb7tE1UPRypQuMhSaCEs3qRpe4NNTk4YXAan8pAGZsObcAKdmjRfa-mbFgr43H26wmbbzfn9T65O3pE9OHaUnS2n66Sr3iR5pLn8j_rA6EDRYY</recordid><startdate>20240812</startdate><enddate>20240812</enddate><creator>Li, Mahei Manhai</creator><creator>Nikishina, Irina</creator><creator>Sevgili, Özge</creator><creator>Semmann, Martin</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20240812</creationdate><title>Wiping out the limitations of Large Language Models -- A Taxonomy for Retrieval Augmented Generation</title><author>Li, Mahei Manhai ; Nikishina, Irina ; Sevgili, Özge ; Semmann, Martin</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_30907463063</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Large language models</topic><topic>Retrieval</topic><topic>Taxonomy</topic><topic>Technology assessment</topic><toplevel>online_resources</toplevel><creatorcontrib>Li, Mahei Manhai</creatorcontrib><creatorcontrib>Nikishina, Irina</creatorcontrib><creatorcontrib>Sevgili, Özge</creatorcontrib><creatorcontrib>Semmann, Martin</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central</collection><collection>ProQuest Central Essentials</collection><collection>AUTh Library subscriptions: ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Li, Mahei Manhai</au><au>Nikishina, Irina</au><au>Sevgili, Özge</au><au>Semmann, Martin</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>Wiping out the limitations of Large Language Models -- A Taxonomy for Retrieval Augmented Generation</atitle><jtitle>arXiv.org</jtitle><date>2024-08-12</date><risdate>2024</risdate><eissn>2331-8422</eissn><abstract>Current research on RAGs is distributed across various disciplines, and since the technology is evolving very quickly, its unit of analysis is mostly on technological innovations, rather than applications in business contexts. Thus, in this research, we aim to create a taxonomy to conceptualize a comprehensive overview of the constituting characteristics that define RAG applications, facilitating the adoption of this technology in the IS community. To the best of our knowledge, no RAG application taxonomies have been developed so far. We describe our methodology for developing the taxonomy, which includes the criteria for selecting papers, an explanation of our rationale for employing a Large Language Model (LLM)-supported approach to extract and identify initial characteristics, and a concise overview of our systematic process for conceptualizing the taxonomy. Our systematic taxonomy development process includes four iterative phases designed to refine and enhance our understanding and presentation of RAG's core dimensions. We have developed a total of five meta-dimensions and sixteen dimensions to comprehensively capture the concept of Retrieval-Augmented Generation (RAG) applications. When discussing our findings, we also detail the specific research areas and pose key research questions to guide future information system researchers as they explore the emerging topics of RAG systems.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier EISSN: 2331-8422
ispartof arXiv.org, 2024-08
issn 2331-8422
language eng
recordid cdi_proquest_journals_3090746306
source Publicly Available Content Database
subjects Large language models
Retrieval
Taxonomy
Technology assessment
title Wiping out the limitations of Large Language Models -- A Taxonomy for Retrieval Augmented Generation
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-16T17%3A19%3A34IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=Wiping%20out%20the%20limitations%20of%20Large%20Language%20Models%20--%20A%20Taxonomy%20for%20Retrieval%20Augmented%20Generation&rft.jtitle=arXiv.org&rft.au=Li,%20Mahei%20Manhai&rft.date=2024-08-12&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E3090746306%3C/proquest%3E%3Cgrp_id%3Ecdi_FETCH-proquest_journals_30907463063%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=3090746306&rft_id=info:pmid/&rfr_iscdi=true