Loading…
MIDST: an enhanced development environment that improves the maintainability of a data science analysis
With the increasing ability to generate actionable insight from data, the field of data science has seen significant growth. As more teams develop data science solutions, the analytical code they develop will need to be enhanced in the future, by an existing or a new team member. Thus, the importanc...
Saved in:
Published in: | International journal of information systems and project management 2020-01, Vol.8 (3), p.5-22 |
---|---|
Main Authors: | , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | cdi_FETCH-LOGICAL-c370t-29b4cd65e284e46e4dca2e38ad1e4e1b24f2a5af18a0d3af9764cdf2b0955abe3 |
---|---|
cites | |
container_end_page | 22 |
container_issue | 3 |
container_start_page | 5 |
container_title | International journal of information systems and project management |
container_volume | 8 |
creator | Saltz, Jeffrey S. Crowston, Kevin Heckman, Robert Hegde, Yatish |
description | With the increasing ability to generate actionable insight from data, the field of data science has seen significant growth. As more teams develop data science solutions, the analytical code they develop will need to be enhanced in the future, by an existing or a new team member. Thus, the importance of being able to easily maintain and enhance the code required for an analysis will increase. However, to date, there has been minimal research on the maintainability of an analysis done by a data science team. To help address this gap, data science maintainability was explored by (1) creating a data science maintainability model, (2) creating a new tool, called MIDST (Modular Interactive Data Science Tool), that aims to improve data science maintainability, and then (3) conducting a mixed method experiment to evaluate MIDST. The new tool aims to improve the ability of a team member to update and rerun an existing data science analysis by providing a visual data flow view of the analysis within an integrated code and computational environment. Via an analysis of the quantitative and qualitative survey results, the experiment found that MIDST does help improve the maintainability of an analysis. Thus, this research demonstrates the importance of enhanced tools tohelp improve the maintainability of data science projects. |
doi_str_mv | 10.12821/ijispm080301 |
format | article |
fullrecord | <record><control><sourceid>proquest_doaj_</sourceid><recordid>TN_cdi_doaj_primary_oai_doaj_org_article_d8c9f2da83254bd5a58a9fb7be4c4400</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><doaj_id>oai_doaj_org_article_d8c9f2da83254bd5a58a9fb7be4c4400</doaj_id><sourcerecordid>2487473903</sourcerecordid><originalsourceid>FETCH-LOGICAL-c370t-29b4cd65e284e46e4dca2e38ad1e4e1b24f2a5af18a0d3af9764cdf2b0955abe3</originalsourceid><addsrcrecordid>eNpNkc1rGzEQxZfSQk3iY--CnLfVp1ebW3G_DAk5JD2LWWlky-yuXEk2-L-vsEPIYZg3w-M3DK9pvjD6lXHN2bewD_kwUU0FZR-aBWeat12n9cd3-nOzzHlPKWVKK6G7RbN93Px4frknMBOcdzBbdMThCcd4mHAudXkKKc4XXXZQSJgOKZ4w1wnJBGEutWAIYyhnEj0B4qAAyTZghVUujOcc8m3zycOYcfnab5q_v36-rP-0D0-_N-vvD60VHS0t7wdp3Uoh1xLlCqWzwFFocAwlsoFLz0GBZxqoE-D7blX9ng-0VwoGFDfN5sp1EfbmkMIE6WwiBHNZxLQ1kEqwIxqnbe-5Ay24koNToDT0fugGlFZKSivr7sqqH_87Yi5mH4-pPpQNl7qTneipqK726rIp5pzQv11l1FyiMe-jEf8BJwuELw</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2487473903</pqid></control><display><type>article</type><title>MIDST: an enhanced development environment that improves the maintainability of a data science analysis</title><source>ABI/INFORM Global</source><source>Publicly Available Content (ProQuest)</source><creator>Saltz, Jeffrey S. ; Crowston, Kevin ; Heckman, Robert ; Hegde, Yatish</creator><creatorcontrib>Saltz, Jeffrey S. ; Crowston, Kevin ; Heckman, Robert ; Hegde, Yatish</creatorcontrib><description>With the increasing ability to generate actionable insight from data, the field of data science has seen significant growth. As more teams develop data science solutions, the analytical code they develop will need to be enhanced in the future, by an existing or a new team member. Thus, the importance of being able to easily maintain and enhance the code required for an analysis will increase. However, to date, there has been minimal research on the maintainability of an analysis done by a data science team. To help address this gap, data science maintainability was explored by (1) creating a data science maintainability model, (2) creating a new tool, called MIDST (Modular Interactive Data Science Tool), that aims to improve data science maintainability, and then (3) conducting a mixed method experiment to evaluate MIDST. The new tool aims to improve the ability of a team member to update and rerun an existing data science analysis by providing a visual data flow view of the analysis within an integrated code and computational environment. Via an analysis of the quantitative and qualitative survey results, the experiment found that MIDST does help improve the maintainability of an analysis. Thus, this research demonstrates the importance of enhanced tools tohelp improve the maintainability of data science projects.</description><identifier>ISSN: 2182-7788</identifier><identifier>ISSN: 2182-7796</identifier><identifier>EISSN: 2182-7788</identifier><identifier>DOI: 10.12821/ijispm080301</identifier><language>eng</language><publisher>Lisbon: Association for Promotion and Dissemination of Scientific Knowledge (SciKA)</publisher><subject>Algorithms ; Data science ; data science development environment ; Experiments ; maintainability ; Project management ; Reproducibility ; Science ; Scientists ; Software development ; Software engineering ; Software upgrading ; visual programming</subject><ispartof>International journal of information systems and project management, 2020-01, Vol.8 (3), p.5-22</ispartof><rights>2020. This work is published under https://www.umes.edu/ajcjs/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c370t-29b4cd65e284e46e4dca2e38ad1e4e1b24f2a5af18a0d3af9764cdf2b0955abe3</citedby></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.proquest.com/docview/2487473903/fulltextPDF?pq-origsite=primo$$EPDF$$P50$$Gproquest$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.proquest.com/docview/2487473903?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>314,777,781,11669,25734,27905,27906,36041,36993,44344,44571,74644,74875</link.rule.ids></links><search><creatorcontrib>Saltz, Jeffrey S.</creatorcontrib><creatorcontrib>Crowston, Kevin</creatorcontrib><creatorcontrib>Heckman, Robert</creatorcontrib><creatorcontrib>Hegde, Yatish</creatorcontrib><title>MIDST: an enhanced development environment that improves the maintainability of a data science analysis</title><title>International journal of information systems and project management</title><description>With the increasing ability to generate actionable insight from data, the field of data science has seen significant growth. As more teams develop data science solutions, the analytical code they develop will need to be enhanced in the future, by an existing or a new team member. Thus, the importance of being able to easily maintain and enhance the code required for an analysis will increase. However, to date, there has been minimal research on the maintainability of an analysis done by a data science team. To help address this gap, data science maintainability was explored by (1) creating a data science maintainability model, (2) creating a new tool, called MIDST (Modular Interactive Data Science Tool), that aims to improve data science maintainability, and then (3) conducting a mixed method experiment to evaluate MIDST. The new tool aims to improve the ability of a team member to update and rerun an existing data science analysis by providing a visual data flow view of the analysis within an integrated code and computational environment. Via an analysis of the quantitative and qualitative survey results, the experiment found that MIDST does help improve the maintainability of an analysis. Thus, this research demonstrates the importance of enhanced tools tohelp improve the maintainability of data science projects.</description><subject>Algorithms</subject><subject>Data science</subject><subject>data science development environment</subject><subject>Experiments</subject><subject>maintainability</subject><subject>Project management</subject><subject>Reproducibility</subject><subject>Science</subject><subject>Scientists</subject><subject>Software development</subject><subject>Software engineering</subject><subject>Software upgrading</subject><subject>visual programming</subject><issn>2182-7788</issn><issn>2182-7796</issn><issn>2182-7788</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><sourceid>M0C</sourceid><sourceid>PIMPY</sourceid><sourceid>DOA</sourceid><recordid>eNpNkc1rGzEQxZfSQk3iY--CnLfVp1ebW3G_DAk5JD2LWWlky-yuXEk2-L-vsEPIYZg3w-M3DK9pvjD6lXHN2bewD_kwUU0FZR-aBWeat12n9cd3-nOzzHlPKWVKK6G7RbN93Px4frknMBOcdzBbdMThCcd4mHAudXkKKc4XXXZQSJgOKZ4w1wnJBGEutWAIYyhnEj0B4qAAyTZghVUujOcc8m3zycOYcfnab5q_v36-rP-0D0-_N-vvD60VHS0t7wdp3Uoh1xLlCqWzwFFocAwlsoFLz0GBZxqoE-D7blX9ng-0VwoGFDfN5sp1EfbmkMIE6WwiBHNZxLQ1kEqwIxqnbe-5Ay24koNToDT0fugGlFZKSivr7sqqH_87Yi5mH4-pPpQNl7qTneipqK726rIp5pzQv11l1FyiMe-jEf8BJwuELw</recordid><startdate>20200101</startdate><enddate>20200101</enddate><creator>Saltz, Jeffrey S.</creator><creator>Crowston, Kevin</creator><creator>Heckman, Robert</creator><creator>Hegde, Yatish</creator><general>Association for Promotion and Dissemination of Scientific Knowledge (SciKA)</general><general>UMinho Editora</general><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7WY</scope><scope>7WZ</scope><scope>7XB</scope><scope>87Z</scope><scope>8FK</scope><scope>8FL</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BEZIV</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>FRNLG</scope><scope>F~G</scope><scope>K60</scope><scope>K6~</scope><scope>L.-</scope><scope>M0C</scope><scope>PIMPY</scope><scope>PQBIZ</scope><scope>PQBZA</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>Q9U</scope><scope>DOA</scope></search><sort><creationdate>20200101</creationdate><title>MIDST: an enhanced development environment that improves the maintainability of a data science analysis</title><author>Saltz, Jeffrey S. ; Crowston, Kevin ; Heckman, Robert ; Hegde, Yatish</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c370t-29b4cd65e284e46e4dca2e38ad1e4e1b24f2a5af18a0d3af9764cdf2b0955abe3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Algorithms</topic><topic>Data science</topic><topic>data science development environment</topic><topic>Experiments</topic><topic>maintainability</topic><topic>Project management</topic><topic>Reproducibility</topic><topic>Science</topic><topic>Scientists</topic><topic>Software development</topic><topic>Software engineering</topic><topic>Software upgrading</topic><topic>visual programming</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Saltz, Jeffrey S.</creatorcontrib><creatorcontrib>Crowston, Kevin</creatorcontrib><creatorcontrib>Heckman, Robert</creatorcontrib><creatorcontrib>Hegde, Yatish</creatorcontrib><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>ABI/INFORM Collection</collection><collection>ABI/INFORM Global (PDF only)</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>ABI/INFORM Collection</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ABI/INFORM Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>ProQuest Business Premium Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>Business Premium Collection (Alumni)</collection><collection>ABI/INFORM Global (Corporate)</collection><collection>ProQuest Business Collection (Alumni Edition)</collection><collection>ProQuest Business Collection</collection><collection>ABI/INFORM Professional Advanced</collection><collection>ABI/INFORM Global</collection><collection>Publicly Available Content (ProQuest)</collection><collection>One Business (ProQuest)</collection><collection>ProQuest One Business (Alumni)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>ProQuest Central Basic</collection><collection>DOAJ Directory of Open Access Journals</collection><jtitle>International journal of information systems and project management</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Saltz, Jeffrey S.</au><au>Crowston, Kevin</au><au>Heckman, Robert</au><au>Hegde, Yatish</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>MIDST: an enhanced development environment that improves the maintainability of a data science analysis</atitle><jtitle>International journal of information systems and project management</jtitle><date>2020-01-01</date><risdate>2020</risdate><volume>8</volume><issue>3</issue><spage>5</spage><epage>22</epage><pages>5-22</pages><issn>2182-7788</issn><issn>2182-7796</issn><eissn>2182-7788</eissn><abstract>With the increasing ability to generate actionable insight from data, the field of data science has seen significant growth. As more teams develop data science solutions, the analytical code they develop will need to be enhanced in the future, by an existing or a new team member. Thus, the importance of being able to easily maintain and enhance the code required for an analysis will increase. However, to date, there has been minimal research on the maintainability of an analysis done by a data science team. To help address this gap, data science maintainability was explored by (1) creating a data science maintainability model, (2) creating a new tool, called MIDST (Modular Interactive Data Science Tool), that aims to improve data science maintainability, and then (3) conducting a mixed method experiment to evaluate MIDST. The new tool aims to improve the ability of a team member to update and rerun an existing data science analysis by providing a visual data flow view of the analysis within an integrated code and computational environment. Via an analysis of the quantitative and qualitative survey results, the experiment found that MIDST does help improve the maintainability of an analysis. Thus, this research demonstrates the importance of enhanced tools tohelp improve the maintainability of data science projects.</abstract><cop>Lisbon</cop><pub>Association for Promotion and Dissemination of Scientific Knowledge (SciKA)</pub><doi>10.12821/ijispm080301</doi><tpages>18</tpages><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 2182-7788 |
ispartof | International journal of information systems and project management, 2020-01, Vol.8 (3), p.5-22 |
issn | 2182-7788 2182-7796 2182-7788 |
language | eng |
recordid | cdi_doaj_primary_oai_doaj_org_article_d8c9f2da83254bd5a58a9fb7be4c4400 |
source | ABI/INFORM Global; Publicly Available Content (ProQuest) |
subjects | Algorithms Data science data science development environment Experiments maintainability Project management Reproducibility Science Scientists Software development Software engineering Software upgrading visual programming |
title | MIDST: an enhanced development environment that improves the maintainability of a data science analysis |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-18T04%3A52%3A40IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_doaj_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=MIDST:%20an%20enhanced%20development%20environment%20that%20improves%20the%20maintainability%20of%20a%20data%20science%20analysis&rft.jtitle=International%20journal%20of%20information%20systems%20and%20project%20management&rft.au=Saltz,%20Jeffrey%20S.&rft.date=2020-01-01&rft.volume=8&rft.issue=3&rft.spage=5&rft.epage=22&rft.pages=5-22&rft.issn=2182-7788&rft.eissn=2182-7788&rft_id=info:doi/10.12821/ijispm080301&rft_dat=%3Cproquest_doaj_%3E2487473903%3C/proquest_doaj_%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c370t-29b4cd65e284e46e4dca2e38ad1e4e1b24f2a5af18a0d3af9764cdf2b0955abe3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2487473903&rft_id=info:pmid/&rfr_iscdi=true |