Loading…

Unsupervised Temporal Video Segmentation as an Auxiliary Task for Predicting the Remaining Surgery Duration

Estimating the remaining surgery duration (RSD) during surgical procedures can be useful for OR planning and anesthesia dose estimation. With the recent success of deep learning-based methods in computer vision, several neural network approaches have been proposed for fully automatic RSD prediction...

Full description

Saved in:

Bibliographic Details
Published in:	arXiv.org 2020-02
Main Authors:	Rivoir, Dominik, Bodenstedt, Sebastian, Felix von Bechtolsheim, Distler, Marius, Weitz, Jürgen, Speidel, Stefanie
Format:	Article
Language:	English
Subjects:	Anesthesia Annotations Cognitive tasks Computer vision Feature extraction Ground truth Machine learning Neural networks Regularization Segmentation Surgery Training
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

cited_by
cites
container_end_page
container_issue
container_start_page
container_title	arXiv.org
container_volume
creator	Rivoir, Dominik Bodenstedt, Sebastian Felix von Bechtolsheim Distler, Marius Weitz, Jürgen Speidel, Stefanie
description	Estimating the remaining surgery duration (RSD) during surgical procedures can be useful for OR planning and anesthesia dose estimation. With the recent success of deep learning-based methods in computer vision, several neural network approaches have been proposed for fully automatic RSD prediction based solely on visual data from the endoscopic camera. We investigate whether RSD prediction can be improved using unsupervised temporal video segmentation as an auxiliary learning task. As opposed to previous work, which presented supervised surgical phase recognition as auxiliary task, we avoid the need for manual annotations by proposing a similar but unsupervised learning objective which clusters video sequences into temporally coherent segments. In multiple experimental setups, results obtained by learning the auxiliary task are incorporated into a deep RSD model through feature extraction, pretraining or regularization. Further, we propose a novel loss function for RSD training which attempts to counteract unfavorable characteristics of the RSD ground truth. Using our unsupervised method as an auxiliary task for RSD training, we outperform other self-supervised methods and are comparable to the supervised state-of-the-art. Combined with the novel RSD loss, we slightly outperform the supervised approach.
format	article
fullrecord	<record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_2365913221</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2365913221</sourcerecordid><originalsourceid>FETCH-proquest_journals_23659132213</originalsourceid><addsrcrecordid>eNqNjMsKwjAQRYMgKNp_GHBdaBNbdSk-cCla3ZZgxxptkzpJRP_eKn6Aq8vhHG6H9bkQcTgdc95jgbXXKIp4OuFJIvrsdtDWN0gPZbGADOvGkKzgqAo0sMeyRu2kU0aDtCA1zP1TVUrSCzJpb3A2BFvCQp2c0iW4C8IOa6n0h_aeSmzLpafvxZB1z7KyGPx2wEbrVbbYhA2Zu0fr8qvxpFuVc5Ems1hwHov_qjeONkla</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2365913221</pqid></control><display><type>article</type><title>Unsupervised Temporal Video Segmentation as an Auxiliary Task for Predicting the Remaining Surgery Duration</title><source>Publicly Available Content Database</source><creator>Rivoir, Dominik ; Bodenstedt, Sebastian ; Felix von Bechtolsheim ; Distler, Marius ; Weitz, Jürgen ; Speidel, Stefanie</creator><creatorcontrib>Rivoir, Dominik ; Bodenstedt, Sebastian ; Felix von Bechtolsheim ; Distler, Marius ; Weitz, Jürgen ; Speidel, Stefanie</creatorcontrib><description>Estimating the remaining surgery duration (RSD) during surgical procedures can be useful for OR planning and anesthesia dose estimation. With the recent success of deep learning-based methods in computer vision, several neural network approaches have been proposed for fully automatic RSD prediction based solely on visual data from the endoscopic camera. We investigate whether RSD prediction can be improved using unsupervised temporal video segmentation as an auxiliary learning task. As opposed to previous work, which presented supervised surgical phase recognition as auxiliary task, we avoid the need for manual annotations by proposing a similar but unsupervised learning objective which clusters video sequences into temporally coherent segments. In multiple experimental setups, results obtained by learning the auxiliary task are incorporated into a deep RSD model through feature extraction, pretraining or regularization. Further, we propose a novel loss function for RSD training which attempts to counteract unfavorable characteristics of the RSD ground truth. Using our unsupervised method as an auxiliary task for RSD training, we outperform other self-supervised methods and are comparable to the supervised state-of-the-art. Combined with the novel RSD loss, we slightly outperform the supervised approach.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Anesthesia ; Annotations ; Cognitive tasks ; Computer vision ; Feature extraction ; Ground truth ; Machine learning ; Neural networks ; Regularization ; Segmentation ; Surgery ; Training</subject><ispartof>arXiv.org, 2020-02</ispartof><rights>2020. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.proquest.com/docview/2365913221?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>780,784,25753,37012,44590</link.rule.ids></links><search><creatorcontrib>Rivoir, Dominik</creatorcontrib><creatorcontrib>Bodenstedt, Sebastian</creatorcontrib><creatorcontrib>Felix von Bechtolsheim</creatorcontrib><creatorcontrib>Distler, Marius</creatorcontrib><creatorcontrib>Weitz, Jürgen</creatorcontrib><creatorcontrib>Speidel, Stefanie</creatorcontrib><title>Unsupervised Temporal Video Segmentation as an Auxiliary Task for Predicting the Remaining Surgery Duration</title><title>arXiv.org</title><description>Estimating the remaining surgery duration (RSD) during surgical procedures can be useful for OR planning and anesthesia dose estimation. With the recent success of deep learning-based methods in computer vision, several neural network approaches have been proposed for fully automatic RSD prediction based solely on visual data from the endoscopic camera. We investigate whether RSD prediction can be improved using unsupervised temporal video segmentation as an auxiliary learning task. As opposed to previous work, which presented supervised surgical phase recognition as auxiliary task, we avoid the need for manual annotations by proposing a similar but unsupervised learning objective which clusters video sequences into temporally coherent segments. In multiple experimental setups, results obtained by learning the auxiliary task are incorporated into a deep RSD model through feature extraction, pretraining or regularization. Further, we propose a novel loss function for RSD training which attempts to counteract unfavorable characteristics of the RSD ground truth. Using our unsupervised method as an auxiliary task for RSD training, we outperform other self-supervised methods and are comparable to the supervised state-of-the-art. Combined with the novel RSD loss, we slightly outperform the supervised approach.</description><subject>Anesthesia</subject><subject>Annotations</subject><subject>Cognitive tasks</subject><subject>Computer vision</subject><subject>Feature extraction</subject><subject>Ground truth</subject><subject>Machine learning</subject><subject>Neural networks</subject><subject>Regularization</subject><subject>Segmentation</subject><subject>Surgery</subject><subject>Training</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><sourceid>PIMPY</sourceid><recordid>eNqNjMsKwjAQRYMgKNp_GHBdaBNbdSk-cCla3ZZgxxptkzpJRP_eKn6Aq8vhHG6H9bkQcTgdc95jgbXXKIp4OuFJIvrsdtDWN0gPZbGADOvGkKzgqAo0sMeyRu2kU0aDtCA1zP1TVUrSCzJpb3A2BFvCQp2c0iW4C8IOa6n0h_aeSmzLpafvxZB1z7KyGPx2wEbrVbbYhA2Zu0fr8qvxpFuVc5Ems1hwHov_qjeONkla</recordid><startdate>20200226</startdate><enddate>20200226</enddate><creator>Rivoir, Dominik</creator><creator>Bodenstedt, Sebastian</creator><creator>Felix von Bechtolsheim</creator><creator>Distler, Marius</creator><creator>Weitz, Jürgen</creator><creator>Speidel, Stefanie</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PTHSS</scope></search><sort><creationdate>20200226</creationdate><title>Unsupervised Temporal Video Segmentation as an Auxiliary Task for Predicting the Remaining Surgery Duration</title><author>Rivoir, Dominik ; Bodenstedt, Sebastian ; Felix von Bechtolsheim ; Distler, Marius ; Weitz, Jürgen ; Speidel, Stefanie</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_23659132213</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Anesthesia</topic><topic>Annotations</topic><topic>Cognitive tasks</topic><topic>Computer vision</topic><topic>Feature extraction</topic><topic>Ground truth</topic><topic>Machine learning</topic><topic>Neural networks</topic><topic>Regularization</topic><topic>Segmentation</topic><topic>Surgery</topic><topic>Training</topic><toplevel>online_resources</toplevel><creatorcontrib>Rivoir, Dominik</creatorcontrib><creatorcontrib>Bodenstedt, Sebastian</creatorcontrib><creatorcontrib>Felix von Bechtolsheim</creatorcontrib><creatorcontrib>Distler, Marius</creatorcontrib><creatorcontrib>Weitz, Jürgen</creatorcontrib><creatorcontrib>Speidel, Stefanie</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Databases</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>Engineering Collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Rivoir, Dominik</au><au>Bodenstedt, Sebastian</au><au>Felix von Bechtolsheim</au><au>Distler, Marius</au><au>Weitz, Jürgen</au><au>Speidel, Stefanie</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>Unsupervised Temporal Video Segmentation as an Auxiliary Task for Predicting the Remaining Surgery Duration</atitle><jtitle>arXiv.org</jtitle><date>2020-02-26</date><risdate>2020</risdate><eissn>2331-8422</eissn><abstract>Estimating the remaining surgery duration (RSD) during surgical procedures can be useful for OR planning and anesthesia dose estimation. With the recent success of deep learning-based methods in computer vision, several neural network approaches have been proposed for fully automatic RSD prediction based solely on visual data from the endoscopic camera. We investigate whether RSD prediction can be improved using unsupervised temporal video segmentation as an auxiliary learning task. As opposed to previous work, which presented supervised surgical phase recognition as auxiliary task, we avoid the need for manual annotations by proposing a similar but unsupervised learning objective which clusters video sequences into temporally coherent segments. In multiple experimental setups, results obtained by learning the auxiliary task are incorporated into a deep RSD model through feature extraction, pretraining or regularization. Further, we propose a novel loss function for RSD training which attempts to counteract unfavorable characteristics of the RSD ground truth. Using our unsupervised method as an auxiliary task for RSD training, we outperform other self-supervised methods and are comparable to the supervised state-of-the-art. Combined with the novel RSD loss, we slightly outperform the supervised approach.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	EISSN: 2331-8422
ispartof	arXiv.org, 2020-02
issn	2331-8422
language	eng
recordid	cdi_proquest_journals_2365913221
source	Publicly Available Content Database
subjects	Anesthesia Annotations Cognitive tasks Computer vision Feature extraction Ground truth Machine learning Neural networks Regularization Segmentation Surgery Training
title	Unsupervised Temporal Video Segmentation as an Auxiliary Task for Predicting the Remaining Surgery Duration
url	http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-08T04%3A44%3A13IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=Unsupervised%20Temporal%20Video%20Segmentation%20as%20an%20Auxiliary%20Task%20for%20Predicting%20the%20Remaining%20Surgery%20Duration&rft.jtitle=arXiv.org&rft.au=Rivoir,%20Dominik&rft.date=2020-02-26&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E2365913221%3C/proquest%3E%3Cgrp_id%3Ecdi_FETCH-proquest_journals_23659132213%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2365913221&rft_id=info:pmid/&rfr_iscdi=true