Loading…

Self-supervised Training of Proposal-based Segmentation via Background Prediction

While supervised object detection methods achieve impressive accuracy, they generalize poorly to images whose appearance significantly differs from the data they have been trained on. To address this in scenarios where annotating data is prohibitively expensive, we introduce a self-supervised approa...

Full description

Saved in:
Bibliographic Details
Published in:arXiv.org 2019-07
Main Authors: Katircioglu, Isinsu, Rhodin, Helge, Constantin, Victor, Spörri, Jörg, Salzmann, Mathieu, Fua, Pascal
Format: Article
Language:English
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites
container_end_page
container_issue
container_start_page
container_title arXiv.org
container_volume
creator Katircioglu, Isinsu
Rhodin, Helge
Constantin, Victor
Spörri, Jörg
Salzmann, Mathieu
Fua, Pascal
description While supervised object detection methods achieve impressive accuracy, they generalize poorly to images whose appearance significantly differs from the data they have been trained on. To address this in scenarios where annotating data is prohibitively expensive, we introduce a self-supervised approach to object detection and segmentation, able to work with monocular images captured with a moving camera. At the heart of our approach lies the observation that segmentation and background reconstruction are linked tasks, and the idea that, because we observe a structured scene, background regions can be re-synthesized from their surroundings, whereas regions depicting the object cannot. We therefore encode this intuition as a self-supervised loss function that we exploit to train a proposal-based segmentation network. To account for the discrete nature of object proposals, we develop a Monte Carlo-based training strategy that allows us to explore the large space of object proposals. Our experiments demonstrate that our approach yields accurate detections and segmentations in images that visually depart from those of standard benchmarks, outperforming existing self-supervised methods and approaching weakly supervised ones that exploit large annotated datasets.
format article
fullrecord <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_2260225310</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2260225310</sourcerecordid><originalsourceid>FETCH-proquest_journals_22602253103</originalsourceid><addsrcrecordid>eNqNytEKgjAUgOERBEn5DoOuB_Msreui6LLQ-zjpUWa22eZ8_hR6gK7-i-9fsAiUSsRhB7BisfetlBKyPaSpitg9p64WPvTkRu2p4oVDbbRpuK35zdneeuzEE2fKqXmTGXDQ1vBRIz9i-WqcDaaaVqp0OcuGLWvsPMW_rtn2ci5OV9E7-wnkh0drgzMTPQAyCZCqRKr_ri_lcD-x</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2260225310</pqid></control><display><type>article</type><title>Self-supervised Training of Proposal-based Segmentation via Background Prediction</title><source>Publicly Available Content (ProQuest)</source><creator>Katircioglu, Isinsu ; Rhodin, Helge ; Constantin, Victor ; Spörri, Jörg ; Salzmann, Mathieu ; Fua, Pascal</creator><creatorcontrib>Katircioglu, Isinsu ; Rhodin, Helge ; Constantin, Victor ; Spörri, Jörg ; Salzmann, Mathieu ; Fua, Pascal</creatorcontrib><description>While supervised object detection methods achieve impressive accuracy, they generalize poorly to images whose appearance significantly differs from the data they have been trained on. To address this in scenarios where annotating data is prohibitively expensive, we introduce a self-supervised approach to object detection and segmentation, able to work with monocular images captured with a moving camera. At the heart of our approach lies the observation that segmentation and background reconstruction are linked tasks, and the idea that, because we observe a structured scene, background regions can be re-synthesized from their surroundings, whereas regions depicting the object cannot. We therefore encode this intuition as a self-supervised loss function that we exploit to train a proposal-based segmentation network. To account for the discrete nature of object proposals, we develop a Monte Carlo-based training strategy that allows us to explore the large space of object proposals. Our experiments demonstrate that our approach yields accurate detections and segmentations in images that visually depart from those of standard benchmarks, outperforming existing self-supervised methods and approaching weakly supervised ones that exploit large annotated datasets.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Image detection ; Image segmentation ; Object recognition ; Proposals ; Training</subject><ispartof>arXiv.org, 2019-07</ispartof><rights>2019. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.proquest.com/docview/2260225310?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>780,784,25753,37012,44590</link.rule.ids></links><search><creatorcontrib>Katircioglu, Isinsu</creatorcontrib><creatorcontrib>Rhodin, Helge</creatorcontrib><creatorcontrib>Constantin, Victor</creatorcontrib><creatorcontrib>Spörri, Jörg</creatorcontrib><creatorcontrib>Salzmann, Mathieu</creatorcontrib><creatorcontrib>Fua, Pascal</creatorcontrib><title>Self-supervised Training of Proposal-based Segmentation via Background Prediction</title><title>arXiv.org</title><description>While supervised object detection methods achieve impressive accuracy, they generalize poorly to images whose appearance significantly differs from the data they have been trained on. To address this in scenarios where annotating data is prohibitively expensive, we introduce a self-supervised approach to object detection and segmentation, able to work with monocular images captured with a moving camera. At the heart of our approach lies the observation that segmentation and background reconstruction are linked tasks, and the idea that, because we observe a structured scene, background regions can be re-synthesized from their surroundings, whereas regions depicting the object cannot. We therefore encode this intuition as a self-supervised loss function that we exploit to train a proposal-based segmentation network. To account for the discrete nature of object proposals, we develop a Monte Carlo-based training strategy that allows us to explore the large space of object proposals. Our experiments demonstrate that our approach yields accurate detections and segmentations in images that visually depart from those of standard benchmarks, outperforming existing self-supervised methods and approaching weakly supervised ones that exploit large annotated datasets.</description><subject>Image detection</subject><subject>Image segmentation</subject><subject>Object recognition</subject><subject>Proposals</subject><subject>Training</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2019</creationdate><recordtype>article</recordtype><sourceid>PIMPY</sourceid><recordid>eNqNytEKgjAUgOERBEn5DoOuB_Msreui6LLQ-zjpUWa22eZ8_hR6gK7-i-9fsAiUSsRhB7BisfetlBKyPaSpitg9p64WPvTkRu2p4oVDbbRpuK35zdneeuzEE2fKqXmTGXDQ1vBRIz9i-WqcDaaaVqp0OcuGLWvsPMW_rtn2ci5OV9E7-wnkh0drgzMTPQAyCZCqRKr_ri_lcD-x</recordid><startdate>20190718</startdate><enddate>20190718</enddate><creator>Katircioglu, Isinsu</creator><creator>Rhodin, Helge</creator><creator>Constantin, Victor</creator><creator>Spörri, Jörg</creator><creator>Salzmann, Mathieu</creator><creator>Fua, Pascal</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20190718</creationdate><title>Self-supervised Training of Proposal-based Segmentation via Background Prediction</title><author>Katircioglu, Isinsu ; Rhodin, Helge ; Constantin, Victor ; Spörri, Jörg ; Salzmann, Mathieu ; Fua, Pascal</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_22602253103</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2019</creationdate><topic>Image detection</topic><topic>Image segmentation</topic><topic>Object recognition</topic><topic>Proposals</topic><topic>Training</topic><toplevel>online_resources</toplevel><creatorcontrib>Katircioglu, Isinsu</creatorcontrib><creatorcontrib>Rhodin, Helge</creatorcontrib><creatorcontrib>Constantin, Victor</creatorcontrib><creatorcontrib>Spörri, Jörg</creatorcontrib><creatorcontrib>Salzmann, Mathieu</creatorcontrib><creatorcontrib>Fua, Pascal</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central</collection><collection>SciTech Premium Collection (Proquest) (PQ_SDU_P3)</collection><collection>ProQuest Engineering Collection</collection><collection>ProQuest Engineering Database</collection><collection>Publicly Available Content (ProQuest)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Katircioglu, Isinsu</au><au>Rhodin, Helge</au><au>Constantin, Victor</au><au>Spörri, Jörg</au><au>Salzmann, Mathieu</au><au>Fua, Pascal</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>Self-supervised Training of Proposal-based Segmentation via Background Prediction</atitle><jtitle>arXiv.org</jtitle><date>2019-07-18</date><risdate>2019</risdate><eissn>2331-8422</eissn><abstract>While supervised object detection methods achieve impressive accuracy, they generalize poorly to images whose appearance significantly differs from the data they have been trained on. To address this in scenarios where annotating data is prohibitively expensive, we introduce a self-supervised approach to object detection and segmentation, able to work with monocular images captured with a moving camera. At the heart of our approach lies the observation that segmentation and background reconstruction are linked tasks, and the idea that, because we observe a structured scene, background regions can be re-synthesized from their surroundings, whereas regions depicting the object cannot. We therefore encode this intuition as a self-supervised loss function that we exploit to train a proposal-based segmentation network. To account for the discrete nature of object proposals, we develop a Monte Carlo-based training strategy that allows us to explore the large space of object proposals. Our experiments demonstrate that our approach yields accurate detections and segmentations in images that visually depart from those of standard benchmarks, outperforming existing self-supervised methods and approaching weakly supervised ones that exploit large annotated datasets.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier EISSN: 2331-8422
ispartof arXiv.org, 2019-07
issn 2331-8422
language eng
recordid cdi_proquest_journals_2260225310
source Publicly Available Content (ProQuest)
subjects Image detection
Image segmentation
Object recognition
Proposals
Training
title Self-supervised Training of Proposal-based Segmentation via Background Prediction
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-07T20%3A00%3A31IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=Self-supervised%20Training%20of%20Proposal-based%20Segmentation%20via%20Background%20Prediction&rft.jtitle=arXiv.org&rft.au=Katircioglu,%20Isinsu&rft.date=2019-07-18&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E2260225310%3C/proquest%3E%3Cgrp_id%3Ecdi_FETCH-proquest_journals_22602253103%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2260225310&rft_id=info:pmid/&rfr_iscdi=true