Loading…

HomebrewedDB: RGB-D Dataset for 6D Pose Estimation of 3D Objects

Among the most important prerequisites for creating and evaluating 6D object pose detectors are datasets with labeled 6D poses. With the advent of deep learning, demand for such datasets is growing continuously. Despite the fact that some of exist, they are scarce and typically have restricted setup...

Full description

Saved in:
Bibliographic Details
Published in:arXiv.org 2019-09
Main Authors: Kaskman, Roman, Zakharov, Sergey, Shugurov, Ivan, Ilic, Slobodan
Format: Article
Language:English
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites
container_end_page
container_issue
container_start_page
container_title arXiv.org
container_volume
creator Kaskman, Roman
Zakharov, Sergey
Shugurov, Ivan
Ilic, Slobodan
description Among the most important prerequisites for creating and evaluating 6D object pose detectors are datasets with labeled 6D poses. With the advent of deep learning, demand for such datasets is growing continuously. Despite the fact that some of exist, they are scarce and typically have restricted setups, such as a single object per sequence, or they focus on specific object types, such as textureless industrial parts. Besides, two significant components are often ignored: training using only available 3D models instead of real data and scalability, i.e. training one method to detect all objects rather than training one detector per object. Other challenges, such as occlusions, changing light conditions and changes in object appearance, as well precisely defined benchmarks are either not present or are scattered among different datasets. In this paper we present a dataset for 6D pose estimation that covers the above-mentioned challenges, mainly targeting training from 3D models (both textured and textureless), scalability, occlusions, and changes in light conditions and object appearance. The dataset features 33 objects (17 toy, 8 household and 8 industry-relevant objects) over 13 scenes of various difficulty. We also present a set of benchmarks to test various desired detector properties, particularly focusing on scalability with respect to the number of objects and resistance to changing light conditions, occlusions and clutter. We also set a baseline for the presented benchmarks using a state-of-the-art DPOD detector. Considering the difficulty of making such datasets, we plan to release the code allowing other researchers to extend this dataset or make their own datasets in the future.
format article
fullrecord <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_2205103560</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2205103560</sourcerecordid><originalsourceid>FETCH-proquest_journals_22051035603</originalsourceid><addsrcrecordid>eNqNyrEKwjAQgOEgCBbtOxw4F9LEVHGSmmo3RdxLqldosT3Npfj6OvgATv_w_RMRKa3TZLNSaiZi5k5KqbK1MkZHYldSj7XHN95tvoXLMU8sWBccY4CGPGQWzsQIBYe2d6GlAagBbeFUd3gLvBDTxj0Y41_nYnkorvsyeXp6jcih6mj0w5cqpaRJpTaZ1P9dH_9ENnQ</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2205103560</pqid></control><display><type>article</type><title>HomebrewedDB: RGB-D Dataset for 6D Pose Estimation of 3D Objects</title><source>Publicly Available Content Database</source><creator>Kaskman, Roman ; Zakharov, Sergey ; Shugurov, Ivan ; Ilic, Slobodan</creator><creatorcontrib>Kaskman, Roman ; Zakharov, Sergey ; Shugurov, Ivan ; Ilic, Slobodan</creatorcontrib><description>Among the most important prerequisites for creating and evaluating 6D object pose detectors are datasets with labeled 6D poses. With the advent of deep learning, demand for such datasets is growing continuously. Despite the fact that some of exist, they are scarce and typically have restricted setups, such as a single object per sequence, or they focus on specific object types, such as textureless industrial parts. Besides, two significant components are often ignored: training using only available 3D models instead of real data and scalability, i.e. training one method to detect all objects rather than training one detector per object. Other challenges, such as occlusions, changing light conditions and changes in object appearance, as well precisely defined benchmarks are either not present or are scattered among different datasets. In this paper we present a dataset for 6D pose estimation that covers the above-mentioned challenges, mainly targeting training from 3D models (both textured and textureless), scalability, occlusions, and changes in light conditions and object appearance. The dataset features 33 objects (17 toy, 8 household and 8 industry-relevant objects) over 13 scenes of various difficulty. We also present a set of benchmarks to test various desired detector properties, particularly focusing on scalability with respect to the number of objects and resistance to changing light conditions, occlusions and clutter. We also set a baseline for the presented benchmarks using a state-of-the-art DPOD detector. Considering the difficulty of making such datasets, we plan to release the code allowing other researchers to extend this dataset or make their own datasets in the future.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Benchmarks ; Clutter ; Datasets ; Detectors ; Machine learning ; Object recognition ; Three dimensional models ; Training</subject><ispartof>arXiv.org, 2019-09</ispartof><rights>2019. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.proquest.com/docview/2205103560?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>780,784,25753,37012,44590</link.rule.ids></links><search><creatorcontrib>Kaskman, Roman</creatorcontrib><creatorcontrib>Zakharov, Sergey</creatorcontrib><creatorcontrib>Shugurov, Ivan</creatorcontrib><creatorcontrib>Ilic, Slobodan</creatorcontrib><title>HomebrewedDB: RGB-D Dataset for 6D Pose Estimation of 3D Objects</title><title>arXiv.org</title><description>Among the most important prerequisites for creating and evaluating 6D object pose detectors are datasets with labeled 6D poses. With the advent of deep learning, demand for such datasets is growing continuously. Despite the fact that some of exist, they are scarce and typically have restricted setups, such as a single object per sequence, or they focus on specific object types, such as textureless industrial parts. Besides, two significant components are often ignored: training using only available 3D models instead of real data and scalability, i.e. training one method to detect all objects rather than training one detector per object. Other challenges, such as occlusions, changing light conditions and changes in object appearance, as well precisely defined benchmarks are either not present or are scattered among different datasets. In this paper we present a dataset for 6D pose estimation that covers the above-mentioned challenges, mainly targeting training from 3D models (both textured and textureless), scalability, occlusions, and changes in light conditions and object appearance. The dataset features 33 objects (17 toy, 8 household and 8 industry-relevant objects) over 13 scenes of various difficulty. We also present a set of benchmarks to test various desired detector properties, particularly focusing on scalability with respect to the number of objects and resistance to changing light conditions, occlusions and clutter. We also set a baseline for the presented benchmarks using a state-of-the-art DPOD detector. Considering the difficulty of making such datasets, we plan to release the code allowing other researchers to extend this dataset or make their own datasets in the future.</description><subject>Benchmarks</subject><subject>Clutter</subject><subject>Datasets</subject><subject>Detectors</subject><subject>Machine learning</subject><subject>Object recognition</subject><subject>Three dimensional models</subject><subject>Training</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2019</creationdate><recordtype>article</recordtype><sourceid>PIMPY</sourceid><recordid>eNqNyrEKwjAQgOEgCBbtOxw4F9LEVHGSmmo3RdxLqldosT3Npfj6OvgATv_w_RMRKa3TZLNSaiZi5k5KqbK1MkZHYldSj7XHN95tvoXLMU8sWBccY4CGPGQWzsQIBYe2d6GlAagBbeFUd3gLvBDTxj0Y41_nYnkorvsyeXp6jcih6mj0w5cqpaRJpTaZ1P9dH_9ENnQ</recordid><startdate>20190930</startdate><enddate>20190930</enddate><creator>Kaskman, Roman</creator><creator>Zakharov, Sergey</creator><creator>Shugurov, Ivan</creator><creator>Ilic, Slobodan</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20190930</creationdate><title>HomebrewedDB: RGB-D Dataset for 6D Pose Estimation of 3D Objects</title><author>Kaskman, Roman ; Zakharov, Sergey ; Shugurov, Ivan ; Ilic, Slobodan</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_22051035603</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2019</creationdate><topic>Benchmarks</topic><topic>Clutter</topic><topic>Datasets</topic><topic>Detectors</topic><topic>Machine learning</topic><topic>Object recognition</topic><topic>Three dimensional models</topic><topic>Training</topic><toplevel>online_resources</toplevel><creatorcontrib>Kaskman, Roman</creatorcontrib><creatorcontrib>Zakharov, Sergey</creatorcontrib><creatorcontrib>Shugurov, Ivan</creatorcontrib><creatorcontrib>Ilic, Slobodan</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Databases</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Kaskman, Roman</au><au>Zakharov, Sergey</au><au>Shugurov, Ivan</au><au>Ilic, Slobodan</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>HomebrewedDB: RGB-D Dataset for 6D Pose Estimation of 3D Objects</atitle><jtitle>arXiv.org</jtitle><date>2019-09-30</date><risdate>2019</risdate><eissn>2331-8422</eissn><abstract>Among the most important prerequisites for creating and evaluating 6D object pose detectors are datasets with labeled 6D poses. With the advent of deep learning, demand for such datasets is growing continuously. Despite the fact that some of exist, they are scarce and typically have restricted setups, such as a single object per sequence, or they focus on specific object types, such as textureless industrial parts. Besides, two significant components are often ignored: training using only available 3D models instead of real data and scalability, i.e. training one method to detect all objects rather than training one detector per object. Other challenges, such as occlusions, changing light conditions and changes in object appearance, as well precisely defined benchmarks are either not present or are scattered among different datasets. In this paper we present a dataset for 6D pose estimation that covers the above-mentioned challenges, mainly targeting training from 3D models (both textured and textureless), scalability, occlusions, and changes in light conditions and object appearance. The dataset features 33 objects (17 toy, 8 household and 8 industry-relevant objects) over 13 scenes of various difficulty. We also present a set of benchmarks to test various desired detector properties, particularly focusing on scalability with respect to the number of objects and resistance to changing light conditions, occlusions and clutter. We also set a baseline for the presented benchmarks using a state-of-the-art DPOD detector. Considering the difficulty of making such datasets, we plan to release the code allowing other researchers to extend this dataset or make their own datasets in the future.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier EISSN: 2331-8422
ispartof arXiv.org, 2019-09
issn 2331-8422
language eng
recordid cdi_proquest_journals_2205103560
source Publicly Available Content Database
subjects Benchmarks
Clutter
Datasets
Detectors
Machine learning
Object recognition
Three dimensional models
Training
title HomebrewedDB: RGB-D Dataset for 6D Pose Estimation of 3D Objects
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-04T23%3A15%3A47IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=HomebrewedDB:%20RGB-D%20Dataset%20for%206D%20Pose%20Estimation%20of%203D%20Objects&rft.jtitle=arXiv.org&rft.au=Kaskman,%20Roman&rft.date=2019-09-30&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E2205103560%3C/proquest%3E%3Cgrp_id%3Ecdi_FETCH-proquest_journals_22051035603%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2205103560&rft_id=info:pmid/&rfr_iscdi=true