Loading…

Robust Data Association Against Detection Deficiency for Semantic SLAM

Robust and accurate object association is essential for precise 3D object landmark inference in semantic Simultaneous Localization and Mapping (SLAM), and yet remains challenging due to the detection deficiency caused by high miss detection rate, false alarm, occlusion and limited field-of-view, etc...

Full description

Saved in:

Bibliographic Details
Published in:	IEEE transactions on automation science and engineering 2024-01, Vol.21 (1), p.868-880
Main Authors:	Lin, Xubin, Ruan, Jiahao, Yang, Yirui, He, Li, Guan, Yisheng, Zhang, Hong
Format:	Article
Language:	English
Subjects:	Algorithms Boxes Cameras Collision avoidance Data association False alarms Feature extraction Image manipulation Image reconstruction Inference local homography multiple object tracking Object motion Object recognition Object tracking Occlusion Robustness semantic SLAM Semantics Simultaneous localization and mapping Target tracking Task space Three-dimensional displays Trajectories Trajectory
Citations:	Items that this one cites
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

cited_by
cites	cdi_FETCH-LOGICAL-c246t-3dd35900c4bf5ae6911640d6be7393bca7d7e8beff8ec9e8603c50ba87e461de3
container_end_page	880
container_issue	1
container_start_page	868
container_title	IEEE transactions on automation science and engineering
container_volume	21
creator	Lin, Xubin Ruan, Jiahao Yang, Yirui He, Li Guan, Yisheng Zhang, Hong
description	Robust and accurate object association is essential for precise 3D object landmark inference in semantic Simultaneous Localization and Mapping (SLAM), and yet remains challenging due to the detection deficiency caused by high miss detection rate, false alarm, occlusion and limited field-of-view, etc. The 2D location of an object is a crucial complementary cue to the appearance feature, especially in the case of associating objects across frames under large viewpoint changes. However, motion model or trajectory pattern based methods struggle to infer object motion reliably with a moving camera. In this paper, by exploiting the local projective warping consistency, a local homography based 2D motion inference method is proposed to sequentially estimate the object location along with uncertainty. By integrating the deep appearance feature and semantic information, an object association method, named HOA, which is robust to detection deficiency is proposed. Experimental evaluations suggest that the proposed motion prediction method is capable of maintaining a low cumulative error over a long duration, which enhances the object association performance in both accuracy and robustness. Note to Practitioners-This work aims to consistently associate 2D detection boxes corresponding to the same 3D object across images. In tasks of landmark-based navigation, collision avoidance, grasping and manipulation, objects in the task space are commonly simplified into 3D enveloping surfaces (e.g. cuboid or ellipsoid) by using 2D object detection boxes from multiple image views, and accurate data association is a prerequisite for precise enveloping surface reconstruction. This problem remains challenging considering the imperfect object detections, the appearance similarity of objects and the unpredictable trajectory of the moving camera. This work proposes a long-term reliable 2D location prediction algorithm that is capable of handling the complex motion of the target. Along with the appearance feature extracted by a retrain-free deep learning based model, this work proposes an object association method that can simultaneously deal with multiple objects with unknown object categories under the moving camera scenario.
doi_str_mv	10.1109/TASE.2022.3233662
format	article
fullrecord	<record><control><sourceid>proquest_ieee_</sourceid><recordid>TN_cdi_proquest_journals_2911488626</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>10011152</ieee_id><sourcerecordid>2911488626</sourcerecordid><originalsourceid>FETCH-LOGICAL-c246t-3dd35900c4bf5ae6911640d6be7393bca7d7e8beff8ec9e8603c50ba87e461de3</originalsourceid><addsrcrecordid>eNpNkEtLAzEQx4MoWKsfQPCw4HlrHptsclz6UKEi2HoO2exEUuymJruHfnt3bQ-eZpj5P-CH0D3BM0KwetpWm-WMYkpnjDImBL1AE8K5zFkp2eW4FzznivNrdJPSDmNaSIUnaPUR6j512cJ0JqtSCtabzoc2q76Mb8cHdGD_Lgtw3npo7TFzIWYb2Ju28zbbrKu3W3TlzHeCu_Ocos_Vcjt_ydfvz6_zap1bWoguZ03DuMLYFrXjBoQiRBS4ETWUTLHamrIpQdbgnASrQArMLMe1kSUUgjTApujxlHuI4aeH1Old6GM7VGo6hBVSCioGFTmpbAwpRXD6EP3exKMmWI-49IhLj7j0GdfgeTh5PAD802NCCKfsF4AwZec</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2911488626</pqid></control><display><type>article</type><title>Robust Data Association Against Detection Deficiency for Semantic SLAM</title><source>IEEE Electronic Library (IEL) Journals</source><creator>Lin, Xubin ; Ruan, Jiahao ; Yang, Yirui ; He, Li ; Guan, Yisheng ; Zhang, Hong</creator><creatorcontrib>Lin, Xubin ; Ruan, Jiahao ; Yang, Yirui ; He, Li ; Guan, Yisheng ; Zhang, Hong</creatorcontrib><description>Robust and accurate object association is essential for precise 3D object landmark inference in semantic Simultaneous Localization and Mapping (SLAM), and yet remains challenging due to the detection deficiency caused by high miss detection rate, false alarm, occlusion and limited field-of-view, etc. The 2D location of an object is a crucial complementary cue to the appearance feature, especially in the case of associating objects across frames under large viewpoint changes. However, motion model or trajectory pattern based methods struggle to infer object motion reliably with a moving camera. In this paper, by exploiting the local projective warping consistency, a local homography based 2D motion inference method is proposed to sequentially estimate the object location along with uncertainty. By integrating the deep appearance feature and semantic information, an object association method, named HOA, which is robust to detection deficiency is proposed. Experimental evaluations suggest that the proposed motion prediction method is capable of maintaining a low cumulative error over a long duration, which enhances the object association performance in both accuracy and robustness. Note to Practitioners-This work aims to consistently associate 2D detection boxes corresponding to the same 3D object across images. In tasks of landmark-based navigation, collision avoidance, grasping and manipulation, objects in the task space are commonly simplified into 3D enveloping surfaces (e.g. cuboid or ellipsoid) by using 2D object detection boxes from multiple image views, and accurate data association is a prerequisite for precise enveloping surface reconstruction. This problem remains challenging considering the imperfect object detections, the appearance similarity of objects and the unpredictable trajectory of the moving camera. This work proposes a long-term reliable 2D location prediction algorithm that is capable of handling the complex motion of the target. Along with the appearance feature extracted by a retrain-free deep learning based model, this work proposes an object association method that can simultaneously deal with multiple objects with unknown object categories under the moving camera scenario.</description><identifier>ISSN: 1545-5955</identifier><identifier>EISSN: 1558-3783</identifier><identifier>DOI: 10.1109/TASE.2022.3233662</identifier><identifier>CODEN: ITASC7</identifier><language>eng</language><publisher>New York: IEEE</publisher><subject>Algorithms ; Boxes ; Cameras ; Collision avoidance ; Data association ; False alarms ; Feature extraction ; Image manipulation ; Image reconstruction ; Inference ; local homography ; multiple object tracking ; Object motion ; Object recognition ; Object tracking ; Occlusion ; Robustness ; semantic SLAM ; Semantics ; Simultaneous localization and mapping ; Target tracking ; Task space ; Three-dimensional displays ; Trajectories ; Trajectory</subject><ispartof>IEEE transactions on automation science and engineering, 2024-01, Vol.21 (1), p.868-880</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2024</rights><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c246t-3dd35900c4bf5ae6911640d6be7393bca7d7e8beff8ec9e8603c50ba87e461de3</cites><orcidid>0000-0002-7011-0331 ; 0000-0002-4721-0019 ; 0000-0002-4123-5530 ; 0000-0003-0261-4068 ; 0000-0002-1677-6132</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/10011152$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,780,784,27924,27925,54796</link.rule.ids></links><search><creatorcontrib>Lin, Xubin</creatorcontrib><creatorcontrib>Ruan, Jiahao</creatorcontrib><creatorcontrib>Yang, Yirui</creatorcontrib><creatorcontrib>He, Li</creatorcontrib><creatorcontrib>Guan, Yisheng</creatorcontrib><creatorcontrib>Zhang, Hong</creatorcontrib><title>Robust Data Association Against Detection Deficiency for Semantic SLAM</title><title>IEEE transactions on automation science and engineering</title><addtitle>TASE</addtitle><description>Robust and accurate object association is essential for precise 3D object landmark inference in semantic Simultaneous Localization and Mapping (SLAM), and yet remains challenging due to the detection deficiency caused by high miss detection rate, false alarm, occlusion and limited field-of-view, etc. The 2D location of an object is a crucial complementary cue to the appearance feature, especially in the case of associating objects across frames under large viewpoint changes. However, motion model or trajectory pattern based methods struggle to infer object motion reliably with a moving camera. In this paper, by exploiting the local projective warping consistency, a local homography based 2D motion inference method is proposed to sequentially estimate the object location along with uncertainty. By integrating the deep appearance feature and semantic information, an object association method, named HOA, which is robust to detection deficiency is proposed. Experimental evaluations suggest that the proposed motion prediction method is capable of maintaining a low cumulative error over a long duration, which enhances the object association performance in both accuracy and robustness. Note to Practitioners-This work aims to consistently associate 2D detection boxes corresponding to the same 3D object across images. In tasks of landmark-based navigation, collision avoidance, grasping and manipulation, objects in the task space are commonly simplified into 3D enveloping surfaces (e.g. cuboid or ellipsoid) by using 2D object detection boxes from multiple image views, and accurate data association is a prerequisite for precise enveloping surface reconstruction. This problem remains challenging considering the imperfect object detections, the appearance similarity of objects and the unpredictable trajectory of the moving camera. This work proposes a long-term reliable 2D location prediction algorithm that is capable of handling the complex motion of the target. Along with the appearance feature extracted by a retrain-free deep learning based model, this work proposes an object association method that can simultaneously deal with multiple objects with unknown object categories under the moving camera scenario.</description><subject>Algorithms</subject><subject>Boxes</subject><subject>Cameras</subject><subject>Collision avoidance</subject><subject>Data association</subject><subject>False alarms</subject><subject>Feature extraction</subject><subject>Image manipulation</subject><subject>Image reconstruction</subject><subject>Inference</subject><subject>local homography</subject><subject>multiple object tracking</subject><subject>Object motion</subject><subject>Object recognition</subject><subject>Object tracking</subject><subject>Occlusion</subject><subject>Robustness</subject><subject>semantic SLAM</subject><subject>Semantics</subject><subject>Simultaneous localization and mapping</subject><subject>Target tracking</subject><subject>Task space</subject><subject>Three-dimensional displays</subject><subject>Trajectories</subject><subject>Trajectory</subject><issn>1545-5955</issn><issn>1558-3783</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><recordid>eNpNkEtLAzEQx4MoWKsfQPCw4HlrHptsclz6UKEi2HoO2exEUuymJruHfnt3bQ-eZpj5P-CH0D3BM0KwetpWm-WMYkpnjDImBL1AE8K5zFkp2eW4FzznivNrdJPSDmNaSIUnaPUR6j512cJ0JqtSCtabzoc2q76Mb8cHdGD_Lgtw3npo7TFzIWYb2Ju28zbbrKu3W3TlzHeCu_Ocos_Vcjt_ydfvz6_zap1bWoguZ03DuMLYFrXjBoQiRBS4ETWUTLHamrIpQdbgnASrQArMLMe1kSUUgjTApujxlHuI4aeH1Old6GM7VGo6hBVSCioGFTmpbAwpRXD6EP3exKMmWI-49IhLj7j0GdfgeTh5PAD802NCCKfsF4AwZec</recordid><startdate>202401</startdate><enddate>202401</enddate><creator>Lin, Xubin</creator><creator>Ruan, Jiahao</creator><creator>Yang, Yirui</creator><creator>He, Li</creator><creator>Guan, Yisheng</creator><creator>Zhang, Hong</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>7TB</scope><scope>8FD</scope><scope>FR3</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><orcidid>https://orcid.org/0000-0002-7011-0331</orcidid><orcidid>https://orcid.org/0000-0002-4721-0019</orcidid><orcidid>https://orcid.org/0000-0002-4123-5530</orcidid><orcidid>https://orcid.org/0000-0003-0261-4068</orcidid><orcidid>https://orcid.org/0000-0002-1677-6132</orcidid></search><sort><creationdate>202401</creationdate><title>Robust Data Association Against Detection Deficiency for Semantic SLAM</title><author>Lin, Xubin ; Ruan, Jiahao ; Yang, Yirui ; He, Li ; Guan, Yisheng ; Zhang, Hong</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c246t-3dd35900c4bf5ae6911640d6be7393bca7d7e8beff8ec9e8603c50ba87e461de3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Algorithms</topic><topic>Boxes</topic><topic>Cameras</topic><topic>Collision avoidance</topic><topic>Data association</topic><topic>False alarms</topic><topic>Feature extraction</topic><topic>Image manipulation</topic><topic>Image reconstruction</topic><topic>Inference</topic><topic>local homography</topic><topic>multiple object tracking</topic><topic>Object motion</topic><topic>Object recognition</topic><topic>Object tracking</topic><topic>Occlusion</topic><topic>Robustness</topic><topic>semantic SLAM</topic><topic>Semantics</topic><topic>Simultaneous localization and mapping</topic><topic>Target tracking</topic><topic>Task space</topic><topic>Three-dimensional displays</topic><topic>Trajectories</topic><topic>Trajectory</topic><toplevel>online_resources</toplevel><creatorcontrib>Lin, Xubin</creatorcontrib><creatorcontrib>Ruan, Jiahao</creatorcontrib><creatorcontrib>Yang, Yirui</creatorcontrib><creatorcontrib>He, Li</creatorcontrib><creatorcontrib>Guan, Yisheng</creatorcontrib><creatorcontrib>Zhang, Hong</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics & Communications Abstracts</collection><collection>Mechanical & Transportation Engineering Abstracts</collection><collection>Technology Research Database</collection><collection>Engineering Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>IEEE transactions on automation science and engineering</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Lin, Xubin</au><au>Ruan, Jiahao</au><au>Yang, Yirui</au><au>He, Li</au><au>Guan, Yisheng</au><au>Zhang, Hong</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Robust Data Association Against Detection Deficiency for Semantic SLAM</atitle><jtitle>IEEE transactions on automation science and engineering</jtitle><stitle>TASE</stitle><date>2024-01</date><risdate>2024</risdate><volume>21</volume><issue>1</issue><spage>868</spage><epage>880</epage><pages>868-880</pages><issn>1545-5955</issn><eissn>1558-3783</eissn><coden>ITASC7</coden><abstract>Robust and accurate object association is essential for precise 3D object landmark inference in semantic Simultaneous Localization and Mapping (SLAM), and yet remains challenging due to the detection deficiency caused by high miss detection rate, false alarm, occlusion and limited field-of-view, etc. The 2D location of an object is a crucial complementary cue to the appearance feature, especially in the case of associating objects across frames under large viewpoint changes. However, motion model or trajectory pattern based methods struggle to infer object motion reliably with a moving camera. In this paper, by exploiting the local projective warping consistency, a local homography based 2D motion inference method is proposed to sequentially estimate the object location along with uncertainty. By integrating the deep appearance feature and semantic information, an object association method, named HOA, which is robust to detection deficiency is proposed. Experimental evaluations suggest that the proposed motion prediction method is capable of maintaining a low cumulative error over a long duration, which enhances the object association performance in both accuracy and robustness. Note to Practitioners-This work aims to consistently associate 2D detection boxes corresponding to the same 3D object across images. In tasks of landmark-based navigation, collision avoidance, grasping and manipulation, objects in the task space are commonly simplified into 3D enveloping surfaces (e.g. cuboid or ellipsoid) by using 2D object detection boxes from multiple image views, and accurate data association is a prerequisite for precise enveloping surface reconstruction. This problem remains challenging considering the imperfect object detections, the appearance similarity of objects and the unpredictable trajectory of the moving camera. This work proposes a long-term reliable 2D location prediction algorithm that is capable of handling the complex motion of the target. Along with the appearance feature extracted by a retrain-free deep learning based model, this work proposes an object association method that can simultaneously deal with multiple objects with unknown object categories under the moving camera scenario.</abstract><cop>New York</cop><pub>IEEE</pub><doi>10.1109/TASE.2022.3233662</doi><tpages>13</tpages><orcidid>https://orcid.org/0000-0002-7011-0331</orcidid><orcidid>https://orcid.org/0000-0002-4721-0019</orcidid><orcidid>https://orcid.org/0000-0002-4123-5530</orcidid><orcidid>https://orcid.org/0000-0003-0261-4068</orcidid><orcidid>https://orcid.org/0000-0002-1677-6132</orcidid></addata></record>
fulltext	fulltext
identifier	ISSN: 1545-5955
ispartof	IEEE transactions on automation science and engineering, 2024-01, Vol.21 (1), p.868-880
issn	1545-5955 1558-3783
language	eng
recordid	cdi_proquest_journals_2911488626
source	IEEE Electronic Library (IEL) Journals
subjects	Algorithms Boxes Cameras Collision avoidance Data association False alarms Feature extraction Image manipulation Image reconstruction Inference local homography multiple object tracking Object motion Object recognition Object tracking Occlusion Robustness semantic SLAM Semantics Simultaneous localization and mapping Target tracking Task space Three-dimensional displays Trajectories Trajectory
title	Robust Data Association Against Detection Deficiency for Semantic SLAM
url	http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-04T03%3A54%3A13IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_ieee_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Robust%20Data%20Association%20Against%20Detection%20Deficiency%20for%20Semantic%20SLAM&rft.jtitle=IEEE%20transactions%20on%20automation%20science%20and%20engineering&rft.au=Lin,%20Xubin&rft.date=2024-01&rft.volume=21&rft.issue=1&rft.spage=868&rft.epage=880&rft.pages=868-880&rft.issn=1545-5955&rft.eissn=1558-3783&rft.coden=ITASC7&rft_id=info:doi/10.1109/TASE.2022.3233662&rft_dat=%3Cproquest_ieee_%3E2911488626%3C/proquest_ieee_%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c246t-3dd35900c4bf5ae6911640d6be7393bca7d7e8beff8ec9e8603c50ba87e461de3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2911488626&rft_id=info:pmid/&rft_ieee_id=10011152&rfr_iscdi=true