Loading…

Robust Data Association Against Detection Deficiency for Semantic SLAM

Robust and accurate object association is essential for precise 3D object landmark inference in semantic Simultaneous Localization and Mapping (SLAM), and yet remains challenging due to the detection deficiency caused by high miss detection rate, false alarm, occlusion and limited field-of-view, etc...

Full description

Saved in:
Bibliographic Details
Published in:IEEE transactions on automation science and engineering 2024-01, Vol.21 (1), p.868-880
Main Authors: Lin, Xubin, Ruan, Jiahao, Yang, Yirui, He, Li, Guan, Yisheng, Zhang, Hong
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites cdi_FETCH-LOGICAL-c246t-3dd35900c4bf5ae6911640d6be7393bca7d7e8beff8ec9e8603c50ba87e461de3
container_end_page 880
container_issue 1
container_start_page 868
container_title IEEE transactions on automation science and engineering
container_volume 21
creator Lin, Xubin
Ruan, Jiahao
Yang, Yirui
He, Li
Guan, Yisheng
Zhang, Hong
description Robust and accurate object association is essential for precise 3D object landmark inference in semantic Simultaneous Localization and Mapping (SLAM), and yet remains challenging due to the detection deficiency caused by high miss detection rate, false alarm, occlusion and limited field-of-view, etc. The 2D location of an object is a crucial complementary cue to the appearance feature, especially in the case of associating objects across frames under large viewpoint changes. However, motion model or trajectory pattern based methods struggle to infer object motion reliably with a moving camera. In this paper, by exploiting the local projective warping consistency, a local homography based 2D motion inference method is proposed to sequentially estimate the object location along with uncertainty. By integrating the deep appearance feature and semantic information, an object association method, named HOA, which is robust to detection deficiency is proposed. Experimental evaluations suggest that the proposed motion prediction method is capable of maintaining a low cumulative error over a long duration, which enhances the object association performance in both accuracy and robustness. Note to Practitioners-This work aims to consistently associate 2D detection boxes corresponding to the same 3D object across images. In tasks of landmark-based navigation, collision avoidance, grasping and manipulation, objects in the task space are commonly simplified into 3D enveloping surfaces (e.g. cuboid or ellipsoid) by using 2D object detection boxes from multiple image views, and accurate data association is a prerequisite for precise enveloping surface reconstruction. This problem remains challenging considering the imperfect object detections, the appearance similarity of objects and the unpredictable trajectory of the moving camera. This work proposes a long-term reliable 2D location prediction algorithm that is capable of handling the complex motion of the target. Along with the appearance feature extracted by a retrain-free deep learning based model, this work proposes an object association method that can simultaneously deal with multiple objects with unknown object categories under the moving camera scenario.
doi_str_mv 10.1109/TASE.2022.3233662
format article
fullrecord <record><control><sourceid>proquest_ieee_</sourceid><recordid>TN_cdi_proquest_journals_2911488626</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>10011152</ieee_id><sourcerecordid>2911488626</sourcerecordid><originalsourceid>FETCH-LOGICAL-c246t-3dd35900c4bf5ae6911640d6be7393bca7d7e8beff8ec9e8603c50ba87e461de3</originalsourceid><addsrcrecordid>eNpNkEtLAzEQx4MoWKsfQPCw4HlrHptsclz6UKEi2HoO2exEUuymJruHfnt3bQ-eZpj5P-CH0D3BM0KwetpWm-WMYkpnjDImBL1AE8K5zFkp2eW4FzznivNrdJPSDmNaSIUnaPUR6j512cJ0JqtSCtabzoc2q76Mb8cHdGD_Lgtw3npo7TFzIWYb2Ju28zbbrKu3W3TlzHeCu_Ocos_Vcjt_ydfvz6_zap1bWoguZ03DuMLYFrXjBoQiRBS4ETWUTLHamrIpQdbgnASrQArMLMe1kSUUgjTApujxlHuI4aeH1Old6GM7VGo6hBVSCioGFTmpbAwpRXD6EP3exKMmWI-49IhLj7j0GdfgeTh5PAD802NCCKfsF4AwZec</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2911488626</pqid></control><display><type>article</type><title>Robust Data Association Against Detection Deficiency for Semantic SLAM</title><source>IEEE Electronic Library (IEL) Journals</source><creator>Lin, Xubin ; Ruan, Jiahao ; Yang, Yirui ; He, Li ; Guan, Yisheng ; Zhang, Hong</creator><creatorcontrib>Lin, Xubin ; Ruan, Jiahao ; Yang, Yirui ; He, Li ; Guan, Yisheng ; Zhang, Hong</creatorcontrib><description>Robust and accurate object association is essential for precise 3D object landmark inference in semantic Simultaneous Localization and Mapping (SLAM), and yet remains challenging due to the detection deficiency caused by high miss detection rate, false alarm, occlusion and limited field-of-view, etc. The 2D location of an object is a crucial complementary cue to the appearance feature, especially in the case of associating objects across frames under large viewpoint changes. However, motion model or trajectory pattern based methods struggle to infer object motion reliably with a moving camera. In this paper, by exploiting the local projective warping consistency, a local homography based 2D motion inference method is proposed to sequentially estimate the object location along with uncertainty. By integrating the deep appearance feature and semantic information, an object association method, named HOA, which is robust to detection deficiency is proposed. Experimental evaluations suggest that the proposed motion prediction method is capable of maintaining a low cumulative error over a long duration, which enhances the object association performance in both accuracy and robustness. Note to Practitioners-This work aims to consistently associate 2D detection boxes corresponding to the same 3D object across images. In tasks of landmark-based navigation, collision avoidance, grasping and manipulation, objects in the task space are commonly simplified into 3D enveloping surfaces (e.g. cuboid or ellipsoid) by using 2D object detection boxes from multiple image views, and accurate data association is a prerequisite for precise enveloping surface reconstruction. This problem remains challenging considering the imperfect object detections, the appearance similarity of objects and the unpredictable trajectory of the moving camera. This work proposes a long-term reliable 2D location prediction algorithm that is capable of handling the complex motion of the target. Along with the appearance feature extracted by a retrain-free deep learning based model, this work proposes an object association method that can simultaneously deal with multiple objects with unknown object categories under the moving camera scenario.</description><identifier>ISSN: 1545-5955</identifier><identifier>EISSN: 1558-3783</identifier><identifier>DOI: 10.1109/TASE.2022.3233662</identifier><identifier>CODEN: ITASC7</identifier><language>eng</language><publisher>New York: IEEE</publisher><subject>Algorithms ; Boxes ; Cameras ; Collision avoidance ; Data association ; False alarms ; Feature extraction ; Image manipulation ; Image reconstruction ; Inference ; local homography ; multiple object tracking ; Object motion ; Object recognition ; Object tracking ; Occlusion ; Robustness ; semantic SLAM ; Semantics ; Simultaneous localization and mapping ; Target tracking ; Task space ; Three-dimensional displays ; Trajectories ; Trajectory</subject><ispartof>IEEE transactions on automation science and engineering, 2024-01, Vol.21 (1), p.868-880</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2024</rights><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c246t-3dd35900c4bf5ae6911640d6be7393bca7d7e8beff8ec9e8603c50ba87e461de3</cites><orcidid>0000-0002-7011-0331 ; 0000-0002-4721-0019 ; 0000-0002-4123-5530 ; 0000-0003-0261-4068 ; 0000-0002-1677-6132</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/10011152$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,780,784,27924,27925,54796</link.rule.ids></links><search><creatorcontrib>Lin, Xubin</creatorcontrib><creatorcontrib>Ruan, Jiahao</creatorcontrib><creatorcontrib>Yang, Yirui</creatorcontrib><creatorcontrib>He, Li</creatorcontrib><creatorcontrib>Guan, Yisheng</creatorcontrib><creatorcontrib>Zhang, Hong</creatorcontrib><title>Robust Data Association Against Detection Deficiency for Semantic SLAM</title><title>IEEE transactions on automation science and engineering</title><addtitle>TASE</addtitle><description>Robust and accurate object association is essential for precise 3D object landmark inference in semantic Simultaneous Localization and Mapping (SLAM), and yet remains challenging due to the detection deficiency caused by high miss detection rate, false alarm, occlusion and limited field-of-view, etc. The 2D location of an object is a crucial complementary cue to the appearance feature, especially in the case of associating objects across frames under large viewpoint changes. However, motion model or trajectory pattern based methods struggle to infer object motion reliably with a moving camera. In this paper, by exploiting the local projective warping consistency, a local homography based 2D motion inference method is proposed to sequentially estimate the object location along with uncertainty. By integrating the deep appearance feature and semantic information, an object association method, named HOA, which is robust to detection deficiency is proposed. Experimental evaluations suggest that the proposed motion prediction method is capable of maintaining a low cumulative error over a long duration, which enhances the object association performance in both accuracy and robustness. Note to Practitioners-This work aims to consistently associate 2D detection boxes corresponding to the same 3D object across images. In tasks of landmark-based navigation, collision avoidance, grasping and manipulation, objects in the task space are commonly simplified into 3D enveloping surfaces (e.g. cuboid or ellipsoid) by using 2D object detection boxes from multiple image views, and accurate data association is a prerequisite for precise enveloping surface reconstruction. This problem remains challenging considering the imperfect object detections, the appearance similarity of objects and the unpredictable trajectory of the moving camera. This work proposes a long-term reliable 2D location prediction algorithm that is capable of handling the complex motion of the target. Along with the appearance feature extracted by a retrain-free deep learning based model, this work proposes an object association method that can simultaneously deal with multiple objects with unknown object categories under the moving camera scenario.</description><subject>Algorithms</subject><subject>Boxes</subject><subject>Cameras</subject><subject>Collision avoidance</subject><subject>Data association</subject><subject>False alarms</subject><subject>Feature extraction</subject><subject>Image manipulation</subject><subject>Image reconstruction</subject><subject>Inference</subject><subject>local homography</subject><subject>multiple object tracking</subject><subject>Object motion</subject><subject>Object recognition</subject><subject>Object tracking</subject><subject>Occlusion</subject><subject>Robustness</subject><subject>semantic SLAM</subject><subject>Semantics</subject><subject>Simultaneous localization and mapping</subject><subject>Target tracking</subject><subject>Task space</subject><subject>Three-dimensional displays</subject><subject>Trajectories</subject><subject>Trajectory</subject><issn>1545-5955</issn><issn>1558-3783</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><recordid>eNpNkEtLAzEQx4MoWKsfQPCw4HlrHptsclz6UKEi2HoO2exEUuymJruHfnt3bQ-eZpj5P-CH0D3BM0KwetpWm-WMYkpnjDImBL1AE8K5zFkp2eW4FzznivNrdJPSDmNaSIUnaPUR6j512cJ0JqtSCtabzoc2q76Mb8cHdGD_Lgtw3npo7TFzIWYb2Ju28zbbrKu3W3TlzHeCu_Ocos_Vcjt_ydfvz6_zap1bWoguZ03DuMLYFrXjBoQiRBS4ETWUTLHamrIpQdbgnASrQArMLMe1kSUUgjTApujxlHuI4aeH1Old6GM7VGo6hBVSCioGFTmpbAwpRXD6EP3exKMmWI-49IhLj7j0GdfgeTh5PAD802NCCKfsF4AwZec</recordid><startdate>202401</startdate><enddate>202401</enddate><creator>Lin, Xubin</creator><creator>Ruan, Jiahao</creator><creator>Yang, Yirui</creator><creator>He, Li</creator><creator>Guan, Yisheng</creator><creator>Zhang, Hong</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>7TB</scope><scope>8FD</scope><scope>FR3</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><orcidid>https://orcid.org/0000-0002-7011-0331</orcidid><orcidid>https://orcid.org/0000-0002-4721-0019</orcidid><orcidid>https://orcid.org/0000-0002-4123-5530</orcidid><orcidid>https://orcid.org/0000-0003-0261-4068</orcidid><orcidid>https://orcid.org/0000-0002-1677-6132</orcidid></search><sort><creationdate>202401</creationdate><title>Robust Data Association Against Detection Deficiency for Semantic SLAM</title><author>Lin, Xubin ; Ruan, Jiahao ; Yang, Yirui ; He, Li ; Guan, Yisheng ; Zhang, Hong</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c246t-3dd35900c4bf5ae6911640d6be7393bca7d7e8beff8ec9e8603c50ba87e461de3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Algorithms</topic><topic>Boxes</topic><topic>Cameras</topic><topic>Collision avoidance</topic><topic>Data association</topic><topic>False alarms</topic><topic>Feature extraction</topic><topic>Image manipulation</topic><topic>Image reconstruction</topic><topic>Inference</topic><topic>local homography</topic><topic>multiple object tracking</topic><topic>Object motion</topic><topic>Object recognition</topic><topic>Object tracking</topic><topic>Occlusion</topic><topic>Robustness</topic><topic>semantic SLAM</topic><topic>Semantics</topic><topic>Simultaneous localization and mapping</topic><topic>Target tracking</topic><topic>Task space</topic><topic>Three-dimensional displays</topic><topic>Trajectories</topic><topic>Trajectory</topic><toplevel>online_resources</toplevel><creatorcontrib>Lin, Xubin</creatorcontrib><creatorcontrib>Ruan, Jiahao</creatorcontrib><creatorcontrib>Yang, Yirui</creatorcontrib><creatorcontrib>He, Li</creatorcontrib><creatorcontrib>Guan, Yisheng</creatorcontrib><creatorcontrib>Zhang, Hong</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics &amp; Communications Abstracts</collection><collection>Mechanical &amp; Transportation Engineering Abstracts</collection><collection>Technology Research Database</collection><collection>Engineering Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>IEEE transactions on automation science and engineering</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Lin, Xubin</au><au>Ruan, Jiahao</au><au>Yang, Yirui</au><au>He, Li</au><au>Guan, Yisheng</au><au>Zhang, Hong</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Robust Data Association Against Detection Deficiency for Semantic SLAM</atitle><jtitle>IEEE transactions on automation science and engineering</jtitle><stitle>TASE</stitle><date>2024-01</date><risdate>2024</risdate><volume>21</volume><issue>1</issue><spage>868</spage><epage>880</epage><pages>868-880</pages><issn>1545-5955</issn><eissn>1558-3783</eissn><coden>ITASC7</coden><abstract>Robust and accurate object association is essential for precise 3D object landmark inference in semantic Simultaneous Localization and Mapping (SLAM), and yet remains challenging due to the detection deficiency caused by high miss detection rate, false alarm, occlusion and limited field-of-view, etc. The 2D location of an object is a crucial complementary cue to the appearance feature, especially in the case of associating objects across frames under large viewpoint changes. However, motion model or trajectory pattern based methods struggle to infer object motion reliably with a moving camera. In this paper, by exploiting the local projective warping consistency, a local homography based 2D motion inference method is proposed to sequentially estimate the object location along with uncertainty. By integrating the deep appearance feature and semantic information, an object association method, named HOA, which is robust to detection deficiency is proposed. Experimental evaluations suggest that the proposed motion prediction method is capable of maintaining a low cumulative error over a long duration, which enhances the object association performance in both accuracy and robustness. Note to Practitioners-This work aims to consistently associate 2D detection boxes corresponding to the same 3D object across images. In tasks of landmark-based navigation, collision avoidance, grasping and manipulation, objects in the task space are commonly simplified into 3D enveloping surfaces (e.g. cuboid or ellipsoid) by using 2D object detection boxes from multiple image views, and accurate data association is a prerequisite for precise enveloping surface reconstruction. This problem remains challenging considering the imperfect object detections, the appearance similarity of objects and the unpredictable trajectory of the moving camera. This work proposes a long-term reliable 2D location prediction algorithm that is capable of handling the complex motion of the target. Along with the appearance feature extracted by a retrain-free deep learning based model, this work proposes an object association method that can simultaneously deal with multiple objects with unknown object categories under the moving camera scenario.</abstract><cop>New York</cop><pub>IEEE</pub><doi>10.1109/TASE.2022.3233662</doi><tpages>13</tpages><orcidid>https://orcid.org/0000-0002-7011-0331</orcidid><orcidid>https://orcid.org/0000-0002-4721-0019</orcidid><orcidid>https://orcid.org/0000-0002-4123-5530</orcidid><orcidid>https://orcid.org/0000-0003-0261-4068</orcidid><orcidid>https://orcid.org/0000-0002-1677-6132</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 1545-5955
ispartof IEEE transactions on automation science and engineering, 2024-01, Vol.21 (1), p.868-880
issn 1545-5955
1558-3783
language eng
recordid cdi_proquest_journals_2911488626
source IEEE Electronic Library (IEL) Journals
subjects Algorithms
Boxes
Cameras
Collision avoidance
Data association
False alarms
Feature extraction
Image manipulation
Image reconstruction
Inference
local homography
multiple object tracking
Object motion
Object recognition
Object tracking
Occlusion
Robustness
semantic SLAM
Semantics
Simultaneous localization and mapping
Target tracking
Task space
Three-dimensional displays
Trajectories
Trajectory
title Robust Data Association Against Detection Deficiency for Semantic SLAM
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-04T03%3A54%3A13IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_ieee_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Robust%20Data%20Association%20Against%20Detection%20Deficiency%20for%20Semantic%20SLAM&rft.jtitle=IEEE%20transactions%20on%20automation%20science%20and%20engineering&rft.au=Lin,%20Xubin&rft.date=2024-01&rft.volume=21&rft.issue=1&rft.spage=868&rft.epage=880&rft.pages=868-880&rft.issn=1545-5955&rft.eissn=1558-3783&rft.coden=ITASC7&rft_id=info:doi/10.1109/TASE.2022.3233662&rft_dat=%3Cproquest_ieee_%3E2911488626%3C/proquest_ieee_%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c246t-3dd35900c4bf5ae6911640d6be7393bca7d7e8beff8ec9e8603c50ba87e461de3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2911488626&rft_id=info:pmid/&rft_ieee_id=10011152&rfr_iscdi=true