Loading…

Why ORB-SLAM is missing commonly occurring loop closures?

We analyse, for the first time, the popular loop closing module of a well known and widely used open-source visual SLAM (ORB-SLAM) pipeline. Investigating failures in the loop closure module of visual SLAM is challenging since it consists of multiple building blocks. Our meticulous investigations ha...

Full description

Saved in:
Bibliographic Details
Published in:Autonomous robots 2023-12, Vol.47 (8), p.1519-1535
Main Authors: Khaliq, Saran, Anjum, Muhammad Latif, Hussain, Wajahat, Khattak, Muhammad Uzair, Rasool, Momen
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c319t-c96098bd0412f0c64bbba57775da00c4c4b8524e0be182009386719d182c3d7f3
cites cdi_FETCH-LOGICAL-c319t-c96098bd0412f0c64bbba57775da00c4c4b8524e0be182009386719d182c3d7f3
container_end_page 1535
container_issue 8
container_start_page 1519
container_title Autonomous robots
container_volume 47
creator Khaliq, Saran
Anjum, Muhammad Latif
Hussain, Wajahat
Khattak, Muhammad Uzair
Rasool, Momen
description We analyse, for the first time, the popular loop closing module of a well known and widely used open-source visual SLAM (ORB-SLAM) pipeline. Investigating failures in the loop closure module of visual SLAM is challenging since it consists of multiple building blocks. Our meticulous investigations have revealed a few interesting findings. Contrary to reported results, ORB-SLAM frequently misses large fraction of loop closures on public (KITTI, TUM RGB-D) datasets. One common assumption is, in such scenarios, the visual place recognition (vPR) block of the loop closure module is unable to find a suitable match due to extreme conditions (dynamic scene, viewpoint/scale changes). We report that native vPR of ORB-SLAM is not the sole reason for these failures. Although recent deep vPR alternatives achieve impressive matching performance, replacing native vPR with these deep alternatives will only partially improve loop closure performance of visual SLAM. Our findings suggest that the problem lies with the subsequent relative pose estimation module between the matching pair. ORB-SLAM3 has improved the recall of the original loop closing module. However, even in ORB-SLAM3, the loop closing module is the major reason behind loop closing failures. Surprisingly, using off-the-shelf ORB and SIFT based relative pose estimators (non real-time) manages to close most of the loops missed by ORB-SLAM. This significant performance gap between the two available methods suggests that ORB-SLAM’s pipeline can be further matured by focusing on the relative pose estimators, to improve loop closure performance, rather than investing more resources on improving vPR. We also evaluate deep alternatives for relative pose estimation in the context of loop closures. Interestingly, the performance of deep relocalization methods (e.g. MapNet) is worse than classic methods even in loop closures scenarios. This finding further supports the fundamental limitation of deep relocalization methods recently diagnosed. Finally, we expose bias in well-known public dataset (KITTI) due to which these commonly occurring failures have eluded the community. We augment the KITTI dataset with detailed loop closing labels. In order to compensate for the bias in the public datasets, we provide a challenging loop closure dataset which contains challenging yet commonly occurring indoor navigation scenarios with loop closures. We hope our findings and the accompanying dataset will help the community in further
doi_str_mv 10.1007/s10514-023-10149-x
format article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2894579777</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2894579777</sourcerecordid><originalsourceid>FETCH-LOGICAL-c319t-c96098bd0412f0c64bbba57775da00c4c4b8524e0be182009386719d182c3d7f3</originalsourceid><addsrcrecordid>eNp9kFtLxDAQhYMoWFf_gE8Fn6OT26Z5knXxBpUFL_gY2jTVLu2mJlvY_nuzVvDNp2GGc87MfAidE7gkAPIqEBCEY6AMEyBc4d0BSoiQDEtB5SFKQFGFhVDsGJ2EsAYAJQESpN4_x3T1fINf8sVT2oS0a0JoNh-pcV3nNu2YOmMG7_ej1rk-Na0Lg7fh-hQd1UUb7NlvnaG3u9vX5QPOV_ePy0WODSNqi42ag8rKCjihNZg5L8uyEFJKURUAhhteZoJyC6UlGY1nsWwuiapiY1glazZDF1Nu793XYMNWr93gN3GlppniQqoYFlV0UhnvQvC21r1vusKPmoDeI9ITIh0R6R9EehdNbDKFfv-g9X_R_7i-AX3VZ9w</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2894579777</pqid></control><display><type>article</type><title>Why ORB-SLAM is missing commonly occurring loop closures?</title><source>Springer Link</source><creator>Khaliq, Saran ; Anjum, Muhammad Latif ; Hussain, Wajahat ; Khattak, Muhammad Uzair ; Rasool, Momen</creator><creatorcontrib>Khaliq, Saran ; Anjum, Muhammad Latif ; Hussain, Wajahat ; Khattak, Muhammad Uzair ; Rasool, Momen</creatorcontrib><description>We analyse, for the first time, the popular loop closing module of a well known and widely used open-source visual SLAM (ORB-SLAM) pipeline. Investigating failures in the loop closure module of visual SLAM is challenging since it consists of multiple building blocks. Our meticulous investigations have revealed a few interesting findings. Contrary to reported results, ORB-SLAM frequently misses large fraction of loop closures on public (KITTI, TUM RGB-D) datasets. One common assumption is, in such scenarios, the visual place recognition (vPR) block of the loop closure module is unable to find a suitable match due to extreme conditions (dynamic scene, viewpoint/scale changes). We report that native vPR of ORB-SLAM is not the sole reason for these failures. Although recent deep vPR alternatives achieve impressive matching performance, replacing native vPR with these deep alternatives will only partially improve loop closure performance of visual SLAM. Our findings suggest that the problem lies with the subsequent relative pose estimation module between the matching pair. ORB-SLAM3 has improved the recall of the original loop closing module. However, even in ORB-SLAM3, the loop closing module is the major reason behind loop closing failures. Surprisingly, using off-the-shelf ORB and SIFT based relative pose estimators (non real-time) manages to close most of the loops missed by ORB-SLAM. This significant performance gap between the two available methods suggests that ORB-SLAM’s pipeline can be further matured by focusing on the relative pose estimators, to improve loop closure performance, rather than investing more resources on improving vPR. We also evaluate deep alternatives for relative pose estimation in the context of loop closures. Interestingly, the performance of deep relocalization methods (e.g. MapNet) is worse than classic methods even in loop closures scenarios. This finding further supports the fundamental limitation of deep relocalization methods recently diagnosed. Finally, we expose bias in well-known public dataset (KITTI) due to which these commonly occurring failures have eluded the community. We augment the KITTI dataset with detailed loop closing labels. In order to compensate for the bias in the public datasets, we provide a challenging loop closure dataset which contains challenging yet commonly occurring indoor navigation scenarios with loop closures. We hope our findings and the accompanying dataset will help the community in further improving the popular ORB-SLAM’s pipeline.</description><identifier>ISSN: 0929-5593</identifier><identifier>EISSN: 1573-7527</identifier><identifier>DOI: 10.1007/s10514-023-10149-x</identifier><language>eng</language><publisher>New York: Springer US</publisher><subject>Artificial Intelligence ; Bias ; Closures ; Computer Imaging ; Control ; Datasets ; Engineering ; Estimators ; Indoor navigation ; Matching ; Mechatronics ; Modules ; Pattern Recognition and Graphics ; Pose estimation ; Robotics ; Robotics and Automation ; Simultaneous localization and mapping ; Vision</subject><ispartof>Autonomous robots, 2023-12, Vol.47 (8), p.1519-1535</ispartof><rights>The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2023. Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c319t-c96098bd0412f0c64bbba57775da00c4c4b8524e0be182009386719d182c3d7f3</citedby><cites>FETCH-LOGICAL-c319t-c96098bd0412f0c64bbba57775da00c4c4b8524e0be182009386719d182c3d7f3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,777,781,27905,27906</link.rule.ids></links><search><creatorcontrib>Khaliq, Saran</creatorcontrib><creatorcontrib>Anjum, Muhammad Latif</creatorcontrib><creatorcontrib>Hussain, Wajahat</creatorcontrib><creatorcontrib>Khattak, Muhammad Uzair</creatorcontrib><creatorcontrib>Rasool, Momen</creatorcontrib><title>Why ORB-SLAM is missing commonly occurring loop closures?</title><title>Autonomous robots</title><addtitle>Auton Robot</addtitle><description>We analyse, for the first time, the popular loop closing module of a well known and widely used open-source visual SLAM (ORB-SLAM) pipeline. Investigating failures in the loop closure module of visual SLAM is challenging since it consists of multiple building blocks. Our meticulous investigations have revealed a few interesting findings. Contrary to reported results, ORB-SLAM frequently misses large fraction of loop closures on public (KITTI, TUM RGB-D) datasets. One common assumption is, in such scenarios, the visual place recognition (vPR) block of the loop closure module is unable to find a suitable match due to extreme conditions (dynamic scene, viewpoint/scale changes). We report that native vPR of ORB-SLAM is not the sole reason for these failures. Although recent deep vPR alternatives achieve impressive matching performance, replacing native vPR with these deep alternatives will only partially improve loop closure performance of visual SLAM. Our findings suggest that the problem lies with the subsequent relative pose estimation module between the matching pair. ORB-SLAM3 has improved the recall of the original loop closing module. However, even in ORB-SLAM3, the loop closing module is the major reason behind loop closing failures. Surprisingly, using off-the-shelf ORB and SIFT based relative pose estimators (non real-time) manages to close most of the loops missed by ORB-SLAM. This significant performance gap between the two available methods suggests that ORB-SLAM’s pipeline can be further matured by focusing on the relative pose estimators, to improve loop closure performance, rather than investing more resources on improving vPR. We also evaluate deep alternatives for relative pose estimation in the context of loop closures. Interestingly, the performance of deep relocalization methods (e.g. MapNet) is worse than classic methods even in loop closures scenarios. This finding further supports the fundamental limitation of deep relocalization methods recently diagnosed. Finally, we expose bias in well-known public dataset (KITTI) due to which these commonly occurring failures have eluded the community. We augment the KITTI dataset with detailed loop closing labels. In order to compensate for the bias in the public datasets, we provide a challenging loop closure dataset which contains challenging yet commonly occurring indoor navigation scenarios with loop closures. We hope our findings and the accompanying dataset will help the community in further improving the popular ORB-SLAM’s pipeline.</description><subject>Artificial Intelligence</subject><subject>Bias</subject><subject>Closures</subject><subject>Computer Imaging</subject><subject>Control</subject><subject>Datasets</subject><subject>Engineering</subject><subject>Estimators</subject><subject>Indoor navigation</subject><subject>Matching</subject><subject>Mechatronics</subject><subject>Modules</subject><subject>Pattern Recognition and Graphics</subject><subject>Pose estimation</subject><subject>Robotics</subject><subject>Robotics and Automation</subject><subject>Simultaneous localization and mapping</subject><subject>Vision</subject><issn>0929-5593</issn><issn>1573-7527</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><recordid>eNp9kFtLxDAQhYMoWFf_gE8Fn6OT26Z5knXxBpUFL_gY2jTVLu2mJlvY_nuzVvDNp2GGc87MfAidE7gkAPIqEBCEY6AMEyBc4d0BSoiQDEtB5SFKQFGFhVDsGJ2EsAYAJQESpN4_x3T1fINf8sVT2oS0a0JoNh-pcV3nNu2YOmMG7_ej1rk-Na0Lg7fh-hQd1UUb7NlvnaG3u9vX5QPOV_ePy0WODSNqi42ag8rKCjihNZg5L8uyEFJKURUAhhteZoJyC6UlGY1nsWwuiapiY1glazZDF1Nu793XYMNWr93gN3GlppniQqoYFlV0UhnvQvC21r1vusKPmoDeI9ITIh0R6R9EehdNbDKFfv-g9X_R_7i-AX3VZ9w</recordid><startdate>20231201</startdate><enddate>20231201</enddate><creator>Khaliq, Saran</creator><creator>Anjum, Muhammad Latif</creator><creator>Hussain, Wajahat</creator><creator>Khattak, Muhammad Uzair</creator><creator>Rasool, Momen</creator><general>Springer US</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>7TB</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>F28</scope><scope>FR3</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>L6V</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>M7S</scope><scope>P5Z</scope><scope>P62</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PTHSS</scope><scope>S0W</scope></search><sort><creationdate>20231201</creationdate><title>Why ORB-SLAM is missing commonly occurring loop closures?</title><author>Khaliq, Saran ; Anjum, Muhammad Latif ; Hussain, Wajahat ; Khattak, Muhammad Uzair ; Rasool, Momen</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c319t-c96098bd0412f0c64bbba57775da00c4c4b8524e0be182009386719d182c3d7f3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Artificial Intelligence</topic><topic>Bias</topic><topic>Closures</topic><topic>Computer Imaging</topic><topic>Control</topic><topic>Datasets</topic><topic>Engineering</topic><topic>Estimators</topic><topic>Indoor navigation</topic><topic>Matching</topic><topic>Mechatronics</topic><topic>Modules</topic><topic>Pattern Recognition and Graphics</topic><topic>Pose estimation</topic><topic>Robotics</topic><topic>Robotics and Automation</topic><topic>Simultaneous localization and mapping</topic><topic>Vision</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Khaliq, Saran</creatorcontrib><creatorcontrib>Anjum, Muhammad Latif</creatorcontrib><creatorcontrib>Hussain, Wajahat</creatorcontrib><creatorcontrib>Khattak, Muhammad Uzair</creatorcontrib><creatorcontrib>Rasool, Momen</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics &amp; Communications Abstracts</collection><collection>Mechanical &amp; Transportation Engineering Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central</collection><collection>Advanced Technologies &amp; Aerospace Collection</collection><collection>AUTh Library subscriptions: ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central</collection><collection>ANTE: Abstracts in New Technology &amp; Engineering</collection><collection>Engineering Research Database</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>Engineering Database</collection><collection>ProQuest advanced technologies &amp; aerospace journals</collection><collection>ProQuest Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>Engineering collection</collection><collection>DELNET Engineering &amp; Technology Collection</collection><jtitle>Autonomous robots</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Khaliq, Saran</au><au>Anjum, Muhammad Latif</au><au>Hussain, Wajahat</au><au>Khattak, Muhammad Uzair</au><au>Rasool, Momen</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Why ORB-SLAM is missing commonly occurring loop closures?</atitle><jtitle>Autonomous robots</jtitle><stitle>Auton Robot</stitle><date>2023-12-01</date><risdate>2023</risdate><volume>47</volume><issue>8</issue><spage>1519</spage><epage>1535</epage><pages>1519-1535</pages><issn>0929-5593</issn><eissn>1573-7527</eissn><abstract>We analyse, for the first time, the popular loop closing module of a well known and widely used open-source visual SLAM (ORB-SLAM) pipeline. Investigating failures in the loop closure module of visual SLAM is challenging since it consists of multiple building blocks. Our meticulous investigations have revealed a few interesting findings. Contrary to reported results, ORB-SLAM frequently misses large fraction of loop closures on public (KITTI, TUM RGB-D) datasets. One common assumption is, in such scenarios, the visual place recognition (vPR) block of the loop closure module is unable to find a suitable match due to extreme conditions (dynamic scene, viewpoint/scale changes). We report that native vPR of ORB-SLAM is not the sole reason for these failures. Although recent deep vPR alternatives achieve impressive matching performance, replacing native vPR with these deep alternatives will only partially improve loop closure performance of visual SLAM. Our findings suggest that the problem lies with the subsequent relative pose estimation module between the matching pair. ORB-SLAM3 has improved the recall of the original loop closing module. However, even in ORB-SLAM3, the loop closing module is the major reason behind loop closing failures. Surprisingly, using off-the-shelf ORB and SIFT based relative pose estimators (non real-time) manages to close most of the loops missed by ORB-SLAM. This significant performance gap between the two available methods suggests that ORB-SLAM’s pipeline can be further matured by focusing on the relative pose estimators, to improve loop closure performance, rather than investing more resources on improving vPR. We also evaluate deep alternatives for relative pose estimation in the context of loop closures. Interestingly, the performance of deep relocalization methods (e.g. MapNet) is worse than classic methods even in loop closures scenarios. This finding further supports the fundamental limitation of deep relocalization methods recently diagnosed. Finally, we expose bias in well-known public dataset (KITTI) due to which these commonly occurring failures have eluded the community. We augment the KITTI dataset with detailed loop closing labels. In order to compensate for the bias in the public datasets, we provide a challenging loop closure dataset which contains challenging yet commonly occurring indoor navigation scenarios with loop closures. We hope our findings and the accompanying dataset will help the community in further improving the popular ORB-SLAM’s pipeline.</abstract><cop>New York</cop><pub>Springer US</pub><doi>10.1007/s10514-023-10149-x</doi><tpages>17</tpages></addata></record>
fulltext fulltext
identifier ISSN: 0929-5593
ispartof Autonomous robots, 2023-12, Vol.47 (8), p.1519-1535
issn 0929-5593
1573-7527
language eng
recordid cdi_proquest_journals_2894579777
source Springer Link
subjects Artificial Intelligence
Bias
Closures
Computer Imaging
Control
Datasets
Engineering
Estimators
Indoor navigation
Matching
Mechatronics
Modules
Pattern Recognition and Graphics
Pose estimation
Robotics
Robotics and Automation
Simultaneous localization and mapping
Vision
title Why ORB-SLAM is missing commonly occurring loop closures?
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-21T08%3A04%3A13IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Why%20ORB-SLAM%20is%20missing%20commonly%20occurring%20loop%20closures?&rft.jtitle=Autonomous%20robots&rft.au=Khaliq,%20Saran&rft.date=2023-12-01&rft.volume=47&rft.issue=8&rft.spage=1519&rft.epage=1535&rft.pages=1519-1535&rft.issn=0929-5593&rft.eissn=1573-7527&rft_id=info:doi/10.1007/s10514-023-10149-x&rft_dat=%3Cproquest_cross%3E2894579777%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c319t-c96098bd0412f0c64bbba57775da00c4c4b8524e0be182009386719d182c3d7f3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2894579777&rft_id=info:pmid/&rfr_iscdi=true