Loading…

A Survey of Object Goal Navigation

Object Goal Navigation (ObjectNav) refers to an agent navigating to an object in an unseen environment, which is an ability often required in the accomplishment of complex tasks. Though it has drawn increasing attention from researchers in the Embodied AI community, there has not been a contemporary...

Full description

Saved in:
Bibliographic Details
Published in:IEEE transactions on automation science and engineering 2024-03, p.1-17
Main Authors: Sun, Jingwen, Wu, Jing, Ji, Ze, Lai, Yu-Kun
Format: Article
Language:English
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites
container_end_page 17
container_issue
container_start_page 1
container_title IEEE transactions on automation science and engineering
container_volume
creator Sun, Jingwen
Wu, Jing
Ji, Ze
Lai, Yu-Kun
description Object Goal Navigation (ObjectNav) refers to an agent navigating to an object in an unseen environment, which is an ability often required in the accomplishment of complex tasks. Though it has drawn increasing attention from researchers in the Embodied AI community, there has not been a contemporary and comprehensive survey of ObjectNav. In this survey, we give an overview of this field by summarizing more than 70 recent papers. First, we give the preliminaries of the ObjectNav: the definition, the simulator, and the metrics. Then, we group the existing works into three categories: 1) end-to-end methods that directly map the observations to actions, 2) modular methods that consist of a mapping module, a policy module, and a path planning module, and 3) zero-shot methods that use zero-shot learning to do navigation. Finally, we summarize the performance of existing works and the main failure modes and discuss the challenges of ObjectNav. This survey would provide comprehensive information for researchers in this field to have a better understanding of ObjectNav. Note to Practitioners -This work was motivated by the increased interest in real-world applications of mobile robots. Object Goal Navigation (ObjectNav), which is an important task in these applications, requires an agent to find an object in an unseen environment. To accomplish that, the agent needs to be equipped with the capability to move in the environment, decide where to go, and recognize the object categories. So far, most works on ObjectNav have been done in a simulation environment. We present an overview of the existing works in ObjectNav and introduce them in three categories. Additionally, we analyze the current performance of ObjectNav and the challenges for future research. This paper provides researchers and practitioners with a comprehensive overview of the developed methods in ObjectNav, which can help them to have a good understanding of this task and develop suitable solutions for applications in the real world.
doi_str_mv 10.1109/TASE.2024.3378010
format article
fullrecord <record><control><sourceid>crossref_ieee_</sourceid><recordid>TN_cdi_crossref_primary_10_1109_TASE_2024_3378010</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>10475904</ieee_id><sourcerecordid>10_1109_TASE_2024_3378010</sourcerecordid><originalsourceid>FETCH-LOGICAL-c261t-1b05e933331ef9f86c854a2da1812db5e579e00bd475599927c76bc788d9f96f3</originalsourceid><addsrcrecordid>eNpNj01Lw0AQhhdRsFZ_gOAheE-c2c1kd46l1CoUe2g9L5vNrqRUI0ks9N-b0B46l5mB94NHiEeEDBH4ZTvbLDIJMs-U0gYQrsQEiUw6fOp6vHNKiYluxV3X7WBQGoaJeJ4lm7_2EI5JE5N1uQu-T5aN2ycf7lB_ub5ufu7FTXT7Ljyc91R8vi6287d0tV6-z2er1MsC-xRLoMBqGAyRoym8odzJyqFBWZUUSHMAKKtcEzGz1F4XpdfGVBy5iGoq8JTr26br2hDtb1t_u_ZoEewIaUdIO0LaM-TgeTp56hDChX7oYMjVPx34S6A</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>A Survey of Object Goal Navigation</title><source>IEEE Xplore (Online service)</source><creator>Sun, Jingwen ; Wu, Jing ; Ji, Ze ; Lai, Yu-Kun</creator><creatorcontrib>Sun, Jingwen ; Wu, Jing ; Ji, Ze ; Lai, Yu-Kun</creatorcontrib><description>Object Goal Navigation (ObjectNav) refers to an agent navigating to an object in an unseen environment, which is an ability often required in the accomplishment of complex tasks. Though it has drawn increasing attention from researchers in the Embodied AI community, there has not been a contemporary and comprehensive survey of ObjectNav. In this survey, we give an overview of this field by summarizing more than 70 recent papers. First, we give the preliminaries of the ObjectNav: the definition, the simulator, and the metrics. Then, we group the existing works into three categories: 1) end-to-end methods that directly map the observations to actions, 2) modular methods that consist of a mapping module, a policy module, and a path planning module, and 3) zero-shot methods that use zero-shot learning to do navigation. Finally, we summarize the performance of existing works and the main failure modes and discuss the challenges of ObjectNav. This survey would provide comprehensive information for researchers in this field to have a better understanding of ObjectNav. Note to Practitioners -This work was motivated by the increased interest in real-world applications of mobile robots. Object Goal Navigation (ObjectNav), which is an important task in these applications, requires an agent to find an object in an unseen environment. To accomplish that, the agent needs to be equipped with the capability to move in the environment, decide where to go, and recognize the object categories. So far, most works on ObjectNav have been done in a simulation environment. We present an overview of the existing works in ObjectNav and introduce them in three categories. Additionally, we analyze the current performance of ObjectNav and the challenges for future research. This paper provides researchers and practitioners with a comprehensive overview of the developed methods in ObjectNav, which can help them to have a good understanding of this task and develop suitable solutions for applications in the real world.</description><identifier>ISSN: 1545-5955</identifier><identifier>EISSN: 1558-3783</identifier><identifier>DOI: 10.1109/TASE.2024.3378010</identifier><identifier>CODEN: ITASC7</identifier><language>eng</language><publisher>IEEE</publisher><subject>Artificial intelligence ; embodied AI ; Habitats ; Measurement ; Navigation ; Object goal navigation ; Robots ; semantic navigation ; Task analysis ; Three-dimensional displays</subject><ispartof>IEEE transactions on automation science and engineering, 2024-03, p.1-17</ispartof><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><orcidid>0000-0002-2094-5680 ; 0000-0002-3644-7424 ; 0000-0002-8968-9902 ; 0000-0001-5123-9861</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/10475904$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,776,780,27903,27904,54774</link.rule.ids></links><search><creatorcontrib>Sun, Jingwen</creatorcontrib><creatorcontrib>Wu, Jing</creatorcontrib><creatorcontrib>Ji, Ze</creatorcontrib><creatorcontrib>Lai, Yu-Kun</creatorcontrib><title>A Survey of Object Goal Navigation</title><title>IEEE transactions on automation science and engineering</title><addtitle>TASE</addtitle><description>Object Goal Navigation (ObjectNav) refers to an agent navigating to an object in an unseen environment, which is an ability often required in the accomplishment of complex tasks. Though it has drawn increasing attention from researchers in the Embodied AI community, there has not been a contemporary and comprehensive survey of ObjectNav. In this survey, we give an overview of this field by summarizing more than 70 recent papers. First, we give the preliminaries of the ObjectNav: the definition, the simulator, and the metrics. Then, we group the existing works into three categories: 1) end-to-end methods that directly map the observations to actions, 2) modular methods that consist of a mapping module, a policy module, and a path planning module, and 3) zero-shot methods that use zero-shot learning to do navigation. Finally, we summarize the performance of existing works and the main failure modes and discuss the challenges of ObjectNav. This survey would provide comprehensive information for researchers in this field to have a better understanding of ObjectNav. Note to Practitioners -This work was motivated by the increased interest in real-world applications of mobile robots. Object Goal Navigation (ObjectNav), which is an important task in these applications, requires an agent to find an object in an unseen environment. To accomplish that, the agent needs to be equipped with the capability to move in the environment, decide where to go, and recognize the object categories. So far, most works on ObjectNav have been done in a simulation environment. We present an overview of the existing works in ObjectNav and introduce them in three categories. Additionally, we analyze the current performance of ObjectNav and the challenges for future research. This paper provides researchers and practitioners with a comprehensive overview of the developed methods in ObjectNav, which can help them to have a good understanding of this task and develop suitable solutions for applications in the real world.</description><subject>Artificial intelligence</subject><subject>embodied AI</subject><subject>Habitats</subject><subject>Measurement</subject><subject>Navigation</subject><subject>Object goal navigation</subject><subject>Robots</subject><subject>semantic navigation</subject><subject>Task analysis</subject><subject>Three-dimensional displays</subject><issn>1545-5955</issn><issn>1558-3783</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><recordid>eNpNj01Lw0AQhhdRsFZ_gOAheE-c2c1kd46l1CoUe2g9L5vNrqRUI0ks9N-b0B46l5mB94NHiEeEDBH4ZTvbLDIJMs-U0gYQrsQEiUw6fOp6vHNKiYluxV3X7WBQGoaJeJ4lm7_2EI5JE5N1uQu-T5aN2ycf7lB_ub5ufu7FTXT7Ljyc91R8vi6287d0tV6-z2er1MsC-xRLoMBqGAyRoym8odzJyqFBWZUUSHMAKKtcEzGz1F4XpdfGVBy5iGoq8JTr26br2hDtb1t_u_ZoEewIaUdIO0LaM-TgeTp56hDChX7oYMjVPx34S6A</recordid><startdate>20240319</startdate><enddate>20240319</enddate><creator>Sun, Jingwen</creator><creator>Wu, Jing</creator><creator>Ji, Ze</creator><creator>Lai, Yu-Kun</creator><general>IEEE</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><orcidid>https://orcid.org/0000-0002-2094-5680</orcidid><orcidid>https://orcid.org/0000-0002-3644-7424</orcidid><orcidid>https://orcid.org/0000-0002-8968-9902</orcidid><orcidid>https://orcid.org/0000-0001-5123-9861</orcidid></search><sort><creationdate>20240319</creationdate><title>A Survey of Object Goal Navigation</title><author>Sun, Jingwen ; Wu, Jing ; Ji, Ze ; Lai, Yu-Kun</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c261t-1b05e933331ef9f86c854a2da1812db5e579e00bd475599927c76bc788d9f96f3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Artificial intelligence</topic><topic>embodied AI</topic><topic>Habitats</topic><topic>Measurement</topic><topic>Navigation</topic><topic>Object goal navigation</topic><topic>Robots</topic><topic>semantic navigation</topic><topic>Task analysis</topic><topic>Three-dimensional displays</topic><toplevel>online_resources</toplevel><creatorcontrib>Sun, Jingwen</creatorcontrib><creatorcontrib>Wu, Jing</creatorcontrib><creatorcontrib>Ji, Ze</creatorcontrib><creatorcontrib>Lai, Yu-Kun</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)【Remote access available】</collection><collection>CrossRef</collection><jtitle>IEEE transactions on automation science and engineering</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Sun, Jingwen</au><au>Wu, Jing</au><au>Ji, Ze</au><au>Lai, Yu-Kun</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A Survey of Object Goal Navigation</atitle><jtitle>IEEE transactions on automation science and engineering</jtitle><stitle>TASE</stitle><date>2024-03-19</date><risdate>2024</risdate><spage>1</spage><epage>17</epage><pages>1-17</pages><issn>1545-5955</issn><eissn>1558-3783</eissn><coden>ITASC7</coden><abstract>Object Goal Navigation (ObjectNav) refers to an agent navigating to an object in an unseen environment, which is an ability often required in the accomplishment of complex tasks. Though it has drawn increasing attention from researchers in the Embodied AI community, there has not been a contemporary and comprehensive survey of ObjectNav. In this survey, we give an overview of this field by summarizing more than 70 recent papers. First, we give the preliminaries of the ObjectNav: the definition, the simulator, and the metrics. Then, we group the existing works into three categories: 1) end-to-end methods that directly map the observations to actions, 2) modular methods that consist of a mapping module, a policy module, and a path planning module, and 3) zero-shot methods that use zero-shot learning to do navigation. Finally, we summarize the performance of existing works and the main failure modes and discuss the challenges of ObjectNav. This survey would provide comprehensive information for researchers in this field to have a better understanding of ObjectNav. Note to Practitioners -This work was motivated by the increased interest in real-world applications of mobile robots. Object Goal Navigation (ObjectNav), which is an important task in these applications, requires an agent to find an object in an unseen environment. To accomplish that, the agent needs to be equipped with the capability to move in the environment, decide where to go, and recognize the object categories. So far, most works on ObjectNav have been done in a simulation environment. We present an overview of the existing works in ObjectNav and introduce them in three categories. Additionally, we analyze the current performance of ObjectNav and the challenges for future research. This paper provides researchers and practitioners with a comprehensive overview of the developed methods in ObjectNav, which can help them to have a good understanding of this task and develop suitable solutions for applications in the real world.</abstract><pub>IEEE</pub><doi>10.1109/TASE.2024.3378010</doi><tpages>17</tpages><orcidid>https://orcid.org/0000-0002-2094-5680</orcidid><orcidid>https://orcid.org/0000-0002-3644-7424</orcidid><orcidid>https://orcid.org/0000-0002-8968-9902</orcidid><orcidid>https://orcid.org/0000-0001-5123-9861</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1545-5955
ispartof IEEE transactions on automation science and engineering, 2024-03, p.1-17
issn 1545-5955
1558-3783
language eng
recordid cdi_crossref_primary_10_1109_TASE_2024_3378010
source IEEE Xplore (Online service)
subjects Artificial intelligence
embodied AI
Habitats
Measurement
Navigation
Object goal navigation
Robots
semantic navigation
Task analysis
Three-dimensional displays
title A Survey of Object Goal Navigation
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-24T09%3A43%3A58IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-crossref_ieee_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20Survey%20of%20Object%20Goal%20Navigation&rft.jtitle=IEEE%20transactions%20on%20automation%20science%20and%20engineering&rft.au=Sun,%20Jingwen&rft.date=2024-03-19&rft.spage=1&rft.epage=17&rft.pages=1-17&rft.issn=1545-5955&rft.eissn=1558-3783&rft.coden=ITASC7&rft_id=info:doi/10.1109/TASE.2024.3378010&rft_dat=%3Ccrossref_ieee_%3E10_1109_TASE_2024_3378010%3C/crossref_ieee_%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c261t-1b05e933331ef9f86c854a2da1812db5e579e00bd475599927c76bc788d9f96f3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=10475904&rfr_iscdi=true