Loading…

Learning stratified 3D reconstruction

Stratified 3 D reconstruction, or a layer-by-layer 3 D reconstruction upgraded from projective to affine, then to the final metric reconstruction, is a well-known 3 D reconstruction method in computer vision. It is also a key supporting technology for various well-known applications, such as streetv...

Full description

Saved in:

Bibliographic Details
Published in:	Science China. Information sciences 2018-02, Vol.61 (2), p.220-235, Article 023101
Main Authors:	Dong, Qiulei, Shu, Mao, Cui, Hainan, Xu, Huarong, Hu, Zhanyi
Format:	Article
Language:	English
Subjects:	Computer Science Computer vision Deep learning Geometry Image reconstruction Information Systems and Communication Service Neural networks Photogrammetry Position Paper Robustness 学习方法计算机视觉 3D RANSAC 空间视觉摄影测量学几何学孤立点
Citations:	Items that cite this one
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

cited_by	cdi_FETCH-LOGICAL-c343t-5875ae85270efd2dd40e85d63afec41f8a2a383098aae4edf90d4f69f9b2e0c23
cites
container_end_page	235
container_issue	2
container_start_page	220
container_title	Science China. Information sciences
container_volume	61
creator	Dong, Qiulei Shu, Mao Cui, Hainan Xu, Huarong Hu, Zhanyi
description	Stratified 3 D reconstruction, or a layer-by-layer 3 D reconstruction upgraded from projective to affine, then to the final metric reconstruction, is a well-known 3 D reconstruction method in computer vision. It is also a key supporting technology for various well-known applications, such as streetview, smart3 D, oblique photogrammetry. Generally speaking, the existing computer vision methods in the literature can be roughly classified into either the geometry-based approaches for spatial vision or the learning-based approaches for object vision. Although deep learning has demonstrated tremendous success in object vision in recent years,learning 3 D scene reconstruction from multiple images is still rare, even not existent, except for those on depth learning from single images. This study is to explore the feasibility of learning the stratified 3 D reconstruction from putative point correspondences across images, and to assess whether it could also be as robust to matching outliers as the traditional geometry-based methods do. In this study, a special parsimonious neural network is designed for the learning. Our results show that it is indeed possible to learn a stratified 3 D reconstruction from noisy image point correspondences, and the learnt reconstruction results appear satisfactory although they are still not on a par with the state-of-the-arts in the structurefrom-motion community due to largely its lack of an explicit robust outlier detector such as random sample consensus（RANSAC）. To the best of our knowledge, our study is the first attempt in the literature to learn3 D scene reconstruction from multiple images. Our results also show that how to implicitly or explicitly integrate an outlier detector in learning methods is a key problem to solve in order to learn comparable3 D scene structures to those by the current geometry-based state-of-the-arts. Otherwise any significant advancement of learning 3 D structures from multiple images seems difficult, if not impossible. Besides, we even speculate that deep learning might be, in nature, not suitable for learning 3 D structure from multiple images, or more generally, for solving spatial vision problems.
doi_str_mv	10.1007/s11432-017-9234-7
format	article
fullrecord	<record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2918546312</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><cqvip_id>674674249</cqvip_id><sourcerecordid>2918546312</sourcerecordid><originalsourceid>FETCH-LOGICAL-c343t-5875ae85270efd2dd40e85d63afec41f8a2a383098aae4edf90d4f69f9b2e0c23</originalsourceid><addsrcrecordid>eNp9kEFLAzEQhYMoWGp_gLeieIwmmewmOUq1Kix4UfAW4mZSt2i2TbYH_70pW_RmGMjM8N48-Ag55-yaM6ZuMucSBGVcUSNAUnVEJlzXhnLDzXHpa1WWAG-nZJbzmpUHwITSE3LVoEuxi6t5HpIbutChn8PdPGHbx7LatUPXxzNyEtxnxtnhn5LX5f3L4pE2zw9Pi9uGtiBhoJVWlUNdCcUweOG9ZGXyNbiAreRBO-FAAzPaOZTog2FehtoE8y6QtQKm5HK8u0n9dod5sOt-l2KJtMJwXcka-F7FR1Wb-pwTBrtJ3ZdL35YzuwdiRyC2ALF7IFYVjxg9uWjjCtPf5f9MF4egjz6utsX3m1SIlhLSwA9UxG1T</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2918546312</pqid></control><display><type>article</type><title>Learning stratified 3D reconstruction</title><source>Springer Link</source><creator>Dong, Qiulei ; Shu, Mao ; Cui, Hainan ; Xu, Huarong ; Hu, Zhanyi</creator><creatorcontrib>Dong, Qiulei ; Shu, Mao ; Cui, Hainan ; Xu, Huarong ; Hu, Zhanyi</creatorcontrib><description>Stratified 3 D reconstruction, or a layer-by-layer 3 D reconstruction upgraded from projective to affine, then to the final metric reconstruction, is a well-known 3 D reconstruction method in computer vision. It is also a key supporting technology for various well-known applications, such as streetview, smart3 D, oblique photogrammetry. Generally speaking, the existing computer vision methods in the literature can be roughly classified into either the geometry-based approaches for spatial vision or the learning-based approaches for object vision. Although deep learning has demonstrated tremendous success in object vision in recent years,learning 3 D scene reconstruction from multiple images is still rare, even not existent, except for those on depth learning from single images. This study is to explore the feasibility of learning the stratified 3 D reconstruction from putative point correspondences across images, and to assess whether it could also be as robust to matching outliers as the traditional geometry-based methods do. In this study, a special parsimonious neural network is designed for the learning. Our results show that it is indeed possible to learn a stratified 3 D reconstruction from noisy image point correspondences, and the learnt reconstruction results appear satisfactory although they are still not on a par with the state-of-the-arts in the structurefrom-motion community due to largely its lack of an explicit robust outlier detector such as random sample consensus（RANSAC）. To the best of our knowledge, our study is the first attempt in the literature to learn3 D scene reconstruction from multiple images. Our results also show that how to implicitly or explicitly integrate an outlier detector in learning methods is a key problem to solve in order to learn comparable3 D scene structures to those by the current geometry-based state-of-the-arts. Otherwise any significant advancement of learning 3 D structures from multiple images seems difficult, if not impossible. Besides, we even speculate that deep learning might be, in nature, not suitable for learning 3 D structure from multiple images, or more generally, for solving spatial vision problems.</description><identifier>ISSN: 1674-733X</identifier><identifier>EISSN: 1869-1919</identifier><identifier>DOI: 10.1007/s11432-017-9234-7</identifier><language>eng</language><publisher>Beijing: Science China Press</publisher><subject>Computer Science ; Computer vision ; Deep learning ; Geometry ; Image reconstruction ; Information Systems and Communication Service ; Neural networks ; Photogrammetry ; Position Paper ; Robustness ; 学习方法;计算机视觉;3D;RANSAC;空间视觉;摄影测量学;几何学;孤立点</subject><ispartof>Science China. Information sciences, 2018-02, Vol.61 (2), p.220-235, Article 023101</ispartof><rights>Science China Press and Springer-Verlag GmbH Germany, part of Springer Nature 2017</rights><rights>Science China Press and Springer-Verlag GmbH Germany, part of Springer Nature 2017.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c343t-5875ae85270efd2dd40e85d63afec41f8a2a383098aae4edf90d4f69f9b2e0c23</citedby></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Uhttp://image.cqvip.com/vip1000/qk/84009A/84009A.jpg</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids></links><search><creatorcontrib>Dong, Qiulei</creatorcontrib><creatorcontrib>Shu, Mao</creatorcontrib><creatorcontrib>Cui, Hainan</creatorcontrib><creatorcontrib>Xu, Huarong</creatorcontrib><creatorcontrib>Hu, Zhanyi</creatorcontrib><title>Learning stratified 3D reconstruction</title><title>Science China. Information sciences</title><addtitle>Sci. China Inf. Sci</addtitle><addtitle>SCIENCE CHINA Information Sciences</addtitle><description>Stratified 3 D reconstruction, or a layer-by-layer 3 D reconstruction upgraded from projective to affine, then to the final metric reconstruction, is a well-known 3 D reconstruction method in computer vision. It is also a key supporting technology for various well-known applications, such as streetview, smart3 D, oblique photogrammetry. Generally speaking, the existing computer vision methods in the literature can be roughly classified into either the geometry-based approaches for spatial vision or the learning-based approaches for object vision. Although deep learning has demonstrated tremendous success in object vision in recent years,learning 3 D scene reconstruction from multiple images is still rare, even not existent, except for those on depth learning from single images. This study is to explore the feasibility of learning the stratified 3 D reconstruction from putative point correspondences across images, and to assess whether it could also be as robust to matching outliers as the traditional geometry-based methods do. In this study, a special parsimonious neural network is designed for the learning. Our results show that it is indeed possible to learn a stratified 3 D reconstruction from noisy image point correspondences, and the learnt reconstruction results appear satisfactory although they are still not on a par with the state-of-the-arts in the structurefrom-motion community due to largely its lack of an explicit robust outlier detector such as random sample consensus（RANSAC）. To the best of our knowledge, our study is the first attempt in the literature to learn3 D scene reconstruction from multiple images. Our results also show that how to implicitly or explicitly integrate an outlier detector in learning methods is a key problem to solve in order to learn comparable3 D scene structures to those by the current geometry-based state-of-the-arts. Otherwise any significant advancement of learning 3 D structures from multiple images seems difficult, if not impossible. Besides, we even speculate that deep learning might be, in nature, not suitable for learning 3 D structure from multiple images, or more generally, for solving spatial vision problems.</description><subject>Computer Science</subject><subject>Computer vision</subject><subject>Deep learning</subject><subject>Geometry</subject><subject>Image reconstruction</subject><subject>Information Systems and Communication Service</subject><subject>Neural networks</subject><subject>Photogrammetry</subject><subject>Position Paper</subject><subject>Robustness</subject><subject>学习方法;计算机视觉;3D;RANSAC;空间视觉;摄影测量学;几何学;孤立点</subject><issn>1674-733X</issn><issn>1869-1919</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2018</creationdate><recordtype>article</recordtype><recordid>eNp9kEFLAzEQhYMoWGp_gLeieIwmmewmOUq1Kix4UfAW4mZSt2i2TbYH_70pW_RmGMjM8N48-Ag55-yaM6ZuMucSBGVcUSNAUnVEJlzXhnLDzXHpa1WWAG-nZJbzmpUHwITSE3LVoEuxi6t5HpIbutChn8PdPGHbx7LatUPXxzNyEtxnxtnhn5LX5f3L4pE2zw9Pi9uGtiBhoJVWlUNdCcUweOG9ZGXyNbiAreRBO-FAAzPaOZTog2FehtoE8y6QtQKm5HK8u0n9dod5sOt-l2KJtMJwXcka-F7FR1Wb-pwTBrtJ3ZdL35YzuwdiRyC2ALF7IFYVjxg9uWjjCtPf5f9MF4egjz6utsX3m1SIlhLSwA9UxG1T</recordid><startdate>20180201</startdate><enddate>20180201</enddate><creator>Dong, Qiulei</creator><creator>Shu, Mao</creator><creator>Cui, Hainan</creator><creator>Xu, Huarong</creator><creator>Hu, Zhanyi</creator><general>Science China Press</general><general>Springer Nature B.V</general><scope>2RA</scope><scope>92L</scope><scope>CQIGP</scope><scope>W92</scope><scope>~WA</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>8FE</scope><scope>8FG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K7-</scope><scope>P5Z</scope><scope>P62</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope></search><sort><creationdate>20180201</creationdate><title>Learning stratified 3D reconstruction</title><author>Dong, Qiulei ; Shu, Mao ; Cui, Hainan ; Xu, Huarong ; Hu, Zhanyi</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c343t-5875ae85270efd2dd40e85d63afec41f8a2a383098aae4edf90d4f69f9b2e0c23</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2018</creationdate><topic>Computer Science</topic><topic>Computer vision</topic><topic>Deep learning</topic><topic>Geometry</topic><topic>Image reconstruction</topic><topic>Information Systems and Communication Service</topic><topic>Neural networks</topic><topic>Photogrammetry</topic><topic>Position Paper</topic><topic>Robustness</topic><topic>学习方法;计算机视觉;3D;RANSAC;空间视觉;摄影测量学;几何学;孤立点</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Dong, Qiulei</creatorcontrib><creatorcontrib>Shu, Mao</creatorcontrib><creatorcontrib>Cui, Hainan</creatorcontrib><creatorcontrib>Xu, Huarong</creatorcontrib><creatorcontrib>Hu, Zhanyi</creatorcontrib><collection>维普_期刊</collection><collection>中文科技期刊数据库-CALIS站点</collection><collection>维普中文期刊数据库</collection><collection>中文科技期刊数据库-工程技术</collection><collection>中文科技期刊数据库- 镜像站点</collection><collection>CrossRef</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central</collection><collection>Advanced Technologies & Aerospace Database‎ (1962 - current)</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer science database</collection><collection>ProQuest advanced technologies & aerospace journals</collection><collection>ProQuest Advanced Technologies & Aerospace Collection</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><jtitle>Science China. Information sciences</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Dong, Qiulei</au><au>Shu, Mao</au><au>Cui, Hainan</au><au>Xu, Huarong</au><au>Hu, Zhanyi</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Learning stratified 3D reconstruction</atitle><jtitle>Science China. Information sciences</jtitle><stitle>Sci. China Inf. Sci</stitle><addtitle>SCIENCE CHINA Information Sciences</addtitle><date>2018-02-01</date><risdate>2018</risdate><volume>61</volume><issue>2</issue><spage>220</spage><epage>235</epage><pages>220-235</pages><artnum>023101</artnum><issn>1674-733X</issn><eissn>1869-1919</eissn><abstract>Stratified 3 D reconstruction, or a layer-by-layer 3 D reconstruction upgraded from projective to affine, then to the final metric reconstruction, is a well-known 3 D reconstruction method in computer vision. It is also a key supporting technology for various well-known applications, such as streetview, smart3 D, oblique photogrammetry. Generally speaking, the existing computer vision methods in the literature can be roughly classified into either the geometry-based approaches for spatial vision or the learning-based approaches for object vision. Although deep learning has demonstrated tremendous success in object vision in recent years,learning 3 D scene reconstruction from multiple images is still rare, even not existent, except for those on depth learning from single images. This study is to explore the feasibility of learning the stratified 3 D reconstruction from putative point correspondences across images, and to assess whether it could also be as robust to matching outliers as the traditional geometry-based methods do. In this study, a special parsimonious neural network is designed for the learning. Our results show that it is indeed possible to learn a stratified 3 D reconstruction from noisy image point correspondences, and the learnt reconstruction results appear satisfactory although they are still not on a par with the state-of-the-arts in the structurefrom-motion community due to largely its lack of an explicit robust outlier detector such as random sample consensus（RANSAC）. To the best of our knowledge, our study is the first attempt in the literature to learn3 D scene reconstruction from multiple images. Our results also show that how to implicitly or explicitly integrate an outlier detector in learning methods is a key problem to solve in order to learn comparable3 D scene structures to those by the current geometry-based state-of-the-arts. Otherwise any significant advancement of learning 3 D structures from multiple images seems difficult, if not impossible. Besides, we even speculate that deep learning might be, in nature, not suitable for learning 3 D structure from multiple images, or more generally, for solving spatial vision problems.</abstract><cop>Beijing</cop><pub>Science China Press</pub><doi>10.1007/s11432-017-9234-7</doi><tpages>16</tpages></addata></record>
fulltext	fulltext
identifier	ISSN: 1674-733X
ispartof	Science China. Information sciences, 2018-02, Vol.61 (2), p.220-235, Article 023101
issn	1674-733X 1869-1919
language	eng
recordid	cdi_proquest_journals_2918546312
source	Springer Link
subjects	Computer Science Computer vision Deep learning Geometry Image reconstruction Information Systems and Communication Service Neural networks Photogrammetry Position Paper Robustness 学习方法计算机视觉 3D RANSAC 空间视觉摄影测量学几何学孤立点
title	Learning stratified 3D reconstruction
url	http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-01T13%3A23%3A17IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Learning%20stratified%203D%20reconstruction&rft.jtitle=Science%20China.%20Information%20sciences&rft.au=Dong,%20Qiulei&rft.date=2018-02-01&rft.volume=61&rft.issue=2&rft.spage=220&rft.epage=235&rft.pages=220-235&rft.artnum=023101&rft.issn=1674-733X&rft.eissn=1869-1919&rft_id=info:doi/10.1007/s11432-017-9234-7&rft_dat=%3Cproquest_cross%3E2918546312%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c343t-5875ae85270efd2dd40e85d63afec41f8a2a383098aae4edf90d4f69f9b2e0c23%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2918546312&rft_id=info:pmid/&rft_cqvip_id=674674249&rfr_iscdi=true