Loading…

SegMind: Semisupervised Remote Sensing Image Semantic Segmentation With Masked Image Modeling and Contrastive Learning Method

Remote sensing (RS) image semantic segmentation has attracted much attention due to its wide applications. However, deep learning-based RS image semantic segmentation methods usually require substantial manual pixelwise annotations, which are expensive and hard to obtain in practice. Although the ex...

Full description

Saved in:
Bibliographic Details
Published in:IEEE transactions on geoscience and remote sensing 2023, Vol.61, p.1-17
Main Authors: Li, Zhenghong, Chen, Hao, Wu, Jiangjiang, Li, Jun, Jing, Ning
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c294t-4295ea4148c76acdceecbb22fa261a4247ec581c3b73561049825c00efe5c7613
cites cdi_FETCH-LOGICAL-c294t-4295ea4148c76acdceecbb22fa261a4247ec581c3b73561049825c00efe5c7613
container_end_page 17
container_issue
container_start_page 1
container_title IEEE transactions on geoscience and remote sensing
container_volume 61
creator Li, Zhenghong
Chen, Hao
Wu, Jiangjiang
Li, Jun
Jing, Ning
description Remote sensing (RS) image semantic segmentation has attracted much attention due to its wide applications. However, deep learning-based RS image semantic segmentation methods usually require substantial manual pixelwise annotations, which are expensive and hard to obtain in practice. Although the existing semisupervised RS semantic segmentation methods effectively reduce dependence on labeled data, they generally focus on information consistency between labeled and unlabeled images, but ignore the potential context information between different areas of the RS image. In fact, the objects contained in an RS image usually have some long-range dependence between each other, since trees are usually on both sides of a road, and the middle of two rows of houses is commonly a road. Therefore, we believe that the potential dependencies between different areas of the RS image should be beneficial to reduce the label dependence of RS semantic segmentation. Based on this point, we propose a novel semisupervised RS image semantic segmentation network named SegMind, which is based on mean-teacher (MT) architecture and adopts masked image modeling (MIM) to enhance information interactions of different areas. Moreover, contrastive learning (CL) and entropy loss are introduced to SegMind framework to further improve the linear separability and prediction confidence of the proposed model. Experiments on three datasets have demonstrated the superiority of the proposed method over the state-of-the-art methods. The code is available at https://github.com/lzh-ggs-ddu/SegMind .
doi_str_mv 10.1109/TGRS.2023.3321041
format article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_crossref_primary_10_1109_TGRS_2023_3321041</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>10268382</ieee_id><sourcerecordid>2879377641</sourcerecordid><originalsourceid>FETCH-LOGICAL-c294t-4295ea4148c76acdceecbb22fa261a4247ec581c3b73561049825c00efe5c7613</originalsourceid><addsrcrecordid>eNpNkMtKw0AUhgdRsFYfQHARcJ0611zcSdFaaBDaisthOjlJp5pJnZkWXPjuJqQLV-f2f-dwfoRuCZ4QgvOH9Wy5mlBM2YQxSjAnZ2hEhMhinHB-jkaY5ElMs5xeoivvdxgTLkg6Qr8rqAtjy8doBY3xhz24o_FQRkto2gBd13pj62jeqLqvGmWD0V1SN2CDCqa10YcJ26hQ_rPDBl3RlvDVY8qW0bS1wSkfzBGiBShn-0EBYduW1-iiUl8ebk5xjN5fntfT13jxNptPnxaxpjkPMae5AMUJz3SaKF1qAL3ZUFopmhDFKU9Bi4xotkmZSLrv84wKjTFUIDqCsDG6H_buXft9AB_krj04252UNEtzlqYJ71VkUGnXeu-gkntnGuV-JMGyd1n2LsveZXlyuWPuBsYAwD89TTKWUfYH5dB6Bg</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2879377641</pqid></control><display><type>article</type><title>SegMind: Semisupervised Remote Sensing Image Semantic Segmentation With Masked Image Modeling and Contrastive Learning Method</title><source>IEEE Electronic Library (IEL) Journals</source><creator>Li, Zhenghong ; Chen, Hao ; Wu, Jiangjiang ; Li, Jun ; Jing, Ning</creator><creatorcontrib>Li, Zhenghong ; Chen, Hao ; Wu, Jiangjiang ; Li, Jun ; Jing, Ning</creatorcontrib><description>Remote sensing (RS) image semantic segmentation has attracted much attention due to its wide applications. However, deep learning-based RS image semantic segmentation methods usually require substantial manual pixelwise annotations, which are expensive and hard to obtain in practice. Although the existing semisupervised RS semantic segmentation methods effectively reduce dependence on labeled data, they generally focus on information consistency between labeled and unlabeled images, but ignore the potential context information between different areas of the RS image. In fact, the objects contained in an RS image usually have some long-range dependence between each other, since trees are usually on both sides of a road, and the middle of two rows of houses is commonly a road. Therefore, we believe that the potential dependencies between different areas of the RS image should be beneficial to reduce the label dependence of RS semantic segmentation. Based on this point, we propose a novel semisupervised RS image semantic segmentation network named SegMind, which is based on mean-teacher (MT) architecture and adopts masked image modeling (MIM) to enhance information interactions of different areas. Moreover, contrastive learning (CL) and entropy loss are introduced to SegMind framework to further improve the linear separability and prediction confidence of the proposed model. Experiments on three datasets have demonstrated the superiority of the proposed method over the state-of-the-art methods. The code is available at https://github.com/lzh-ggs-ddu/SegMind .</description><identifier>ISSN: 0196-2892</identifier><identifier>EISSN: 1558-0644</identifier><identifier>DOI: 10.1109/TGRS.2023.3321041</identifier><identifier>CODEN: IGRSD2</identifier><language>eng</language><publisher>New York: IEEE</publisher><subject>Annotations ; Contrastive learning (CL) ; Deep learning ; Image enhancement ; Image processing ; Image segmentation ; Learning systems ; masked image modeling (MIM) ; Modelling ; Remote sensing ; remote sensing (RS) image semantic segmentation ; Roads ; Semantic segmentation ; Semantics ; semisupervised learning ; Training ; Transformers</subject><ispartof>IEEE transactions on geoscience and remote sensing, 2023, Vol.61, p.1-17</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2023</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c294t-4295ea4148c76acdceecbb22fa261a4247ec581c3b73561049825c00efe5c7613</citedby><cites>FETCH-LOGICAL-c294t-4295ea4148c76acdceecbb22fa261a4247ec581c3b73561049825c00efe5c7613</cites><orcidid>0000-0001-8111-6803 ; 0000-0002-7880-3394</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/10268382$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,780,784,4023,27922,27923,27924,54795</link.rule.ids></links><search><creatorcontrib>Li, Zhenghong</creatorcontrib><creatorcontrib>Chen, Hao</creatorcontrib><creatorcontrib>Wu, Jiangjiang</creatorcontrib><creatorcontrib>Li, Jun</creatorcontrib><creatorcontrib>Jing, Ning</creatorcontrib><title>SegMind: Semisupervised Remote Sensing Image Semantic Segmentation With Masked Image Modeling and Contrastive Learning Method</title><title>IEEE transactions on geoscience and remote sensing</title><addtitle>TGRS</addtitle><description>Remote sensing (RS) image semantic segmentation has attracted much attention due to its wide applications. However, deep learning-based RS image semantic segmentation methods usually require substantial manual pixelwise annotations, which are expensive and hard to obtain in practice. Although the existing semisupervised RS semantic segmentation methods effectively reduce dependence on labeled data, they generally focus on information consistency between labeled and unlabeled images, but ignore the potential context information between different areas of the RS image. In fact, the objects contained in an RS image usually have some long-range dependence between each other, since trees are usually on both sides of a road, and the middle of two rows of houses is commonly a road. Therefore, we believe that the potential dependencies between different areas of the RS image should be beneficial to reduce the label dependence of RS semantic segmentation. Based on this point, we propose a novel semisupervised RS image semantic segmentation network named SegMind, which is based on mean-teacher (MT) architecture and adopts masked image modeling (MIM) to enhance information interactions of different areas. Moreover, contrastive learning (CL) and entropy loss are introduced to SegMind framework to further improve the linear separability and prediction confidence of the proposed model. Experiments on three datasets have demonstrated the superiority of the proposed method over the state-of-the-art methods. The code is available at https://github.com/lzh-ggs-ddu/SegMind .</description><subject>Annotations</subject><subject>Contrastive learning (CL)</subject><subject>Deep learning</subject><subject>Image enhancement</subject><subject>Image processing</subject><subject>Image segmentation</subject><subject>Learning systems</subject><subject>masked image modeling (MIM)</subject><subject>Modelling</subject><subject>Remote sensing</subject><subject>remote sensing (RS) image semantic segmentation</subject><subject>Roads</subject><subject>Semantic segmentation</subject><subject>Semantics</subject><subject>semisupervised learning</subject><subject>Training</subject><subject>Transformers</subject><issn>0196-2892</issn><issn>1558-0644</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><recordid>eNpNkMtKw0AUhgdRsFYfQHARcJ0611zcSdFaaBDaisthOjlJp5pJnZkWXPjuJqQLV-f2f-dwfoRuCZ4QgvOH9Wy5mlBM2YQxSjAnZ2hEhMhinHB-jkaY5ElMs5xeoivvdxgTLkg6Qr8rqAtjy8doBY3xhz24o_FQRkto2gBd13pj62jeqLqvGmWD0V1SN2CDCqa10YcJ26hQ_rPDBl3RlvDVY8qW0bS1wSkfzBGiBShn-0EBYduW1-iiUl8ebk5xjN5fntfT13jxNptPnxaxpjkPMae5AMUJz3SaKF1qAL3ZUFopmhDFKU9Bi4xotkmZSLrv84wKjTFUIDqCsDG6H_buXft9AB_krj04252UNEtzlqYJ71VkUGnXeu-gkntnGuV-JMGyd1n2LsveZXlyuWPuBsYAwD89TTKWUfYH5dB6Bg</recordid><startdate>2023</startdate><enddate>2023</enddate><creator>Li, Zhenghong</creator><creator>Chen, Hao</creator><creator>Wu, Jiangjiang</creator><creator>Li, Jun</creator><creator>Jing, Ning</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7UA</scope><scope>8FD</scope><scope>C1K</scope><scope>F1W</scope><scope>FR3</scope><scope>H8D</scope><scope>H96</scope><scope>KR7</scope><scope>L.G</scope><scope>L7M</scope><orcidid>https://orcid.org/0000-0001-8111-6803</orcidid><orcidid>https://orcid.org/0000-0002-7880-3394</orcidid></search><sort><creationdate>2023</creationdate><title>SegMind: Semisupervised Remote Sensing Image Semantic Segmentation With Masked Image Modeling and Contrastive Learning Method</title><author>Li, Zhenghong ; Chen, Hao ; Wu, Jiangjiang ; Li, Jun ; Jing, Ning</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c294t-4295ea4148c76acdceecbb22fa261a4247ec581c3b73561049825c00efe5c7613</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Annotations</topic><topic>Contrastive learning (CL)</topic><topic>Deep learning</topic><topic>Image enhancement</topic><topic>Image processing</topic><topic>Image segmentation</topic><topic>Learning systems</topic><topic>masked image modeling (MIM)</topic><topic>Modelling</topic><topic>Remote sensing</topic><topic>remote sensing (RS) image semantic segmentation</topic><topic>Roads</topic><topic>Semantic segmentation</topic><topic>Semantics</topic><topic>semisupervised learning</topic><topic>Training</topic><topic>Transformers</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Li, Zhenghong</creatorcontrib><creatorcontrib>Chen, Hao</creatorcontrib><creatorcontrib>Wu, Jiangjiang</creatorcontrib><creatorcontrib>Li, Jun</creatorcontrib><creatorcontrib>Jing, Ning</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Xplore</collection><collection>CrossRef</collection><collection>Water Resources Abstracts</collection><collection>Technology Research Database</collection><collection>Environmental Sciences and Pollution Management</collection><collection>ASFA: Aquatic Sciences and Fisheries Abstracts</collection><collection>Engineering Research Database</collection><collection>Aerospace Database</collection><collection>Aquatic Science &amp; Fisheries Abstracts (ASFA) 2: Ocean Technology, Policy &amp; Non-Living Resources</collection><collection>Civil Engineering Abstracts</collection><collection>Aquatic Science &amp; Fisheries Abstracts (ASFA) Professional</collection><collection>Advanced Technologies Database with Aerospace</collection><jtitle>IEEE transactions on geoscience and remote sensing</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Li, Zhenghong</au><au>Chen, Hao</au><au>Wu, Jiangjiang</au><au>Li, Jun</au><au>Jing, Ning</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>SegMind: Semisupervised Remote Sensing Image Semantic Segmentation With Masked Image Modeling and Contrastive Learning Method</atitle><jtitle>IEEE transactions on geoscience and remote sensing</jtitle><stitle>TGRS</stitle><date>2023</date><risdate>2023</risdate><volume>61</volume><spage>1</spage><epage>17</epage><pages>1-17</pages><issn>0196-2892</issn><eissn>1558-0644</eissn><coden>IGRSD2</coden><abstract>Remote sensing (RS) image semantic segmentation has attracted much attention due to its wide applications. However, deep learning-based RS image semantic segmentation methods usually require substantial manual pixelwise annotations, which are expensive and hard to obtain in practice. Although the existing semisupervised RS semantic segmentation methods effectively reduce dependence on labeled data, they generally focus on information consistency between labeled and unlabeled images, but ignore the potential context information between different areas of the RS image. In fact, the objects contained in an RS image usually have some long-range dependence between each other, since trees are usually on both sides of a road, and the middle of two rows of houses is commonly a road. Therefore, we believe that the potential dependencies between different areas of the RS image should be beneficial to reduce the label dependence of RS semantic segmentation. Based on this point, we propose a novel semisupervised RS image semantic segmentation network named SegMind, which is based on mean-teacher (MT) architecture and adopts masked image modeling (MIM) to enhance information interactions of different areas. Moreover, contrastive learning (CL) and entropy loss are introduced to SegMind framework to further improve the linear separability and prediction confidence of the proposed model. Experiments on three datasets have demonstrated the superiority of the proposed method over the state-of-the-art methods. The code is available at https://github.com/lzh-ggs-ddu/SegMind .</abstract><cop>New York</cop><pub>IEEE</pub><doi>10.1109/TGRS.2023.3321041</doi><tpages>17</tpages><orcidid>https://orcid.org/0000-0001-8111-6803</orcidid><orcidid>https://orcid.org/0000-0002-7880-3394</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 0196-2892
ispartof IEEE transactions on geoscience and remote sensing, 2023, Vol.61, p.1-17
issn 0196-2892
1558-0644
language eng
recordid cdi_crossref_primary_10_1109_TGRS_2023_3321041
source IEEE Electronic Library (IEL) Journals
subjects Annotations
Contrastive learning (CL)
Deep learning
Image enhancement
Image processing
Image segmentation
Learning systems
masked image modeling (MIM)
Modelling
Remote sensing
remote sensing (RS) image semantic segmentation
Roads
Semantic segmentation
Semantics
semisupervised learning
Training
Transformers
title SegMind: Semisupervised Remote Sensing Image Semantic Segmentation With Masked Image Modeling and Contrastive Learning Method
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-08T20%3A24%3A43IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=SegMind:%20Semisupervised%20Remote%20Sensing%20Image%20Semantic%20Segmentation%20With%20Masked%20Image%20Modeling%20and%20Contrastive%20Learning%20Method&rft.jtitle=IEEE%20transactions%20on%20geoscience%20and%20remote%20sensing&rft.au=Li,%20Zhenghong&rft.date=2023&rft.volume=61&rft.spage=1&rft.epage=17&rft.pages=1-17&rft.issn=0196-2892&rft.eissn=1558-0644&rft.coden=IGRSD2&rft_id=info:doi/10.1109/TGRS.2023.3321041&rft_dat=%3Cproquest_cross%3E2879377641%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c294t-4295ea4148c76acdceecbb22fa261a4247ec581c3b73561049825c00efe5c7613%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2879377641&rft_id=info:pmid/&rft_ieee_id=10268382&rfr_iscdi=true