Loading…

SegMind: Semisupervised Remote Sensing Image Semantic Segmentation With Masked Image Modeling and Contrastive Learning Method

Remote sensing (RS) image semantic segmentation has attracted much attention due to its wide applications. However, deep learning-based RS image semantic segmentation methods usually require substantial manual pixelwise annotations, which are expensive and hard to obtain in practice. Although the ex...

Full description

Saved in:

Bibliographic Details
Published in:	IEEE transactions on geoscience and remote sensing 2023, Vol.61, p.1-17
Main Authors:	Li, Zhenghong, Chen, Hao, Wu, Jiangjiang, Li, Jun, Jing, Ning
Format:	Article
Language:	English
Subjects:	Annotations Contrastive learning (CL) Deep learning Image enhancement Image processing Image segmentation Learning systems masked image modeling (MIM) Modelling Remote sensing remote sensing (RS) image semantic segmentation Roads Semantic segmentation Semantics semisupervised learning Training Transformers
Citations:	Items that this one cites Items that cite this one
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

cited_by	cdi_FETCH-LOGICAL-c294t-4295ea4148c76acdceecbb22fa261a4247ec581c3b73561049825c00efe5c7613
cites	cdi_FETCH-LOGICAL-c294t-4295ea4148c76acdceecbb22fa261a4247ec581c3b73561049825c00efe5c7613
container_end_page	17
container_issue
container_start_page	1
container_title	IEEE transactions on geoscience and remote sensing
container_volume	61
creator	Li, Zhenghong Chen, Hao Wu, Jiangjiang Li, Jun Jing, Ning
description	Remote sensing (RS) image semantic segmentation has attracted much attention due to its wide applications. However, deep learning-based RS image semantic segmentation methods usually require substantial manual pixelwise annotations, which are expensive and hard to obtain in practice. Although the existing semisupervised RS semantic segmentation methods effectively reduce dependence on labeled data, they generally focus on information consistency between labeled and unlabeled images, but ignore the potential context information between different areas of the RS image. In fact, the objects contained in an RS image usually have some long-range dependence between each other, since trees are usually on both sides of a road, and the middle of two rows of houses is commonly a road. Therefore, we believe that the potential dependencies between different areas of the RS image should be beneficial to reduce the label dependence of RS semantic segmentation. Based on this point, we propose a novel semisupervised RS image semantic segmentation network named SegMind, which is based on mean-teacher (MT) architecture and adopts masked image modeling (MIM) to enhance information interactions of different areas. Moreover, contrastive learning (CL) and entropy loss are introduced to SegMind framework to further improve the linear separability and prediction confidence of the proposed model. Experiments on three datasets have demonstrated the superiority of the proposed method over the state-of-the-art methods. The code is available at https://github.com/lzh-ggs-ddu/SegMind .
doi_str_mv	10.1109/TGRS.2023.3321041
format	article
fullrecord	<record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_crossref_primary_10_1109_TGRS_2023_3321041</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>10268382</ieee_id><sourcerecordid>2879377641</sourcerecordid><originalsourceid>FETCH-LOGICAL-c294t-4295ea4148c76acdceecbb22fa261a4247ec581c3b73561049825c00efe5c7613</originalsourceid><addsrcrecordid>eNpNkMtKw0AUhgdRsFYfQHARcJ0611zcSdFaaBDaisthOjlJp5pJnZkWXPjuJqQLV-f2f-dwfoRuCZ4QgvOH9Wy5mlBM2YQxSjAnZ2hEhMhinHB-jkaY5ElMs5xeoivvdxgTLkg6Qr8rqAtjy8doBY3xhz24o_FQRkto2gBd13pj62jeqLqvGmWD0V1SN2CDCqa10YcJ26hQ_rPDBl3RlvDVY8qW0bS1wSkfzBGiBShn-0EBYduW1-iiUl8ebk5xjN5fntfT13jxNptPnxaxpjkPMae5AMUJz3SaKF1qAL3ZUFopmhDFKU9Bi4xotkmZSLrv84wKjTFUIDqCsDG6H_buXft9AB_krj04252UNEtzlqYJ71VkUGnXeu-gkntnGuV-JMGyd1n2LsveZXlyuWPuBsYAwD89TTKWUfYH5dB6Bg</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2879377641</pqid></control><display><type>article</type><title>SegMind: Semisupervised Remote Sensing Image Semantic Segmentation With Masked Image Modeling and Contrastive Learning Method</title><source>IEEE Electronic Library (IEL) Journals</source><creator>Li, Zhenghong ; Chen, Hao ; Wu, Jiangjiang ; Li, Jun ; Jing, Ning</creator><creatorcontrib>Li, Zhenghong ; Chen, Hao ; Wu, Jiangjiang ; Li, Jun ; Jing, Ning</creatorcontrib><description>Remote sensing (RS) image semantic segmentation has attracted much attention due to its wide applications. However, deep learning-based RS image semantic segmentation methods usually require substantial manual pixelwise annotations, which are expensive and hard to obtain in practice. Although the existing semisupervised RS semantic segmentation methods effectively reduce dependence on labeled data, they generally focus on information consistency between labeled and unlabeled images, but ignore the potential context information between different areas of the RS image. In fact, the objects contained in an RS image usually have some long-range dependence between each other, since trees are usually on both sides of a road, and the middle of two rows of houses is commonly a road. Therefore, we believe that the potential dependencies between different areas of the RS image should be beneficial to reduce the label dependence of RS semantic segmentation. Based on this point, we propose a novel semisupervised RS image semantic segmentation network named SegMind, which is based on mean-teacher (MT) architecture and adopts masked image modeling (MIM) to enhance information interactions of different areas. Moreover, contrastive learning (CL) and entropy loss are introduced to SegMind framework to further improve the linear separability and prediction confidence of the proposed model. Experiments on three datasets have demonstrated the superiority of the proposed method over the state-of-the-art methods. The code is available at https://github.com/lzh-ggs-ddu/SegMind .</description><identifier>ISSN: 0196-2892</identifier><identifier>EISSN: 1558-0644</identifier><identifier>DOI: 10.1109/TGRS.2023.3321041</identifier><identifier>CODEN: IGRSD2</identifier><language>eng</language><publisher>New York: IEEE</publisher><subject>Annotations ; Contrastive learning (CL) ; Deep learning ; Image enhancement ; Image processing ; Image segmentation ; Learning systems ; masked image modeling (MIM) ; Modelling ; Remote sensing ; remote sensing (RS) image semantic segmentation ; Roads ; Semantic segmentation ; Semantics ; semisupervised learning ; Training ; Transformers</subject><ispartof>IEEE transactions on geoscience and remote sensing, 2023, Vol.61, p.1-17</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2023</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c294t-4295ea4148c76acdceecbb22fa261a4247ec581c3b73561049825c00efe5c7613</citedby><cites>FETCH-LOGICAL-c294t-4295ea4148c76acdceecbb22fa261a4247ec581c3b73561049825c00efe5c7613</cites><orcidid>0000-0001-8111-6803 ; 0000-0002-7880-3394</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/10268382$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,780,784,4023,27922,27923,27924,54795</link.rule.ids></links><search><creatorcontrib>Li, Zhenghong</creatorcontrib><creatorcontrib>Chen, Hao</creatorcontrib><creatorcontrib>Wu, Jiangjiang</creatorcontrib><creatorcontrib>Li, Jun</creatorcontrib><creatorcontrib>Jing, Ning</creatorcontrib><title>SegMind: Semisupervised Remote Sensing Image Semantic Segmentation With Masked Image Modeling and Contrastive Learning Method</title><title>IEEE transactions on geoscience and remote sensing</title><addtitle>TGRS</addtitle><description>Remote sensing (RS) image semantic segmentation has attracted much attention due to its wide applications. However, deep learning-based RS image semantic segmentation methods usually require substantial manual pixelwise annotations, which are expensive and hard to obtain in practice. Although the existing semisupervised RS semantic segmentation methods effectively reduce dependence on labeled data, they generally focus on information consistency between labeled and unlabeled images, but ignore the potential context information between different areas of the RS image. In fact, the objects contained in an RS image usually have some long-range dependence between each other, since trees are usually on both sides of a road, and the middle of two rows of houses is commonly a road. Therefore, we believe that the potential dependencies between different areas of the RS image should be beneficial to reduce the label dependence of RS semantic segmentation. Based on this point, we propose a novel semisupervised RS image semantic segmentation network named SegMind, which is based on mean-teacher (MT) architecture and adopts masked image modeling (MIM) to enhance information interactions of different areas. Moreover, contrastive learning (CL) and entropy loss are introduced to SegMind framework to further improve the linear separability and prediction confidence of the proposed model. Experiments on three datasets have demonstrated the superiority of the proposed method over the state-of-the-art methods. The code is available at https://github.com/lzh-ggs-ddu/SegMind .</description><subject>Annotations</subject><subject>Contrastive learning (CL)</subject><subject>Deep learning</subject><subject>Image enhancement</subject><subject>Image processing</subject><subject>Image segmentation</subject><subject>Learning systems</subject><subject>masked image modeling (MIM)</subject><subject>Modelling</subject><subject>Remote sensing</subject><subject>remote sensing (RS) image semantic segmentation</subject><subject>Roads</subject><subject>Semantic segmentation</subject><subject>Semantics</subject><subject>semisupervised learning</subject><subject>Training</subject><subject>Transformers</subject><issn>0196-2892</issn><issn>1558-0644</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><recordid>eNpNkMtKw0AUhgdRsFYfQHARcJ0611zcSdFaaBDaisthOjlJp5pJnZkWXPjuJqQLV-f2f-dwfoRuCZ4QgvOH9Wy5mlBM2YQxSjAnZ2hEhMhinHB-jkaY5ElMs5xeoivvdxgTLkg6Qr8rqAtjy8doBY3xhz24o_FQRkto2gBd13pj62jeqLqvGmWD0V1SN2CDCqa10YcJ26hQ_rPDBl3RlvDVY8qW0bS1wSkfzBGiBShn-0EBYduW1-iiUl8ebk5xjN5fntfT13jxNptPnxaxpjkPMae5AMUJz3SaKF1qAL3ZUFopmhDFKU9Bi4xotkmZSLrv84wKjTFUIDqCsDG6H_buXft9AB_krj04252UNEtzlqYJ71VkUGnXeu-gkntnGuV-JMGyd1n2LsveZXlyuWPuBsYAwD89TTKWUfYH5dB6Bg</recordid><startdate>2023</startdate><enddate>2023</enddate><creator>Li, Zhenghong</creator><creator>Chen, Hao</creator><creator>Wu, Jiangjiang</creator><creator>Li, Jun</creator><creator>Jing, Ning</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7UA</scope><scope>8FD</scope><scope>C1K</scope><scope>F1W</scope><scope>FR3</scope><scope>H8D</scope><scope>H96</scope><scope>KR7</scope><scope>L.G</scope><scope>L7M</scope><orcidid>https://orcid.org/0000-0001-8111-6803</orcidid><orcidid>https://orcid.org/0000-0002-7880-3394</orcidid></search><sort><creationdate>2023</creationdate><title>SegMind: Semisupervised Remote Sensing Image Semantic Segmentation With Masked Image Modeling and Contrastive Learning Method</title><author>Li, Zhenghong ; Chen, Hao ; Wu, Jiangjiang ; Li, Jun ; Jing, Ning</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c294t-4295ea4148c76acdceecbb22fa261a4247ec581c3b73561049825c00efe5c7613</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Annotations</topic><topic>Contrastive learning (CL)</topic><topic>Deep learning</topic><topic>Image enhancement</topic><topic>Image processing</topic><topic>Image segmentation</topic><topic>Learning systems</topic><topic>masked image modeling (MIM)</topic><topic>Modelling</topic><topic>Remote sensing</topic><topic>remote sensing (RS) image semantic segmentation</topic><topic>Roads</topic><topic>Semantic segmentation</topic><topic>Semantics</topic><topic>semisupervised learning</topic><topic>Training</topic><topic>Transformers</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Li, Zhenghong</creatorcontrib><creatorcontrib>Chen, Hao</creatorcontrib><creatorcontrib>Wu, Jiangjiang</creatorcontrib><creatorcontrib>Li, Jun</creatorcontrib><creatorcontrib>Jing, Ning</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Xplore</collection><collection>CrossRef</collection><collection>Water Resources Abstracts</collection><collection>Technology Research Database</collection><collection>Environmental Sciences and Pollution Management</collection><collection>ASFA: Aquatic Sciences and Fisheries Abstracts</collection><collection>Engineering Research Database</collection><collection>Aerospace Database</collection><collection>Aquatic Science & Fisheries Abstracts (ASFA) 2: Ocean Technology, Policy & Non-Living Resources</collection><collection>Civil Engineering Abstracts</collection><collection>Aquatic Science & Fisheries Abstracts (ASFA) Professional</collection><collection>Advanced Technologies Database with Aerospace</collection><jtitle>IEEE transactions on geoscience and remote sensing</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Li, Zhenghong</au><au>Chen, Hao</au><au>Wu, Jiangjiang</au><au>Li, Jun</au><au>Jing, Ning</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>SegMind: Semisupervised Remote Sensing Image Semantic Segmentation With Masked Image Modeling and Contrastive Learning Method</atitle><jtitle>IEEE transactions on geoscience and remote sensing</jtitle><stitle>TGRS</stitle><date>2023</date><risdate>2023</risdate><volume>61</volume><spage>1</spage><epage>17</epage><pages>1-17</pages><issn>0196-2892</issn><eissn>1558-0644</eissn><coden>IGRSD2</coden><abstract>Remote sensing (RS) image semantic segmentation has attracted much attention due to its wide applications. However, deep learning-based RS image semantic segmentation methods usually require substantial manual pixelwise annotations, which are expensive and hard to obtain in practice. Although the existing semisupervised RS semantic segmentation methods effectively reduce dependence on labeled data, they generally focus on information consistency between labeled and unlabeled images, but ignore the potential context information between different areas of the RS image. In fact, the objects contained in an RS image usually have some long-range dependence between each other, since trees are usually on both sides of a road, and the middle of two rows of houses is commonly a road. Therefore, we believe that the potential dependencies between different areas of the RS image should be beneficial to reduce the label dependence of RS semantic segmentation. Based on this point, we propose a novel semisupervised RS image semantic segmentation network named SegMind, which is based on mean-teacher (MT) architecture and adopts masked image modeling (MIM) to enhance information interactions of different areas. Moreover, contrastive learning (CL) and entropy loss are introduced to SegMind framework to further improve the linear separability and prediction confidence of the proposed model. Experiments on three datasets have demonstrated the superiority of the proposed method over the state-of-the-art methods. The code is available at https://github.com/lzh-ggs-ddu/SegMind .</abstract><cop>New York</cop><pub>IEEE</pub><doi>10.1109/TGRS.2023.3321041</doi><tpages>17</tpages><orcidid>https://orcid.org/0000-0001-8111-6803</orcidid><orcidid>https://orcid.org/0000-0002-7880-3394</orcidid></addata></record>
fulltext	fulltext
identifier	ISSN: 0196-2892
ispartof	IEEE transactions on geoscience and remote sensing, 2023, Vol.61, p.1-17
issn	0196-2892 1558-0644
language	eng
recordid	cdi_crossref_primary_10_1109_TGRS_2023_3321041
source	IEEE Electronic Library (IEL) Journals
subjects	Annotations Contrastive learning (CL) Deep learning Image enhancement Image processing Image segmentation Learning systems masked image modeling (MIM) Modelling Remote sensing remote sensing (RS) image semantic segmentation Roads Semantic segmentation Semantics semisupervised learning Training Transformers
title	SegMind: Semisupervised Remote Sensing Image Semantic Segmentation With Masked Image Modeling and Contrastive Learning Method
url	http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-08T20%3A24%3A43IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=SegMind:%20Semisupervised%20Remote%20Sensing%20Image%20Semantic%20Segmentation%20With%20Masked%20Image%20Modeling%20and%20Contrastive%20Learning%20Method&rft.jtitle=IEEE%20transactions%20on%20geoscience%20and%20remote%20sensing&rft.au=Li,%20Zhenghong&rft.date=2023&rft.volume=61&rft.spage=1&rft.epage=17&rft.pages=1-17&rft.issn=0196-2892&rft.eissn=1558-0644&rft.coden=IGRSD2&rft_id=info:doi/10.1109/TGRS.2023.3321041&rft_dat=%3Cproquest_cross%3E2879377641%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c294t-4295ea4148c76acdceecbb22fa261a4247ec581c3b73561049825c00efe5c7613%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2879377641&rft_id=info:pmid/&rft_ieee_id=10268382&rfr_iscdi=true