Loading…

Paddy field object detection for robotic combine based on real‐time semantic segmentation algorithm

The development of robotic combine for rice harvesting has garnered worldwide attention in recent years. The robotic combine is capable of running along a designated path; however, it still requires human operator supervision due to the lack of object detection sensors for safety purposes. To achiev...

Full description

Saved in:

Bibliographic Details
Published in:	Journal of field robotics 2024-03, Vol.41 (2), p.273-287
Main Authors:	Zhu, Jiajun, Iida, Michihisa, Chen, Sikai, Cheng, Shijing, Suguri, Masahiko, Masuda, Ryohei
Format:	Article
Language:	English
Subjects:	Acceleration Algorithms Artificial neural networks convolutional neural networks deep learning Frames per second Harvesting Machine learning Microprocessors object detection Object recognition paddy field real‐time robotic combine Robotics Semantic segmentation Semantics Sensors
Citations:	Items that this one cites
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

cited_by
cites	cdi_FETCH-LOGICAL-c2570-3bc5aeed72df9cd597bb51d472c8d305d8aa84efc50c763b36bbcc7903dbc4443
container_end_page	287
container_issue	2
container_start_page	273
container_title	Journal of field robotics
container_volume	41
creator	Zhu, Jiajun Iida, Michihisa Chen, Sikai Cheng, Shijing Suguri, Masahiko Masuda, Ryohei
description	The development of robotic combine for rice harvesting has garnered worldwide attention in recent years. The robotic combine is capable of running along a designated path; however, it still requires human operator supervision due to the lack of object detection sensors for safety purposes. To achieve a fully unmanned robotic combine, a real‐time paddy field object detection method is necessary. Typically, all paddy field objects are detected individually using multiple algorithms and sensors, which significantly increases the complexity and cost of the detection process. In this study, the deep learning (DL) based semantic segmentation (SS) method was employed to detect all paddy field objects simultaneously using only an RGB camera. Considering the environment of the paddy field, a new SS model called “The Robotic Combine Network (TRCNet)” was specifically designed for the robotic combine. And four state‐of‐the‐art lightweight convolutional neural networks were applied as the backbones of the TRCNet. To achieve real‐time detection, TensorRT (NVIDIA) was utilized for speeding up the prediction process. All models were trained and evaluated using paddy field images captured during the robotic combine's harvesting process. The results showed that the TRCNet can successfully detect all paddy field objects. The mean intersection over union, and frames per second (FPS) of the best two SS models were 0.823, 47.48, and 0.834, 32.44, respectively. The FPS values were obtained after speed acceleration and tested with an image size of 640 × 480 pixels on an embedded processor (Jetson TX2), enabling real‐time object detection in paddy fields for the robotic combine.
doi_str_mv	10.1002/rob.22260
format	article
fullrecord	<record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2920920753</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2920920753</sourcerecordid><originalsourceid>FETCH-LOGICAL-c2570-3bc5aeed72df9cd597bb51d472c8d305d8aa84efc50c763b36bbcc7903dbc4443</originalsourceid><addsrcrecordid>eNp1kM1KAzEUhYMoWKsL3yDgysW0mWQy6Sy1-AeFiug65OdOTZmZ1GSKdOcj-Iw-iakj7oQL5y6-cy73IHSek0lOCJ0GryeU0pIcoFHOeZkVVSkO_3ZeHaOTGNeEFGxW8RGCR2XtDtcOGou9XoPpsYU-ifMdrn3AKdH3zmDjW-06wFpFSGiHA6jm6-Ozdy3gCK3q9lSEVQtdr37sqln54PrX9hQd1aqJcParY_Rye_M8v88Wy7uH-dUiM5QLkjFtuAKwgtq6MpZXQmue20JQM7OMcDtTalZAbTgxomSalVobIyrCrDZFUbAxuhhyN8G_bSH2cu23oUsnJa0oSSM4S9TlQJngYwxQy01wrQo7mRO5b1Gmn-VPi4mdDuy7a2D3PyiflteD4xtow3ah</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2920920753</pqid></control><display><type>article</type><title>Paddy field object detection for robotic combine based on real‐time semantic segmentation algorithm</title><source>Wiley</source><creator>Zhu, Jiajun ; Iida, Michihisa ; Chen, Sikai ; Cheng, Shijing ; Suguri, Masahiko ; Masuda, Ryohei</creator><creatorcontrib>Zhu, Jiajun ; Iida, Michihisa ; Chen, Sikai ; Cheng, Shijing ; Suguri, Masahiko ; Masuda, Ryohei</creatorcontrib><description>The development of robotic combine for rice harvesting has garnered worldwide attention in recent years. The robotic combine is capable of running along a designated path; however, it still requires human operator supervision due to the lack of object detection sensors for safety purposes. To achieve a fully unmanned robotic combine, a real‐time paddy field object detection method is necessary. Typically, all paddy field objects are detected individually using multiple algorithms and sensors, which significantly increases the complexity and cost of the detection process. In this study, the deep learning (DL) based semantic segmentation (SS) method was employed to detect all paddy field objects simultaneously using only an RGB camera. Considering the environment of the paddy field, a new SS model called “The Robotic Combine Network (TRCNet)” was specifically designed for the robotic combine. And four state‐of‐the‐art lightweight convolutional neural networks were applied as the backbones of the TRCNet. To achieve real‐time detection, TensorRT (NVIDIA) was utilized for speeding up the prediction process. All models were trained and evaluated using paddy field images captured during the robotic combine's harvesting process. The results showed that the TRCNet can successfully detect all paddy field objects. The mean intersection over union, and frames per second (FPS) of the best two SS models were 0.823, 47.48, and 0.834, 32.44, respectively. The FPS values were obtained after speed acceleration and tested with an image size of 640 × 480 pixels on an embedded processor (Jetson TX2), enabling real‐time object detection in paddy fields for the robotic combine.</description><identifier>ISSN: 1556-4959</identifier><identifier>EISSN: 1556-4967</identifier><identifier>DOI: 10.1002/rob.22260</identifier><language>eng</language><publisher>Hoboken: Wiley Subscription Services, Inc</publisher><subject>Acceleration ; Algorithms ; Artificial neural networks ; convolutional neural networks ; deep learning ; Frames per second ; Harvesting ; Machine learning ; Microprocessors ; object detection ; Object recognition ; paddy field ; real‐time ; robotic combine ; Robotics ; Semantic segmentation ; Semantics ; Sensors</subject><ispartof>Journal of field robotics, 2024-03, Vol.41 (2), p.273-287</ispartof><rights>2023 Wiley Periodicals LLC.</rights><rights>2024 Wiley Periodicals LLC.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c2570-3bc5aeed72df9cd597bb51d472c8d305d8aa84efc50c763b36bbcc7903dbc4443</cites><orcidid>0000-0002-2814-9080</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,27903,27904</link.rule.ids></links><search><creatorcontrib>Zhu, Jiajun</creatorcontrib><creatorcontrib>Iida, Michihisa</creatorcontrib><creatorcontrib>Chen, Sikai</creatorcontrib><creatorcontrib>Cheng, Shijing</creatorcontrib><creatorcontrib>Suguri, Masahiko</creatorcontrib><creatorcontrib>Masuda, Ryohei</creatorcontrib><title>Paddy field object detection for robotic combine based on real‐time semantic segmentation algorithm</title><title>Journal of field robotics</title><description>The development of robotic combine for rice harvesting has garnered worldwide attention in recent years. The robotic combine is capable of running along a designated path; however, it still requires human operator supervision due to the lack of object detection sensors for safety purposes. To achieve a fully unmanned robotic combine, a real‐time paddy field object detection method is necessary. Typically, all paddy field objects are detected individually using multiple algorithms and sensors, which significantly increases the complexity and cost of the detection process. In this study, the deep learning (DL) based semantic segmentation (SS) method was employed to detect all paddy field objects simultaneously using only an RGB camera. Considering the environment of the paddy field, a new SS model called “The Robotic Combine Network (TRCNet)” was specifically designed for the robotic combine. And four state‐of‐the‐art lightweight convolutional neural networks were applied as the backbones of the TRCNet. To achieve real‐time detection, TensorRT (NVIDIA) was utilized for speeding up the prediction process. All models were trained and evaluated using paddy field images captured during the robotic combine's harvesting process. The results showed that the TRCNet can successfully detect all paddy field objects. The mean intersection over union, and frames per second (FPS) of the best two SS models were 0.823, 47.48, and 0.834, 32.44, respectively. The FPS values were obtained after speed acceleration and tested with an image size of 640 × 480 pixels on an embedded processor (Jetson TX2), enabling real‐time object detection in paddy fields for the robotic combine.</description><subject>Acceleration</subject><subject>Algorithms</subject><subject>Artificial neural networks</subject><subject>convolutional neural networks</subject><subject>deep learning</subject><subject>Frames per second</subject><subject>Harvesting</subject><subject>Machine learning</subject><subject>Microprocessors</subject><subject>object detection</subject><subject>Object recognition</subject><subject>paddy field</subject><subject>real‐time</subject><subject>robotic combine</subject><subject>Robotics</subject><subject>Semantic segmentation</subject><subject>Semantics</subject><subject>Sensors</subject><issn>1556-4959</issn><issn>1556-4967</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><recordid>eNp1kM1KAzEUhYMoWKsL3yDgysW0mWQy6Sy1-AeFiug65OdOTZmZ1GSKdOcj-Iw-iakj7oQL5y6-cy73IHSek0lOCJ0GryeU0pIcoFHOeZkVVSkO_3ZeHaOTGNeEFGxW8RGCR2XtDtcOGou9XoPpsYU-ifMdrn3AKdH3zmDjW-06wFpFSGiHA6jm6-Ozdy3gCK3q9lSEVQtdr37sqln54PrX9hQd1aqJcParY_Rye_M8v88Wy7uH-dUiM5QLkjFtuAKwgtq6MpZXQmue20JQM7OMcDtTalZAbTgxomSalVobIyrCrDZFUbAxuhhyN8G_bSH2cu23oUsnJa0oSSM4S9TlQJngYwxQy01wrQo7mRO5b1Gmn-VPi4mdDuy7a2D3PyiflteD4xtow3ah</recordid><startdate>202403</startdate><enddate>202403</enddate><creator>Zhu, Jiajun</creator><creator>Iida, Michihisa</creator><creator>Chen, Sikai</creator><creator>Cheng, Shijing</creator><creator>Suguri, Masahiko</creator><creator>Masuda, Ryohei</creator><general>Wiley Subscription Services, Inc</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>7TB</scope><scope>8FD</scope><scope>F28</scope><scope>FR3</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><orcidid>https://orcid.org/0000-0002-2814-9080</orcidid></search><sort><creationdate>202403</creationdate><title>Paddy field object detection for robotic combine based on real‐time semantic segmentation algorithm</title><author>Zhu, Jiajun ; Iida, Michihisa ; Chen, Sikai ; Cheng, Shijing ; Suguri, Masahiko ; Masuda, Ryohei</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c2570-3bc5aeed72df9cd597bb51d472c8d305d8aa84efc50c763b36bbcc7903dbc4443</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Acceleration</topic><topic>Algorithms</topic><topic>Artificial neural networks</topic><topic>convolutional neural networks</topic><topic>deep learning</topic><topic>Frames per second</topic><topic>Harvesting</topic><topic>Machine learning</topic><topic>Microprocessors</topic><topic>object detection</topic><topic>Object recognition</topic><topic>paddy field</topic><topic>real‐time</topic><topic>robotic combine</topic><topic>Robotics</topic><topic>Semantic segmentation</topic><topic>Semantics</topic><topic>Sensors</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Zhu, Jiajun</creatorcontrib><creatorcontrib>Iida, Michihisa</creatorcontrib><creatorcontrib>Chen, Sikai</creatorcontrib><creatorcontrib>Cheng, Shijing</creatorcontrib><creatorcontrib>Suguri, Masahiko</creatorcontrib><creatorcontrib>Masuda, Ryohei</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics & Communications Abstracts</collection><collection>Mechanical & Transportation Engineering Abstracts</collection><collection>Technology Research Database</collection><collection>ANTE: Abstracts in New Technology & Engineering</collection><collection>Engineering Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Journal of field robotics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Zhu, Jiajun</au><au>Iida, Michihisa</au><au>Chen, Sikai</au><au>Cheng, Shijing</au><au>Suguri, Masahiko</au><au>Masuda, Ryohei</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Paddy field object detection for robotic combine based on real‐time semantic segmentation algorithm</atitle><jtitle>Journal of field robotics</jtitle><date>2024-03</date><risdate>2024</risdate><volume>41</volume><issue>2</issue><spage>273</spage><epage>287</epage><pages>273-287</pages><issn>1556-4959</issn><eissn>1556-4967</eissn><abstract>The development of robotic combine for rice harvesting has garnered worldwide attention in recent years. The robotic combine is capable of running along a designated path; however, it still requires human operator supervision due to the lack of object detection sensors for safety purposes. To achieve a fully unmanned robotic combine, a real‐time paddy field object detection method is necessary. Typically, all paddy field objects are detected individually using multiple algorithms and sensors, which significantly increases the complexity and cost of the detection process. In this study, the deep learning (DL) based semantic segmentation (SS) method was employed to detect all paddy field objects simultaneously using only an RGB camera. Considering the environment of the paddy field, a new SS model called “The Robotic Combine Network (TRCNet)” was specifically designed for the robotic combine. And four state‐of‐the‐art lightweight convolutional neural networks were applied as the backbones of the TRCNet. To achieve real‐time detection, TensorRT (NVIDIA) was utilized for speeding up the prediction process. All models were trained and evaluated using paddy field images captured during the robotic combine's harvesting process. The results showed that the TRCNet can successfully detect all paddy field objects. The mean intersection over union, and frames per second (FPS) of the best two SS models were 0.823, 47.48, and 0.834, 32.44, respectively. The FPS values were obtained after speed acceleration and tested with an image size of 640 × 480 pixels on an embedded processor (Jetson TX2), enabling real‐time object detection in paddy fields for the robotic combine.</abstract><cop>Hoboken</cop><pub>Wiley Subscription Services, Inc</pub><doi>10.1002/rob.22260</doi><tpages>15</tpages><orcidid>https://orcid.org/0000-0002-2814-9080</orcidid></addata></record>
fulltext	fulltext
identifier	ISSN: 1556-4959
ispartof	Journal of field robotics, 2024-03, Vol.41 (2), p.273-287
issn	1556-4959 1556-4967
language	eng
recordid	cdi_proquest_journals_2920920753
source	Wiley
subjects	Acceleration Algorithms Artificial neural networks convolutional neural networks deep learning Frames per second Harvesting Machine learning Microprocessors object detection Object recognition paddy field real‐time robotic combine Robotics Semantic segmentation Semantics Sensors
title	Paddy field object detection for robotic combine based on real‐time semantic segmentation algorithm
url	http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-22T19%3A59%3A33IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Paddy%20field%20object%20detection%20for%20robotic%20combine%20based%20on%20real%E2%80%90time%20semantic%20segmentation%20algorithm&rft.jtitle=Journal%20of%20field%20robotics&rft.au=Zhu,%20Jiajun&rft.date=2024-03&rft.volume=41&rft.issue=2&rft.spage=273&rft.epage=287&rft.pages=273-287&rft.issn=1556-4959&rft.eissn=1556-4967&rft_id=info:doi/10.1002/rob.22260&rft_dat=%3Cproquest_cross%3E2920920753%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c2570-3bc5aeed72df9cd597bb51d472c8d305d8aa84efc50c763b36bbcc7903dbc4443%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2920920753&rft_id=info:pmid/&rfr_iscdi=true