Loading…
Paddy field object detection for robotic combine based on real‐time semantic segmentation algorithm
The development of robotic combine for rice harvesting has garnered worldwide attention in recent years. The robotic combine is capable of running along a designated path; however, it still requires human operator supervision due to the lack of object detection sensors for safety purposes. To achiev...
Saved in:
Published in: | Journal of field robotics 2024-03, Vol.41 (2), p.273-287 |
---|---|
Main Authors: | , , , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | |
---|---|
cites | cdi_FETCH-LOGICAL-c2570-3bc5aeed72df9cd597bb51d472c8d305d8aa84efc50c763b36bbcc7903dbc4443 |
container_end_page | 287 |
container_issue | 2 |
container_start_page | 273 |
container_title | Journal of field robotics |
container_volume | 41 |
creator | Zhu, Jiajun Iida, Michihisa Chen, Sikai Cheng, Shijing Suguri, Masahiko Masuda, Ryohei |
description | The development of robotic combine for rice harvesting has garnered worldwide attention in recent years. The robotic combine is capable of running along a designated path; however, it still requires human operator supervision due to the lack of object detection sensors for safety purposes. To achieve a fully unmanned robotic combine, a real‐time paddy field object detection method is necessary. Typically, all paddy field objects are detected individually using multiple algorithms and sensors, which significantly increases the complexity and cost of the detection process. In this study, the deep learning (DL) based semantic segmentation (SS) method was employed to detect all paddy field objects simultaneously using only an RGB camera. Considering the environment of the paddy field, a new SS model called “The Robotic Combine Network (TRCNet)” was specifically designed for the robotic combine. And four state‐of‐the‐art lightweight convolutional neural networks were applied as the backbones of the TRCNet. To achieve real‐time detection, TensorRT (NVIDIA) was utilized for speeding up the prediction process. All models were trained and evaluated using paddy field images captured during the robotic combine's harvesting process. The results showed that the TRCNet can successfully detect all paddy field objects. The mean intersection over union, and frames per second (FPS) of the best two SS models were 0.823, 47.48, and 0.834, 32.44, respectively. The FPS values were obtained after speed acceleration and tested with an image size of 640 × 480 pixels on an embedded processor (Jetson TX2), enabling real‐time object detection in paddy fields for the robotic combine. |
doi_str_mv | 10.1002/rob.22260 |
format | article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2920920753</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2920920753</sourcerecordid><originalsourceid>FETCH-LOGICAL-c2570-3bc5aeed72df9cd597bb51d472c8d305d8aa84efc50c763b36bbcc7903dbc4443</originalsourceid><addsrcrecordid>eNp1kM1KAzEUhYMoWKsL3yDgysW0mWQy6Sy1-AeFiug65OdOTZmZ1GSKdOcj-Iw-iakj7oQL5y6-cy73IHSek0lOCJ0GryeU0pIcoFHOeZkVVSkO_3ZeHaOTGNeEFGxW8RGCR2XtDtcOGou9XoPpsYU-ifMdrn3AKdH3zmDjW-06wFpFSGiHA6jm6-Ozdy3gCK3q9lSEVQtdr37sqln54PrX9hQd1aqJcParY_Rye_M8v88Wy7uH-dUiM5QLkjFtuAKwgtq6MpZXQmue20JQM7OMcDtTalZAbTgxomSalVobIyrCrDZFUbAxuhhyN8G_bSH2cu23oUsnJa0oSSM4S9TlQJngYwxQy01wrQo7mRO5b1Gmn-VPi4mdDuy7a2D3PyiflteD4xtow3ah</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2920920753</pqid></control><display><type>article</type><title>Paddy field object detection for robotic combine based on real‐time semantic segmentation algorithm</title><source>Wiley</source><creator>Zhu, Jiajun ; Iida, Michihisa ; Chen, Sikai ; Cheng, Shijing ; Suguri, Masahiko ; Masuda, Ryohei</creator><creatorcontrib>Zhu, Jiajun ; Iida, Michihisa ; Chen, Sikai ; Cheng, Shijing ; Suguri, Masahiko ; Masuda, Ryohei</creatorcontrib><description>The development of robotic combine for rice harvesting has garnered worldwide attention in recent years. The robotic combine is capable of running along a designated path; however, it still requires human operator supervision due to the lack of object detection sensors for safety purposes. To achieve a fully unmanned robotic combine, a real‐time paddy field object detection method is necessary. Typically, all paddy field objects are detected individually using multiple algorithms and sensors, which significantly increases the complexity and cost of the detection process. In this study, the deep learning (DL) based semantic segmentation (SS) method was employed to detect all paddy field objects simultaneously using only an RGB camera. Considering the environment of the paddy field, a new SS model called “The Robotic Combine Network (TRCNet)” was specifically designed for the robotic combine. And four state‐of‐the‐art lightweight convolutional neural networks were applied as the backbones of the TRCNet. To achieve real‐time detection, TensorRT (NVIDIA) was utilized for speeding up the prediction process. All models were trained and evaluated using paddy field images captured during the robotic combine's harvesting process. The results showed that the TRCNet can successfully detect all paddy field objects. The mean intersection over union, and frames per second (FPS) of the best two SS models were 0.823, 47.48, and 0.834, 32.44, respectively. The FPS values were obtained after speed acceleration and tested with an image size of 640 × 480 pixels on an embedded processor (Jetson TX2), enabling real‐time object detection in paddy fields for the robotic combine.</description><identifier>ISSN: 1556-4959</identifier><identifier>EISSN: 1556-4967</identifier><identifier>DOI: 10.1002/rob.22260</identifier><language>eng</language><publisher>Hoboken: Wiley Subscription Services, Inc</publisher><subject>Acceleration ; Algorithms ; Artificial neural networks ; convolutional neural networks ; deep learning ; Frames per second ; Harvesting ; Machine learning ; Microprocessors ; object detection ; Object recognition ; paddy field ; real‐time ; robotic combine ; Robotics ; Semantic segmentation ; Semantics ; Sensors</subject><ispartof>Journal of field robotics, 2024-03, Vol.41 (2), p.273-287</ispartof><rights>2023 Wiley Periodicals LLC.</rights><rights>2024 Wiley Periodicals LLC.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c2570-3bc5aeed72df9cd597bb51d472c8d305d8aa84efc50c763b36bbcc7903dbc4443</cites><orcidid>0000-0002-2814-9080</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,27903,27904</link.rule.ids></links><search><creatorcontrib>Zhu, Jiajun</creatorcontrib><creatorcontrib>Iida, Michihisa</creatorcontrib><creatorcontrib>Chen, Sikai</creatorcontrib><creatorcontrib>Cheng, Shijing</creatorcontrib><creatorcontrib>Suguri, Masahiko</creatorcontrib><creatorcontrib>Masuda, Ryohei</creatorcontrib><title>Paddy field object detection for robotic combine based on real‐time semantic segmentation algorithm</title><title>Journal of field robotics</title><description>The development of robotic combine for rice harvesting has garnered worldwide attention in recent years. The robotic combine is capable of running along a designated path; however, it still requires human operator supervision due to the lack of object detection sensors for safety purposes. To achieve a fully unmanned robotic combine, a real‐time paddy field object detection method is necessary. Typically, all paddy field objects are detected individually using multiple algorithms and sensors, which significantly increases the complexity and cost of the detection process. In this study, the deep learning (DL) based semantic segmentation (SS) method was employed to detect all paddy field objects simultaneously using only an RGB camera. Considering the environment of the paddy field, a new SS model called “The Robotic Combine Network (TRCNet)” was specifically designed for the robotic combine. And four state‐of‐the‐art lightweight convolutional neural networks were applied as the backbones of the TRCNet. To achieve real‐time detection, TensorRT (NVIDIA) was utilized for speeding up the prediction process. All models were trained and evaluated using paddy field images captured during the robotic combine's harvesting process. The results showed that the TRCNet can successfully detect all paddy field objects. The mean intersection over union, and frames per second (FPS) of the best two SS models were 0.823, 47.48, and 0.834, 32.44, respectively. The FPS values were obtained after speed acceleration and tested with an image size of 640 × 480 pixels on an embedded processor (Jetson TX2), enabling real‐time object detection in paddy fields for the robotic combine.</description><subject>Acceleration</subject><subject>Algorithms</subject><subject>Artificial neural networks</subject><subject>convolutional neural networks</subject><subject>deep learning</subject><subject>Frames per second</subject><subject>Harvesting</subject><subject>Machine learning</subject><subject>Microprocessors</subject><subject>object detection</subject><subject>Object recognition</subject><subject>paddy field</subject><subject>real‐time</subject><subject>robotic combine</subject><subject>Robotics</subject><subject>Semantic segmentation</subject><subject>Semantics</subject><subject>Sensors</subject><issn>1556-4959</issn><issn>1556-4967</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><recordid>eNp1kM1KAzEUhYMoWKsL3yDgysW0mWQy6Sy1-AeFiug65OdOTZmZ1GSKdOcj-Iw-iakj7oQL5y6-cy73IHSek0lOCJ0GryeU0pIcoFHOeZkVVSkO_3ZeHaOTGNeEFGxW8RGCR2XtDtcOGou9XoPpsYU-ifMdrn3AKdH3zmDjW-06wFpFSGiHA6jm6-Ozdy3gCK3q9lSEVQtdr37sqln54PrX9hQd1aqJcParY_Rye_M8v88Wy7uH-dUiM5QLkjFtuAKwgtq6MpZXQmue20JQM7OMcDtTalZAbTgxomSalVobIyrCrDZFUbAxuhhyN8G_bSH2cu23oUsnJa0oSSM4S9TlQJngYwxQy01wrQo7mRO5b1Gmn-VPi4mdDuy7a2D3PyiflteD4xtow3ah</recordid><startdate>202403</startdate><enddate>202403</enddate><creator>Zhu, Jiajun</creator><creator>Iida, Michihisa</creator><creator>Chen, Sikai</creator><creator>Cheng, Shijing</creator><creator>Suguri, Masahiko</creator><creator>Masuda, Ryohei</creator><general>Wiley Subscription Services, Inc</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>7TB</scope><scope>8FD</scope><scope>F28</scope><scope>FR3</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><orcidid>https://orcid.org/0000-0002-2814-9080</orcidid></search><sort><creationdate>202403</creationdate><title>Paddy field object detection for robotic combine based on real‐time semantic segmentation algorithm</title><author>Zhu, Jiajun ; Iida, Michihisa ; Chen, Sikai ; Cheng, Shijing ; Suguri, Masahiko ; Masuda, Ryohei</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c2570-3bc5aeed72df9cd597bb51d472c8d305d8aa84efc50c763b36bbcc7903dbc4443</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Acceleration</topic><topic>Algorithms</topic><topic>Artificial neural networks</topic><topic>convolutional neural networks</topic><topic>deep learning</topic><topic>Frames per second</topic><topic>Harvesting</topic><topic>Machine learning</topic><topic>Microprocessors</topic><topic>object detection</topic><topic>Object recognition</topic><topic>paddy field</topic><topic>real‐time</topic><topic>robotic combine</topic><topic>Robotics</topic><topic>Semantic segmentation</topic><topic>Semantics</topic><topic>Sensors</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Zhu, Jiajun</creatorcontrib><creatorcontrib>Iida, Michihisa</creatorcontrib><creatorcontrib>Chen, Sikai</creatorcontrib><creatorcontrib>Cheng, Shijing</creatorcontrib><creatorcontrib>Suguri, Masahiko</creatorcontrib><creatorcontrib>Masuda, Ryohei</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics & Communications Abstracts</collection><collection>Mechanical & Transportation Engineering Abstracts</collection><collection>Technology Research Database</collection><collection>ANTE: Abstracts in New Technology & Engineering</collection><collection>Engineering Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Journal of field robotics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Zhu, Jiajun</au><au>Iida, Michihisa</au><au>Chen, Sikai</au><au>Cheng, Shijing</au><au>Suguri, Masahiko</au><au>Masuda, Ryohei</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Paddy field object detection for robotic combine based on real‐time semantic segmentation algorithm</atitle><jtitle>Journal of field robotics</jtitle><date>2024-03</date><risdate>2024</risdate><volume>41</volume><issue>2</issue><spage>273</spage><epage>287</epage><pages>273-287</pages><issn>1556-4959</issn><eissn>1556-4967</eissn><abstract>The development of robotic combine for rice harvesting has garnered worldwide attention in recent years. The robotic combine is capable of running along a designated path; however, it still requires human operator supervision due to the lack of object detection sensors for safety purposes. To achieve a fully unmanned robotic combine, a real‐time paddy field object detection method is necessary. Typically, all paddy field objects are detected individually using multiple algorithms and sensors, which significantly increases the complexity and cost of the detection process. In this study, the deep learning (DL) based semantic segmentation (SS) method was employed to detect all paddy field objects simultaneously using only an RGB camera. Considering the environment of the paddy field, a new SS model called “The Robotic Combine Network (TRCNet)” was specifically designed for the robotic combine. And four state‐of‐the‐art lightweight convolutional neural networks were applied as the backbones of the TRCNet. To achieve real‐time detection, TensorRT (NVIDIA) was utilized for speeding up the prediction process. All models were trained and evaluated using paddy field images captured during the robotic combine's harvesting process. The results showed that the TRCNet can successfully detect all paddy field objects. The mean intersection over union, and frames per second (FPS) of the best two SS models were 0.823, 47.48, and 0.834, 32.44, respectively. The FPS values were obtained after speed acceleration and tested with an image size of 640 × 480 pixels on an embedded processor (Jetson TX2), enabling real‐time object detection in paddy fields for the robotic combine.</abstract><cop>Hoboken</cop><pub>Wiley Subscription Services, Inc</pub><doi>10.1002/rob.22260</doi><tpages>15</tpages><orcidid>https://orcid.org/0000-0002-2814-9080</orcidid></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1556-4959 |
ispartof | Journal of field robotics, 2024-03, Vol.41 (2), p.273-287 |
issn | 1556-4959 1556-4967 |
language | eng |
recordid | cdi_proquest_journals_2920920753 |
source | Wiley |
subjects | Acceleration Algorithms Artificial neural networks convolutional neural networks deep learning Frames per second Harvesting Machine learning Microprocessors object detection Object recognition paddy field real‐time robotic combine Robotics Semantic segmentation Semantics Sensors |
title | Paddy field object detection for robotic combine based on real‐time semantic segmentation algorithm |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-22T19%3A59%3A33IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Paddy%20field%20object%20detection%20for%20robotic%20combine%20based%20on%20real%E2%80%90time%20semantic%20segmentation%20algorithm&rft.jtitle=Journal%20of%20field%20robotics&rft.au=Zhu,%20Jiajun&rft.date=2024-03&rft.volume=41&rft.issue=2&rft.spage=273&rft.epage=287&rft.pages=273-287&rft.issn=1556-4959&rft.eissn=1556-4967&rft_id=info:doi/10.1002/rob.22260&rft_dat=%3Cproquest_cross%3E2920920753%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c2570-3bc5aeed72df9cd597bb51d472c8d305d8aa84efc50c763b36bbcc7903dbc4443%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2920920753&rft_id=info:pmid/&rfr_iscdi=true |