Hort-YOLO: A multi-crop deep learning model with an integrated semi-automated annotation framework

This study addresses the significant challenge of accurate object detection in highly variable lighting conditions (ambient and artificial). We introduce a novel architecture, Hort-YOLO, which features a custom backbone, DeepD424v1, and a redesigned YOLOv4 head. The DeepD424v1 backbone is built on a modular, asymmetric structure that effectively extracts discriminative multi-scale global–local spatial features. This design fuses features at different depths to prevent the loss of feature perception while simultaneously enhancing recognition speed and accuracy. The network’s asymmetric branches, with multi-scale and parallel down sampling layers, gradually reduce the spatial size of feature maps. This process extracts fine-to-coarse details with richer feature information and generates diverse contextual information in both spatial and channel dimensions. This design effectively reduces computational complexity and enhances the representation learning capabilities of the convolutional neural network (CNN). The model size is approximately 2.6 × 10² MB. The improved Spatial Pyramid Pooling Module (SPPM) of the Hort-YOLO detector can accurately locate the target object even its pixel size is less than 5 % of the input image. A comparative performance evaluation was conducted on a class imbalanced, dynamic, and noisy horticultural dataset. Despite the presence of a low and moderate level of class imbalance, Models 1, 2, and 4 achieved a higher F1 score of 0.68 on the validation dataset. In comparison with other object detectors, including YOLOv10 (n, s, m, l, x, b), YOLOv11 (n, s, m, l, x), YOLOv12 (n, s, m, l, x), YOLOx (medium coco), and both standard and modified YOLOv4, Hort-YOLO achieved a mAP05 and recall score of score 0.77 and 0.80, respectively. This study also demonstrates the efficiency of a semi-automatic annotation process, which reduces annotation time by 5 to 6 times. This annotation framework will help to scale up the supervised learning process by efficiently processing large datasets. Hort-YOLO also demonstrates the model’s robustness under different lighting, occlusion, and background complexity conditions, detecting objects at a range of 15 to 30 frames per second (FPS) in a real-world scenario.

https://doi.org/10.1016/j.compag.2025.111196

Title：Hort-YOLO: A multi-crop deep learning model with an integrated semi-automated annotation framework
Authors：M.P. Islam, K. Hatou, K. Shinagawa, S. Kondo, Y. Kadoya, M. Aono, T. Kawara, K. Matsuoka
Journal：Computers and Electronics in Agriculture, 240, 111196
DOI：10.1016/j.compag.2025.111196, 2026 (Available online 15 November 2025).

Attached files

【Multi-crop deep learning model integrated with an AI-assisted semi-automatic annotation framework】Network-centric Hort-YOLO models vs cultivation goals@Md Parvez Islam（Ehime University）
【Multi-crop deep learning model integrated with an AI-assisted semi-automatic annotation framework】Saliency maps for assessing the efficiency of semi-automated data annotation tasks@Md Parvez Islam（Ehime University）
【Qualitative evaluation based on real-time detector performance】Real-time monitoring of seedlings@Md Parvez Islam（Ehime University）
【Qualitative evaluation based on real-time detector performance】Real-time monitoring of tomato growth@Md Parvez Islam（Ehime University）

28/11/2025 Ehime University

Regions: Asia, Japan

Keywords: Applied science, Computing, Business, Agriculture & fishing, Universities & research, Science, Agriculture & fishing, Society, Economics/Management

Disclaimer: AlphaGalileo is not responsible for the accuracy of content posted to AlphaGalileo by contributing institutions or for the use of any information through the AlphaGalileo system.

Latest Publications

Testimonials

For well over a decade, in my capacity as a researcher, broadcaster, and producer, I have relied heavily on Alphagalileo.
All of my work trips have been planned around stories that I've found on this site.
The under embargo section allows us to plan ahead and the news releases enable us to find key experts.
Going through the tailored daily updates is the best way to start the day. It's such a critical service for me and many of my colleagues.

Koula Bouloukos, Senior manager, Editorial & Production Underknown

We have used AlphaGalileo since its foundation but frankly we need it more than ever now to ensure our research news is heard across Europe, Asia and North America. As one of the UK’s leading research universities we want to continue to work with other outstanding researchers in Europe. AlphaGalileo helps us to continue to bring our research story to them and the rest of the world.

Peter Dunn, Director of Press and Media Relations at the University of Warwick

AlphaGalileo has helped us more than double our reach at SciDev.Net. The service has enabled our journalists around the world to reach the mainstream media with articles about the impact of science on people in low- and middle-income countries, leading to big increases in the number of SciDev.Net articles that have been republished.

Hort-YOLO: A multi-crop deep learning model with an integrated semi-automated annotation framework

This item is under embargo and is only visible to journalists

Latest Publications

Testimonials

Koula Bouloukos, Senior manager, Editorial & Production Underknown

Peter Dunn, Director of Press and Media Relations at the University of Warwick

Ben Deighton, SciDevNet