Sharper AI eyes for maritime safety
en-GBde-DEes-ESfr-FR

Sharper AI eyes for maritime safety

27/04/2026 TranSpread

Ship detection from remote sensing images is essential for maritime transportation, naval awareness, emergency rescue, and port logistics. Optical imagery provides rich texture and structural information, but it also creates major challenges for automated detection. Ships may appear extremely small, densely packed, partly hidden, or visually confused with wakes, coastlines, clouds, and sea clutter. Conventional single-stage detectors often struggle to balance sensitivity and precision under these conditions, especially when object scales vary sharply across the same image. Based on these challenges, in-depth research is needed on more robust ship detection frameworks for optical remote sensing imagery.

Researchers from Yan’an University and Northwestern Polytechnical University reported (DOI: 10.34133/remotesensing.1038) this advance in Journal of Remote Sensing, published on March 25, 2026. Their study introduces a coarse-to-fine saliency-driven maritime ship detection network called C2FSMSDet, developed to improve detection accuracy in complex optical remote sensing scenes. The system addresses a practical problem faced by current methods: how to reliably identify ships when image backgrounds are noisy and object boundaries are weak or overlapping.

The core innovation is a two-stage design. In the coarse detection stage (CoarseDet), the model first generates a pixel-level saliency map to highlight likely ship regions while suppressing wakes, sea clutter, and coastal interference. In the fine detection stage (FineDet), those saliency cues guide instance-level segmentation so that ship boundaries can be separated more precisely. This design differs from standard single-stage detectors because it decouples rough localization from detailed delineation. On a mixed test set, the framework achieved an F1 score of 0.912 and a mean average precision at an intersection over union threshold of 0.5 (mAP0.5) of 0.953, outperforming strong baseline detectors such as YOLOv7, which reached an mAP0.5 of 0.893.

C2FSMSDet combines transformer-based global reasoning with multiscale feature extraction. In CoarseDet, a Fully Convolutional Transformer (FCT) captures long-range contextual dependencies, while a Wide Focus Block (WFB) uses parallel dilated convolutions to analyze targets at different scales. A Criss-Cross Attention Module (CCAM) further strengthens pixel-level contextual modeling across rows and columns, helping the network distinguish ships from confusing background structures. The resulting saliency map is then passed to FineDet. This second stage is built on an optimized Mask Region-Based Convolutional Neural Network (Mask R-CNN) with a Swin Transformer backbone, mosaic data augmentation, and a Context Enhancement Module (CEM). Together, these components improve boundary recovery, object separation, and robustness in dense or partially occluded scenes. The model was evaluated on three public datasets: Airbus Ship Detection, HRSC2016, and DOTA. In saliency evaluation, the CoarseDet configuration with CCAM reached the best values among FCN-based variants, including a root mean square error (RMSE) of 8.40 and mean absolute error (MAE) of 0.072.

According to the authors, the strength of the framework lies in its coarse-to-fine strategy: the first stage prioritizes recall by finding possible ship regions, while the second stage improves precision through detailed instance segmentation. This coordinated design allows the model to better separate closely packed ships and reduce confusion from surrounding maritime backgrounds.

The researchers trained and tested the system using three publicly available ship-detection datasets covering open sea, ports, nearshore scenes, and aerial views. Training included standard augmentation such as rotation, scaling, flipping, and color jittering, while the FineDet stage also used mosaic augmentation. CoarseDet was optimized with Adam, an initial learning rate of 1 × 10−4, and cosine annealing. Experiments were run with an NVIDIA GeForce RTX 3090 Graphics Processing Unit (GPU), Intel Xeon E5-2699C v4 Central Processing Unit (CPU), PyTorch 1.12.1, and CUDA 11.7.

This study points to a promising future for high-accuracy maritime monitoring based on optical remote sensing. A more reliable ship detector could support maritime traffic management, anomaly warning, port operations, and rescue response, while also contributing to sustainable ocean economic development. The coarse-to-fine design may also be adapted to other remote sensing tasks involving small, dense, or weak-boundary targets, such as vehicles, infrastructure, or environmental hazards in complex scenes.

###

References

DOI

10.34133/remotesensing.1038

Original Source URL

https://doi.org/10.34133/remotesensing.1038

Funding information

This work was supported in part by the National Natural Science Foundation of China under grants 62171381 and 62366053, the Youth Project of the Natural Science Basic Research Program of Shaanxi Province under grant 2025JC-YBQN-923, and General Project of Yan’an City Key R&D Program (Industrial Sector) no. 2025SLGYGG-004.

About Journal of Remote Sensing

The Journal of Remote Sensing, an online-only Open Access journal published in association with AIR-CAS, promotes the theory, science, and technology of remote sensing, as well as interdisciplinary research within earth and information science.

Paper title: Rethinking Coarse-to-Fine Fully Convolutional Transformers for Salient Maritime Ship Detection in Optical Remote Sensing Imagery
Fichiers joints
  • Comparative performance radar charts for different detection stages and methods.
27/04/2026 TranSpread
Regions: Asia, China, North America, United States
Keywords: Applied science, Technology

Disclaimer: AlphaGalileo is not responsible for the accuracy of content posted to AlphaGalileo by contributing institutions or for the use of any information through the AlphaGalileo system.

Témoignages

We have used AlphaGalileo since its foundation but frankly we need it more than ever now to ensure our research news is heard across Europe, Asia and North America. As one of the UK’s leading research universities we want to continue to work with other outstanding researchers in Europe. AlphaGalileo helps us to continue to bring our research story to them and the rest of the world.
Peter Dunn, Director of Press and Media Relations at the University of Warwick
AlphaGalileo has helped us more than double our reach at SciDev.Net. The service has enabled our journalists around the world to reach the mainstream media with articles about the impact of science on people in low- and middle-income countries, leading to big increases in the number of SciDev.Net articles that have been republished.
Ben Deighton, SciDevNet
AlphaGalileo is a great source of global research news. I use it regularly.
Robert Lee Hotz, LA Times

Nous travaillons en étroite collaboration avec...


  • The Research Council of Norway
  • SciDevNet
  • Swiss National Science Foundation
  • iesResearch
Copyright 2026 by DNN Corp Terms Of Use Privacy Statement