China has opened access to what it describes as the world’s largest ship detection data set built using both visible light and infrared images. The resource, known as the Dual-Modal Ship Detection (DMSD) data set, is designed to improve how machines identify vessels in complex ocean environments.
According to a study published in the Journal of Radars, the data set includes more than 2,000 paired images. Each pair captures the same ship using standard cameras and infrared sensors. In total, nearly 20,000 ship instances have been annotated, helping systems learn where vessels appear in different conditions.
The data was collected using multiple sensing technologies. These include visible-light cameras, infrared devices, radar systems, and synthetic aperture radar mounted on airborne and coastal platforms. The images were gathered in waters near Yantai, in China’s Shandong province.
A key feature of this data set is its diversity. The images reflect different conditions such as clear skies, cloudy weather, rain, fog, and low visibility. Data was also recorded at different times of the day, including daylight, evening, and nighttime.
This variety makes the data set useful for training artificial intelligence systems. It helps machines recognize ships even when visibility is poor. Earlier data sets often lacked this range or focused only on one type of imaging, which limited their effectiveness in real conditions.
Harsh ocean conditions make ship detection difficult
Detecting ships at sea is more complex than identifying objects on land. The ocean surface reflects sunlight, creating glare that can distort images. Constant wave movement also changes how objects appear, making detection less reliable.
Weather conditions add to the difficulty. Fog, rain, and cloud cover reduce visibility and affect image clarity. At night, visible-light cameras become less effective, increasing reliance on infrared systems.
Distance is another challenge. Ships are often far from sensors, so they appear small and less detailed. Background elements like waves or nearby vessels can further confuse detection systems.
Detection errors highlight real-world challenges
These challenges can lead to errors. A system may detect an object but fail to identify it correctly. This reduces the effectiveness of monitoring and tracking operations.
The difficulty of accurate detection was highlighted in February. An Iranian claim of a strike on the USS Abraham Lincoln near the Strait of Hormuz was later dismissed by Washington. The episode showed that reaching a ship’s location is not the same as accurately detecting and tracking it.
This reflects a broader issue in maritime technology. Identifying moving targets under difficult conditions remains a complex task. The DMSD data set is intended to improve how systems process visual and infrared data together.
Houthis issue warning over Red Sea use as US–Israel tensions with Iran grow
Data release highlights dual-use technology concerns
The release of this data set has drawn attention because of its dual-use nature. While it is presented as a research resource, it also has clear applications in surveillance and defense systems.
Artificial intelligence depends on large and detailed data for training. In maritime environments, accurate detection is important for monitoring activity and tracking vessels. This applies to drones and other automated systems.
Existing maritime data sets have limitations. Some include only infrared images, while others lack varied conditions. The DMSD data set addresses these gaps by combining high-resolution visible images with aligned infrared data.
This alignment allows systems to compare both formats and learn more effectively. As a result, detection accuracy can improve in low light or poor weather conditions.
The developers have indicated that more data may be added later. This could include additional sensor types such as radar and inverse synthetic aperture radar.
The release highlights the growing importance of maritime monitoring. As global activity at sea increases, reliable detection systems are becoming more important. By making this data set available, researchers now have access to a detailed resource for improving machine vision in ocean environments.




