ArticlePDF Available

Research on wave measurement and simulation experiments of binocular stereo vision based on intelligent feature matching

Frontiers
Frontiers in Marine Science
Authors:
  • Qilu University of Technology (Shandong Academy of Sciences)

Abstract and Figures

Waves are crucial in ocean observation and research. Stereo vision-based wave measurement, offering non-contact, low-cost, and intelligent processing, is an emerging method. However, improving accuracy remains a challenge due to wave complexity. This paper presents a novel approach to measure wave height, period, and direction by combining deep learning-based stereo matching with feature matching techniques. To improve the discontinuity and low accuracy in disparity maps from traditional wave image matching algorithms, this paper proposes the use of a high-precision stereo matching method based on Pyramid Stereo Matching Network (PSM-Net).A 3D reconstruction method integrating Scale-Invariant Feature Transform (SIFT) with stereo matching was also introduced to overcome the limitations of template matching and interleaved spectrum methods, which only provide 2D data and fail to capture the full 3D motion of waves. This approach enables accurate wave direction measurement. Additionally, a six-degree-of-freedom platform was proposed to simulate waves, addressing the high costs and attenuation issues of traditional wave tank simulations. Experimental results show the prototype system achieves a wave height accuracy within 5%, period accuracy within 4%, and direction accuracy of ±2°, proving the method’s effectiveness and offering a new approach to stereo vision-based wave measurement.
This content is subject to copyright.
Research on wave measurement
and simulation experiments of
binocular stereo vision based on
intelligent feature matching
Junjie Wu
1,2
, Shizhe Chen
1,2,3
*, Shixuan Liu
1,2,3
,
Miaomiao Song
1,2,3
, Bo Wang
1,2,3
, Qingyang Zhang
1,2
,
Yushang Wu
1,2,3
, Zhuo Lei
1,2,3
, Jiming Zhang
1,2,3
,
Xingkui Yan
1,2,3
and Bin Miao
1,2,3
1
Institute of Oceanographic Instrumentation, Qilu University of Technology (Shandong Academy of
Sciences), Qingdao, China,
2
School of Ocean Technology Sciences, Qilu University of Technology
(Shandong Academy of Sciences), Qingdao, China,
3
Laoshan Laboratory, Qingdao, China
Waves are crucial in ocean observation and research. Stereo vision-based wave
measurement, offering non-contact, low-cost, and intelligent processing, is an
emerging method. However, improving accuracy remains a challenge due to
wave complexity. This paper presents a novel approach to measure wave height,
period, and direction by combining deep learning-based stereo matching with
feature matching techniques. To improve the discontinuity and low accuracy in
disparity maps from traditional wave image matching algorithms, this paper
proposes the use of a high-precision stereo matching method based on
Pyramid Stereo Matching Network (PSM-Net).A 3D reconstruction method
integrating Scale-Invariant Feature Transform (SIFT) with stereo matching was
also introduced to overcome the limitations of template matching and
interleaved spectrum methods, which only provide 2D data and fail to capture
the full 3D motion of waves. This approach enables accurate wave direction
measurement. Additionally, a six-degree-of-freedom platform was proposed to
simulate waves, addressing the high costs and attenuation issues of traditional
wave tank simulations. Experimental results show the prototype system achieves
a wave height accuracy within 5%, period accuracy within 4%, and direction
accuracy of ±2°, proving the methods effectiveness and offering a new approach
to stereo vision-based wave measurement.
KEYWORDS
stereo vision, deep learning, stereo matching, feature matching, wave parameter
measurement, wave height, wave period, wave direction
Frontiers in Marine Science frontiersin.org01
OPEN ACCESS
EDITED BY
Xi Zhang,
Ministry of Natural Resources, China
REVIEWED BY
Shaowei Zhang,
Chinese Academy of Sciences (CAS), China
Gui Gao,
Southwest Jiaotong University, China
Ma Xin,
Ocean University of China, China
*CORRESPONDENCE
Shizhe Chen
chensz@qlu.edu.cn
RECEIVED 09 October 2024
ACCEPTED 09 December 2024
PUBLISHED 23 December 2024
CITATION
Wu J, Chen S, Liu S, Song M, Wang B,
Zhang Q, Wu Y, Lei Z, Zhang J, Yan X and
Miao B (2024) Research on wave
measurement and simulation experiments of
binocular stereo vision based on intelligent
feature matching.
Front. Mar. Sci. 11:1508233.
doi: 10.3389/fmars.2024.1508233
COPYRIGHT
©2024Wu,Chen,Liu,Song,Wang,Zhang,
Wu,Lei,Zhang,YanandMiao.Thisisanopen-
access article distributed under the terms o f
the Creative Commons Attribution License
(CC BY). The use, distribution or reproduction
in other forums is permitted, provided the
original author(s) and the copyright owner(s)
are credited and that the original publication
in this journal is cited, in accordance with
accepted academic practice. No use,
distribution or reproduction is permitted
which does not comply with these terms.
TYPE Original Research
PUBLISHED 23 December 2024
DOI 10.3389/fmars.2024.1508233
1 Introduction
With the advancement of marine scientic research and the
increase in ocean development activities, the accurate measurement
and monitoring of ocean waves have become increasingly
important (Zhang et al., 2017;Chowdhury et al., 2021). Wave
measurement techniques can be categorized into contact and non-
contact methods. Contact measurement techniques (Li et al., 2012;
Lin and Yang, 2020) rely on sensors that make direct contact with
the sea surface to detect wave uctuations. This technique high
measurement accuracy and provides reliable wave data, making it
widely used in wave measurement. However, such instruments are
limited to collecting localized, single-point data, which prevents
comprehensive analysis of the wave eld. To minimize interference
with the wave surface, non-contact measurement technologies, such
as acoustic wave gauges (Yang et al., 2018), radar (Zhang et al.,
2022;Zhang et al., 2024), satellite remote sensing (Gao et al., 2023b;
Zhang et al., 2023), and stereo photogrammetry (Benetazzo, 2006;
Nie, 2020), have emerged. These methods avoid the risk of
equipment damage caused by contact with seawater. However,
acoustic wave gauges can have their measurement accuracy
compromised by water splashes; radar systems are expensive,
difcult to install, retrieve, and maintain, and the interpretation of
the data they collect is complex; satellite remote sensing offers low
measurement accuracy, poor data reliability, is highly susceptible to
weather conditions, and comes with high costs.
The basic working principle of stereo photogrammetry for wave
measurement involves continuously capturing images of the sea
surface using cameras, acquiring wave image data, and then
processing these images with a computer to calculate the
fundamental wave parameters. Compared to radar and satellite
remote sensing technologies, stereo photogrammetry has the
advantage of high-precision observation using only cameras, making
it relatively low-cost, with simple and convenient installation and
maintenance. This technology is already widely applied in other
systems (Fan et al., 2020;Cao et al., 2022;He et al., 2022;Sun et al.,
2022;Tani et al., 2023). The earliest efforts to develop a remote sensing
system for wavemeasurement using stereo photogrammetry date back
to the late 1980s, initiated by Shemding (Shemdin and Wu, 1988;
Shemdin and Tran, 1992) and Banner et al (Banner et al., 1989). In
recent years, the rapid advancements in computer and image
processing technologies have led to the emergence of numerous
methods for calculating wave parameters. In 2017, Yan L (Yan,
2017). applied template matching technology to measure wave
height, enabling tsunami prediction. That same year, Bergamasco
introduced WASS (Waves Acquisition Stereo System) (Bergamasco
et al., 2017), an open-source software package for marine 3D
reconstruction. WASS uses cutting-edge stereo technology to
automate the generation of dense point clouds from stereo images,
facilitating the reconstruction of the oceans three-dimensional shape.
In 2019, Shi L (Shi et al., 2019). proposed a stereo vision method for
obtaining ocean wave parameters, employing the Scale-Invariant
Feature Transform (SIFT) (Lowe, 2004)stereomatchingalgorithm
along with template matching techniques to calculate wave
parameters. The nal results indicated a maximum measurement
error of 10% for wave height, a period accuracy of 0.5 seconds, and a
wave direction accuracy of ±10°. In 2021, Wang Z (Wang, 2021).
replaced SIFT with the Semi-Global Block Matching (SGBM)
(Hirschmuller, 2007) algorithm for stereo matching to compute
wave height and used interleaved spectrum and time delay to
calculate wave direction. The nal measurement results showed that
the relative standard deviation of wave height ranged from 2.5% to
12.5%, with a maximum relative error of 4.97% for wave direction.
In recent years, the widespread application of deep learning
across various elds has driven the development of related
technologies. Within ocean data processing, deep learning
techniques have been widely applied to tasks such as image
classication, data processing, and scene reconstruction. For
example, deep learning methods have shown notable success in
ship detection and sh detection (Raveendran et al., 2021;Gao et al.,
2023c;Gao et al., 2023a;Zhang et al., 2024). Additionally, these
techniques have been broadly utilized in ocean science (Kandimalla
et al., 2022;Cao et al., 2024;Gao et al., 2024). In these applications,
deep learnings robust feature extraction and adaptive capabilities
have markedly enhanced the accuracy of image processing and
analysis. The successful deployment of these technologies offers
valuable guidance for the approach introduced in this paper.
Accurate estimation of wave parameters is essential for a wide
range of oceanographic and engineering applications. However,
existing computational methods for wave motion analysis still face
signicant challenges. Traditional stereo matching algorithms often
generate disparities with substantial errors and noise, limiting the
precision of wave direction and motion calculations. Furthermore,
while methods like template matching and interleaved spectrum are
effective for two-dimensional space, they fail to capture the
complexities of wave motion in three-dimensional space, which is
crucial for accurate three-dimensional wave motion analysis.
Moreover, traditional wave tank testing is expensive and challenging,
particularly when simulating multi-directional wave motion.
To address these limitations, this paper proposes a novel
solution: replacing traditional stereo matching techniques with a
deep learning-based Pyramid Stereo Matching Network (PSM-Net)
(Chang and Chen, 2018). PSM-Net, with its ability to effectively
handle large-scale disparities and complex textures, provides a more
accurate and robust approach for wave motion estimation.
Additionally, to further improve the accuracy of three-
dimensional wave motion analysis, we combine the PSM-Net
with the SIFT feature point matching algorithm, enabling precise
calculation of wave parameters in three-dimensional space. And we
propose using a six-degree-of-freedom platform as an alternative
for indoor experiments, offering a cost-effective and feasible
solution for validating the proposed algorithm. This innovative
approach not only improves computational accuracy but also opens
up new possibilities for ocean wave simulation and analysis.
2 Basic techniques for wave
measurement using stereo vision
The system architecture is illustrated in Figure 1. The principle
of wave measurement using stereo vision is based on stereo vision
technology, employing two cameras to simultaneously capture
Wu et al. 10.3389/fmars.2024.1508233
Frontiers in Marine Science frontiersin.org02
images of the waves. After preprocessing the images, a stereo
matching algorithm is used to compute the disparity map, and
the depth of each pixel is calculated based on the principles of
triangulation. This allows for the acquisition of the three-
dimensional shape and motion parameters of the target. The
overall system owchart, shown in Figure 2, is divided into four
main stages: preprocessing, stereo matching, three-dimensional
reconstruction, and wave parameter estimation.
2.1 Image preprocessing
Preprocessing of stereo images is a crucial step in stereo vision
systems, as it enhances the accuracy and efciency of subsequent
processing. This step consists of two parts: calibration and image
rectication. The rst step in preprocessing is the calibration of the
stereo cameras. In this study, the Zhang Z. calibration method
(Zhang, 2000) is employed, which involves capturing images of a
calibration board from various poses and detecting the corners of
the board in each image to calculate the camera parameters. The
preprocessing steps are illustrated in Figure 3.Duringimage
rectication, to address the issue of ensuring that the imaging
planes are coplanar and aligned for three-dimensional coordinate
reconstruction, this paper adopts distortion correction and epipolar
rectication techniques. This transforms the image planes of the left
and right cameras into a coplanar and aligned state, thereby
simplifying the calculations involved in stereo matching.
The detailed preprocessing steps are as follows:
1. Design a calibration board of appropriate size, ensuring
that its relative position to the camera can cover the entire
eld of view.
2. Position the calibration board at different angles and
locations within the cam-eraseld of view to capture
multiple images. Make sure the board covers the entire
visual range and that at least 20 pairs of images are collected
for thorough calibration.
3. Detect the corner points of the chessboard in the
acquired images.
4. Using the detected corner points and the specied
dimensions of the calibration board, calculate the
cameras intrinsic parameter matrix. Next, compute the
extrinsic parameter matrix between the two cameras using
images taken together.
5. By comparing the ideal and actual positions of the detected
corner points, calculate the image reprojection error to
obtain the distortion coefcients. Then, use these
coefcients to correct the new images.
6. Calculate the correction matrix with the camera parameters
and apply geometric transformations to the two images for
epipolar rectication.
2.2 Stereo matching algorithm
The stereo matching algorithm is a key component of stereo
vision and plays an essential role in obtaining disparity maps for
three-dimensional coordinate reconstruction. The effectiveness of a
good stereo matching algorithm directly inuences the quality of
the 3D reconstruction. Given the complex textures present in ocean
waves and the varying weather and lighting conditions outdoors,
traditional dense stereo matching algorithms, such as SGBM, and
sparse matching algorithms, like SIFT, often produce disparity
maps that are not continuous and may exhibit signicant
deviations. To ad-dress this issue, this paper adopts a deep
learning-based PSM-Net network to replace traditional algorithms
for calculating image disparity.
The overall network structure is illustrated in Figure 4. PSM-
Net is an end-to-end deep learning model. Initially, the left and
right images are fed into two weight-shared Convolutional Neural
Networks (CNNs) for feature extraction. The model then utilizes a
pyramid structure to extract multi-scale features from the left and
right images, which is particularly well-suited for handling the
complex textures and dynamic changes of ocean surfaces. By
capturing features at multiple scales, the pyramid structure
ensures a more comprehensive representation of the intricate
patterns and varying depths inherent in marine environments,
thereby improving the accuracy of disparity estimation. Feature
fusion is performed through convolutional layers, leveraging the
pyramid structure to create a robust and enriched feature
representation. Next, the model employs a cost volume for cost
aggregation and regularization. Finally, a three-dimensional
convolutional network integrates both global and local
information to extract the nal disparity map using a regression
approach from the cost volume. This stepwise optimization
method, progressing from global to local, effectively enhances the
accuracy and robustness of stereo matching, resulting in high
precision, high matching rates, and continuous, smooth disparity
maps that improve measurement accuracy.
2.3 3D reconstruction algorithm
After obtaining the camera parameters and the disparity map,
the world coordinates of the pixel points can be inferred using the
principle of triangulation. The schematic representation is
illustrated in Figure 5, and the corresponding formula is given in
Equation 1.
FIGURE 1
System architecture diagram.
Wu et al. 10.3389/fmars.2024.1508233
Frontiers in Marine Science frontiersin.org03
Z
f=Y
y2
=Y
y1
=X
x1
=Xb
x2
(1)
This leads to the formulation of Equation 2.
X=x1*b
x1x2=x1*b
d
Y=y1*b
x1x2=y1*b
d
Z=f*b
x1x2=f*b
d
8
>
>
>
>
<
>
>
>
>
:
(2)
In this equation, fis the focal length and bis the baseline length,
both calculated in Section 2.1. The disparity dhas been
determined in Section 2.2. With these parameters, the world
coordinates of the pixel points X;Y;Zcan be obtained.
2.4 Wave height and wave
period algorithm
After the three-dimensional coordinates of the pixels have been
acquired, selecting a measurement area. Then, computes the
equation of the average sea level, and the distance from points to
this plane is used to determine the height and period variations of
the wave points. In this study, the calculation process for wave
height and wave period is illustrated in Figure 6.
FIGURE 3
Steps for preprocessing.
FIGURE 2
Overall system workow diagram.
Wu et al. 10.3389/fmars.2024.1508233
Frontiers in Marine Science frontiersin.org04
The specic estimation steps are designed as follows:
1. Read the three-dimensional reconstruction results of the
current frame.
2. Select ndata points within the range of the water surface
captured by the cam-era and compute the sea
level equation.
3. Set a distance threshold l, and calculate the distance from
each point to the sea level. Points with a distance less than l
are recorded as inliers.
4. Repeat the experiment mtimes to identify the inlier set with
the maximum number of inliers. Use the coordinates of
these inliers to recalculate the sea level equation, which will
serve as the current sea level equation. After obtaining the
results, read the next frame.
5. Repeat steps 1-4 for Nsets of images to derive Nsea level
equations. Calculate the mean of the coefcients to obtain
the average sea level equation.
6. Compute the distance from each point in the Nsets of
images to the average sea level, which represents the wave
height for those images. Calculate the mean of all wave
heights to get the average wave height.
7. Select Mxed points and record their height variations over
time. Apply smoothing ltering to process the curves, and
use the zero-crossing method to calculate the wave period
from the smoothed curves. Compute the mean wave period
from all points to obtain the average wave period.
2.5 Feature point matching algorithm
Stereo matching focuses on identifying corresponding points in
the left and right views to obtain depth information. However, in
this section, feature matching is used to nd correspondences
between features in consecutive frames of the same scene. By
com-paring the descriptors of these feature points, similar regions
or corresponding points in the images can be identied. These
points are typically signicant and unique, providing important
information about the structure of the images.
Waves are typically irregular and nonlinear, with their shape, size,
and direction continuously changing. Different sizes of waves contain
various scale features. Feature matching algorithms can extract multi-
scale feature points and accurately nd correspondences between
feature points, even in the presence of noise and partial occlusion,
effectively addressing the complex variations of wave characteristics.
Therefore, this study employs the SIFT feature point matching
algorithm to detect key point information in consecutive frames
and combines it with the results of 3D reconstruction to calculate
wave direction. Compared to traditional methods such as template
matching and phase-shifting techniques, this approach integrates 3D
information, allowing for a more accurate capture of wave motion
characteristics and enhancing the robustness and precision of wave
direction prediction.
The steps of the SIFT algorithm are illustrated in Figure 7,
which divides the over-all process into three main stages: feature
point detection, feature description, and feature matching. The
principle of feature point detection is based on the detection of
extrema in the scale space. First, a scale space is constructed by
applying Gaussian blur for multi-scale processing, resulting in a
scale space pyramid. After constructing the Gaussian pyramid of
the image, subtracting adjacent layers produces a Difference of
Gaussian (DoG) pyramid. Next, extrema points are identied
within the DoG pyramid, which serve as the feature points.
FIGURE 5
Principle diagram of 3D reconstruction.
FIGURE 4
PSM-Net structure.
Wu et al. 10.3389/fmars.2024.1508233
Frontiers in Marine Science frontiersin.org05
Feature description involves mathematically characterizing these
feature points by capturing their position, scale, and orientation in-
formation, resulting in the generation of keypoint descriptors.
Finally, the feature matching step is completed by comparing the
sets of keypoint descriptors from the two images and matching
corresponding keypoints.
To address the issue that using template matching can only
provide the two-dimensional motion direction of waves and cannot
comprehensively reect their movement in three-dimensional
space, this paper employs the SIFT algorithm to calculate the
feature descriptors of each keypoint and perform matching. By
accurately determining the two-dimensional movement of
keypoints between consecutive frames and integrating this with
the results from stereo matching and three-dimensional re-
construction, the two-dimensional movement of keypoints is
extended to three-dimensional movement. This allows for the
calculation of the three-dimensional motion direction of the
waves. The specic design is illustrated in Figure 8. This meth-od
ingeniously combines depth learning-based stereo matching
technology with feature matching techniques, effectively
integrating the two-dimensional movement of feature points with
the results of three-dimensional reconstruction to provide a com-
prehensive analysis of wave movement, thereby signicantly
enhancing the accuracy and reliability of wave direction
parameter measurements.
2.6 Wave direction estimation algorithm
Building on the design from the previous section, this study
employs the SIFT method to process two consecutive frames of
images, matching the keypoints between them and calculating their
two-dimensional positional changes within the images. By
integrating the results from the three-dimensional reconstruction,
the three-dimensional coordinate changes of the keypoints can be
inferred, enabling the calculation of wave direction. The detailed
ow of wave direction computation is illustrated in Figure 9.
The specic estimation steps are designed as follows:
1. Set the reference direction (typically North) as 0°.
2. Use the SIFT algorithm to match feature points between the
two images taken at different times and extract their two-
dimensional coordinates.
3. Convert the two-dimensional coordinates into the
corresponding three-dimensional world coordinates using
the calculations from Section 1.3.
4. Subtract the old three-dimensional world coordinates from
the new coordinates to obtain the displacement vector.
5. Calculate the average displacement vector by taking the
mean of the velocity vectors.
6. Determine the azimuth of the average velocity vector to
obtain the average wave direction at that moment.
FIGURE 6
Process diagram for calculating wave height and wave period.
FIGURE 7
The steps of the SIFT algorithm.
Wu et al. 10.3389/fmars.2024.1508233
Frontiers in Marine Science frontiersin.org06
7. Repeat steps 2-4 for N sets of images until all time instances
of wave direction are obtained, then calculate the mean to
derive the average wave direction over this time period.
3 System development
Theentiresystemconsistsoftwomain components: hardware and
software. The hardware component includes the various hardware
elements, while the software component encompasses the algorithm
modules and their processing workows. The following sections
provide a detailed introduction to both components.
3.1 Hardware conguration
The overall composition of the systems hardware is illustrated
in Figure 10. The camera system consists of the Alvium G1-1236c
camera paired with an LM6FC24M lens. The Alvium G1-1236c
features a resolution of 4112 (H) × 3008 (V), providing high-
resolution images that facilitate the capture of intricate details on
the ocean sur-face, making it suitable for high-precision marine
parameter collection. Compared to consumer-grade cameras, this
camera offers advantages such as stable image trans-mission
capabilities and high interference resistance, allowing it to operate
reliably in complex environments. The LM6FC24M lens boasts a
wide eld of view, enabling coverage of a larger marine observation
area. Its precision design effectively reduces im-age distortion while
preserving color delity, making it well-suited for industrial ap-
plications. The combination of these two components signicantly
enhances the over-all performance of the system.
The camera is securely mounted using a tripod, a gimbal, and a
stereo board. The tripod provides excellent support, ensuring stable
camera operation under various conditions. The gimbal allows for
precise adjustments and corrections of the cameras angle and
direction as needed. The stereo board facilitates quicker setup and
adjustments, making it convenient and efcient to reposition the
cameras. This entire mounting system effectively enhances the
reliability of image acquisition, ensuring that the camera maintains
a stable posture during the shooting process, thus allowing it to
operate reliably in different environments. Finally, the system is
equipped with a computer featuring a high-performance GPU for
data processing, ensuring efcient handling of image data.
3.2 Software conguration
The overall module structure of the software is illustrated in
Figure 11. The cam-era calibration module is responsible for image
acquisition, calibration, and preprocessing, aiming to synchronize
the left and right views of the same scene and transmit the image
FIGURE 9
Process diagram for calculating wave direction.
FIGURE 8
Integrated design of PSM-Net and SIFT.
Wu et al. 10.3389/fmars.2024.1508233
Frontiers in Marine Science frontiersin.org07
data to the computation unit. This module eliminates distortions in
the im-ages and performs geometric correction based on calibration
parameters to ensure the alignment of the left and right images.
The stereo matching module calculates the disparity of each
pixel in the left and right images, generating a disparity map to
obtain depth information about the scene. The 3D reconstruction
module utilizes the results from stereo matching to reconstruct the
3D information of each pixel, providing foundational data for
subsequent analysis of the waves three-dimensional morphology.
The wave height and wave period calculation module employs the
results from the 3D reconstruction to calculate the wave height and
wave period.
The feature matching algorithm is used to extract and match
feature points in the front and rear frame images, identifying the
positional changes of the waves at different time points and
analyzing their dynamic characteristics. The wave direction
calculation module combines feature matching and 3D
reconstruction results to compute the wave direction.
These modules collaborate with each other to ultimately achieve
the parameter calculation of the waves.
The software section is built in a coding environment that uses
Python 3.7 or higher. The primary algorithm libraries utilized
include Vmbpy for controlling the camera, OpenCV for image
processing, PyTorch for establishing the stereo matching deep
learning network, NumPy for scientic computing, Open3D for
point cloud processing, Pandas for data manipulation, and
Matplotlib for data visualization. These libraries collectively
support the implementation of various algorithms required for
the systems functionality.
4 Wave simulation experiment design
based on a six-degree-of-freedom
plat-form
To verify the accuracy of the algorithms, this paper innovatively
proposes a solution using a six-degree-of-freedom platform to
simulate waves, addressing the issues of attenuation and the
difculty of simulating multi-directional waves commonly found
in traditional wave tanks. The parameters of the six-degree-of-
freedom platform are shown in Table 1. The overall plan is as
follows: First, the six-degree-of-freedom plat-form is restored to the
neutral position, serving as the average sea level. An object is placed
on this level to simulate the cur-rent wave state, and the distance
from the object to the average sea level is calculated and compared
with the actual height to verify the accuracy of the average sea level
and wave height algorithms. Next, the platform is returned to the
mid-position, and a xed periodic back-and-forth motion around
the axis is set to simulate the cyclical uctuations of waves. The
computed period is compared with the set period to verify the
accuracy of the wave period algorithm. Finally, the platform is again
restored to the mid-position, and it is set to move in any direction to
simulate the wave propagation direction. The computed wave
direction is compared with the actual motion di-rection to
validate the accuracy of the wave direction algorithm. Compared
to traditional wave tanks, the six-degree-of-freedom platform can
accurately simulate com-plex multi-directional wave movements,
avoiding attenuation during motion and eliminating the high costs
associated with wave tank devices, thereby providing a more
economical and efcient solution for research.
The platform setup is illustrated in Figure 12, where a support is
used to x the camera in a position that covers the entire motion
FIGURE 11
Software conguration of the system.
FIGURE 10
Hardware conguration of the system.
Wu et al. 10.3389/fmars.2024.1508233
Frontiers in Marine Science frontiersin.org08
area of the six-degree-of-freedom platform, which is connected to a
computer for data processing. Objects are placed on the six-degree-
of-freedom platform, and after capturing images, algorithms are
used to calculate the platforms motion parameters. By comparing
these with the actual motion parameters, the accuracy and reliability
of the 3D reconstruction algorithm and the wave parameter
algorithm are evaluated.
4.1 Wave height and period experiment
In order to validate the effectiveness of the method proposed in
this paper, a laboratory scene was chosen, where disparity maps
were generated for both an initial at plane and a plane with objects
placed on it, using SGBM and PSM-Net trained with the Scene Flow
dataset (Mayer et al., 2016). The height of the objects in relation to
the initial plane was calculated and compared to the actual object
heights to assess the accuracy and reliability of the 3D
reconstruction and sea-level estimation algorithms. The disparity
map generation results are shown in Figure 13, and the object
heights calculated via 3D reconstruction using the camera
parameters are presented in Table 2.
The disparity maps show that, in comparison to traditional
algorithms, the disparity maps generated by PSM-Net are more
continuous and smoother, effectively avoiding the discontinuities
often encountered in traditional methods. From the experimental
results, it can be seen that the object height measured by the 3D
reconstruction algorithm has an error within 2%, conrming the
algorithms high accuracy in wave height measurement. PSM-Net,
in particular, achieves higher precision, highlighting its advantages
over conventional algorithms.
The six-degree-of-freedom platform is set to perform periodic
reciprocating motion for 20 seconds, capturing a total of 201 images
over 67 seconds at a rate of 3 frames per second. A target area is
selected, and the average sea level equation is calculated. Seven
random points within the selected area are chosen, and the
distances from these points to the average plane are tracked,
recording the height variations. The height data is converted into
FIGURE 12
Platform setup.
TABLE 1 Main technical parameters of the six-degree-of-
freedom platform.
Name Six-Degree-of-Freedom Platform (1t)
Model RX/YBT-6-10000(H)
Load 1ton
Swing
Frequency
0-22Hz
X-Axis Displacement ± 0.4m, Speed ± 0.8m/s, Acceleration ± 10m/s
2
Angular Displacement ±30°, Angular Velocity ±40°/s, Angular
Acceleration ±300°/s
2
Y-Axis Displacement ± 0.4m, Speed ± 0.8m/s, Acceleration ± 10m/s
2
Angular Displacement ±30°, Angular Velocity ±40°/s, Angular
Acceleration ±300°/s
2
Z-Axis Displacement ± 0.2 m, Speed ± 0.8m/s, Acceleration ± 10m/s
2
Angular Displacement ±30°, Angular Velocity ±40°/s, Angular
Acceleration ±300°/s
2
Positioning
Accuracy
Translational Repeatability ± 0.8mm
Angular Repeatability ± 0.04°
Stroke Hysteresis 0.8mm
Wu et al. 10.3389/fmars.2024.1508233
Frontiers in Marine Science frontiersin.org09
the time domain, and a curve graph is plotted. This curve is
processed using smoothing lters. The time difference between
two zero crossings is calculated using the zero-crossing method,
yielding the wave period. The mean of the calculated periods is
taken as the average motion period, which is then compared with
the actual period to compute the system error. The selected target
area, point locations, obtained disparity maps, curves, and the
curves smoothed using the Savitzky-Golay lter are shown in
Figure 14. The results are presented in Table 3.
The analysis of the data reveals that the errors in the periods
measured by the period algorithm range from 0.55% to 4.00%.
These results indicate that the period algorithm performs well in
indoor experiments, demonstrating high accuracy and reliability.
4.2 Wave direction experiment
The initial position and motion trajectory parameters of the
six-degree-of-freedom platform are set, with the reference
direction dened as the positive y-axis of the plat-form. The
calculations are carried out using the template matching
algorithm, SIFT, and the proposed method. The computed
results are compared to the actual motion parameters to assess
the validity and effectiveness of the algorithm, proving the
superiority of the method used in this paper. Figure 15 shows
the matched points obtained by SIFT and the matching results
from the template matching algorithm. The experimental results
are summarized in Table 4.
TABLE 2 Height measurement results of the six-degree-of-freedom platform.
PSM-Net
Average
Height
Actual
Height
Average
Error SGBM
Average
Height
Actual
Height
Average
Error
Cup Height 20.84cm 20.7cm 0.67% Cup Height 21.22cm 20.7cm 2.51%
Helmet
Height 16.47cm 16.7cm 1.38%
Helmet
Height
16.62cm 16.7cm 0.48%
Glass Height 21.55cm 21.3cm 1.15% Glass Height 20.93cm 21.3cm 1.74%
Tape Height 4.88cm 4.8cm 1.67% Tape Height 4.63cm 4.8cm 3.54%
FIGURE 13
Experimental results of height measurements. (A) Disparity Map of the Baseline Plane (SGBM), (B) Disparity Map of the Baseline Plane (PSM-Net),
(C) Disparity Map of the Object (SGBM), (D) Disparity Map of the Object (PSM-Net).
Wu et al. 10.3389/fmars.2024.1508233
Frontiers in Marine Science frontiersin.org10
The experimental results shown in Figure 15 and the data in
Table 4 clearly indicate that the proposed method has a signicant
advantage in the accuracy of wave direction calculation, especially
when there is a change in the depth (X-axis) of the six-degree-of-
freedom platform. The results demonstrate that the proposed
method maintains higher accuracy during platform depth
adjustments or in complex motion scenarios. Future research
could further explore the algorithms applicability in more
complex environments, aiming to promote its application and
development in real-world engineering.
5 Conclusions
This paper presents a method for calculating wave parameters using
the principles of binocular vision. To address the issues of discontinuity
in disparity maps and insufcient accuracy generated by traditional
stereo matching methods, this study proposes the use of the PSM-Net,
based on deep learning, for stereo matching to obtain high-precision
depth maps and enhance overall accuracy. To overcome the limitation
of template matching in providing only the two-dimensional motion
direction of waves and failing to comprehensively reect their
movement in three-dimensional space, this study introduces the use
of the SIFT algorithm in conjunction with stereo matching and three-
dimensional reconstruction results to analyze the changes in the three-
dimensional coordinates of feature points, thereby inferring the three-
dimensional motion direction of the waves. To tackle the high cost and
difculty of simulating multi-directional waves with traditional wave
umes, this paper pro-poses replacing the ume with a six-degree-of-
freedom platform to simulate wave motion. Experimental data analysis
indicates that the algorithm can accurately estimate wave parameters
simulated by the six-degree-of-freedom platform, yielding results with a
TABLE 3 Six-degree-of-freedom platform period measurement results.
Measurement Actual Error Measurement Actual Error
Point 1 20.80s 20.00s 4.00% Point 5 20.80s 20.00s 4.00%
Point 2 19.89s 20.00s 0.55% Point 6 19.55s 20.00s 2.25%
Point 3 19.73s 20.00s 1.35% Point 7 20.43s 20.00s 2.15%
Point 4 20.80s 20.00s 4.00% Average 20.29s 20.00s 1.45%
(A) (B)
(C) (D)
FIGURE 14
Experimental results of period measurements. (A) Randomly Selected Points in the Area, (B) Disparity Map of the Area, (C) Distance Variation over
Time, (D) Smoothed Data
Wu et al. 10.3389/fmars.2024.1508233
Frontiers in Marine Science frontiersin.org11
TABLE 4 Comparison of wave direction measurement results on the six-degree-of-freedom platform.
Movement
Direction
Actual
Value (°)
Proposed Method
Prediction (°)
Error
(°)
SIFT Only Pre-
diction (°)
Error
(°)
Template Matching
Prediction (°)
Error
(°)
Positive Y-axis 0 0.72 0.72 0.83 0.83 -9.42 9.42
Positive X-axis 90 90.65 0.65 82.9 7.1 83.1 6.9
Negative Y-axis -180 -179.76 0.24 -178.23 1.77 -171.04 8.96
Negative X-axis -90 -88.74 1.26 -82.1 7.9 -94.45 4.45
FIGURE 15
Experimental results of direction measurements. (A) Move Along the Positive Y-Axis (SIFT), (B) Move Along the Positive Y-Axis (Template Matching),
(C) Move Along the Negative Y-Axis (SIFT), (D) Move Along the Negative Y-Axis (Template Matching), (E) Move Along the Positive X-Axis (SIFT),
(F) Move Along the Positive X-Axis (Template Matching), (G) Move Along the Negative X-axis (SIFT), (H) Move Along the Negative X-axis
(Template Matching).
Wu et al. 10.3389/fmars.2024.1508233
Frontiers in Marine Science frontiersin.org12
maximum wave height error of <5%, a maximum period error of <4%,
and a wave direction error of <2°, demonstrating high reliability. The
successful experiments conducted in the indoor six-degree-of-freedom
platform showcase its potential for re-al-world applications, providing a
solid foundation for further research and development of binocular
vision-based wave parameter measurement technology.
Although this study has achieved preliminary results, there are still
several areas that require further exploration and improvement. Firstly,
the current methods primarily focus on indoor wave simulation. While
the six-degree-of-freedom platform experiment effectively simulates
certain wave motion scenarios, real-world marine observations often
face challenges such as adverse weather conditions, which can degrade
camera image quality and affect image processing algorithms.
Additionally, although the proposed method demonstrates high
accuracy, real-time processing of stereo images remains a challenge
in large-scale marine environments. Future work could focus on
improving real-time wave parameter estimation through hardware
acceleration or algorithm optimization. In conclusion, this research
offers an effective solution for wave parameter measurement, yet there
is signicant potential for further development to enable broader
application in real-world conditions.
Data availability statement
The raw data supporting the conclusions of this article will be
made available by the authors, without undue reservation.
Author contributions
JW: Investigation, Methodology, Software, Validation, Writing
original draft. SC: Conceptualization, Data curation, Formal analysis,
Funding acquisition, Investigation, Project administration, Resources,
Supervision, Writing review & editing. SL: Formal analysis, Project
administration, Software, Supervision, Writing review & editing.
MS: Conceptualization, Formal analysis, Project administration,
Resources, Supervision, Writing review & editing. BW:
Conceptualization, Project administration, Supervision, Writing
review & editing. QZ: Conceptualization, Formal analysis, Software,
Writing original draft. YW: Supervision, Validation, Writing
review & editing. ZL: Data curation, Formal analysis, Writing
review & editing. JZ: Supervision, Validation, Writing review &
editing. XY: Supervision, Validation, Writing review & editing. BM:
Supervision, Validation, Writing review & editing.
Funding
The author(s) declare that nancial support was received for the
research, authorship, and/or publication of this article. This work is
supported by the National Key Research and Development Program
(2022YFC3104201), National Natural Science Foundation of
China (41976179).
Acknowledgments
We would like to especially thank all teams involved in the
research and development for their help.
Conict of interest
The authors declare that the research was conducted in the
absence of any commercial or nancial relationships that could be
construed as a potential conict of interest.
Generative AI statement
The author(s) declare that no Generative AI was used in the
creation of this manuscript.
Publishers note
All claims expressed in this article are solely those of the authors
and do not necessarily represent those of their afliated organizations,
or those of the publisher, the editors and the reviewers. Any product
that may be evaluated in this article, or claim that may be made by its
manufacturer, is not guaranteed or endorsed by the publisher.
References
Banner, M. L., Jones, I. S. F., and Trinder, J. C. (1989). Wavenumber spectra of short
gravity waves. J. Fluid Mechanics 198, 321344. doi: 10.1017/S0022112089000157
Benetazzo, A. (2006). Measurements of short water waves using stereo matched
image sequences. Coast. Eng. 53, 10131032. doi: 10.1016/j.coastaleng.2006.06.012
Bergamasco, F., Torsello, A., Sclavo, M., Barbariol, F., and Benetazzo, A. (2017).
WASS: An open-source pipeline for 3D stereo reconstruction of ocean waves. Comput.
Geosciences 107, 2836. doi: 10.1016/j.cageo.2017.07.001
Cao, C., Bao, L., Gao, G., Liu, G., and Zhang, X. (2024). A novel method for ocean
wave spectra retrieval using deep learning from Sentinel-1 wave mode data. IEEE
Trans. Geosci. Remote Sens. 62, 116. doi: 10.1109/TGRS.2024.3369080
Cao, Z., Wang, Y., Zheng, W., Yin, L., Tang, Y., Miao, W., et al. (2022). The algorithm
of stereo vision and shape from shading based on endoscope imaging. Biomed. Signal
Process. Control 76, 103658. doi: 10.1016/j.bspc.2022.103658
Chang, J. R., and Chen, Y. S. (2018). Pyramid stereo matching network,in
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.
IEEE. 54105418. doi: 10.1109/CVPR.2018.00567
Chowdhury, M. S., Rahman, K. S., Selvanathan, V., Nuthammachot, N., Suklueng, M.,
Mostafaeipour, A., et al. (2021). Current trends and prospects of tidal energy technology.
Environment Dev. Sustainability 23, 81798194. doi: 10.1007/s10668-020-01013-4
Fan, R., Wang, L., Bocus, M. J., and Pitas, I. (2020). Computer stereo vision for
autonomous driving. arXiv:2012.03194 [cs.CV]. doi: 10.48550/arXiv.2012.03194
Gao, G., Bai, Q., Zhang, C., Zhang, L., and Yao, L. (2023a). Dualistic cascade
convolutional neural network dedicated to fully PolSAR image ship detection. ISPRS J.
Photogrammetry Remote Sens. 202, 663681. doi: 10.1016/j.isprsjprs.2023.07.006
Gao, G., Yao, B., Li, Z., Duan, D., and Zhang, X. (2024). Forecasting of sea surface
temperature in Eastern Tropical Pacic by a hybrid multiscale spatialtemporal model
Wu et al. 10.3389/fmars.2024.1508233
Frontiers in Marine Science frontiersin.org13
combining error correction map. IEEE Trans. Geosci. Remote Sens. 62, 122.
doi: 10.1109/TGRS.2024.3353288
Gao, G., Yao, L., Li, W., Zhang, L., and Zhang, M. (2023b). Onboard information
fusion for multisatellite collaborative observation: Summary, challenges, and
perspectives. IEEE Geosci. Remote Sens. Magazine 11, 4059. doi: 10.1109/
MGRS.2023.3274301
Gao, G., Zhou, P., Yao, L., Liu, J., Zhang, C., and Duan, D. (2023c). A bi-prototype
BDC metric network with lightweight adaptive task attention for few-shot ne-grained
ship classication in remote sensing images. IEEE Trans. Geosci. Remote Sens. 61, 116.
doi: 10.1109/TGRS.2023.3321533
He, P., Hu, D., and Hu, Y. (2022). Deployment of a deep-learning based multi-view
stereo approach for measurement of ship shell plates. Ocean Eng. 260, 111968.
doi: 10.1016/j.oceaneng.2022.111968
Hirschmuller, H. (2007). Stereo processing by semiglobal matching and mutual
information. IEEE Trans. Pattern An al. Mach. Intell. 30, 328341. doi: 10.1109/
TPAMI.2007.1166
Kandimalla, V., Richard, M., Smith, F., Quirion, J., Torgo, L., and Whidden, C.
(2022). Automated detection, classication and counting of sh in sh passages with
deep learning. Front. Mar. Sci. 8, 823173. doi: 10.3389/fmars.2021.823173
Li,J.,Chen,R.,Wang,S.,Mao,H.,He,Y.,Sui,D.,etal.(2012).Trendsininternational
marine observation technology development and the practice of deep-sea station c onstruction
in China. J. Trop. Oceanography 02, 123133. Available at: https://kns.cnki.net/kcms2 /article/
abstract?v=92B8ThN9IcuClrvzntcCzCwVELaLiAYANNXmVjJx5kxrNWC_
cPKovm0UnhmK_8V8asYsYnG4vxSNyiqIyLh8 1ncG1AB33r2S-I0Qxecko4tkOB2G_
ZfrHJ-_RIk0duQeDuUvEQD12BeDXhSTkqc6kbRg7pC-RXhkuCA_okZm_
WHIB93Mu6pdTMWHbE6dDuod&uniplatform=NZKPT&language=CHS (Accessed
December 14, 2024).
Lin, M., and Yang, C. (2020). Ocean observation technologies: A review. Chin. J.
Mechanical Eng. 33, 32. doi: 10.1186/s10033-020-00449-z
Lowe, D. G. (2004). Distinctive image features from scale-invariant keypoints. Int. J.
Comput. Vision 60, 91110. doi: 10.1023/B:VISI.0000029664.99615.94
Mayer, N., Ilg, E., Häusser, P., Fischer, P., Dosovitskiy, A., and Brox, T. (2016). A
large dataset to train convolutional networks for disparity, optical ow, and scene ow
estimation,in Proceedings of the IEEE Conference on Computer Vision and Pattern
Recognition (CVPR). IEEE. 40404048. doi: 10.1109/CVPR.2016.438
Nie, W. (2020). Research on wave elevation measurement method and 3D
visualization technology based on binocular vision (Master's thesis, Harbin
Engineering University). China Natl. Knowledge Infrastructure. doi: 10.27060/
d.cnki.ghbcu.2020.001047
Raveendran, S., Patil, M. D., and Birajdar, G. K. (2021). Underwater image
enhancement: a comprehensive review, recent trends, challenges and applications.
Artif. Intell. Rev. 54, 54135467. doi: 10.1007/s10462-021-10025-z
Shemdin, O. H., and Tran, H. M. (1992). Measuring short surface waves with
stereophotography. Photogrammetric Eng. Remote Sens. 58, 311316. doi: 10.1109/
OCEANS.1990.584829
Shemdin, O. H., and Wu, S. (1988). Directional measurement of short ocean waves
with stereophotography. J. Geophysical Research: Oceans 93, 1389113901.
doi: 10.1029/JC093iC11p13891
Shi, L., Zhu, H., and Yu, Y. (2019). Research on wave parameter telemetry method
based on binocular stereo vision. J. Electronic Measurement Instrumentation 33, 99
104. doi: 10.13382/j.jemi.B1801549
Sun, H., Wu, G., Wang, X., Zhang, T., Zhang, P., Chen, W., et al. (2022). Research on
a measurement method for the ocean wave eld based on stereo vision. Appl. Sci. 12,
7447. doi: 10.3390/app12157447
Tani,S.,Ruscio,F.,Bresciani,M.,Nordfeldt,B.M.,Bonin-Font,F.,andCostanzi,R.(2023).
Development and testing of a navigation solution for Autonomous Underwater Vehicles
based on stereo vision. Ocean Eng. 280, 114757. doi: 10.1016/j.oceaneng.2023.114757
Wang, Z. (2021). Research on wave parameter observation technology based on
binocular vision (Master's thesis, Harbin Engineering University). China Natl.
Knowledge Infrastructure. doi: 10.27060/d.cnki.ghbcu.2021.001155
Yan, L. (2017). Long-distance wave measurement based on 3D image measurement
technology. (Master's thesis, Nanjing University of Science and Technology). China
National Knowledge Infrastructure. Available at: https://kns.cnki.net/kcms2/article/
abstract?v=ufuULlVWCsNqBVOcqdKJh2KqZinolM0oxTxYZ9
oEuTP0YePSffKX0BKIjKGAc-shV7Mw3RP69tgKVaZxqdPl9vEC96sIjcvEjO09
Drp3zseRimVvWpjqNVu7T-PZI85axP_J05qloth29uLQCS_3QMgHz-
2IhXsbPgNh6eEKijvoSZvnaBuIZDV8XwHWH-Rvm7V3mjXzTLbQ=&uniplatform=
NZKPT&language=CHS (Accessed December 14, 2024).
Yang, Y., Hui, L., and Wan, J. (2018). Design of an acoustic wave gauge based on
ATxmega256A1. Meteorological Hydrological Mar. Instruments 35, 3236.
doi: 10.19441/j.cnki.issn1006-009x.2018.02.009
Zhang, Z. (2000). Flexible camera calibration by viewing a plane from unknown
orientations,in Proceedings of the IEEE Computer Society Conference on Computer Vision
and Pattern Recognition. IEEE. 666673. doi: 10.1109/ICCV.1999.791289
Zhang, C., Zhang, X., Gao, G., Lang, H., Liu, G., Cao, C., et al. (2024). Development
and application of ship detection and classication datasets: A review. IEEE Geosci.
Remote Sens. Magazine. 12, 1245. doi: 10.1109/MGRS.2024.3450681
Zhang, X., Gao, G., and Chen, S.-W. (2024). Polarimetric autocorrelation matrix: A
new tool for joint characterizing of target polarization and Doppler scattering
mechanism. IEEE Trans. Geosci. Remote Sensing 62,122. doi: 10.1109/
TGRS.2024.3398632
Zhang, L., Gao, G., Chen, C., Gao, S., and Yao, L. (2022). Compact polarimetric
synthetic aperture radar for target detection: A review. IEEE Geosci. Remote Sens.
Magazine 10, 115152. doi: 10.1109/MGRS.2022.3186904
Zhang, D., Wang, J., Lin, Y., Si, Y., Huang, C., Yang, J., et al. (2017). Present situation
and future prospect of renewable energy in China. Renewable Sustain. Energy Rev. 76,
865871. doi: 10.1016/j.rser.2017.03.023
Zhang, Q., Zhang, L., and Jia, Y. (2023). Development and prospect of satellite
remote sensing technology for ocean dynamic environment. J. Spacecraft Rockets 60,
16791698. doi: 10.2514/1.A35710
Wu et al. 10.3389/fmars.2024.1508233
Frontiers in Marine Science frontiersin.org14
ResearchGate has not been able to resolve any citations for this publication.
Article
Full-text available
This paper introduces an innovative approach in Synthetic Aperture Radar (SAR) polarimetry and proposes a novel descriptor called polarimetric autocorrelation matrix. Different from polarimetric covariance and coherency matrices, the polarimetric autocorrelation matrix can capture hidden Doppler information in the frequency domain and encode it in the phase using higher-order statistical methods. This matrix facilitates the joint extraction and analysis of polarization and Doppler information from fully Polarimetric SAR (PolSAR) data through matrix analysis. The paper explains that the polarimetric autocorrelation matrix can be decomposed into a Doppler-related matrix and a covariance-related matrix. The magnitudes and phases of these matrices provide insight into Doppler shifts between different polarizations and the variance of the backscattering coefficient. Our research demonstrates how the Doppler shift is influenced not only by the target’s radial velocity but also by radar polarization. Four quad-polarimetric RADARSAT-2 images are used for testing in this study, and a set of characterization features and parameters are derived from the polarimetric autocorrelation matrix. The novel descriptor, together with the new parameters, can detect man-made target and sea ice, mitigate ambiguity caused by moving targets, and indicate the motion status of ship targets and sea currents. Specifically for marine scenes, our data found Doppler differences in HH polarization and VV polarization induced by sea surface motion can be up to 100 Hz.
Article
Full-text available
Fine-grained ship classification in optical remote sensing images is a major challenge in the ocean observation field, elaborated as follows: First, the cost of acquiring ship images is expensive. Obtaining numerous labeled samples is difficult, resulting in the poor generalization ability of training models. Second, the features of ship target cannot be accurately obtained owing to complex background interference. Third, inter-class similarity and intra-class diversity among different ships render ship classification difficult. In this study, we propose LATA-BP-BDC: a bi-prototype Brownian Distance Covariance (BDC) metric network with lightweight adaptive task attention (LATA) for few-shot fine-grained ship classification. First, the LATA module is used to generate 3-dimensional (3D) weights, which can effectively reduce complex background interference and improve the adaptive capturing ability of target features without including additional network operators. Second, we input target features into the BDC metric module and output the BDC matrices to represent image information. Because the similarity between two images can be calculated as the corresponding BDC matrices distance, the improvement of the relevance of similar targets can be realized. Finally, we use the bi-prototype module to generate highly accurate prototypes, further calibrating information differences between images, which enhances the correlation between the same category samples and separability between different categories samples. Consequently, this process effectively reduces the influence of large intra-class appearance variation and small inter-class appearance variation. We perform validation using two fine-grained datasets, FGSCR and CUB. Compared with state-of-the-art methods, the LATA-BP-BDC achieves a superior performance and has good generalization for fine-grained few-shot classification.
Article
Full-text available
Featured Application This technology can realize near-field wave load monitoring on ships and it can also provide support for short-term wave prediction. Abstract The wave parameter is an important environmental input condition. Traditional contact wave measurement methods are unable to meet the requirements of high precision, non-contact, and ship wave field assessment. Alternatively, stereo vision technology can realize a non-contact and mobile form of measurement. However, this technology suffers from poor timeliness and adaptability. This paper proposes a comprehensive wave measurement method that is based on stereo vision, wherein the gridding of siftGPU is used to achieve the fast matching of large images. The whole algorithm can be run within 6 s and it guarantees more than 20,000 feature-matching logarithms. Furthermore, by utilizing the least squares method and sea surface wave surface theory, the sea surface base level can be calculated without control points, along with the inversion of the sea wave parameters (wave height, period, and wave direction) and error point fitting. The rationality and superiority of the algorithm were verified through multiple comparison experiments. Compared with the Richard Brancker Research (RBR) wave height meter, the measurement error of the wave height is less than 10%, the period error is less than 0.5 s, and the wave direction error is less than 10° with the proposed method.
Article
Ship detection and classification pose critical challenges in maritime surveillance and monitoring. On the one hand, they find applications in civilian domains, like fishery management, illegal immigrant monitoring, and marine rescue. On the other hand, ships are important targets in maritime combat. Ship detection and classification help to obtain timely maritime military intelligence that holds immense importance in seizing control of the maritime battlefield and ensuring the success of military operations [1], [2]. Remote sensing technology, facilitated by sensors mounted on platforms like satellites and airplanes, enables long-distance noncontact imaging of ground targets. It offers high timeliness, wide coverage, and low-cost accessibility and is not constrained by geographic conditions. Consequently, it has emerged as the primary method for monitoring vessels [4], [5].
Article
Ocean wave is of great significance in marine environment prediction, maritime navigation and global climate change. Synthetic Aperture Radar (SAR) is widely used in ocean wave spectra retrieval due to its two-dimensional high resolution, all-weather and all-time advantages. Nevertheless, the nonlinear mapping between SAR and ocean waves, caused by velocity bunching, hinders the advancement of wave spectra inversion techniques, resulting in low-quality and incomplete wave spectra. To overcome the problem, a novel deep learning model SAR2WV for ocean wave spectra retrieval based on Pix2pix is proposed by constructing the nonlinear mapping relationship of SAR cross spectra and ocean wave spectra. A total of 106,844 Sentinel-1 wave mode dataset along with the corresponding ECMWF ERA 5 wave data are processed and used for training the SAR2WV model. Experiments demonstrate that the proposed SAR2WV model can significantly improve the accuracy of the retrieved wave spectra and wave parameters, with the spectra similarity improved by 60.3%, root mean square error (RMSE) of significant wave height (SWH) decreased from 0.966 m to 0.386 m, RMSE of mean wave period (MWP) decreased from 1.208 s to 0.811 s, and correlation coefficient of peak wave direction increased from 0.65 to 0.72, which achieves better performance than Ocean Swell Wave Spectra (OSW) algorithm and other methods.
Article
Sea surface temperature (SST) is one of the most important parameters in the global ocean-atmosphere system. Predicting SST can help to analyze and identify extreme weather and protect marine environment in advance. Traditional numerical and machine learning methods tend to ignore spatial features. The single model in existing deep learning methods suffers from weakening spatial features and reducing ability of discriminating time-series information. At the same time, rare consideration about the influence of ocean physical phenomena has been given. These will lead to inaccurate prediction results. Based on the spatial-temporal characteristics and physical laws of the SST field, this paper proposes a hybrid multi-scale spatial-temporal model combining error correction map (ECM-HMSTM) to predict the SST. First, the ECM-HMSTM can comprehensively extract the spatial-temporal features of the SST field at different scales and thus the SST prediction map can be obtained. Second, by a new error correction approach based on the activity of tropical instability waves (TIWs), the ECM-HMSTM can effectively predict the variation characteristics of TIWs signals, which results in producing the error correction maps. Third, by fusing the two above maps, the SST field in the tropical eastern Pacific Ocean after five days is predicted. Experiment results show that the accuracy of the ECM-HMSTM was improved by 10.3% compared with the current state-of-the-art deep convolution model. Moreover, the SST predicted by the ECM-HMSTM performs well on characterizing the intensity of TIWs. Therefore, this paper provides a strategy for effective short-term prediction of SST fields, which is of guidance for prediction and analysis of ocean phenomena and climate.
Article
Onboard information fusion for multisatellites, which is based on spatial computing mode, can improve the satellites’ capability, such as the spatial–temporal coverage, detection accuracy, recognition confidence, position precision, and prediction precision for disaster monitoring, maritime surveillance, and other emergent or continuous persistent observing situations. First, we analyze the necessity of onboard information fusion. Next, the recent onboard processing developments are summarized and the existing problems are discussed. Furthermore, the key technologies and concepts of onboard information fusion are summarized in the fields of feature representation, association, feature-level fusion, spatial computing architecture, and other issues. Finally, the future developments of onboard information fusion are investigated and discussed.