ArticlePDF Available

Design and Image Research of Tennis Line Examination Based on Machine Vision Analysis

Authors:

Abstract and Figures

In view of the intelligent demand of tennis line examination, this paper performs a systematic analysis on the intelligent recognition of tennis line examination. Then, a tennis line recognition method based on machine vision is proposed. In this paper, the color region of the image recognition region is divided based on the region growth, and the rough estimation of the court boundary is realized. In order to achieve the effect of camera calibration, a fast camera calibration method which can be used for a variety of court types is proposed. On the basis of camera calibration, a tennis line examination and segmentation system based on machine vision analysis is constructed, and the experimental results are verified by design experiments. The results show that the machine vision analysis-based intelligent segmentation system of tennis line examination has high recognition accuracy and can meet the actual needs of tennis line examination.
This content is subject to copyright. Terms and conditions apply.
Research Article
Design and Image Research of Tennis Line Examination Based on
Machine Vision Analysis
Liu Yan
1
and Sun Xin
2
1
College of Sports Science, Harbin Normal University, Harbin 150025, China
2
Harbin Finance University, Harbin 150030, China
Correspondence should be addressed to Sun Xin; 2016121993@jou.edu.cn
Received 25 July 2021; Accepted 21 August 2021; Published 21 September 2021
Academic Editor: Bai Yuan Ding
Copyright ©2021 Liu Yan and Sun Xin. is is an open access article distributed under the Creative Commons Attribution
License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is
properly cited.
In view of the intelligent demand of tennis line examination, this paper performs a systematic analysis on the intelligent
recognition of tennis line examination. en, a tennis line recognition method based on machine vision is proposed. In this paper,
the color region of the image recognition region is divided based on the region growth, and the rough estimation of the court
boundary is realized. In order to achieve the effect of camera calibration, a fast camera calibration method which can be used for a
variety of court types is proposed. On the basis of camera calibration, a tennis line examination and segmentation system based on
machine vision analysis is constructed, and the experimental results are verified by design experiments. e results show that the
machine vision analysis-based intelligent segmentation system of tennis line examination has high recognition accuracy and can
meet the actual needs of tennis line examination.
1. Introduction
Tennis, as one of the hottest ball games, has gained the at-
tention of many fans all over the world. In order to ensure the
fairness of tennis match, the third-party evaluators are re-
quired to participate in the evaluation of the game, called line
review. In the competition, the tennis ball flies very fast, so
whether the ball is out of bounds the moment it hits the
ground needs scientific evaluation. e traditional line ex-
amination is judged by the naked eye, which is controversial.
e emergence of “eagle eye” technology further improves the
fairness of tennis line examination. At present, eagle eye
system is mostly “instant playback” system; that is, in case of
dispute, it is judged by applying for playback, and the eagle eye
coverage is in advance. erefore, not all cases can challenge
the form of new assessment, so the accuracy and timeliness of
current tennis line review need to be further improved [1].
At present, the application of machine vision in sports
video analysis is more, mainly in football and basketball. In
football and basketball, match support has matured. Although
the heat of tennis video analysis is not comparable to football
and basketball, its environment is more complex and requires
higher quality of machine vision recognition. At present, the
requirements of sports video visual recognition are mainly
reflected in the above aspects, which are vision, text, and
hearing [2]. For tennis, the time detection of tennis video
mainly starts from visual information, such as lens type
analysis, detection, and tracking of players and court. Tennis
competition is affected by the court, angle, and other complex
factors, resulting in the judgment result of the ball landing out
of bounds. erefore, high-quality machine vision recognition
method is needed for video and image analysis to eliminate
complex background interference and improve the accuracy
of tennis line review [3].
Based on the above requirement analysis, the image
processing technology based on machine vision is proposed.
In this paper, combined with the actual needs of tennis line
review, the design of tennis line review machine vision
assistant system and its effect are analyzed. is article
further improves the recognition accuracy of tennis line
review through visual and algorithm improvements, which
has a certain effect on the development of tennis line review.
Hindawi
Computational Intelligence and Neuroscience
Volume 2021, Article ID 2436120, 11 pages
https://doi.org/10.1155/2021/2436120
2. Construction of the Machine Vision
Algorithm Model
2.1. Model Derivation. It is inevitable that objects with the
same color as the court will appear in the video frame of
tennis match. erefore, it is necessary to remove these
noncourt factors in the detection of court cashing. In this
paper, local entropy is introduced as texture feature, and the
uncertainty of random variables is quantitatively processed
by entropy. In image processing, entropy can be used to
measure image homogeneity, which is expressed as follows
[4]:
H 􏽘
255
i0
Pilog Pi,
PiNi
N.
(1)
Here, Ni represents the pixel gray value, N represents the
number of pixels with gray value in the region, and Pi
represents the probability of gray value of. e main color
filtering algorithm is constructed by image local entropy,
and its flow can be expressed as the result shown in Figure 1.
After filtering the main color, a color filter can be obtained,
which is represented as a binary image in the computer. In
the image, the pixel with the corresponding value of 1 in the
video frame is marked as the main color; otherwise, it is the
nondominant color. After that, the local entropy of the video
frame can be calculated and processed, the local entropy
image binarization processing is carried out through the
adaptive threshold, the optimized filter is obtained through
the fusion processing, and the main color detection is re-
alized by the filter [5].
On the basis of the above main color filtering algorithm,
the frame diagram of the court segmentation system is
constructed, as shown in Figure 2. e camera is calibrated
by the main color filtering algorithm, and the court sideline
is accurately segmented based on the AOC segmentation
algorithm [6].
2.2. Calibration. e camera calibration process mainly
includes three steps: the detection of court sideline, the
analysis of court type, and the solution of optimal
homography matrix. e model combines with the court
sideline detection to obtain the four benchmark points
calibrated by the camera. In this paper, the tennis sideline is
identified. In the actual recognition, only part of the court
area will be identified in real time at a certain time.
erefore, the algorithm complexity can be reduced by
reducing the parameter space of the optimal unit matrix
solving stage [7]. e tennis court segmentation based on
AOC mainly uses the camera calibration results to calculate
the proportion of the court part in the court model area and
projects the region into the image through the homography
matrix and finally obtains the accurate segmentation result.
Next, we analyze the algorithm of the process [8].
e working principle of pinhole camera is as follows:
light is emitted from a distance and projected into an image
plane through the camera pinhole. If the focal length of the
camera is f, the distance between the camera and the object
is Z, any point of the object can be expressed as X, and the X
corresponding point on the image plane is x; then [9],
x
fi
X
Z.(2)
By exchanging pinholes and image planes, (2) can be
expressed as another mathematical expression [10]:
x
fi
X
Z.(3)
e reason for the lack of symbol on the left side of
equation (3) is that the target image is not inverted after the
pinhole is exchanged with the image plane.
In theory, the center point of the image is the main point,
which is also the intersection of optical axis and image plane.
In fact, the main point is not on the optical axis. By in-
troducing two parameters cxand cyand modeling the
position offset of the main point relative to the optical axis,
the relationship between point Q(X, Y, Z)and its projection
point (xscreen, yscreen )can be expressed as [11]
Main color extraction
based on AGMM Calculate local entrophy image
Main color rectangle
area search
Find the main color rectangle?
yes no
Calculate local
entropy threshold
Set the default threshold
Entropy to entropy filter Entropy filter and color
filter do intersection
Figure 1: Main color filtering algorithm flow.
2Computational Intelligence and Neuroscience
xscreen fx
X
Z
􏼒 􏼓+cx,
yscreen fy
Y
Z
􏼒 􏼓+cy.
(4)
Here, the focal length in horizontal and vertical direc-
tions is different because the imaging shape on the pixel
meter is rectangular rather than squared.
e process of coordinate transformation can be realized
by projection transformation, and the imaging process of
camera itself is projection transformation. e point in the
real world is transformed into the corresponding coordinates
of the camera image through the camera, which can be
expressed in mathematical form. It can be realized by a
homography matrix. rough DLT (Direct Linear Trans-
form) algorithm, homography matrix can be solved effectively
when enough corresponding coordinate points are given. In
tennis match, the transformation relation of the sum of the
boundary points of the court can be expressed as [12]
S
u
v
1
H
x
y
1
.(5)
Here, Sis an invariant nonzero vector u v 1
􏼂 􏼃T,x
can
be expressed as x y 1
􏼂 􏼃T,H
h1h2h3
h4h5h6
h7h8h9
, and, in this
study, His a 3 3 matrix. erefore, if we can find four
points (Xi:(xi, yi,1), i 1,2,3,4)in the real court and the
corresponding four points (Ui:(ui, vi,1), i 1,2,3,4)in the
image, we can solve the homography matrix [13]:
x1y11 0 0 0 x1u1y1u1
0 0 0 x1y11x1v1y1v1
x2y21 0 0 0 x2u1y2u2
0 0 0 x2y21x2v2y2v2
x3y31 0 0 0 x3u3y3u3
0 0 0 x3y31x3v3y3v3
x4y41 0 0 0 x4u3y4u4
0 0 0 x4y41x4v3y4v3
h1
h2
h3
h4
h5
h6
h7
h8
u1
v1
u2
v2
u3
v3
u4
v4
.(6)
us, the homography matrix can be obtained.
In the process of storing the golf course model, it is
necessary to store the position of the court boundary line,
which is usually realized by the way of configuration file
storage. In this paper, the configuration file is stored in the
format of key value, which mainly includes three attributes,
namely, tennis court height (field), tennis court width (field),
and the tennis field. According to the position of the sideline,
it can be divided into left court boundary (field lines) and
right field. With this setting, it is easy to search some frames
in a centralized way. On this basis, the court can be further
divided into horizontal lines and vertical lines. e
boundaries of each line are shown in the model standard
format: line “(x
1
Y
1
), (x
2
Y
2
)” [14].
In the calibration process, four lines (horizontal line and
vertical line are both two) are selected from the field model
for calibration. In order to avoid repeated calculation of
calibration points, the vertical line should be sorted from left
to right, and the horizontal line from top to bottom. For the
straight line of the court model, you can control the field in
the configuration file. e lines set is sorted by manually
planning the line order. It can be sorted only once, while the
straight lines in the image can be sorted according to the
distance between these lines and the datum point, which is
represented as ω. Set the reference point as the midpoint of
the left boundary and the key point of the upper boundary of
the image, as shown in Figure 3 [15].
e horizontal line of the image is represented as h1, and
the vertical line of the image is represented as v1. e
horizontal and vertical lines in the model are represented as
h1
and h2
, respectively. e subscript of the line is set
according to the distance from the line to the reference
point. If the central line of the set of horizontal lines exists,
ω(hl)ω(hl+1), as shown in Figure 4 [16].
Stadium segmentation result
Aoc
calculation
Aoc back projection
Aoc part
Camera calibration
General optimal
homography
matrix solution
Field edge detection
Video frame Field detection and field
boundary detection
Figure 2: Stadium segmentation system.
Computational Intelligence and Neuroscience 3
When solving the optimal homography matrix, it is
mainly realized by processing all line combinations. Specif-
ically, two horizontal lines are randomly obtained from the
image set, which are represented as hiand hk, respectively,
and the two horizontal lines are randomly returned from the
established tennis court model and are, respectively, repre-
sented as hi
and hk
,i>k. e same method is used to obtain
two points from the tennis court image and the tennis court
model, and a combined image is obtained through the in-
tersection of two and two, and the four intersection points are
calculated, as shown in the following formula [17]:
p1hi×vm,
p2hi×vn,
p3hk×vm,
p4hk×vn.
(7)
e image is subdivided into the left and right court
modes for space reduction, but the overall parameter size is
still large; if the algorithm detects more than one straight
line, this will lead to a rapid increase in data volume,
resulting in a longer data calculation time, and finding the
results of model parameters is more time-consuming;, the
actual application of the algorithm is not good and obviously
cannot meet the timely needs of tennis tournaments; in
order to improve the efficiency of tennis line review, further
improvement is needed [18].
When there are many parameters in the model, it is
difficult to find the optimal solution. erefore, this paper
uses the method of not seeking Hmatrix and other follow-up
operations to improve the efficiency of solving the optimal
solution. Two pruning methods are introduced in the system
to reduce the time complexity and eliminate the obvious
errors in the system calculation process.
e first method of pruning is to estimate the height of
the court through two image horizontal lines and the cor-
responding horizontal lines in the corresponding model. If
the actual heights of the two images are too far, they will be
discarded directly, and the vertical lines will be operated the
same way. rough this pruning method, the score calcu-
lation of homography matrix solution can be effectively
avoided, and the amount of calculation can be reduced [19].
Another pruning method is to get the homography
matrix by calculating the calibration set of the system. e
homography matrix itself is estimated. It is obviously im-
possible to obtain the value of the optimal solution by
discarding the homography matrix. Although this process is
accompanied by the calculation of the homography matrix,
the calculation process is fast. In the process of image
processing, it is necessary to reflect the golf course model to
the corresponding image of the model. e mapping process
is one-to-one corresponding, so there is a certain time loss in
the process.
ere are eight degrees of freedom in the homography
matrix, which are camera internal parameter 3, camera
rotation 3, translation parameter 1, and focal length
parameter 1. e remaining parameters are related to dif-
ferent directional scaling; that is, the horizontal and vertical
directions show different zoom ratios in the process of
zooming. e camera imaging process can be expressed as [20]
piHpi
f0ox
0f oy
0 0 1
r00 r01 r02 tx
r10 r11 r12 ty
r20 r21 r22 tz
1 0 0 0
0β0 0
0 0 1 0
0 0 0 1
x
y
z
0
1
.(8)
Here, the focal length is denoted by f. In fact, the an-
isotropic scaling can be ignored in the real world, so βcan be
set to 1, which is one of the basic conditions for pruning.
2.3. e Tennis Court Is Precisely Divided. In this paper, the
model is designed to accurately divide the side of the court
for the needs of tennis wire review, and the process divides
the court by a single matrix, which mainly consists of two
parts, namely, AOC calculation and AOC reverse projection.
vn
vm
p4
p2
p1
p3
hi
hk
Figure 4: Acquisition of a fixed point.
1
2
3
4
5
Figure 3: e tennis court line is sorted.
4Computational Intelligence and Neuroscience
AOC is defined as the area captured by the camera. In tennis,
the camera is not filmed in the whole process. In order to
improve the identification effect of the court, the local area is
generally shot in high definition to improve the effect of line
inspection. erefore, by calculating AOC, the precise di-
vision of the pitch boundary can be achieved. AOC’s cal-
culation process consists of four main steps, namely,
coordinate mapping, field shape boundary recognition,
coordinate detection, and AOC generation. Mapping co-
ordinates to a model image results in a standard shape pitch
boundary model after projection, due to different camera
angles, or produces two types of polygon shapes, as shown in
Figure 5 [21].
e projection point under the top view angle is rep-
resented as (Pi1
, Pi2
, Pi3
, Pi4
), and the top point of the court
under the top view angle is represented as
(PT1, PT2, PT3, PT4).(Pi1
, Pi2
, Pi3
, Pi4
), the connecting lines
of the vertices, form a convex polygon or a self-intersecting
polygon. e two situations are identified, respectively. If the
result is a convex polygon, Aiis used to represent the co-
incidence area between the polygon and the court; other-
wise, Aiis used to represent the coincidence area between
the polygon extension line and the court.
In order to solve problem Ai, we need to calculate three
point sets. First, we calculate the projection to get the polygon
(Pi1
, Pi2
, Pi3
, Pi4
)and the intersection point of the court
boundary. Second, the tennis court vertex (Pi1
, Pi2
, Pi3
, Pi4
)in
the polygon area is searched, and then the four vertices in the
tennis court are searched. e rearrangement of these points
is completed by AOC generation process. Every two adjacent
vertices form an edge of the polygon. In both cases, the point
set can be represented as (Qi, Pi2
, Pi3
, Pi4
, PT3)and
(Qi, Pi2
, Pi3
, Pi4
, Q2), respectively.
AOC reverse projection is carried out after rearing,
which is the last step in the operation of the model, and
the single matrix results are projected back to the model
image, resulting in a more accurate reverse projection
result, thus obtaining the field boundary distribution
result.
3. Image Processing
Although the image is grayed, filtered, and threshold-seg-
mented, there are still some small particles in the image and
the contour is not clear. erefore, it is necessary to process
the image morphology. Image morphology mainly removes
the unwanted information from binary image and
strengthens the needed information. Morphological pro-
cessing is mainly divided into pixel morphological pro-
cessing and regional morphological processing. It can be
seen from the image after threshold segmentation that there
are pixels in the contour of the target which should be the
target but divided into background points. At the same time,
there are pixels that should be background but are divided
into targets. erefore, morphological processing should be
carried out on these misjudged pixels first. e morphology
of image is similar to image filter operator. It is to replace the
original pixel with a certain shape of structural elements and
pixels in the domain and then study the feature information
of the image. e basic treatment of morphological treat-
ment is corrosion and expansion, and another form of
morphological processing is based on corrosion and
expansion.
3.1. Corrosion. Corrosion is to move the corrosion element
from left to top and from top to bottom in image A. Only
when it is the same as the structure element will it be
retained, and different places will be removed. e etching of
binary image with structural element B can be expressed as
follows.
B(x)is the structural element and A is the original
image. e corrosion of gray scale image can be expressed by
the following formula:
AΘBx|B(x)A
{ },
(fΘb) min f(s+x, t +y) b(x, y)|(sx, t y)
􏼈
Df;(x, y)Db􏽯.
(9)
Corrosion is mainly used to eliminate the high bright-
ness of the isolated pixels in the image, which can refine the
contour of the target. As a result, the high brightness area of
the image becomes smaller and the dark brightness area is
expanded.
3.2. Expansion. In contrast to dilation and erosion, when
traversing all pixels in image A, as long as the intersection
with structural elements is not empty set, the binary image
expansion can be expressed by the following formula:
(fΘb) max f(sx, t y) + b(x, y)|(sx, t y)
􏼈
Df;(x, y)Db􏽯.
(10)
Dilation can fill the holes and gaps of the image and also
can expand the outline of the image. For the gray image, it
can expand the brightness area and shrink the dark
brightness area.
3.3. Hole Filling. Hole filling is an algorithm based on set
expansion, complement, and intersection. e main process
of hole filling is to seed a background pixel in the image
boundary. According to the principle of 8-connected,
foreground 1 is used to fill the background outside the
Pr1Pr1
Pr2Pr2
Pr4Pr4
Pr3Pr3
P
i1
P
i1
P
i4
P
i4
P
i3
P
i3
P
i2P
i2
Q1
Q1
Q2
Q2
Ai
Ai
Figure 5: Two types of AOC.
Computational Intelligence and Neuroscience 5
particle, and then the image filled with background is re-
versed to obtain the hole image represented by foreground
color L. e hole filling can be completed by adding the hole
image represented by foreground color Land the original
binary image. It can be expressed as follows:
XkXk1B
􏼁Ac
k1,2,3,· · · .(11)
3.4. Morphological Treatment of Granules. Particles refer to
the region composed of nonzero or high gray pixels con-
nected with each other in the image. Particle morphology
processing includes particle separation, image labeling,
particle region division, and particle filtering. In this paper,
the image is filtered by two values, that is, the particle size
and the particle area. Users can choose the filtering standard
according to different needs. In this paper, the nontarget
particles in the image are removed according to the particle
area.
e edge of an image refers to the pixels with mutation in
the image, which can be connected to form the outline of the
image. Edge detection can be completed by one or two
derivatives. e main detection algorithms are gradient
detection, Robert detection, Sobel detection, Prewitt de-
tection, and canny detection.
3.4.1. Gradient Detection. Gradient operator is calculated by
calculating partial derivatives zf/zxand zf/zyof each pixel
in the image. For partial derivatives, it can be approximately
expressed as
zf
zxf(x+1) f(x),
zf
zyf(y+1) f(y).
(12)
en, the gradient operator can be expressed as follows:
gxzf(x, y)
zxf(x+1, y) f(x, y),
gyzf(x, y)
zyf(x, y +1) f(x, y).
(13)
3.4.2. Roberts Operator. Robert crossover operator can ex-
tract the edge in diagonal direction. Suppose that there is a
region of size 3 ×3, in which the element is aij , and the
operator can be expressed as follows:
gxzf
zxa33 a22
􏼁,
gyzf
zya32 a23
􏼁.
(14)
Robert operator is used to extract the image contour.
Compared with the gradient operator, the edge is well
extracted, and the contour extraction is continuous. Al-
though it looks a little discontinuous from the diagram, it is a
continuous boundary after zooming in.
3.4.3. Prewitt Operator. Prewitt operator is improved from
Robert operator. Prewitt operator takes into account the
property of center point to end data and has more infor-
mation about edge direction. For a region of 3 ×3 size, the
element is aij. e Prewitt operator can be used to represent
the following:
gxzf
zxa31 +2a32 +a33
􏼁a11 +2a12 +a13
􏼁,
gyzf
zya13 +2a23 +a33
􏼁a11 +2a21 +a31
􏼁.
(15)
3.5. Canny Operator. Canny operator is the most excellent
operator among the basic edge detectors. Firstly, Canny
operator smoothes the image with a Gaussian filter. Let
G(x, y)represent the Gaussian function and represent
input images; then smoothing the input image can be
expressed as
g(x, y) G(x, y) · f(x, y),
G(x, y) ex2+y2/2σ2
( ).
(16)
e second step is to calculate the gradient value and
angle of the image:
M(x, y) ����������������������
zg(x, y)
zx
􏼠 􏼡2
+zg(x, y)
zy
􏼠 􏼡2
􏽶
􏽴,
α(x, y) arctan zg(x, y)/zx
zg(x, y)/zy
􏼢 􏼣.
(17)
e third step is to carry out nonmaximum suppression
on the gradient amplitude image and compare the horizontal
angle of 45and vertical angle of +45with the image to
find the closest angle direction. If the amplitude is at least
less than one of the two neighbors of the closest angle di-
rection, then gN(x, y) 0 (suppression), and then
gN(x, y) M(x, y), so as to obtain the image after non-
maximum suppression, that is, gN(x, y).
e last step is to use double threshold processing and
connection analysis to detect and connect edges. e pro-
cessing of low threshold TLand high threshold THcan be
regarded as the addition of two images as follows:
gNH(x, y) gN(x, y)TH,
gNH(x, y) gN(x, y)TH.(18)
At the beginning, the pixel values of the two images are
set to zero. After threshold processing, the nonzero pixels in
the high threshold image are removed by subtracting the low
threshold image from the high threshold image, and the
6Computational Intelligence and Neuroscience
strong pixels in the high threshold image will be marked as
edge pixels.
In Canny operator, we can change the value of O, low
threshold, high threshold, and window size to achieve the
purpose of contour extraction. Canny operator can extract
contour well, but it takes more time.
3.6. e Contour Is Extracted by Morphological Processing.
Morphological contour extraction is based on the evolution
of corrosion expansion, and there are two main extraction
methods: inner gradient boundary and outer gradient
boundary. If Ais used to represent the original image, Bis
the structural element, and β(A)is the boundary of image A,
and then the inner and outer gradient boundaries can be
expressed as follows:
β(A) A (AΘB),
β(A) (AB) A. (19)
It can be seen from the formula that the inner gradient
boundary first etched image Awith structural element B, and
then the boundary can be obtained by subtracting the
corroded image with image A. e extraction of external
gradient is to expand image Awith structuring element B
first and then subtract image Afrom the expanded image to
obtain the outer gradient boundary. Different choices of
structural elements can achieve different effects. e larger
the structure element is, the thicker the boundary will be.
Compared with these six methods, the gradient operator,
Sobel operator, and Prewitt operator can extract the contour,
but the contour is not continuous, and further processing is
needed. Roberts operator, Canny operator, and morpho-
logical extraction can extract the contour of the target
completely. Although Canny operator is the best operator, it
consumes more time and increases the processing time of
machine vision system. Roberts operator can also extract the
contour, but the contour extraction has a little dilation effect,
which will cause errors for the next measurement; and
morphology can well propose the contour of the target, so
this paper selects the contour extraction of morphology.
4. Model Construction
Based on the detection and image processing of the front-
end tennis court, the requirements of the line review of the
tennis court are analyzed and the system prototype is
designed. e system constructed in this paper mainly de-
signs a structure with three tiers, which are application layer,
data access layer, and data connection layer. e hierarchical
structure of the overall design of the system is shown in
Figure 6:
(1) e application layer is the application process of
sports video moving object detection and tracking
system. It is mainly the data used when users access
sports video management module, video acquisition
management module, target detection management
module, and target tracking management module.
(2) e data access layer is the process of accessing data
in the system. e sports video moving object de-
tection and tracking system uses the target tracking
and target detection algorithm to access the data in
the database.
(3) e data connection layer is used to store the data
information in the system, such as video informa-
tion, user information, video detection information,
and video tracking information. e functional
structure of the system is shown in Figure 7.
e target detection management function is to detect
and analyze the target in the sports video collected, which
can be used as the analysis data in the training. e main
executors of this function are video analysis users. is
function mainly includes four functions: target detection,
target model establishment, target model updating, and
detection result display. e target detection diagram is
shown in Figure 8. Firstly, the video analysis manager enters
the video target detection management module with his
username and password and then submits the video target
detection request. After the detection is completed, the
detection results are displayed.
5. Experimental Analysis
In order to verify the validity of the model, this paper collects
the video design experiment of tennis match from the In-
ternet, selects multiple sets of image frames from the video as
the test set, and compares the algorithm with the histogram
Sports video management
Application layer
Video capture
Target Detection Target Tracking
Data access layer
Sports video source data Target detection
and tracking
Hibernate frame work
Data connection layer
Video information User Info Video detection
information
Video tracking
information
Figure 6: Hierarchical structure of the overall system design.
Computational Intelligence and Neuroscience 7
Video target detection module
Submit video target
detection request Submit user ID
Target Detection Tennis line review Log in system
Database Submit video
target
detection
information
Enter
username
and
password
Figure 8: Collaboration diagram of target detection function.
Target model
update
Build the
target model
Establish tracking
target instance
Target matching
processing
Target Detection Test result display Create target
template
Lost tracking
processing
Target Detection Target Tracking
Video retrieval Video maintenance Video input and
initializaion
Sequence image
acquisition
Video upload Video download Read file Get camera video
Tennis video management Video capture
Figure 7: Overall function structure of the system.
8Computational Intelligence and Neuroscience
method and the hybrid Gauss model. ese image frames
are recognized by the test set images to compare the final
recognition results. Firstly, the extraction effect of pri-
mary colors was analyzed. e accuracy test results of
different primary colors were shown in Table 1 and
Figure 9. It can be found from the test results that the
model built in this paper has a remarkable effect; the
analysis is that the system in this paper will constantly
update the parameters in its operating sheet and filter out
some color interference.
Next, divide the field of the model area to determine
whether the tennis ball falls outside the divided limit
area. Taking each selected video frame as the test set, the
court and the main color region recognition method are
studied through the AOC method. When the tennis ball
is determined, since all the hitting results in the video
frame are known, the recognition results can be com-
pared with the actual results in order to get the recog-
nition accuracy.
After expanding the experiment, the identification of
whether the tennis ball’s landing moment was out of
boundary was judged on the basis of the AOC method and
the main color area method, respectively, and the results are
shown in Table 2 and Figure 10.
From the above graph segmentation results, the recogni-
tion method based on the main color region belongs to a rough
recognition algorithm. For tennis competition, the environ-
ment of the court is more complex, so it is difficult to get more
accurate recognition results only through this method. e
recognition algorithm based on AOC is more accurate, and the
identification results can be effectively improved by calibrating
the camera to meet the actual needs of tennis line review.
Table 1: Comparison table of the accuracy of the main color extraction.
Video sequence group Histogram method (%) GMM method (%) Method of this article (%)
1 81.51 85.29 91.42
2 94.67 94.51 97.97
3 87.20 89.04 95.69
4 93.15 95.60 98.73
5 86.38 88.42 89.95
Figure 9: Comparison of the accuracy of the main color extraction.
Table 2: A comparison table of the accuracy of the identification method.
e video sequence A field boundary detection algorithm based on the main color area (%) AOC-based pitch segmentation algorithm
(%)
Tennis video 1 84.21 97.44
Tennis video 2 92.42 98.34
Tennis video 3 89.53 98.76
Tennis video 4 88.43 97.65
Tennis video 5 89.51 98.35
Computational Intelligence and Neuroscience 9
6. Conclusion
is paper analyzes the accuracy of traditional tennis line
review and proposes an intelligent recognition model of
tennis line review based on AOC with the support of ma-
chine vision algorithm and verifies the performance of the
model. e process of coordinate transformation can be
realized by projection transformation, and the imaging
process of camera itself is projection transformation. e
point in the real world is transformed into the corresponding
coordinates of the camera image by the camera. e process
can be expressed in mathematical form and realized by a
homography matrix. e model designed in this paper can
accurately segment the sideline of tennis court according to
the needs of tennis line review. e process divides the
course by homography matrix, which includes two parts:
AOC calculation and AOC back projection. e perfor-
mance analysis of the model constructed in this paper shows
that the recognition algorithm based on AOC is more ac-
curate. By calibrating the camera, the recognition results can
be effectively improved to meet the actual needs of tennis
line review.
is paper only conducts system performance verifica-
tion through theoretical research and a small number of
image experiments, so it is necessary to further expand the
research and practice, and, in the follow-up research, it is
necessary to verify the system in combination with actual
competitions.
Data Availability
e data used to support the findings of this study are
available from the corresponding author upon request.
Conflicts of Interest
e authors declare that they have no conflicts of interest.
Acknowledgments
is work was supported by Harbin Normal University and
Harbin Finance University.
References
[1] P. S. Glazier and S. Mehdizadeh, “Challenging conventional
paradigms in applied sports biomechanics research,” Sports
Medicine, vol. 49, no. 2, pp. 171–176, 2019.
[2] L. G. Appelbaum and G. Erickson, “Sports vision training: a
review of the state-of-the-art in digital training techniques,”
International Review of Sport and Exercise Psychology, vol. 11,
no. 1, pp. 160–189, 2018.
[3] J. Yang, “e simulation of table tennis during the course of
sports,” Caribbean Journal of Science, vol. 52, no. 4,
pp. 1561–1564, 2019.
[4] M. Bulat, N. Korkmaz Can, Y. Z. Arslan, and W. Herzog,
“Musculoskeletal simulation tools for understanding mech-
anisms of lower-limb sports injuries,” Current Sports Medicine
Reports, vol. 18, no. 6, pp. 210–216, 2019.
[5] A. Lopatiev, O. Ivashchenko, O. Khudolii et al., “Systemic
approach and mathematical modeling in physical education
and sports,” Journal of Physical Education and Sport (JPES),
vol. 17, no. 1, pp. 146–155, 2017.
[6] L. YanRu, “An artificial intelligence and machine vision based
evaluation of physical education teaching,” Journal of Intel-
ligent and Fuzzy Systems, pp. 1–11, 2020.
[7] J. P. R. Lara, C. L. R. Vieira, M. S. Misuta, F. A. Moura, and
R. M. L. d. Barros, “Validation of a video-based system for
automatic tracking of tennis players,” International Journal of
Performance Analysis in Sport, vol. 18, no. 1, pp. 137–150,
2018.
[8] G. Tsagkatakis, M. Jaber, and P. Tsakalides, “Convolutional
neural networks for the analysis of broadcasted tennis games,”
Electronic Imaging, vol. 2018, no. 2, pp. 206–211, 2018.
[9] T. Fernando, S. Denman, S. Sridharan et al., “Memory aug-
mented deep generative models for forecasting the next shot
location in tennis,” IEEE Transactions on Knowledge and Data
Engineering, vol. 32, no. 9, pp. 1785–1797, 2019.
[10] N. Elliott, S. Choppin, S. Goodwill, T. Senior, J. Hart, and
T. Allen, “Single view silhouette fitting techniques for esti-
mating tennis racket position,” Sports Engineering, vol. 21,
no. 2, pp. 137–147, 2018.
[11] S. A. Kovalchik, J. Sackmann, and M. Reid, “Player, official or
machine?: uses of the challenge system in professional tennis,”
International Journal of Performance Analysis in Sport, vol.17,
no. 6, pp. 961–969, 2017.
[12] B. Giles, S. Kovalchik, and M. Reid, “A machine learning
approach for automatic detection and classification of changes
of direction from player tracking data in professional tennis,”
Journal of Sports Sciences, vol. 38, no. 1, pp. 106–113, 2020.
[13] M. R. Keyvanpour, S. Vahidian, and M. Ramezani, “HMR-vid:
a comparative analytical survey on human motion recogni-
tion in video data,” Multimedia Tools and Applications,
vol. 79, no. 43, pp. 31819–31863, 2020.
[14] Y.-F. Ji, J.-W. Zhang, Z.-h. Shi, M.-H. Liu, and J. Ren, “Re-
search on real - time tracking of table tennis ball based on
machine learning with low-speed camera,” Systems Science
and Control Engineering, vol. 6, no. 1, pp. 71–79, 2018.
Tennis video
3
Tennis video
4
Tennis video
5
Tennis video
2
Tennis video
1
e video sequence
75
80
85
90
95
100
Comparison of the accuracy of the identification
method. (%)
An Algorithm for Field Border Detection Based on
Dominant Color
Aoc-based stadium segmentation algorithm
Figure 10: Comparison of the accuracy of the identification
method.
10 Computational Intelligence and Neuroscience
[15] E. E. Cust, A. J. Sweeting, K. Ball, and S. Robertson, “Machine
and deep learning for sport-specific movement recognition: a
systematic review of model development and performance,”
Journal of Sports Sciences, vol. 37, no. 5, pp. 568–600, 2019.
[16] R. Gayanov, K. Mironov, R. Mukhametshin, A. Vokhmintsev,
and D. Kurennov, “Transportation of small objects by robotic
throwing and catching: applying genetic programming for
trajectory estimation,” IFAC-PapersOnLine, vol. 51, no. 30,
pp. 533–537, 2018.
[17] J. Zhang, H. Yu, H. Deng et al., “A robust and rapid camera
calibration method by one captured image,” IEEE Transac-
tions on Instrumentation and Measurement, vol. 68, no. 10,
pp. 4112–4121, 2018.
[18] O. Cant, S. Kovalchik, R. Cross, and M. Reid, “Validation of
ball spin estimates in tennis from multi-camera tracking
data,” Journal of Sports Sciences, vol. 38, no. 3, pp. 296–303,
2020.
[19] M. R. Mohammadi, “Deep multiple instance learning for
airplane detection in high-resolution imagery,” Machine
Vision and Applications, vol. 32, no. 1, pp. 1–14, 2021.
[20] Y. Zhao, R. Xiong, and Y. Zhang, “Model based motion state
estimation and trajectory prediction of spinning ball for ping-
pong robots using expectation-maximization algorithm,”
Journal of Intelligent and Robotic Systems, vol. 87, no. 3,
pp. 407–423, 2017.
[21] M. Fern´
andez L´
opez, “Research on the specific movement of
the head in tennis strokes,” ITF Coaching and Sport Science
Review, vol. 28, no. 80, pp. 16–19, 2020.
Computational Intelligence and Neuroscience 11
... Zhao Z applied the PSO-BP neural network algorithm to airport passenger prediction classification based on risk [8] . Yan L used computer vision to realize tennis recognition in 2021, which proved the feasibility of the application of intelligent algorithms in the field of sports [9] . ...
Article
Full-text available
In the era of big data, sports performance prediction becomes critical by analyzing accumulated competition data, especially the influence of momentum factors on competition results, although its definition and quantification are controversial in academia. This study explores the application of momentum in predicting the results of sports competitions and proposes a comprehensive research framework combining sports science and sports psychology. By collecting and analyzing the competition data, the feasibility of momentum as an important index in predicting the results of the competition is confirmed. The results show that momentum not only reflects the immediate performance of athletes in the game but also predicts their future results. This finding is of great significance for coaches and athletes to formulate strategies and adjust tactics in the game. Through continuous accumulation and analysis of data, we can more accurately predict the results of the game, and improve the level and fairness of sports competition.
... In comparing the two angles, the overlapping of a service line and a net strap was the most critical difference. In the optimal angle video, every line in the court is visible, leading to clear visual information without interference, which is necessary for the accuracy of line review (Yan et al., 2021;Yang, 2019). On the other hand, in the suboptimal angle, the net cord overlaps the service line in the upper half court, creating a visual hindrance. ...
Article
Full-text available
The present study examines the validity of the SwingVision application by comparing SwingVision’s data to criterion data. Also, we investigated the difference in recording angles of SwingVision (optimal and suboptimal). Six college students played four matches, and every match was recorded from two different angles. After the data collection, recorded videos were analyzed by SwingVision and human analysts (criterion). A total of 1065 strokes were analyzed in the agreement of SwingVision and human analysts. Cross-tabulation with a column proportion test, Cochran’s Q test, and Kappa statistics were utilized to demonstrate the association of categorical variables (stroke, hit depth, hit zone, bounce depth, bounce zone, spin, result) between the three data (i.e. optimal, suboptimal, and criterion). Repeated measures analysis of variance (ANOVA) and Pearson correlation were used to compare speed data. SwingVision data in most variables showed high proportional similarity and percent agreement with criterion data. Additionally, the optimal angle data had much more similar results to the criterion data than the suboptimal data. Therefore, this present study documented that SwingVision is trustworthy, and users should be aware of possible errors derived from angle differences.
... In comparing the two angles, the overlapping of a service line and a net strap was the most critical difference. In the optimal angle video, every line in the court is visible, leading to clear visual information without interference, which is necessary for the accuracy of line review (Yan et al., 2021;Yang, 2019). On the other hand, in the suboptimal angle, the net cord overlaps the service line in the upper half court, creating a visual hindrance. ...
Article
The present study examines the validity of the SwingVision application by comparing SwingVision's data to criterion data. Also, we investigated the difference in recording angles of SwingVision (optimal and suboptimal). Six college students played four matches, and every match was recorded from two different angles. After the data collection , recorded videos were analysed by SwingVision and human analysts (criterion). A total of 1065 strokes were analysed in the agreement of SwingVision and human analysts. Cross-tabulation with a column proportion test, Cochran's Q test, and Kappa statistics were utilised to demonstrate the association of categorical variables (stroke, hit depth, hit zone, bounce depth, bounce zone, spin, result) between the three data (i.e. optimal, suboptimal, and criterion). Repeated measures analysis of variance (ANOVA) and Pearson correlation were used to compare speed data.
... In attempts to minimise human error and provide greater transparency, some sports have already introduced video technology. Video Assistant Referee (VAR) was adopted by the International Football Association Board (IFAB) and is now used extensively via a variety of technological systems (IFAB, 2018), while tennis has the "Hawk-eye" line examination system (Hawk-Eye Innovations Ltd., Sony Business Europe, Basingstoke, England) to enhance and sometimes replace the line judges who use the naked eye (Spitz et al., 2021;Yan and Xin, 2021). In some cases, the main aim of introducing AI was not to limit human error, but to assist with score prediction in sports or injury risk assessment/prevention (Li and Xu, 2021;Claudino, et al., 2019). ...
Article
Full-text available
This study investigated the conceptions of perfection that figure in the minds of divers, coaches, and judges. Additionally, consideration was given to their views of whether an Artificial Intelligence (AI) based scoring system could be relied upon to yield authentic results, allied with the desirability of its use. Six participants (2 each of diver, coach, judge) were interviewed. Following verbatim transcriptions, thematic analyses were conducted to identify commonly occurring themes relevant to diving and its scoring system. The results of this preliminary study have shown a leaning towards clarification of the rules on point deductions by FINA (Fédération Internationale De Natation/ International Swimming Federation). Furthermore, the concept of perfection in diving changes from country to country and culture to culture, providing further difficulties in the objectivity of judging and there was a call for openness and clarity. With the human element of judging carrying weakness and allowing for errors, subsequently, it was felt by the respondents that some parts of a dive should be measured with technology where feasible. There was a consensus that judges could not be fully replaced without substantive changes to the sport of diving, in effect losing the artistic element that is subjectively assessed.
... Table tennis has attracted many players to achieve a career and also acted as a source of entertainment for viewers. A fair review process can be provided by the third-party evaluators are appointed to evaluate the line review [39,40]. In this context, the F-TENG was attached to the edge line of the table tennis court to detect whether the ball touch the edge line or not to offer a transparent review. ...
Article
The triboelectric nanogenerator (TENG) has emerged as a promising technology for the conversion of mechanical energy into electrical energy. There is widespread development in the field of the internet of things and wireless sensor networks to realize future portable and wearable electronic devices. Sports science is a big sector to explore self-powered sensors to reduce the usage of batteries and complicated conventional sensors. The TENG can be one of the best alternatives to realize energy harvesting, impact monitoring, and performance evaluation of athletes in the area of intelligent sports. This work presents a new design for TENG by collecting worn-out textiles from the waste bin to useful self-powered applications in intelligent sports for the first time. The circular economy of the worn-out textile can promote the eco-friendly concept of recycling, reuse, and reducing for various energy devices. The waste materials were evaluated by various techniques such as structural, morphology, and FT-IR. The single electrode mode TENG was fabricated using the various positive triboelectric textile layers and negative triboelectric textile layers. The new device structure of 4 fingers knitted TENG (F-TENG) was able to deliver a peak to peak voltage of 4.2 V and peak-to-peak current output of 2.7 nA by simple stretching and releasing. Further, the energy harvesting from various sports activities was collated such as dribbling the ball, running, and stretching exercises. The digital signal processing techniques such as wavelet transform and wavelet packet transform were introduced to shed light upon the usage of TENG-based sensing units in sports facilities such as self-powered punching power recognition and edge ball judgment system.
Article
Full-text available
Technique is one of the aspects that has the most relevant influence on tennis player performance. Searching for more efficient and effective technique, by means of the application of biomechanical laws, is a constant among coaches and researchers. This article deals with a very concrete subject in tennis technique: the position of the head during the impact phase of tennis strokes. Biomechanical aspects of the strokes will also be considered, as well as other relevant aspects such as fixing the gaze during the stroke and the stretching-shortening cycle.
Article
Full-text available
According to the rapid spread of multimedia data and online observations by users, the importance of researching on machine vision also, analyzing and automatic understanding of video data content is progressively increasing. Human motion recognition in video data is a crucial research subject in machine vision science that has plenty of applications, for instance, video surveillance, video indexing, robotics, human-computer interface and multimedia retrieval. Despite a high number of researches conducted on this topic, there is a necessity to achieve a more in-depth understanding, complete classification, and evaluation of existing human motion recognition stages. The novelty of this paper, our comparative analytical framework includes three major parts. Firstly, three different stages are introduced in recognizing human motion consisting of background subtraction, feature extraction, and machine learning classification. Secondly, five essential criteria are defined for evaluating the proposed human motion recognition methods. Finally, our comparative analysis of human motion recognition stages comprises two models. The analysis of background subtraction methods is based on applying the criteria for a qualitative comparison. Next, the feature extraction and machine learning classification methods are examined by specifying their main idea, benefits and challenges. Our comparative analytical framework can be beneficial for every researcher in this field by simplifying accurate selection and development of human motion recognition methods in future works.
Article
Full-text available
Musculoskeletal simulation and dynamic modeling programs have been used to gain insight into lower-limb musculoskeletal injury mechanisms. In addition to the temporospatial, kinematic, and kinetic data obtained from motion analysis systems, musculoskeletal simulation programs also can provide information on joint contact and muscle forces, musculotendinous lengths and velocities, and muscle activation levels. Musculoskeletal simulation platforms may help in the assessment of risk factors for sports-related injuries. Using musculoskeletal simulations for injury prevention programs may help lower the incidence of sports injuries, and may allow for fast recovery from injury. In this review, injury mechanisms and risk factors of some of the most common lower-limb musculoskeletal injuries, including anterior cruciate ligament, patellofemoral, and hamstring injuries were summarized from a biomechanical perspective. Also, the efficacy of musculoskeletal modeling and dynamic simulation tools in helping our understanding of these injury mechanisms was discussed.
Article
Full-text available
This paper evaluates the effectiveness of, and highlights issues with, conventional paradigms in applied sports biomechanics research and comments on their capacity to optimise techniques of individual athletes. In empirical studies, group-based analyses often mask variability between athletes and only permit probabilistic ‘in general’ or ‘on average’ statements that may not be applicable to specific athletes. In individual-based analyses, performance parameters typically exhibit a small range and a flat response over iterative performance trials making establishing associations between performance parameters and the performance criterion problematic. In theoretical studies, computer simulation modelling putatively enables athlete-specific optimum techniques to be identified but, given each athlete’s unique intrinsic dynamics, it is far from certain that these optimum techniques will be attainable, particularly under the often-intense psychological pressures of competition, irrespective of the volume of practice undertaken. Sports biomechanists and coaching practitioners are advised to be more circumspect with regard to interpreting the results of applied sports biomechanics research and have greater awareness of their assumptions and limitations, as inappropriate interpretation of results may have adverse consequences for performance and injury.
Article
The manual evaluation method to evaluate the effect of physical education teaching is tedious, and it will have a large error when the amount of data is large. In order to improve the efficiency of physical education evaluation, this article uses artificial intelligence for data analysis and uses machine vision to identify the teaching process to assist teachers in physical education. In order to reduce the calibration error of the parameters and obtain more accurate camera imaging geometric parameters, this paper adopts the method of averaging multiple sample points to determine the calibration parameters of the camera. In addition, this study builds system function modules according to actual needs and verifies system performance through experimental teaching methods. The research results show that the model proposed in this paper has a certain practical effect.
Article
The purpose of this study was to examine the accuracy of methods to estimate ball spin rate and spin axis direction using multi-camera tracking technology. The method implemented by Hawk-Eye and a theoretical ball trajectory model were assessed for their accuracy against high speed-vision. A theoretical ball trajectory model was found to estimate ball spin rate and the spin axis direction more accurately than the method used by Hawk-Eye. The superior performance of the ball trajectory model was indicated by a lower mean bias and standard deviation (2.92 ± 222.76) than Hawk-Eye (-100.01 ± 542.44), as well as narrower limits of agreement and a 327.63 RPM lower RMSE. Spin rates less than 4500 RPM were estimated with the highest accuracy, but neither method was able to consistently and accurately estimate spin rates >4500 RPM. The application of a trajectory model to multi-camera ball tracking data provides a practical and non-invasive method to accurately estimate the spin imparted when hitting, allowing for large-scale collection of spin rates during matches.
Article
The purpose of this study was to develop an automated method for identifying and classifying change of direction (COD) movements in professional tennis using tracking data. Three sport science and strength and conditioning experts coded match-play footage of nineteen professional tennis players (9 male and 10 female) from the Australian Open Grand Slam for COD of medium and high intensity. A total of 1,494 changes were identified and aligned with 2D player position sampled at 25 Hz based on camera tracking data. Several machine learning classifiers were trained and tested on a set of 1,128 time-motion features. A random forest algorithm was found to have the best out-of-sample performance, classifying medium and high intensity changes with an F1-score of 0.729. This research offers a novel and applicable way for utilising player tracking data and machine learning techniques to automatically identify and classify COD movements in professional tennis.
Article
This paper presents a novel framework for predicting shot location and type in tennis. Inspired by recent neuroscience discoveries we incorporate neural memory modules to model the episodic and semantic memory components of a tennis player. We propose a Semi-Supervised Generative Adversarial Network architecture that couples these memory models with the automatic feature learning power of deep neural networks, and demonstrate methodologies for learning player level behavioural patterns with the proposed framework. We evaluate the effectiveness of the proposed model on tennis tracking data from the 2012 Australian Tennis Open and exhibit applications of the proposed method in discovering how players adapt their style depending on the match context.
Article
A robust and rapid camera calibration method is essential in vision measurement. This paper proposes a camera calibration method using a 2.5-D coding target that requires just a single captured image and is capable of identifying the intrinsic and extrinsic parameters of a camera system simultaneously. The calibration target consists of four fixed planes to which coded patterns are attached. The fixed planes of the target prevent the ill condition that arises in Zhang's method, and they provide four different homography matrices at the same time. The extrinsic parameters of the cameras are calculated using the geometric constraints of the feature circles on each plane. The coded pattern allows the four planes to be distinguished automatically. The proposed method is first verified by comparison experiments with three classical calibration methods. The results confirm that the proposed method has the greatest accuracy and smallest variance of errors. The influences on the calibration results of geometric parameters such as the dihedral angle and the area ratio of the feature circles are then investigated by systematic experiments. This proposed camera calibration method using a single captured image represents a breakthrough in a camera calibration technology and has a broad potential for application to machine vision and vision measurement.