Conference PaperPDF Available

PPFPS - a paraboloid prediction based fractional pixel search strategy for H.26L

Authors:
  • tianjinuniversity

Abstract

H.26L is a new recommendation for moving picture coding proposed by ITU-T, in which 1/4-pel and 1/8-pel fractional pixel motion compensation is added to achieve more accurate motion description and higher compression efficiency. However, at the same time the complexity of the fractional pixel motion estimation also increases, requiring a total of 24 fractional-pel positions to be checked in 1/8-pel motion vector search, while some fast integer pixel motion search algorithms have decreased the searching pixels to about ten. Therefore, reducing the computational load for fractional pixel motion search is both necessary and significant. A novel paraboloid prediction based fractional pixel search algorithm is first proposed in this paper. In any case (1/2 pixel, 1/4 pixel or 1/8 pixel resolution), roughly a computation reduction of 62.5% in fractional pixel motion estimation can be achieved compared with the full fractional pixel motion search algorithm. Experimental results prove that the strategy keeps good performance in preserving image quality and has little influence on the bit rate.
PPFPS-
A
Paraboloid Prediction based Fractional Pixel Search Strategy for
H.26L*
Zhibo Chen Cheng
Du
Jinghua Wang Yun He
State Key Lab on Microwave
&
Digital Communications
Dept.
of
Electronic Engineering, Tsinghua University, Beijing,
100084
{cherub, hey}
@video.mdc.tsinghua.edu.cn
ABSTRACT
H.26L is a new recommendation for moving picture coding
proposed by ITU-T, in which 1/4-pel and 1/8-pel fractional pixel
motion compensation is added to achieve more accurate motion
description and higher compression efficiency. However, at the
same time the complexity of the fractional pixel motion
estimation also increases, requiring a total of 24 fractional-pel
positions to be checked in 1/8-pel motion vector search, while
some fast integer pixel motion search algorithms have decreased
the searching pixels to about ten. Therefore, reducing the
computational load for fractional pixel motion search is both
necessary and significant. A novel Paraboloid Prediction based
Fractional Pixel Search algorithm is first proposed in this paper.
In any case (U2 pixel, 1/4 pixel or 1/8 pixel resolution), roughly
a computation reduction of 62.5% in fractional pixel motion
estimation can be achieved compared with the full fractional
pixel motion search algorithm. Experimental results prove that
the strategy keeps good performance in preserving image quality
and makes little influence
on
the bit rate.
Keywords:
H.26L, fractional pixel motion estimation,
paraboloid prediction, PPFPS, PPHPS
1.
INTRODUCTION
Hybrid coding framework has been successfully adopted in
various moving picture coding standards, such as MPEG-I,
MPEG-2, H.263, and MPEG-4. From 1998, a new ITU-T
moving picture coding recommendation named H.26L is placed
on
the agenda of ITU-T
Q.
15
group (now named VCEG---Video
Coding Experts Group)[l]. The latest version of H.26L, Test
Model Long Term
8.0,
is proposed in Port0 Seguro meeting this
year[2], which is characterised by its high coding efficiency and
more functionalities.
Compared with H.263, some significant changes have been
adopted in H.26L[2]: the residue coding is 4x4 block based,
DCT is replaced by integer transform, universal VLC table is
used, 1/4 and
1/8
pixel accuracy motion estimation is proposed,
etc. A PSNR analysis shows that the H.26L results average more
than 2
dE4
higher in luma PSNR across all test cases than ISO-
MPEG’s chosen MPEG-4 “Advanced Simple” profile anchor
performance[3]. However, H.26L is essentially still based on
hybrid coding framework, and motion estimatiodcompensation
is still important in achieving the high coding efficiency.
Motion estimation is generally conducted into
two
steps: the first
step is integer pixel motion vector estimation and the second is
fractional pixel motion vector estimation. For fractional pixel
motion estimation, 1/2-pel accuracy is frequently used (H.263,
MPEG-I, MPEG-2, MPEG-4) and higher resolution motion
vectors are adopted recently in MPEG-4 (1/4-pel accuracy)[4]
and H.26L (U4 and 1/8-pel accuracy)[2].
Algorithms on fast motion estimation are always hot research
spots, especially fast integer pixel motion estimation, which has
achieved most attention because traditional fractional pixel
motion estimation (such as 1/2-pel) takes only a very small
proportion of the computational load of motion estimation.
However, with the development of fast integer motion
estimation algorithms and the decreasing number of integer
motion search points, computation load of fractional pixel
motion estimation has become more comparable to that of the
integer case. For instance, some center-biased fast integer
motion vector estimation algorithms [6][7][8][9] have reduced
the checking pixel positions averagely down to
10;
while full
fractional-pel motion estimation needs at least
8
1/2-pel
positions to be checked, and more positions to be searched if
higher resolution motion estimation is adopted, e.g. 16 positions
in the case of 1/4-pel and 24 positions in 1/8-pel, even possesses
a much higher proportion than integer pixel motion estimation in
the total motion estimation strategy.
There is still no related work investigating fast fractional-pel
motion estimation algorithms,
so
this paper presents a work both
necessary and significant on the topic.
In this paper, we propose a Paraboloid Prediction based
Fractional Pixel Search (PPFPS) strategy which is characterized
by its excellent ability to reduce computation load as well as its
good quality of preserving the rate-distortion performance.
For the sake of consistency, fractional-pel is used in this paper as
a general designation of all concepts related to fractional pixel
which are indicated by 1/2-pel, 1/4-pel and 1/8-pel, for instance,
1
/2-pel positions, 1/4-pel positions, and 1/8-pel positions are
used to indicate those pixel positions interpolated by a 1/2-pel,
1/4-pel or 1/8-pel filter and are generally referred to as
fractional-pel positions.
The paper is organized as follows: in Section 2, we will describe
the full fractional pixel motion estimation proposed in H.26L.
Section 3 presents our fast fractional motion estimation
algorithm and experimental results are given in Section 4.
Section 5 concludes this paper.
:This project
is
supported
by
National Science Foundation China
0-7803-7448-7/02/$17.00 02002
IEEE
111
-
9
2.
FFPS
-
Full Fractional Pixel Search algorithm in
H.26L
In H.26L[2], for each block or macro-block the motion vector is
determined by a full search on integer pixel positions followed
by fractional-pel refinement which is done by using full
fractional pixel search algorithm. It is reported that a gain up to
1.5 dB of PSNR is achieved with 1/8-pel motion estimation as
compared to 1/4-pel[ lo].
The fractional-pel positions are drawn in Fig.1, where capital
letters represent integer pixel positions, Roman numbers 1/2-pel
positions, lower case letters 1/4-pel positions and Arabian
numbers 1/8-pel positions. In a motion compensated prediction
scheme the motion vector and the block-size pattem (e.g. 16x16,
four 8x8, etc.) have to be estimated for each macroblock, which
is done by the following five steps[9]:
1.
Do integer pixel search to find the best integer pixel
motion vector;
2. Check the eight 1/2-pel positions
I
-
VI11
around the best
integer pixel position
C
in order to find the best 1/2-pel
motion vector;
Check the eight 1/4-pel positions a
-
h around the best 1/2-
pel position
V
in order to find the best 1/4-pel motion
vector;
4. Check the eight 1/8-pel positions
1
-
8
around the best 1/4-
pel position
h
so
as to find the best 1/8-pel motion vector;
5. Select the motion vector and block-size pattern, which
produces the lowest rate-distortion cost.
To acquire the positions of these fractional pixels, a 6 tap filter
(I,-5,20,20,-5,1)/32 is used to produce the 1/2-pel positions;
linear interpolation is employed to produce the 1/4-pel positions,
and an 8-tap filters is used in providing a 1/8-pel accuracy
3.
prediction[
I
j.
DI
I
HI IV
VI
DI
VI D2
I1
111
abc
CdVeHz
123
f
g4h5
618
VI1 VI11
v2
D4
Capital letters(C,Hi,Hz
...)
:
integer pixel positions
Roma numbers(I,II,III
...)
:
1/2-pel positions
Lower case letters(a,b,c
...)
:
1/4-pel positions
Arabian numbers( 1,2,3
...)
:
1/8-pel positions
Fig.
1
Full Fractional Pixel Motion Estimation
Clearly, totally 24 fractional positions are checked to get a
1/8-
pel resolution motion vector.
3.
PPFPS
-
Paraboloid Prediction based Fractional
Pixel Search Strategy
Since fractional-pel motion estimation strategy is carriled out in a
way that 1/2-pel motion estimation is followed
by
higher
resolution fractional-pel refinement such as 1/4-pel or
1
/%pel,
estimation accuracy of lower resolution motion estimation will
definitively influence that of higher resolution estimation.
Therefore, a fast and accurate 1/2-pel motion estimation
algorithm should be adopted in the first place.
Mathematical prediction model has been proposed as a fast 1/2-
pel motion estimation algorithm[l1][12], but only gives out
prediction on the horizontal or vertical direction, without the
required accuracy. We proposed a paraboloid prediction based
half pixel search (PPHFS) algorithm in[5]which can estimate not
only both the horizontal and vertical directions, but also the
diagonal direction. Thus a more accurate estimation and a better
performance can be achieved with searched 1/2-pel positions
decreasing from
8
to
3.
Based on
our
earlier work[5], the PPFPS strategy is proposed in
this paper, which takes PPHPS as the 1/2-pel motion estimation
algorithm and a novel higher resolution fractional-pel prediction
refinement algorithm is proposed.
3.1
PPHPS for 1/2:-pel motion estimation
In H.26L,
SAD
(Sum of Absolute Difference) is still used as the
basic cost function described as:
F(P)
=
t:IL(P)
-
wl
(1)
where
p
is an integer pixel or a fractional pixel in the previous
reconstructed picture,
C
is the pixel in the current picture and
L(*)
represents the luma
of
the pixel. The summation in
(I)
is
executed on different block-size pattems
in
H.26L. It can be
proved that, due to the property of SAD,
F(p)
is a smooth
convex function in the sub-area around the optimal matching
position[5]. Therefore, a quadratic equation is used to model the
convex function as
$(p)
=
A(x
-
J:~)~
+
B(y
-
yo)*
+
D
(2)
where
A
,
B
,
xo
,
yo
and
D
are five unknown parameters.
Supposing the coordinates of integer pixels
C
,
H,
,
H2,
V,
,
V2
are
(0,O)
,
(-1,O)
,
(1,O)
,
(0,
-1)
and (0,l) and that
the five pixels’ cost functxon values are
F(C)
,
F(Hl)
,
F(H2)
,
F(V,)
and
F(V2)
respectively (see Fig.l), if all
these 5 values are available in the integer pixel motion
estimation (called
Constraint
I),
the
5
unknown
parameters in
(2) can be solved as follows
I11
-
10
ifB=O.
Then we can decide the best 1/2-pel position by checking the
minimum cost function among the three 1/2-pel positions instead
of eight. The three 1/2-pel positions are given in Table
I,
which
are predicted according to the values of
xo
and
yo,
refer to Fig. 1
for designations.
Table
I
Search
Points
XO
=o,yo
10
xo
10,yo
=o
xo
>o,y,
=o
xo
>O,YO
>o
I,
IV, VI
111, v, VI11
v, VII, VI11
11,
111,
v
IV, VI, VI1
I,
11,
IV
3.2
Prediction refinement in 1/4-pel and 1/8-pel cases
chosen to be checked, such as the positions 2,3and 5 in Fig.2 (b).
Then the best 1/4-pel motion vector is obtained from the three
1/4-pels, according to the minimum cost function rule. The 1/8-
pel motion vector can be decided in the same way as that of the
1/4-pel.
-
B
-
123
4
-
5
123
4
-
5
Fig.2 Two patterns in prediction refinement
The refinement algorithm can also be used to find the best 1/2-
pel motion vector. But it should be noted that the algorithm can
not estimate the diagonal direction if only
Constraint
Z
is
satisfied,
so
it is not
so
accurate as using PPHPS to find a 1/2-pel
motion vector.
In conclusion,
3
fractional-pel positions need to be checked in
1/4-pel and l/S-pel motion estimation cases respectively and
totally
9
positions should be searched in order
to
get the 1/8-peI
motion vector. Compared with the 24 positions needed in the
Full Fractional Pixel Search algorithm of H.26L, roughly a
62.5% computation reduction is achieved in fractional-pel
motion estimation, where additional calculation for the
paraboloid prediction can be neglected.
Algorithm
3.3 Constraints
on
Fast Integer Pixel Search
After 1/2-pel motion estimation using PPHPS, further prediction
is proposed to achieve higher resolution of motion compensation.
be satisfied in this case,
so
that the PPHPS algorithm cannot be
that the Cost function
F(p)
is a smooth convex function in the
As
discussed
in
[SI,
Constraint
I
should be satisfied when
applying
the
ppHps
algorithm.
However,
not
all
integer
pixel
pixel
search
algorithms,
the
search process
is
divided
into
non-center points. The algorithm first searches all positions in
While it should
be
pointed
Out
that
Constraint
cannot
motion search algorithms
can
satisfy
Constraint
1.
In fast integer
used directly
in
ll4-pe1 or 1/8-pe1
motion
estimation.
Assuming consecutive layers[ 131, each containing a center point and Some
prediction area, we propose a simple and efficient prediction
refinement algorithm to estimate the 1/4-pel and 1/8-pel motion
vectors.
As
mentioned in the last section, three 1/2-pel positions around
the best integer pixel position
C
are checked in the PPHPS
algorithm and the best 1/2-pel position is found. In our
prediction refinement scheme, besides the best 1/2-pel position,
a sub-optimal 1/2-pel position whose cost function value
is
the
second smallest
of
the four positions (three 1/2-pel positions
checked plus the original best integer pixel position) is also
selected.
In view of the relative location of the best 1/2-pel position and
the sub-optimal 1/2-pel position, two patterns are considered.
Let A be the best 1/2-pel position and
B
the sub-optimal 1/2-pel
position. In the first pattern, A and B are in the same horizontal
or vertical line, and three 1/4-pel positions between
A
and
B
are
chosen as the checking pixels for 1/4-pel motion vector. An
instance is shown in Fig.2 (a), where positions
3,5
and 8 are
chosen. In the second pattern, A and
B
are lined up in diagonal
direction, and three 1/4-pel positions between A and
B
are
the current layer, and then chooses one of the optimal positions
as the center of the next layer. When the termination criterion
is
satisfied in a layer, the search will terminate. Here the form of
the non-center points is called Search Pattern. It is obviously
that diamond search pattern can satisfy
Constraint
Z,
and some
existing fast integer search algorithms such as Diamond Search
(DS)[6][7], Nearest-Neighbors Search ("S)[8] and
Unrestricted Center-Biased Diamond Search (UCBDS)[9] are all
based on this pattern[ 131, and they also satisfy
Constraint
I.
4. Simulation Results
The proposed algorithm is evaluated in the framework of H.26L
TML8.0 encoder. Some widely used test sequences, such as
News, Car Phone, Stefan and Coast Guard, are used in the
experiments. The frame rate of the input/output sequences is
30f/s. The picture format is CIF (4:2:0) and 100 frames are
coded in each sequence, the first frame being
I
frame and all
others
P
frames. There are no rate control algorithms introduced
in TML8.0,
so
that all tests are carried out under fixed
111
-
11
quantization parameters (QP). Because different QP generates
similar results, QP=IO is selected without
loss
of universality.
For simplicity, we set Inter block search model to 16x16 only
and reference frame number to one, CABAC (context-based
adaptive binary arithmetic coding) employs the entropy coding
method. These options will not affect the final results of our
algorithm.
We adopt full search as the integer pixel motion estimation
algorithm,
so
as to focus our efforts on the studies of fast
fractional pixel motion estimation. Note that the full search also
satisfies
Constraint
I.
Test results for PSNR and bit rate are listed in Table I1 and
Table 111. In simulation, 1/8-pel resolution motion vector are
adopted and the reference method is FFPS, representing Full
Fractional Pixel Search algorithm used in H.26L, which needs
totally 24 fractional-pel positions in this case.
The computation load reduction of fractional-pel motion
estimation is remarkable. Additional computation for paraboloid
prediction as well as sub-optimal position search can be
neglected compared with the calculation of SAD, and in any
case (1/2-pel, 1/4-pel and 1/8-pel) roughly a computational
reduction of 62.5% can be achieved.
Table
I1
Average PSNR of
P
frames
Sequence PSNRldB
Table
111
Average
Bit
Rate of
P
frames
Sequence
As can be seen from
Table
11,
PSNR difference is small for each
sequence, the worst case for PSNR degradation is 0.013dI3 and
the best shows an increase of 0.003dB in PSNR. The average
degradation is 0.006dB.
In Table
111,
Bitr-P represents the average bit rate of P frames. It
can be calculated that the maximal increase of Bitr-P is 3.0%,
the minimal
-0.8%,
and the average value increases by 0.82%. It
can be concluded that the proposed algorithm preserves both
PSNR
and bit rate of the encoded images.
5.
Conclusions
Based on the assumption that the cost function of motion
compensation is a smooth convex function in the sub-area
around the optimal matching position, a paraboloid prediction
based half pixel search algorithm together with a prediction
refinement algorithm for higher resolution fractional-pel motion
estimation (1/4-pel, 1/8-pel, etc.) is adopted in the proposed
Paraboloid Prediction based Fractional Pixel Search strategy.
In any case (1/2-pel, 1/4-pel or 1/8-pel) a computation reduction
of
about 62.5% can be achieved,
so
it can greatly speed up the
fractional pixel motion search. With a remarkable computational
reduction for fractional pixel search, the proposed algorithm has
relatively high prediction accuracy. The proposed algorithm has
negligible affect on image quality and bit rate with an average
0.006 dB PSNR degradation and 0.82% bit rate increase reported
from the simulation of the test sequences as compared to full
fractional pixel motion estima.tion. Moreover, this strategy can
also be applied in MPEG-4 in which 1/4-pel motion vector is
supported.
Reference:
Gary Sullivan, “Q. 15/16 meeting report”, Geneva, Jan.26-
Feb.6, 1998
“H.26L Test Model Long Term Number
8
(TML-8) drafto”,
ITU VCEG, June, 2001
“Summary Information for ITU-T VCEG Draft H.26L
Algorithm in Response to Video and DCinenia CfPs”
ISO/IEC JTCl/SC29/WCiIl, M7511,2001, July
MPEG-4 Video Verification Model version 16.0, ISO/IEC
JTCl/SC29/WGll N33 12, Noordwijkerhout
,
March,
2000
Cheng Du, Yun He, Junli Zheng, “A paraboloid prediction
based fast half pixel motion estimation” Joumal of
Tsinghua University- Science and Technology, V01.41
(I),
pp. 1-4, Jan. 200 1
S.
Zhu and K.-K. Ma, “A New Diamond Search AJgorithm
for Fast Block Matching Motion Estimation,” in IProc. Int.
Conf. Inform., Commuri., Signal Process., pp.:292-296,
Singapore, Sept.9-12, 1907
Shan Zhu and Kai-Kuang Ma, “A New Diamond Search
Algorithm for Fast Block-Matching Motion Estimation”,
IEEE Trans. Image Processing, vol. 92, no.
2,
pp. 287 -290,
Feb. 2000
M.
Gallant, F. Kossentini, “A computation constrained
motion vector search algorithm for block-based motion
estimation,” IEEE conference on signal, systems, and
computers, Nov. 1998
J.
Y.
Than,
S.
Ranganath,
M.
Ranganath, and A.
A.
Kassim,
“A Novel Unrestricted Center-Biased Diamond. Search
Algorithm for Block Motion Estimation,” IEEI? Trans,
Circuits and System for ’Video Technology, vol.
8,
no. 4,
Thomas Wedi, “1/8-pel ;Displacement Vector Resolution
for TML-6”, ITU-T VCEGM45, Austin, USA, 2001
Jeong J, Ahn, W. “Subpixel-accuracy motion estimation
using a model for motion compensated errors”. pp. 13.4-
Senda Y, Harasaki H, Yano M. “Theoretical background
and improvement of a simplified half-pel motion
estimation”, Proc.ICIP 96
,
pp.263 -266, vo1.3
,
1996
pp. 369-377, Aug. 1998
1-13.4-4, PCS’93.
[13]
Cheng Du, Yun He, “A Comparative Study of Motion
Estimation for Low Bit Rate Video Coding”,
pp.1239-1249, V01.2, VClP 2000.
I11
-
12
... Different fast sub-pixel ME algorithms [77][78][79][80] have been proposed, and a number of them have been adopted by the JM reference software [30]. The common idea is to 66 simplify the search pattern by applying refined prediction algorithms, and improved adaptive threshold schemes to terminate unnecessary search positions. ...
... Also the utilisation of mathematical models for matching error either to reduce the number of fractional-pixel search points [79,86] or to directly predict the best matching fractional-pixel location [87,88] were proposed. ...
... Further experiments have been carried out to compare the proposed scheme to recently published work [89,90]. 79 The result of these experiments is shown in ...
Thesis
Full-text available
Nowadays the amount of digital video applications is rapidly increasing. The amount of raw video data is very large which makes storing, processing, and transmitting video sequences very complex tasks. Furthermore, whilst the demand for enhanced user experience is growing, the sizes of devices capable of performing video processing operations are getting smaller. This further increases the practical limitations encountered when handling these large amounts of data, and makes research on video compression systems and standards very important. The H.264 is the latest international video coding standard. It compresses high quality video content at low bitrates for a wide range of applications. It uses state-of-the-art coding tools and provides enhanced coding efficiency to provide higher compression capabilities with high perceptual quality. These capabilities have also contributed to significant increase in complexity when implementing the H.264 in real-time applications. Within video coding, motion estimation is a primary contributor to the gain in compression but is also the most computationally intensive part. The objective of this project is designing and combining a series of novel techniques to overcome those limitations. In this thesis, an investigation and four algorithms are proposed which can be classified along three main streams. In the first stream, an investigation was carried out and two algorithms were designed for optimising the motion estimation process for the H.264/AVC whilst maintaining the same quality and the compression rate as the standard. They are based on exploiting frequency domain motion estimation and on the interpolation effect on the motion estimation process. Firstly, the H.264 recommended interpolation and rate distortion methods were examined when frequency domain motion estimation is employed, this investigation has outlined novel improvements for frequency domain motion estimation adaptation. Secondly, a novel fast frequency domain motion estimation algorithm has been designed, the advantage of this approach over standard algorithms is the significant reduction in the encoding complexity it provides for a variety of video sequences. Finally, a novel fast subpixel motion estimation algorithm has been developed, the algorithm adaptively terminates subpixel motion estimation based on the video properties. In the second and third streams the complexity reduction algorithms are further developed to achieve complexity-scalable control of the standard scalable and multiview extensions where more data and flexibilities are incorporated to enhance the end-user experiences. The proposed algorithms offer the following developments and contributions. The application of the interpolation effect to reduce the encoding complexity is unique. The developed algorithms are flexible in their applications and can be combined with different fast algorithms. The conducted experiments show significant speed improvements, thus making a novel contribution to the implementation of real-time H.264 standard encoders in computationally constrained environments such as low-power mobile devices and general purpose computers.
... Many fractional pixel motion estimation algorithms have been proposed so far. 7,[15][16][17][18][19][20][21][22][23][24][25][26] The conventional hierarchical fractional pixel search (HFPS) algorithm 7 that has been adopted by the reference software, checks 16 search points at ¼-pel accuracy. Wang et. ...
... al. 15 have proposed a fast fractional pel motion estimation algorithm by using a fixed half pel search pattern, whereas the quarter search patterns are adapted. A paraboloid prediction based fractional pel search strategy 16 combines paraboloid prediction based half pel search with directional refinement algorithm to estimate the fractional pel motion vector. However, the implementation of this algorithm requires that the search model of the integer pixel ME to be diamond shaped so that the cost value of the four diamond vertices around the best integer pel position should be available before fractional pel motion estimation. ...
Article
Fractional pixel motion estimation (ME) is required to achieve more accurate motion vectors and higher compression efficiency. This results in an increase in the computational complexity of the ME process because of additional computational overheads such as interpolation and fractional pixel search. Fast algorithms for fractional ME in H. 264/AVC are presented. To reduce the complexity of fractional pixel ME, unimodal error surface assumption is used to check only some points in the fractional pixel search window. The proposed algorithm employs motion prediction, directional quadrant and point-based search pattern and early termination to speed up the process. Hence, the proposed algorithm efficiently explores the neighborhood of integer pixel based on high correlation that exists between the neighboring fractional pixels and unimodal property of error surface. The proposed search pattern and early termination reduce computational time by almost 8% to 18% as compared to the hierarchical fractional pixel algorithm employed in the reference software with a negligible degradation in video quality and negligible increase in bit rate. (C) 2013 SPIE and IS&T
... Typically, the matching error used in the algorithm is the sum of absolute difference (SAD). The PPHPS algorithm was extended to quarter-and 1/8-pixel motion estimation in [11]. ...
... The three unknown coefficients in Equation 9 can be evaluated using three matching error values at integer-pixel locations, as shown in Figure 6 and in Equation 10. Using the evaluated coefficients, the matching error at fractional-pixel locations can be estimated as in Equation 11. (11) In general, the 1-D parabolic model (i.e., Equation 9) can be applied to any row or column of integer-or fractional-pixel locations. Three known matching error values are required to evaluate the model coefficients. ...
Article
Full-text available
This paper presents interpolation-free fractional-pixel motion estimation (FME) algorithms and efficient hardware prototype of one of the proposed FME algorithms. The proposed algorithms use a mathematical model to approximate the matching error at fractional-pixel locations instead of using the block matching algorithm to evaluate the actual matching error. Hence, no interpolation is required at fractional-pixel locations. The matching error values at integer-pixel locations are used to evaluate the mathematical model coefficients. The performance of the proposed algorithms has been compared with several FME algorithms including the full quarter-pixel search (FQPS) algorithm, which is used as part of the H.264 reference software. The computational cost and the performance analysis show that the proposed algorithms have about 90% less computational complexity than the FQPS algorithm with comparable reconstruction video quality (i.e., approximately 0.2 dB lower reconstruction PSNR values). In addition, a hardware prototype of one of the proposed algorithms is presented. The proposed architecture has been prototyped using the TSMC 0.18 μm CMOS technology. It has maximum clock frequency of 312.5 MHz, at which, the proposed architecture can process more than 70 HDTV 1080p fps. The architecture has only 13,650 gates. The proposed architecture shows superior performance when compared with several FME architectures.
... The common idea is to simplify the search pattern. Some are based on the assumption that the error surface is monotonic [4] ; some are based on the observation that the cost function is a smooth convex function in the prediction area [5] , etc. However, there is no one based on the statistics of MVs. ...
Article
Motion estimation is an important and intensive task in video coding applications. Since the complexity of integer pixel search has been greatly reduced by the numerous fast ME algorithm, the computation overhead required by fractional pixel ME has become relatively significant. To reduce the complexity of the fractional pixel ME algorithm, a directionality-based fractional pixel ME algorithm is proposed. The proposed algorithm efficiently explores the neighborhood positions which with high probability to be the best matching around the minimum one and skips over other unlikely ones. Thus, the proposed algorithm can complete the search by examining only 3 points on appropriate condition instead of 17 search points in the search algorithm of reference software. The simulation results show that the proposed algorithm successfully optimizes the fractional-pixel motion search on both half and quarter-pixel accuracy and improves the processing speed with low PSNR penalty.
Conference Paper
A fast integer pixel motion search algorithm based on H.264 multi-reference frames is proposed. This algorithm can effectively reduce the motion search by establishing a search starting point model in multi-reference frames and a hexagon search based on multi-reference frames calculation volume. The experimental results show that the fast motion search algorithm based on multi-reference frames can reduce the motion search time of H.264 multi-reference frames to a great extent while maintaining good coding quality.
Article
The latest video coding standard, HEVC can improve the coding efficiency significantly compared with the H.264/AVC. However the HEVC encoder requires much larger computational complexities. The longer 8-tap interpolation filter of the HEVC which is used in a non-integer motion estimation is one of the reasons and this paper aims to reduce the computational complexities. First of all, three shorter-tap interpolation filters for a motion estimation process are tested rather than the use of a standard interpolation filter. In addition, the fast searching strategies to reduce the number of comparisons for choosing the best non-integer motion vector are proposed. Finally, the interpolation process is selectively applied according to the searching strategy. By combining all of the techniques, the experimental results show that the encoding times can be reduced by 13.6%, 18.5% and 21.1% with the coding efficiency penalties of 0.7%, 1.5% and 2.5%, respectively. For the full-HD video sequences, the coding efficiency penalties are reduced to 0.4%, 1.1% and 1.6% at the same level of the encoding time savings, which shows the effectiveness of the proposed schemes for the high resolution video sequences.
Article
Motion estimation is always regarded as the most time consuming module in video coding, and many fast motion estimation algorithms have been proposed to speed-up it. However one fact that motion regions need more complex search whereas still regions does not instead is often ignored. On the other hand, the analysis of the distribution of motion vector difference shows that the predicted motion vector is very near to real motion vector. Herein, in this improved motion estimation algorithm, motion region is first identified using improved visual rhythm analysis and then one efficient search scheme, named search center adaptive motion estimation, is carried out according the MB motion. In the simulations, the algorithm is verified on platform JM7.3. The results show that the search scheme can great speed-up the motion estimation of still MBs, and it can eliminate about 25% integer pixel search process of motion MBs as well. The encoding performance loss is negligible for low motion video and trifling for video sequences with complex motion.
Article
In this paper, a fast sub-pixel motion estimation algorithm is presented for H.264/MPEG4 AVC video coding. The proposed methods are based on mathematical models of the motion compensated error distribution function to predict the possible searching direction. Here we do not select the parabola in the horizontal direction but the paraboloid in two dimension by five integer-pixels, the parabola can more exactly describe the moving and texture characteristic in the compressed pictures without strict limitation on the very low bit rate coding. In order to decide the coefficients of the paraboloid model which is suitable for most of sequences, the minimum mean square error (MMSE) from the best integer-pixel motion vector and its horizontally adjacent two half-pixel motion vectors are computed, then the MMSE correlation among three elements are compared and analyzed respectively. By an accurate modeling, we can effectively predict the further searching mode instead of full search. For the quarter-pixel accuracy motion vector, an approximation method is used to get the results from the half-pixel middle results directly. Experimental results show that the proposed method reduces the computational complexity down to 54% averagely, similar performance compared with other fast algorithms in speed, and degradation in the reconstructed video quality is negligible. At the same time, our proposed method does not depend on the integer-pixel searching mode
Article
Full-text available
The widespread use of block-based interframe motion estimation for video sequence compression in both MPEG and H.263 standards is due to its effectiveness and simplicity of implementation. Nevertheless, the high computational complexity of the full-search algorithm has motivated a host of suboptimal but faster search strategies. A popular example is the three-step search (TSS) algorithm. However, its uniformly spaced search pattern is not well matched to most real-world video sequences in which the motion vector distribution is nonuniformly biased toward the zero vector. Such an observation inspired the new three-step search (NTSS) which has a center-biased search pattern and supports a halfway-stop technique. It is faster on average, and gives better motion estimation as compared to the well-known TSS. Later, the four-step search (4SS) algorithm was introduced to reduce the average case from 21 to 19 search points, while maintaining a performance similar to NTSS in terms of motion compensation errors. We propose a novel unrestricted center-biased diamond search (UCBDS) algorithm which is more efficient, effective, and robust than the previous techniques. It has a best case scenario of only 13 search points and an average of 15.5 block matches. This makes UCBDS consistently faster than the other suboptimal block-matching techniques. This paper also compares the above methods in which both the processing speed and the accuracy of motion compensation are tested over a wide range of test video sequences
Article
Typical motion estimation for block-based video coding schemes consists of two parts: the one at integer pixel accuracy and the other at half pixel accuracy. In this paper, integer pixel motion estimation algorithms are first discussed in terms of three technical categories: search step, search pattern and decision of the initial motion vector. With the development of the efficiency of integer pixel motion estimation, the computation load of widely used full half pixel search becomes relatively higher. In order to further improve the speed of half pixel search, this paper proposes a paraboloid prediction based fast half pixel search algorithm. Experimental results show that variable search step, search pattern with less points and predicted initial motion vector helps to improve the performance of fast integer pixel search and the proposed fast half pixel search increases the speed of half pixel search with almost not affecting the image quality.© (2000) COPYRIGHT SPIE--The International Society for Optical Engineering. Downloading of the abstract is permitted for personal use only.
Conference Paper
Typical motion estimation for block-based video coding schemes consists of two parts: the one at integer pixel accuracy and the other at half pixel accuracy. In this paper, integer pixel motion estimation algorithms are first discussed in terms of three technical categories: search step, search pattern and decision of the initial motion vector. With the development of the efficiency of integer pixel motion estimation, the computation load of widely used full half pixel search becomes relatively higher. In order to further improve the speed of half pixel search, this paper proposes a paraboloid prediction based fast half pixel search algorithm. Experimental results show that variable search step, search pattern with less points and predicted initial motion vector helps to improve the performance of fast integer pixel search and the proposed fast half pixel search increases the speed of half pixel search with almost not affecting the image quality.
Conference Paper
This paper describes theoretical background of a mean absolute error (MAE) approximation method which was adopted as a half-pel motion estimation method in our real-time MPEG-2 codec VisuaLink 7000. We also propose an improved MAE approximation method which employs the horizontal and the vertical differentials of a source picture in addition to the MAEs of adjacent full-pel motion vectors. In general, any approximation method for MAE involves some loss of coding efficiency. However, the proposed method reduces loss of coding efficiency to 1/2~3/4 of other existing methods with negligible increases of necessary computations and data transfers
Article
Based on the study of motion vector distribution from several commonly used test image sequences, a new diamond search (DS) algorithm for fast block-matching motion estimation (BMME) is proposed in this paper. Simulation results demonstrate that the proposed DS algorithm greatly outperforms the well-known three-step search (TSS) algorithm. Compared with the new three-step search (NTSS) algorithm, the DS algorithm achieves close performance but requires less computation by up to 22% on average. Experimental results also show that the DS algorithm is better than the four-step search (4SS) and block-based gradient descent search (BBGDS), in terms of mean-square error performance and required number of search points
1/8-pel Displacement Vector Resolution for TML-6
  • Thomas Wediin
A paraboloid prediction based fast half pixel motion estimation
  • Cheng Du
  • Yun He
  • Junli Zheng
A computation constrained motion vector search algorithm for block-based motion estimation
  • M Gallant
  • F Kossentini
M. Gallant, F. Kossentini, "A computation constrained motion vector search algorithm for block-based motion estimation," IEEE conference on signal, systems, and computers, Nov. 1998