Zhengyi Luo's research while affiliated with Shanghai University of Electric Power and other places

Publications (23)

Article
It has been recognized that videos have to be encoded in a rate-distortion optimized manner for high coding performance. Therefore, operational coding methods have been developed for conventional distortion metrics such as Sum of Squared Error (SSE). Nowadays, with the rapid development of machine learning, the state-of-the-art learning based metri...
Conference Paper
Rate control plays a key role in video coding, which has a significant effect on encoder performance. With parallel video coding frameworks more and more popular, rate control suitable for parallel coding is highly desired. However, most rate control algorithms only focus on the rate distortion performance but ignoring the data correlation in paral...
Conference Paper
The newly proposed video coding standard, High Efficiency Video Coding (HEVC), has been widely accepted and adopted by industry and academia due to its better coding efficiency compared with H.264/AVC. While HEVC achieves an increase of about 40% in coding efficiency, its computational complexity has been increased significantly. Given this, a high...
Article
The H.265/MPEG-HEVC is the latest video coding standard, which achieves an increase of about 50% in coding efficiency compared to its predecessor H.264/MPEG-AVC. Ever since H.265/MPEG-HEVC was designed to replace almost all existing H.264/ MPEG-AVC codecs, high-resolution video coding beyond High Definition (4K, 8K, etc.) has drawn more attention....
Conference Paper
This paper presents a rate control scheme for low delay video communication of the High Efficiency Video Coding (HEVC) standard. To prevent the buffer overflow and underflow under small buffer size constraint in low delay communication, the state-of-the-art R-λ algorithm is improved for more accurate bit allocation. A new bit allocation method base...
Article
Full-text available
Raptor codes are state-of-the-art forward error correction (FEC) solutions for multimedia transmission, which have been applied to unequal error protection (UEP) of multi-layered media such as scalable video coding. In this paper, we address the problem of UEP for singlelayered video over packet erasure channels. By exploiting the different priorit...
Article
Full-text available
The field of video coding has been exploring the compact representation of video data, where perceptual redundancies in addition to signal redundancies are removed for higher compression. Many research efforts have been dedicated to modeling the human visual system's characteristics. The resulting models have been integrated into video coding frame...
Conference Paper
Due to the best effort feature of many existing transmission channels, video streams often suffer from inevitable transmission errors. In this paper, we propose a scheme of robust video transmission based on the state-of-the-art Raptor codes, whose applications are in full swing now. And considering Region of Interest (ROI) often draws much attenti...
Article
Full-text available
H.264/AVC adopts many directional spatial prediction models in block-based manner that neighboring pixels on the left and top sides yield prediction for the pixels in a data block to be encoded. However, such models may adapt poorly to the rich textures inside blocks of video signal. In this letter, a new lossless intra coding method based on pixel...
Conference Paper
Video coding has been widely adopted to achieve pleasant video quality at constrained bitrate. In this paper, adaptive frequency coefficient suppression directed by Human Visual System (HVS) is presented for H.264 video coding. Firstly, starting from Just Noticeable Distortion (JND) models for the classic DCT domain, we deduce a JND threshold for t...
Conference Paper
During the period of transmission, video data usually suffer from transmission errors inevitably. Intra update is a common approach to stop error propagation. However, damaged images cannot recover until next update in case of errors, which often leads to annoying effect. In this paper, we propose an enhanced leaky prediction approach that enables...
Conference Paper
Unequal error protection (UEP), which provides important data with more protection, has been proven to be able to produce better quality in image communication. Previous UEP schemes are mostly proposed for single-image or single-program scenarios. Yet few are developed for multiple programs. Inspired by the MPEG-2 transport stream (TS), in this pap...

Citations

... Similarly, in [95], PWMSE-V [33] that considered spatial and temporal masking modulation was applied to the distortion term in RDO, and Lagrangian multiplier adaptation was derived accordingly. In [58], Luo et al. built a linear relationship between block-level VMAF degradation and MSE difference in multi-pass pre-coding, which derived a block-level Lagrangian multiplier adaptation. However, when measured with VMAF, only 3.61% and 2.67% BDBR gains were achieved for LD and Random Access (RA) settings, respectively. ...
... This is required in order to preserve a high, and possibly stable, video quality. The quality may be defined by objective parameters that describe quality of service (QoS) [14][15][16][17][18], or by the users' subjective assessment scores that represent the so-called quality of experience (QoE) [19][20][21]. In this paper, the objective approach will be presented as a method of assessing video quality and comparing codec performance. ...
... Recently, Wei et al. [43] used static and dynamic based perceptual feature to control bit allocation. Wang et al. [42] also proposed a masking effect-based RC method, which considered temporal and spatial information. However, the bitrate accurately of these models are relatively rough. ...
... Some works use the features of the infrared videos [17], satellite videos [18] and surveillance videos [19,20] to generate additional reference frames for H.265/HEVC for further bitrate savings. Many works also applied deep learning techniques to video compression, including using deep learning networks to improve the accuracy of sub-pixel motion estimation and motion compensation [21][22][23][24], enhance the bi-prediction performance [25], and improve the quality of reference frames [4][5][6], etc. Considering the continuous growth of video services and the development of new industries such as cloud gaming, the demand for bandwidth is increasing exponentially and the exploration of video compression is still urgent. ...
... Meanwhile, the method proposed in [56] can achieve a BD-rate of 1.1, a BD-PSNR of 0.3 and a latency reduction of 0.2. For the method of [63], the BD-rate and BD-PSNR are 4.6 and 5.4, respectively, which are much higher than those of baseline HM16.19 coding algorithms, including [59] and [115]. When a recently proposed JND-based perceptual RC method for HEVC [66] is compared with [116] and [117], it can be seen that this method significantly reduces the bit rate while ensuring the subjective video quality. ...
... It is due to the current rate control is designed for the rate distortion performance, but neglecting video data correlation in parallel coding cases. A parallel rate control scheme for HEVC which supports both frame-parallelism and slice-level parallelism has been studied by Xie et al. [41]. In our literature survey, this work is the only article addressing rate control scheme for parallelism case in HEVC. ...
... In the latter case, the basis of the proposal is to use the Cauchy optimization method, in order to further minimize the number of bits that are observed at the output of the CABAC entropy encoder. Those extensions of CABAC have led to a noticeable reduction of the video data stream by 0.6% to 1.2%, making these proposals competitive to other literature proposals [e.g., 27]. ...
... The model proposed in [24] has been further extended by Xie et al. in [25] for bit allocation strategy in the context of RC. Specific hierarchical coding schemes have also been investigated by Gao et al. for Low Delay (LD) [26] and Random Access (RA) [27] coding configurations. In the specific case of HM and RA configuration, coding efficiency increases by 2.2% and can be further improved to 5.2% when the method is coupled with the high-complexity Multi Quantization Parameter (MQP) optimization proposed by Sullivan and Wiegand [7]. ...
... In [11] and [12], features of MB partition types, discrete cosine transformation coefficient and QP are extracted from H.264 bitstreams to train the support vector machine (SVM) classifier, in order to speed up CU splitting in H.264 to HEVC transcoding. MV clustering is utilized in [13] and [14] for CU partition in H.264 to HEVC transcoding. ...
... 2) Video codec selection: We identify x264 and x265 [37] as the most widely spread codecs for executing video encoding tasks, deployed by more than 90% of the video streaming industry [16]. The x265 video codec typically requires more computing resources than x264 but achieves a higher video quality for the same encoding parameters [14]. ...