Christian Timmerer’s research while affiliated with Alpen-Adria-Universität Klagenfurt and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (390)


Convex Hull Prediction Methods for Bitrate Ladder Construction: Design, Evaluation, and Comparison
  • Article

March 2025

·

9 Reads

ACM Transactions on Multimedia Computing, Communications and Applications

·

·

Hadi Amirpour

·

[...]

·

Luce Morin

HTTP adaptive streaming (HAS) has emerged as a prevalent approach for over-the-top (OTT) video streaming services due to its ability to deliver a seamless user experience. A fundamental component of HAS is the bitrate ladder, which comprises a set of encoding parameters ( e.g. , bitrate-resolution pairs) used to encode the source video into multiple representations. This adaptive bitrate ladder enables the client’s video player to dynamically adjust the quality of the video stream in real-time based on fluctuations in network conditions, ensuring uninterrupted playback by selecting the most suitable representation for the available bandwidth. The most straightforward approach involves using a fixed bitrate ladder for all videos, consisting of pre-determined bitrate-resolution pairs known as one-size-fits-all . Conversely, the most reliable technique relies on intensively encoding all resolutions over a wide range of bitrates to build the convex hull , thereby optimizing the bitrate ladder by selecting the representations from the convex hull for each specific video. Several techniques have been proposed to predict content-based ladders without performing a costly, exhaustive search encoding. This paper provides a comprehensive review of various convex hull prediction methods, including both conventional and learning-based approaches. Furthermore, we conduct a benchmark study of several handcrafted- and deep learning (DL)-based approaches for predicting content-optimized convex hulls across multiple codec settings. The considered methods are evaluated on our proposed large-scale dataset, which includes 300 UHD video shots encoded with software and hardware encoders using three state-of-the-art video standards, including AVC/H.264, HEVC/H.265, and VVC/H.266, at various bitrate points. Our analysis provides valuable insights and establishes baseline performance for future research in this field. Dataset URL: https://nasext-vaader.insa-rennes.fr/ietr-vaader/datasets/br_ladder


Fig. 2: Low-resolution to high-resolution interpolation of one CTU.
Multi-resolution Encoding for HTTP Adaptive Streaming using VVenC
  • Preprint
  • File available

March 2025

HTTP Adaptive Streaming (HAS) is a widely adopted method for delivering video content over the Internet, requiring each video to be encoded at multiple bitrates and resolution pairs, known as representations, to adapt to various network conditions and device capabilities. This multi-bitrate encoding introduces significant challenges due to the computational and time-intensive nature of encoding multiple representations. Conventional approaches often encode these videos independently without leveraging similarities between different representations of the same input video. This paper proposes an accelerated multi-resolution encoding strategy that utilizes representations of lower resolutions as references to speed up the encoding of higher resolutions when using Versatile Video Coding (VVC); specifically in VVenC, an optimized open-source software implementation. For multi-resolution encoding, a mid-bitrate representation serves as the reference, allowing interpolated encoded partition data to efficiently guide the partitioning process in higher resolutions. The proposed approach uses shared encoding information to reduce redundant calculations, optimizing partitioning decisions. Experimental results demonstrate that the proposed technique achieves a reduction of up to 17% compared to medium preset in encoding time across videos of varying complexities with minimal BDBR/BDT of 0.12 compared to the fast preset.

Download

Fig. 3: An example of search space reduction for one CU. TABLE I: The performance of ETRF and the fast preset compared to the medium preset.
Improving the Efficiency of VVC using Partitioning of Reference Frames

March 2025

·

6 Reads

In response to the growing demand for high-quality videos, Versatile Video Coding (VVC) was released in 2020, building on the hybrid coding architecture of its predecessor, HEVC, achieving about 50% bitrate reduction for the same visual quality. It introduces more flexible block partitioning, enhancing compression efficiency at the cost of increased encoding complexity. To make efficient use of VVC in practical applications, optimization is essential. VVenC, an optimized open-source VVC encoder, introduces multiple presets to address the trade-off between compression efficiency and encoder complexity. Although an optimized set of encoding tools has been selected for each preset, the rate-distortion (RD) search space in the encoder presets still poses a challenge for efficient encoder implementations. In this paper, we propose Early Termination using Reference Frames (ETRF), which improves the trade-off between encoding efficiency and time complexity and positions itself as a new preset between medium and fast presets. The CTU partitioning map of the reference frames in lower temporal layers is employed to accelerate the encoding of frames in higher temporal layers. The results show a reduction in the encoding time of around 21% compared to the medium preset. Specifically, for videos with high spatial and temporal complexities, which typically require longer encoding times, the proposed method achieves a better trade-off between bitrate savings and encoding time compared to the fast preset.


Perceptual Visual Quality Assessment: Principles, Methods, and Future Directions

March 2025

·

23 Reads

As multimedia services such as video streaming, video conferencing, virtual reality (VR), and online gaming continue to expand, ensuring high perceptual visual quality becomes a priority to maintain user satisfaction and competitiveness. However, multimedia content undergoes various distortions during acquisition, compression, transmission, and storage, resulting in the degradation of experienced quality. Thus, perceptual visual quality assessment (PVQA), which focuses on evaluating the quality of multimedia content based on human perception, is essential for optimizing user experiences in advanced communication systems. Several challenges are involved in the PVQA process, including diverse characteristics of multimedia content such as image, video, VR, point cloud, mesh, multimodality, etc., and complex distortion scenarios as well as viewing conditions. In this paper, we first present an overview of PVQA principles and methods. This includes both subjective methods, where users directly rate their experiences, and objective methods, where algorithms predict human perception based on measurable factors such as bitrate, frame rate, and compression levels. Based on the basics of PVQA, quality predictors for different multimedia data are then introduced. In addition to traditional images and videos, immersive multimedia and generative artificial intelligence (GenAI) content are also discussed. Finally, the paper concludes with a discussion on the future directions of PVQA research.


Counterfeiting Attacks on a RDH-EI scheme based on block-permutation and Co-XOR

February 2025

·

2 Reads

ACM Transactions on Multimedia Computing, Communications and Applications

Reversible data hiding in encrypted images (RDH-EI) has gained widespread attention due to its potential applications in secure cloud storage. However, the security challenges of RDH-EI in cloud storage scenarios remain largely unexplored. In this paper, we present a counterfeiting attack on RDH-EI schemes that utilize block-permutation and Co-XOR (BPCX) encryption. We demonstrate that ciphertext images generated by BPCX-based RDH-EI are easily tampered with to produce a counterfeit decrypted image with different contents imperceptible to the human eye. This vulnerability is mainly because the block permutation key information of BPCX is susceptible to known-plaintext attacks (KPAs). Taking ciphertext images in telemedicine scenarios as an example, we describe two potential counterfeiting attacks, namely fixed-area and optimal-area attacks. We show that the quality of forged decrypted images depends on the accuracy of the estimated block-permutation key under KPA conditions. To improve the invisibility of counterfeit decrypted images, we analyze the limitations of existing KPA methods against BPCX encryption for 2×22\times 2 block sizes and propose a novel diagonal inversion rule specifically designed for image blocks. This rule further enhances the accuracy of the estimated block-permutation key. The experiments show that, compared to existing KPA methods, the accuracy of the estimated block-permutation key in the UCID dataset increases by an average of 11.5%. In the counterfeiting attack experiments on Camera's encrypted image, we successfully tampered with over 80% of the pixels in the target area under the fixed-region attack. Additionally, we achieved a tampering success rate exceeding 90% in the optimal-region attack.


Energy-Efficient Video Streaming: Open-Source Tools, Datasets, and Solutions

January 2025

·

17 Reads

ACM SIGMultimedia Records

Energy efficiency has become a crucial aspect of today's IT infrastructures, and video (streaming) accounts for over half of today's Internet traffic. This column highlights open-source tools, datasets, and solutions addressing energy efficiency in video streaming presented at ACM Multimedia Systems 2024 and its co-located workshop ACM Green Multimedia Systems.


MPEG Column: 146th MPEG Meeting in Rennes, France

January 2025

·

3 Reads

ACM SIGMultimedia Records

The 146th MPEG meeting was held in Rennes, France from 22-26 April 2024, and the official press release can be found here. It comprises the following highlights: •AI-based Point Cloud Coding: Call for proposals focusing on AI-driven point cloud encoding for applications such as immersive experiences and autonomous driving. •Object Wave Compression: Call for interest in object wave compression for enhancing computer holography transmission. •Open Font Format: Committee Draft of the fifth edition, overcoming previous limitations like the 64K glyph encoding constraint. •Scene Description: Ratified second edition, integrating immersive media objects and extending support for various data types. •MPEG Immersive Video (MIV): New features in the second edition, enhancing the compression of immersive video content. •Video Coding Standards: New editions of AVC, HEVC, and Video CICP, incorporating additional SEI messages and extended multiview profiles. •Machine-Optimized Video Compression: Advancement in optimizing video encoders for machine analysis. •MPEG-I Immersive Audio: Reached Committee Draft stage, supporting high-quality, real-time interactive audio rendering for VR/AR/MR. •Video-based Dynamic Mesh Coding (V-DMC): Committee Draft status for efficiently storing and transmitting dynamic 3D content. •LiDAR Coding: Enhanced efficiency and responsiveness in LiDAR data processing with the new standard reaching Committee Draft status.


MPEG Column: 145th MPEG Meeting (Virtual/Online)

January 2025

·

2 Reads

ACM SIGMultimedia Records

The 145th MPEG meeting was held online from 22-26 January 2024, and the official press release can be found here. It comprises the following highlights: •Latest Edition of the High Efficiency Image Format Standard Unveils Cutting-Edge Features for Enhanced Image Decoding and Annotation •MPEG Systems finalizes Standards supporting Interoperability Testing •MPEG finalizes the Third Edition of MPEG-D Dynamic Range Control •MPEG finalizes the Second Edition of MPEG-4 Audio Conformance •MPEG Genomic Coding extended to support Transport and File Format for Genomic Annotations •MPEG White Paper: Neural Network Coding (NNC) - Efficient Storage and Inference of Neural Networks for Multimedia Applications This column will focus on the High Efficiency Image Format (HEIF) and interoperability testing. As usual, a brief update on MPEG-DASH et al. will be provided.


Fig. 2: Distribution of extracted complexity features.
Fig. 9: Histogram of selected resolutions for three different methods.
Fig. 10: Histogram of selected resolutions for different λ values.
Real-Time Quality- and Energy-Aware Bitrate Ladder Construction for Live Video Streaming

January 2025

·

29 Reads

IEEE Journal on Emerging and Selected Topics in Circuits and Systems

Live video streaming’s growing demand for high-quality content has resulted in significant energy consumption, creating challenges for sustainable media delivery. Traditional adaptive video streaming approaches rely on the over-provisioning of resources leading to a fixed bitrate ladder, which is often inefficient for the heterogeneous set of use cases and video content. Although dynamic approaches like per-title encoding optimize the bitrate ladder for each video, they mainly target video-on-demand to avoid latency and fail to address energy consumption. In this paper, we present LiveESTR, a method for building a quality- and energy-aware bitrate ladder for live video streaming. LiveESTR eliminates the need for exhaustive video encoding processes on the server side, ensuring that the bitrate ladder construction process is fast and energy efficient. A lightweight model for multi-label classification, along with a lookup table, is utilized to estimate the optimized resolution-bitrate pair in the bitrate ladder. Furthermore, both spatial and temporal resolutions are supported to achieve high energy savings while preserving compression efficiency. Therefore, a tunable parameter λ and a threshold τ are introduced to balance the trade-off between compression/quality and energy efficiency. Experimental results show that LiveESTR reduces the encoder and decoder energy consumption by 74.6% and 29.7%, with only a 2.1% increase in Bjøntegaard Delta Rate (BD-Rate) compared to traditional per-title encoding. Furthermore, it is shown that by increasing λ to prioritize video quality, LiveESTR achieves 2.2% better compression efficiency in terms of BD-Rate while still reducing decoder energy consumption by 7.5%.


MPEG Column: 142nd MPEG Meeting in Antalya, Türkiye

December 2024

ACM SIGMultimedia Records

The 142nd MPEG meeting was held as a face-to-face meeting in Antalya, Türkiye, and the official press release can be found here and comprises the following items: •MPEG issues Call for Proposals for Feature Coding for Machines •MPEG finalizes the 9th Edition of MPEG-2 Systems •MPEG reaches the First Milestone for Storage and Delivery of Haptics Data •MPEG completes 2nd Edition of Neural Network Coding (NNC) •MPEG completes Verification Test Report and Conformance and Reference Software for MPEG Immersive Video •MPEG finalizes work on metadata-based MPEG-D DRC Loudness Leveling The press release text has been modified to match the target audience of ACM SIGMM and highlight research aspects targeting researchers in video technologies. This column focuses on the 9th edition of MPEG-2 Systems, storage and delivery of haptics data, neural network coding (NNC), MPEG immersive video (MIV), and updates on MPEG-DASH.


Citations (43)


... In this work, we used the Multi-Dimensional Video Compression Dataset (MVCD) [41]. This dataset contains 4 712 000 records of encoding and decoding processes of 1000 different videos from the Inter4K video dataset [42]. ...

Reference:

Real-Time Quality- and Energy-Aware Bitrate Ladder Construction for Live Video Streaming
MVCD: Multi-Dimensional Video Compression Dataset
  • Citing Conference Paper
  • December 2024

... In the second approach, efforts have been made to improve energy efficiency through adaptive selection of video coding parameters, such as resolution and framerate, which significantly influence the energy consumption of video codecs [29][30][31][32]. Azimi et al. [31] proposes to construct a bitrate ladder that ensures that the decoding time stays below a certain threshold to address rebuffering issues and reduce energy consumption. ...

Decoding Complexity-Aware Bitrate-Ladder Estimation for Adaptive VVC Streaming
  • Citing Conference Paper
  • August 2024

... Several studies corroborate this claim, emphasizing the substantial environmental impact of video streaming and the critical importance of mitigating it. 18,19,46,47 However, these analyses often fail to address the specific impacts of short-form video streaming from a systemic perspective, potentially resulting in an incomplete understanding of its environmental effects. 48 While streaming can be a more sustainable alternative in certain contexts, it poses several potential challenges. ...

Green Video Streaming: Challenges and Opportunities
  • Citing Article
  • October 2024

ACM SIGMultimedia Records

... Adaptive video streaming is a transformative force that has been enriched by the tool of Artificial Intelligence (AI). Machine learning algorithms and data analytics can make the streaming method efficient by providing value-added advice on performance improvement, which leads to a stable, and efficient, and effective streaming platform for users [16][17]. Previous methodologies of ABR are substantially more basic, the decision making primarily driven by assessments of bandwidth with minimal consideration for other factors in the immediate environment. ...

REVISION: A Roadmap on Adaptive Video Streaming Optimization
  • Citing Preprint
  • September 2024

... In the second approach, efforts have been made to improve energy efficiency through adaptive selection of video coding parameters, such as resolution and framerate, which significantly influence the energy consumption of video codecs [29][30][31][32]. Azimi et al. [31] proposes to construct a bitrate ladder that ensures that the decoding time stays below a certain threshold to address rebuffering issues and reduce energy consumption. ...

Energy-Aware Spatial and Temporal Resolution Selection for Per-Title Encoding

IEEE Access

... The Variable Framerate (VFR) Pareto-front (PF) approach presents substantial advantages over traditional dynamic spatial resolution PF methods, particularly in terms of Quality of Experience (QoE) and decoding complexity. Dynamic spatial resolution methods [13], [20], [21] adjust the spatial resolution to balance quality and resource usage, but they can result in noticeable shifts in visual detail and clarity. These changes can disrupt the viewer's experience, especially when switching between high and low spatial resolutions. ...

Towards ML-Driven Video Encoding Parameter Selection for Quality and Energy Optimization
  • Citing Conference Paper
  • June 2024

... Over the past decade, advancements in artificial intelligence have led to the development of numerous innovative approaches for addressing challenges across various domains [1][2][3][4][5][6][7][8]. Facial recognition is no exception, and many methods based on deep learning and convolutional neural networks (CNNs) have been introduced [9][10][11][12][13][14]. ...

EVCA: Enhanced Video Complexity Analyzer
  • Citing Conference Paper
  • April 2024

... Based on the built-in power monitoring tools mentioned above, several software tools have been developed, such as PowerAPI [42], CarbonCode [43], and very recently GREEM [44]. CarbonCode has been frequently used by researchers in energy-driven optimizations in video streaming, e.g., [45], [46]. ...

GREEM: An Open-Source Energy Measurement Tool for Video Processing
  • Citing Conference Paper
  • April 2024

... In the second approach, efforts have been made to improve energy efficiency through adaptive selection of video coding parameters, such as resolution and framerate, which significantly influence the energy consumption of video codecs [29][30][31][32]. Azimi et al. [31] proposes to construct a bitrate ladder that ensures that the decoding time stays below a certain threshold to address rebuffering issues and reduce energy consumption. ...

Energy-Aware Resolution Selection for Per-Title Encoding
  • Citing Conference Paper
  • April 2024

... Software as a Service (SaaS) enables cloud users to conveniently access a wide range of software applications through the internet. This service allows users to utilize software applications without the need for local installation or maintenance [4][5]. The Platform as a Service (PaaS) concept enables customers to access a diverse range of platforms and resources online. ...

IoT Privacy Protection: JPEG-TPE With Lower File Size Expansion and Lossless Decryption
  • Citing Article
  • July 2024

IEEE Internet of Things Journal