
Marek DomańskiPoznań University of Technology · Instituteof Multimedia Telecommunications
Marek Domański
Prof. Dr. (prof. dr hab. inż.)
About
325
Publications
39,932
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
1,970
Citations
Publications
Publications (325)
The application of machine learning to video coding is generally studied in two main approaches: end-to-end video coding using deep neural networks and classic hybrid codecs with individual tools implemented using such networks. This work exploits the latter approach, where a trained Artificial Neural Network (ANN) is used for fast implementation o...
This paper summarizes recent research on network-on-multi-chip (NoMC) at Poznań University of Tech-nology. The proposed network architecture supports hierar-chical addressing and multicast transition mode. Such an ap-proach provides new debugging functionality hardly attain-able in classical hardware testing methodology. A multicast transmission al...
Video acquired from multiple cameras located along a line is often rectified to video virtually obtained from cameras with ideally parallel optical axes collocated on a single plane and principal points on a line. Such an approach simplifies video processing including depth estimation and compression. Nowadays, for many application video, like virt...
Hybrid video compression plays an invaluable role in digital video transmission and storage services and systems. It performs several-hundred-fold reduction in the amount of video data, which makes these systems much more efficient. An important element of hybrid video compression is entropy coding of the data. The state-of-the-art in this field is...
In this paper, the authors describe two methods designed for reducing the spatiotemporal redundancy of the video within the MPEG Immersive video (MIV) encoder: patch occupation modification and cluster splitting. These methods allow optimizing two important parameters of the immersive video: bitrate and pixelrate. The patch occupation modification...
The paper presents a study of a lossy compression impact on depth estimation and virtual view quality. Two scenarios were considered: the approach based on ISO/IEC 23090-12 coder-agnostic MPEG Immersive video standard, and the more general approach based on simulcast video coding. The commonly used compression techniques were tested: VVC (MPEG-I Pa...
This paper presents the color-dependent method of removing interview redundancy from multiview video. The pruning of input views decides which fragments of views are redundant, i.e., do not provide new information about the three-dimensional scene, as these fragments were already visible from different views. The proposed modification of the prunin...
This paper presents a study on the use of encoder-derived features in decoder-side depth estimation. The scheme of multiview video encoding does not require the transmission of depth maps (which carry the geometry of a three-dimensional scene) as only a set of input views and their parameters are compressed and packed into the bitstream, with a set...
The paper deals with Video Coding for Machines that is a new paradigm in video coding related to consumption of decoded video by humans and machines. For such tasks, joint transmission of compressed video and features is considered. In this paper, we focus our considerations of features on SIFT keypoints. They can be extracted from the decoded vide...
This paper introduces an end-to-end learned image compression system, termed ANFIC, based on Augmented Normalizing Flows (ANF). ANF is a new type of flow model, which stacks multiple variational autoencoders (VAE) for greater model expressiveness. The VAE-based image compression has gone mainstream, showing promising compression performance. Our wo...
The paper presents a new approach to multiview video coding using Screen Content Coding. It is assumed that for a time instant the frames corresponding to all views are packed into a single frame, i.e. the frame-compatible approach to multiview coding is applied. For such coding scenario, the paper demonstrates that Screen Content Coding can be eff...
In this paper, the color correction method developed for immersive video systems is presented. The proposed method significantly increases the consistency of color characteristics of multiview sequences, understood both as the temporal and the inter-view consistency, what highly improves the subjective quality of the synthesized virtual views prese...
The paper deals with efficient compression of immersive video representations for the synthesis of video related to virtual viewports, i.e., to selected virtual viewer positions and selected virtual directions of watching. The goal is to obtain possibly high quality of virtual video obtained from compressed representations of immersive video acquir...
In this paper, we propose a depth map refinement method that increases the quality of immersive video. The proposal highly enhances the inter-view consistency of depth maps (estimated or acquired by any method), crucial for achieving the required fidelity of the virtual view synthesis process. In the described method, only information from depth ma...
This paper introduces an end-to-end learned image compression system, termed ANFIC, based on Augmented Normalizing Flows (ANF). ANF is a new type of flow model, which stacks multiple variational autoencoders (VAE) for greater model expressiveness. The VAE-based image compression has gone mainstream, showing promising compression performance. Our wo...
The paper presents a new method of depth estimation, dedicated for free-viewpoint television (FTV) and virtual navigation (VN). In this method, multiple arbitrarily positioned input views are simultaneously used to produce depth maps characterized by high inter-view and temporal consistencies. The estimation is performed for segments and their size...
The paper presents a new method of depth estimation dedicated for free-viewpoint television (FTV). The estimation is performed for segments and thus their size can be used to control a trade-off between the quality of depth maps and the processing time of their estimation. The proposed algorithm can take as its input multiple arbitrarily positioned...
In this paper, we identify the general requirements for network-on-chips (NoCs) and the general characteristics of field-programmable gate arrays (FPGAs) from leading producers. Based on the analysis provided, an FPGA-oriented NoC called RingNet is proposed. As a distinctive feature, RingNet uses communication through a centrally placed memory that...
The extensions of Advanced Video Coding (AVC) and High-efficiency Video Coding (HEVC) for multiview video plus depth allow for coding of 3D video scenes. During the standardization of these extensions it was demonstrated that compression performance can often be improved by application of nonlinear depth transformation prior to the compression and...
This chapter addresses predictive coding methods for depth maps that are required for virtual view synthesis. In multi-view immersive systems, virtual views play a very important role in the overall quality experienced by the users. Despite the fact that depth maps are not viewed by the users, their accuracy has a significant impact on the quality...
This chapter addresses image and video technologies related to 3D immersive multimedia delivery systems with special emphasis on the most promising digital formats. Besides recent research results and technical challenges associated with multiview image and image, video and lightfield acquisition and processing, the chapter also presents relevant r...
This chapter starts by addressing the impact of the inaccurate camera system alignment on the spatial reconstruction accuracy and stereo perception. An experimental study is described, using a stereoscopic camera setup and its deterministic relations derived by trigonometry, spatial model, and basic stereoscopic formulas. The significance of errors...
The paper presents Improved Adaptive Arithmetic Coding algorithm for application in future video compression technology. The proposed solution is based on the Context-based Adaptive Binary Arithmetic Coding (CABAC) technique and uses the authors mechanism of symbols probability estimation that exploits Context-Tree Weighting (CTW) technique. This p...
HEVC (MPEG-H Part 2 and H.265) is a new coding technology which is expected to be deployed on the market along with new video services in the near future. HEVC is a successor of currently widely used AVC (MPEG-4 Part 10 and H.264). In this paper, the quality coding gains obtained for the Cascaded Pixel Domain Transcoder of AVC-coded material to HEV...