Figure 5 - uploaded by Wijnand A Ijsselsteijn
Content may be subject to copyright.
A layered coding syntax provides backwards compatibility to conventional 2D digital TV and allows to adapt the view synthesis to a wide range of different 2D and 3D displays. 

A layered coding syntax provides backwards compatibility to conventional 2D digital TV and allows to adapt the view synthesis to a wide range of different 2D and 3D displays. 

Source publication
Conference Paper
Full-text available
We describe the goals of the ATTEST project, which started in March 2002 as part of the Information Society Technologies (IST) programme, sponsored by the European Commission. In the 2-year project, several industrial and academic partners cooperate towards a flexible, 2D-compatible and commercially feasible 3D-TV system-for broadcast environments....

Contexts in source publication

Context 1
... contrast to former proposals, this new approach is based on a flexible, modular and open architecture that provides important system features, such as backwards compatibility to today's 2D digital TV, scalability in terms of receiver complexity and adaptability to a wide range of different 2D and 3D displays. For this purpose, the data representation and coding syntax of the ATTEST system are based on a layered structure shown in Figure 5. This structure consists of one base layer and at least one additional enhancement layer. ...
Context 2
... scale scenes with a high amount of occlusions) it might be necessary to send further information, for example segmentation masks and occluded texture. The layered structure in Figure 5 is extendable in this sense. As stereovision is only one depth cue and other cues such as motion-parallax are of comparable importance, it is significant to note that the described layered structure is flexible enough to support alternative forms of depth representation. ...

Similar publications

Article
Full-text available
In this paper we will present the concept of a modular three dimensional broadcast chain, that allows for an evolutionary introduction of depth perception into the context of 2D digital TV. The work is performed within the framework of the European Information Society Technologies (IST) project "Advanced Three-dimensional Television System Technolo...

Citations

... Three-dimensional (3D) videos provide better immersive experiences to viewers than traditional 2D videos, thus they have been becoming more and more popular in entertainments [1,2,3]. One typical format for storage and online distribution of 3D videos is the stereoscopic format, which contains two different 2D views captured by two parallel cameras for each frame. ...
... This research is supported by the National Natural Science Foundation of China (61602527,61772555,61772553,U1734208), Natural Science Foundation of Hunan Province, China (2020JJ4746), and Guangxi Key Laboratory of Trusted Software (KX202032) view with its depth map for each frame. Compared with the stereoscopic format, the DIBR format saves storage and transmission bandwidth by efficiently compressing the depth map and thus is more widely adopted [2,3]. ...
Conference Paper
Copyright protection of depth image-based rendering (DIBR) 3D videos is crucial due to the popularity of these videos. Despite the success of recent watermarking schemes, it is still challenging to ensure the robustness against strong geometric attacks when both lossless quality and distinguishability of protected videos are required. In this paper, we pro-pose a novel zero-watermarking scheme to improve the performance under strong geometric attacks when satisfying the other two requirements. In our scheme, CT-SVD-based features are extracted to ensure both distinguishability and robustness against signal processing and DIBR conversion at-tacks, while a SIFT-based rectication mechanism is designed to resist geometric attacks. Further, an attention-based fusion strategy is proposed to complement the robustness of rectied and unrectied CT-SVD features. Experimental results demonstrate that our scheme outperforms the existing zero-watermarking schemes in terms of distinguishability and robustness against strong geometric attacks such as rotation, cyclic translation and shearing.
... Protecting the copyrights of three-dimensional (3D) videos has become a crucial issue [4,26,33,45]. On the one hand, the risk of copyright infringement for 3D videos has increased because 3D videos, which can provide better immersive experiences to viewers than traditional 2D videos, have become more and more popular over the Internet [22,32,40,46]. On the other hand, the illegal copying and distribution of 3D videos will cause more serious losses to their owners than that of 2D videos because the production costs of 3D videos are much higher. ...
... In this study, DIBR 3D video is chosen as the protection target for the following two reasons. First, 3D videos stored in DIBR format can reduce storage and transmission-bandwidth requirements compared with stereoscopic format [31,40]. Second, DIBR is a key technique for converting existing 2D videos to 3D videos [5,27,48], and 3D videos synthesized using DIBR have much lower costs than those using stereo videos because they do not require different cameras to capture video frames from various viewpoints. ...
... Four sequences of synthesized frames are generated by setting the baseline distances as 5% and 7% of the video frame width. These baseline distances have been chosen carefully because baseline distances larger than 7% may induce discomfort for 3D viewing [1,40]. All original 2D frames, depth maps and synthesized frames suffer 14 types of attacks, with a total of 26 different attacks. ...
Article
Digital rights management (DRM) of depth-image-based rendering (DIBR) 3D video is an emerging area of research. Existing schemes for DIBR 3D video cause video distortions, are vulnerable to severe signal and geometric attacks, cannot protect 2D frames and depth maps independently, or have difficulty handling large-scale videos. To address these issues, a novel zero-watermark scheme based on invariant features and similarity-based retrieval to protect DIBR 3D video (RZW-SR) is proposed in this study. In RZW-SR, invariant features are extracted to generate master and ownership shares to provide distortion-free, robust and discriminative copyright identification under various attacks. Different from conventional zero-watermark schemes, our proposed scheme stores features and ownership shares correlatively and designs a similarity-based retrieval phase to provide effective solutions for large-scale videos. In addition, flexible mechanisms based on attention-based fusion are designed to protect 2D frames and depth maps, either independently or simultaneously. The experimental results demonstrate that RZW-SR has superior DRM performance compared to existing schemes. First, RZW-SR can obtain the ownership shares relevant to a particular 3D video precisely and reliably for effective copyright identification of large-scale videos. Second, RZW-SR ensures lossless, precise, reliable and flexible copyright identification for 2D frames and depth maps of 3D videos.
... In DIBR, both the left and right views are produced by a monoscopic center image and associated per-pixel depth information of the same scene, known as the depth map. In recent years, DIBR [35] technique for 3D TV has gained significant attention from the research community due to the following provisions: (i) User configurable depth degree by modifying the depth image. (ii), lower transmission bit-rate per scene attributable to smooth areas in the depth map [16], and the ability to be compressed to a much lower bit rate than a normal color representation does [12], and (iii) backward compatibility of DIBR which allows the center image itself to be used as a content on 2D-TV. ...
Article
Full-text available
Depth Image Based Rendering (DIBR) for 3D-TV, or free-view TV, is one of the most promising techniques in multimedia world, whereby a monoscopic image and the depth image of the same view are utilized to generate stereoscopic left and right images. Therefore, the protection of valuable content generated for 3D TV is an important concern in the world of digital media. In this paper, we exploit an interpolation errors expansion scheme by employing Genetic Programming based smart reversible watermarking technique that is viable for 3D-TV. The proposed technique exploits directional weights using hidden dependencies pertaining to the imperceptibility and capacity of the watermark in a previously established interpolation scheme. It then embeds the watermark in the 3D content using interpolation error expansion based reversible watermarking scheme. Previously presented empirical techniques are not much effective as they use hit and trial strategies for selecting optimal weights for watermark embedding. The proposed technique achieves significant watermark capacity as well as imperceptibility, and is reversible when compared to existing state of the art techniques.
... Multiple broadcasting service providers use DIBR-3D technology and many European and Asian countries have commenced its transmission. DIBR-3D TV is the revolution in traditional television systems and made it even more smart where viewers enjoy lifelike scenes on their traditional home television [4]. DIBR-3D TV contents were first introduced in Advance Three-Dimensional Television System Technology (ATTEST) [4] by Fehn et. ...
... DIBR-3D TV is the revolution in traditional television systems and made it even more smart where viewers enjoy lifelike scenes on their traditional home television [4]. DIBR-3D TV contents were first introduced in Advance Three-Dimensional Television System Technology (ATTEST) [4] by Fehn et. al. [5]. ...
... Depth map is the gray-scale image, consist of information related to the distance of objects in the scene from the viewpoint. The depth map and the corresponding input 2D video/image are combined to generate the left and right view using DIBR procedure for the DIBR-3D TV system [4]. 3D technology has improved over the past few years and the generation of 3D contents has been enhanced significantly. ...
Article
Full-text available
DIBR-3D technology has evolved over the past few years with the demands of consumers increasing in recent times for future free-view 3D videos on their home televisions. The main issue in 3D technology is the lack of 3D content available to watch using the traditional TV systems. Although, some sophisticated devices like stereoscopic cameras have been used to fill the gap between the 3D content demand and 3D content supply. But the content generated through these sophisticated devices can not be displayed on the traditional TV systems, so there needs to be some mechanism which is inline with the traditional TV. Furthermore, the huge collection of existing 2D content should be converted to 3D using depth image-based rendering techniques. This conversion technique can highly contribute in overcoming the shortage problem of the 3D content. This paper presents a novel approach for converting 2D degraded image for DIBR 3D-TV view. This degraded or noisy/blur image is enhanced through image dehazing and Directional Filter Bank (DFB). This enhancement is necessary because of the occlusion effect or hole filling problem that occurs due to imperfect depth map. The enhanced image is then segmented into the foreground image and the background image. After the segmentation, the depth map is generated using image profiles. Moreover, Stereoscopic images are finally produced using the DIBR procedure which is based on the 2D input image and the corresponding depth map. We have verified the results of the proposed approach by comparing the results with the existing state-of-the-art techniques.
... HE three-dimensional (3D) videos can provide better immersive experiences to viewers than traditional 2D videos and thus have been becoming popular over the Internet [1][2]. At the same time, the risk of copyright infringement for 3D videos has also been increasing. ...
... In this study, the DIBR-based format is chosen as the protection target due to the following two reasons. First, stereo videos stored in DIBR-based format can save storage and transmission-bandwidth because the depth maps, which mainly consist of gray-level pixels and smooth areas, are amenable to effective compression [2]. Second, existing 2D videos can be converted to 3D videos based on DIBR technique, and the 3D videos synthesized in this manner costs are much lower because they do not need different cameras to capture video frames from various viewpoints. ...
Article
The copyright protection of 3D videos has become a crucial issue. In this study, a novel zero-watermark scheme with similarity-based retrieval is proposed. In our proposed scheme the features of both 2D-video and depth-map components are first extracted. Next, master shares and ownership shares are generated from these features and their relevant copyright information based on (2,2) visual secret sharing scheme. Different with traditional zero-watermark schemes, both the features and ownership shares are stored in relevant databases. When a 3D video is queried, a novel similarity-based retrieval phase is designed to obtain the ownership shares relevant to the particular 3D video. After that, the queried master shares are generated from this 3D video and stacked with its relevant ownership shares to identify its copyright ownership. To satisfy different DRM requirements of 3D videos, flexible mechanisms are designed for both similarity-based retrieval and copyright identification functions in our study. The experimental results demonstrate that RZW-SR3D not only obtains the ownership shares relevant to a particular 3D video precisely and reliably when processing numerous videos, which outperforms the traditional zero-watermark schemes, but also identifies the copyrights of 2D-video and depth-map components of 3D videos reliably, independently and simultaneously without any content distortion or watermark-embedding limitation, which outperforms existing 2D-video based and depth-map based watermark schemes for protecting 3D video.
... A major issue associated with this approach is that pixels are selected evenly without giving importance to fine details. [5] proposes view dependent 3D video compression technique for mobile devices. [6] and [4] use image and zlib compression, respectively. [7] proposes activity aware adaptive compression by combining activity recognition and realtime morphing based compression. ...
... Silhoute based meshes are directly transmitted by [Petit et al. 2009]. Image and zlib compression are used by [Redert et al. 2002] and [Yang et al. 2006a] respectively. View dependent compression was proposed by [Shi et al. 2009] for mobile devices. ...
... Compression and transmission: The generated mesh is compressed using zlib. The texture image is encoded as a jpeg image similar to [Redert et al. 2002] and [Yang et al. 2006a]. The data is then transmitted to the receiver for rendering. ...
Conference Paper
3D Tele-Immersion (3DTI) systems capture and transmit large volumes of data per frame to enable virtual world interaction between geographically distributed people. Large delays/latencies introduced during the transmission of these large volumes of data can lead to poor quality of experience of the 3DTI systems. Such poor experiences can possibly be overcome by animating the previously received mesh using the current skeletal data (that is very small in size and hence experiences much lower communication delays). However, using just the previously transmitted mesh for animation is not ideal and could render inconsistent results. In this paper, we present a DIstortion Score based Pose SElection (DISPOSE) approach to render the person by using an appropriate mesh for a given pose. Unlike pose space animation methods that require manual or offline time consuming pose set creation, our distortion score based scheme can choose the mesh to be transmitted and update the pose set accordingly. DISPOSE works with partial meshes and does not require dense registration enabling real time pose space creation. With DISPOSE incorporated into 3DTI, the latency for rendering the mesh on the receiving side is limited by only the transmission delay of the skeletal data (which is only around 250 bytes). Our evaluations show the effectiveness of DISPOSE for generating good quality online animation faster than real time.
... Current methods to deal with this problem involve some form of compression of the data. Some methods use standard compression libraries directly on the color and depth data [2]. Some of the methods use a combination of standard compression libraries to compress the data before it is sent over the Internet [3]. ...
Conference Paper
Full-text available
3D collaborative Tele-Immersive environments allow reconstruction of real world 3D scenes in the virtual world across multiple physical locations. This kind of reconstruction results in a lot of 3D data being transmitted over the internet in real time. The current systems allow for transmission at low frame rates due to the large volume of data and network bandwidth restrictions. In this paper we propose a prediction based approach that generates future frames by animating the live model based on few skeleton points. By doing so the magnitude of data transmitted is reduced to few hundred bytes. The prediction errors are corrected when an entire frame is received. This approach allows minimal amounts (few bytes) of data to be transmitted per frame, thus allowing for high frame rates and still maintain an acceptable visual quality of reconstruction at the receiver side.
... For example, Levoy [8] proposed to use polygons to assist JPEG/MPEG compression. 3D image warping has been similarly applied for different purposes in various remote rendering systems [10,2,6] and other applications [16,13]. However, we believe we are the first to study using the rendering contexts to assist real-time video coding in the area of cloud gaming, and we present the best solution so far to integrate the graphics rendering contexts and IBR techniques into video coding. ...
... But as we will introduce later in this section, we can also select the frames that do not exist in the video sequence as the reference frame. Our work is also different from the projects that apply 3D image warping techniques to generate stereo views from 2D sources [16]. It is because the 2D-3D conversion systems usually focus on high quality warping while our system concentrates on coding performance improvement. ...
Conference Paper
Full-text available
The emerging cloud gaming service has been growing rapidly, but not yet able to reach mobile customers due to many limitations, such as bandwidth and latency. We introduce a 3D image warping assisted real-time video coding method that can potentially meet all the requirements of mobile cloud gaming. The proposed video encoder selects a set of key frames in the video sequence, uses the 3D image warping algorithm to interpolate other non-key frames, and encodes the key frames and the residues frames with an H.264/AVC encoder. Our approach is novel in taking advantage of the run-time graphics rendering contexts (rendering viewpoint, pixel depth, camera motion, etc.) from the 3D game engine to enhance the performance of video encoding for the cloud gaming service. The experiments indicate that our proposed video encoder has the potential to beat the state-of-art x264 encoder in the scenario of real-time cloud gaming. For example, by implementing the proposed method in a 3D tank battle game, we experimentally show that more than 2 dB quality improvement is possible.
... These systems capture scenes based on the LDV (Layered Depth Video) and MVD (Multiview Plus Depth Video) formats. Several research projects in Europe ATTEST [10] and 3D4YOU [11] [12] are investigating production techniques that produce accurate content depth maps. These depth maps can be treated as an additional video component to sum up to the two dimensional video information ...
Article
Full-text available
This paper presents an analysis of the characteristics of the DVB-T2 standard to deliver 3DTV contents (3DTV). The work has been based on the current state-of-the-art developments in technologies that enable 3DTV delivery, from production to displays, with special emphasis on formats and coding efficiencies. Based on bitrate and format requirements the capacity of DVB-T2 is investigated, proposing two roll-out scenarios and discussing the associated DVB-T2 configuration parameters. The target receivers are both fixed and portable, and different backward compatibility problems and solutions are highlighted. The result of this work is an estimation of the number and type of services (both 2D and 3D) that this standard will be able to deliver in the mid term future as 2D HD and 3D services become commercially available.