Houari Sabirin
Research interests
-
InterestsObject Tracking, Video coding, multimedia application format, Photography, Management Information Systems, Multimedia, Hypermedia, Object Recognition
Research experience
-
Oct 2011–
presentResearch: HEVC implementation for SoC
Korea Advanced Institute of Science and TechnologySouth Korea · DaejeonDevelopment of High efficiency video coding (HEVC) encoder and decoder for system on chip application -
Jun 2011–
Oct 2011Research: Hybrid HEVC – MPEG-2 coding for 3D broadcasting
Korea Advanced Institute of Science and TechnologySouth Korea · DaejeonDevelopment of interface between HEVC reference software and MPEG-2 reference software for hybrid HEVC-MPEG-2 coding for 3D broadcasting -
Aug 2009–
Oct 2011Research: Object detection and tracking in compressed domain
Korea Advanced Institute of Science and TechnologySouth Korea · DaejeonDevelopment of object detection and tracking method in compressed bitstreams for reliable detection and tracking with low computational complexity -
Feb 2007–
Oct 2009Research: Multimedia application format
Korea Advanced Institute of Science and Technology · Department of Information and Communications EngineeringSouth Korea · DaejeonDevelopment of multimedia application format standardization in ISO/IEC JTC 1/SC 29/WG 11 Moving Picture Experts Group (MPEG)
Education
-
Aug 1996–
Aug 2001Institut Teknologi Bandung
Physics · B.SIndonesia · Bandung
Other
-
LanguagesIndonesia
English
Publications
-
1.82Impact points
Moving Object Detection and Tracking using A Spatio-temporal Graph in H.264/AVC bitstreams for Video Surveillance
IEEE Transactions on Multimedia. 06/2012;
This paper presents a spatio-temporal graph based method of detecting and tracking moving objects by treating the encoded blocks with non-zero motion vectors and/or non-zero residues as potential parts of objects in H.264/AVC bitstreams. A spatio-temporal graph is constructed by first clustering the... [more] This paper presents a spatio-temporal graph based method of detecting and tracking moving objects by treating the encoded blocks with non-zero motion vectors and/or non-zero residues as potential parts of objects in H.264/AVC bitstreams. A spatio-temporal graph is constructed by first clustering the encoded blocks of potential object parts into block groups, each of which is defined as an attributed subgraph where the attributes of the vertices represent the positions, motion vectors and residues of the blocks. In order to remove false-positive blocks and to track the real objects, temporal connections between subgraphs in two consecutive frames are constructed and the similarities between subgraphs are computed, which constitutes a spatio-temporal graph. We show the experimental results that the proposed spatio-temporal graph based representation of potential object blocks enables effective detection for the small-sized objects and the objects with small motion vectors and residues, and allows for reliable tracking of the detected objects even under occlusion. The identification of the detected moving objects is determined as rectangular ROIs for which the ROI sizes and positions are adaptively adjusted to give the best approximation of the real shapes and positions of the objects.
-
Real-time detection and tracking of multiple objects with partial decoding in H.264/AVC bitstream domain
02/2012;
In this paper, we show that we can apply probabilistic spatiotemporal macroblock filtering (PSMF) and partial decoding processes to effectively detect and track multiple objects in real time in H.264|AVC bitstreams with stationary background. Our contribution is that our method cannot only show fast... [more] In this paper, we show that we can apply probabilistic spatiotemporal macroblock filtering (PSMF) and partial decoding processes to effectively detect and track multiple objects in real time in H.264|AVC bitstreams with stationary background. Our contribution is that our method cannot only show fast processing time but also handle multiple moving objects that are articulated, changing in size or internally have monotonous color, even though they contain a chaotic set of non-homogeneous motion vectors inside. In addition, our partial decoding process for H.264|AVC bitstreams enables to improve the accuracy of object trajectories and overcome long occlusion by using extracted color information.
-
4.91Impact points
The MPEG Musical Slide Show Application Format: Enriching the MP3 Experience [Standards in a Nutshell]
Signal Processing Magazine, IEEE. 08/2011;
The ISO/IEC 23000-4 MPEG-A Part 4: The Musical Slide Show Application Format (MSS AF) standard is a storage format that specifies the synchronization of MP3 audio, Joint Picture Experts Group (JPEG) images and 3rd Generation Partnership Project (3GPP) timed text data in conjunction with the descript... [more] The ISO/IEC 23000-4 MPEG-A Part 4: The Musical Slide Show Application Format (MSS AF) standard is a storage format that specifies the synchronization of MP3 audio, Joint Picture Experts Group (JPEG) images and 3rd Generation Partnership Project (3GPP) timed text data in conjunction with the descriptions for image rendering animation using MPEG-4 Lightweight Application for Scene Representation (LASeR) scene representation. The creation information for MP3 audio and JPEG images, and the color and texture descriptions for JPEG images can also be generated for richer content description using MPEG-7 metadata. By specifying such a storage format, the standard enables different multimedia contents (in this case audio, images, texts and metadata) to be integrated into a single file in a structured way. In addition to this, the standard specifies the protection and governance schemes that enable flexible rights management of the contents seamlessly. Therefore, new business models can be created with the governed contents in secured and controlled manners according to their target applications. This article provides an overview of the technologies used in the standard as well as application examples to show its benefits and advantages.
-
1.66Impact points
Archive and Preservation of Media Content Using MPEG-A
Multimedia, IEEE. 01/2011;
This article describes a standardized packaging format for digital media files. Archiving is accomplished through a hierarchical file structure and rich contextual information, while preservation is realized by enabling portability in the structure and file attributes. Advanced functionality, such a... [more] This article describes a standardized packaging format for digital media files. Archiving is accomplished through a hierarchical file structure and rich contextual information, while preservation is realized by enabling portability in the structure and file attributes. Advanced functionality, such as usage governance, is supported by the packaging format.
-
4.91Impact points
The MPEG Musical Slide Show Application Format: Enriching the MP3 Experience [Standards in a Nutshell]
IEEE Signal Processing Magazine. 01/2011; 28:136-141.
The ISO/IEC 23000-4 MPEG-A Part 4: The Musical Slide Show Application Format (MSS AF) standard is a storage format that specifies the synchronization of MP3 audio, Joint Picture Experts Group (JPEG) images and 3rd Generation Partnership Project (3GPP) timed text data in conjunction with the descript... [more] The ISO/IEC 23000-4 MPEG-A Part 4: The Musical Slide Show Application Format (MSS AF) standard is a storage format that specifies the synchronization of MP3 audio, Joint Picture Experts Group (JPEG) images and 3rd Generation Partnership Project (3GPP) timed text data in conjunction with the descriptions for image rendering animation using MPEG-4 Lightweight Application for Scene Representation (LASeR) scene representation. The creation information for MP3 audio and JPEG images, and the color and texture descriptions for JPEG images can also be generated for richer content description using MPEG-7 metadata. By specifying such a storage format, the standard enables different multimedia contents (in this case audio, images, texts and metadata) to be integrated into a single file in a structured way. In addition to this, the standard specifies the protection and governance schemes that enable flexible rights management of the contents seamlessly. Therefore, new business models can be created with the governed contents in secured and controlled manners according to their target applications. This article provides an overview of the technologies used in the standard as well as application examples to show its benefits and advantages.
-
1.66Impact points
DMB Application Format for Mobile Multimedia Services
IEEE Multimedia. 01/2011;
Digital Multimedia Broadcasting Application Format (DMB AF) is Part 9 standard of ISO/IEC 23000 MPEG-A Multimedia Application Formats. The DMB AF standard specifies a structured file format for storage and playback of DMB (i.e., T-DMB, S-DMB, DAB, and DAB+) contents, enabling a variety of different ... [more] Digital Multimedia Broadcasting Application Format (DMB AF) is Part 9 standard of ISO/IEC 23000 MPEG-A Multimedia Application Formats. The DMB AF standard specifies a structured file format for storage and playback of DMB (i.e., T-DMB, S-DMB, DAB, and DAB+) contents, enabling a variety of different types of multimedia components to be easily packaged, managed, exchanged and played, exploiting widely deployed DMB receivers, in governed and protected ways. This paper provides an overview of the DMB AF standard including the usage scenarios that show its advantages and benefits.
-
Graph-based object detection and tracking in H.264/AVC bitstreams for surveillance video
Multimedia and Expo (ICME), 2011 IEEE International Conference on; 01/2011
In this paper we present a novel method to detect and track moving objects in H.264/AVC bitstreams by processing motion vector and residue information. The encoded blocks with nonzero motion vectors and residues are first detected as moving object candidates. A spatio-temporal graph in video sequenc... [more] In this paper we present a novel method to detect and track moving objects in H.264/AVC bitstreams by processing motion vector and residue information. The encoded blocks with nonzero motion vectors and residues are first detected as moving object candidates. A spatio-temporal graph in video sequences is then constructed to represent groups of blocks in each frame and their associations to the other groups of blocks in subsequent frames. Identification and refinement of ROIs for moving objects being tracked are done by graph matching and adaptive ROI-size adjustment. The experimental results show that the proposed method can correctly identify real moving objects from frame to frame and can effectively detect small-sized objects and objects with small motion vectors and residues, as well as by recognizing moving objects even under occlusion.
-
Archive and Preservation of Media Content Using MPEG-A
IEEE Multimedia. 01/2010; 17:94-99.
This article describes a standardized packaging format for digital media files. Archiving is accomplished through a hierarchical file structure and rich contextual information, while preservation is realized by enabling portability in the structure and file attributes. Advanced functionality, such a... [more] This article describes a standardized packaging format for digital media files. Archiving is accomplished through a hierarchical file structure and rich contextual information, while preservation is realized by enabling portability in the structure and file attributes. Advanced functionality, such as usage governance, is supported by the packaging format.
-
Musical slide show MAF with protection and governance using MPEG-21 IPMP Components and REL
Multimedia on Mobile Devices 2007, San Jose, CA, USA; 01/2007
The Musical Slide Show Multimedia Application Format (MAF) which is currently being standardized by the Moving Picture Expert Group (MPEG) conveys the concept of combining several established standard technologies in a single file format. It defines the format of packing up MP3 audio data, along wit... [more] The Musical Slide Show Multimedia Application Format (MAF) which is currently being standardized by the Moving Picture Expert Group (MPEG) conveys the concept of combining several established standard technologies in a single file format. It defines the format of packing up MP3 audio data, along with JPEG images, MPEG-7 Simple Profile metadata, timed text, and MPEG-4 LASeR script. The presentation of Musical Slide Show MAF contents is made in a synchronized manner with JPEG images, timed text to MP3 audio track. Also, the rendering effect on JPEG images can be supported by the MPEG-4 LASeR script. This Musical Slide Show MAF will enrich the consumption of MP3 contents assisted with synchronized and rendered JPEG images, text as well as MPEG-7 metadata about the MP3 audio contents. However, there is no protection and governance mechanism for Musical Slide Show MAF which is the essential elements to deploy the sorts of contents. In this paper, to manage the Musical Slide Show MAF contents in a controlled manner, we present a protection and governance mechanism by using MPEG-21 Intellectual Property Management and Protection (IPMP) Components and MPEG-21 Rights Expression Language (REL) technologies We implement an authoring tool and a player tool for Musical Slide Show MAF contents and show the experimental results as well.
Following (2)
-
Aries Susanto HT
Korea Advanced Institute of Science and Technology -
Arief Sasongko Adhi
Badan Tenaga Nuklir Nasional