• Home
  • IBM
  • Computer Science, Almaden
  • Dulce B. Ponceleon
Dulce B. Ponceleon

Dulce B. Ponceleon
IBM · Computer Science, Almaden

Doctor of Philosophy

About

67
Publications
7,644
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
1,245
Citations
Introduction
Natural Language Processing

Publications

Publications (67)
Patent
Embodiments of the invention relate to digital content protection for recordable media using encryption and decryption based on device keys in the media. The invention increases the number of extended applications supported the media key blocks and facilitates the assignment of the applications to the media key blocks. One aspect of the invention c...
Patent
Embodiments of this disclosure relate to binary tree structures, and more specifically to assigning resources to a binary tree structure, such as for content protection. In embodiments, a total number of resources in a first category of resources is identified. The resources may be devices that play encryption protected content such as a CD, a DVD,...
Patent
According to one embodiment of the present invention, a system, method, and computer program product is provided for rebinding title keys in clusters of devices with distinct security levels in broadcast encryption systems. The method includes receiving a new management key and unbinding an encrypted title key with a previously used management key,...
Conference Paper
As digital media replaces analog, consumers are beginning to discover that their devices no longer interoperate in the way they are accustomed to. For example, they may find it impossible to transfer a DVR-recorded show onto a portable media player from a different manufacturer. Content creators are weary of supporting myriad ecosystems, each with...
Conference Paper
The future of multimedia community depends on how the community effectively and efficiently recruits, nurtures and retains young talents. Students tends to decide on their majors based on job opportunities and the main question in every student mind while finishing a degree is "which jobs are out there for me?" In this panel, we have gathered peopl...
Conference Paper
"Networking of Multimedia Women" event is a continuation of an on-going conversation in the multimedia research community and efforts by the ACM SIGMM to engage and promote female researchers in multimedia community, enable networking of junior and senior female researchers, and give insights towards successful professional careers based on example...
Conference Paper
In this paper, we describe SCOTT: a demonstration system that uses the Set Cover Tracing algorithm for determining the source of pirate content. This algorithm is very efficient in dealing with collusion attacks - the performance is close to linear in the number of colluders. However, the algorithm is based on the Set Cover Problem, which is known...
Conference Paper
Broadcast encryption is a well established alternative to public key encryption for use in content protection systems. It offers significant performance benefits, as well as useful features such a one-to-many delivery, dynamic membership in the authorized receivers group, and provides anonymous access to content, permitting content protection syste...
Conference Paper
Devising effective Content Protection mechanisms and building satisfactory Digital Rights Management systems have been top priorities for the Publishing and Entertainment Industries in recent years. In this tutorial, we focus on protection tools and standards for entertainment content. We analyze the challenges of content protection systems, rangin...
Article
Full-text available
Compact representations of video data greatly enhances efficient video browsing. Such representations provide the user with information about the content of the particular sequence being examined while preserving the essential message. We propose a method to automatically generate video summaries using transcripts obtained by automatic speech recog...
Conference Paper
Multimedia content protection is a controversial topic. Content owners want to protect their rights while consumers want flexible usage, privacy and seamless content flow. In this tutorial we cover from cryptography fundamentals, to history, emerging standards, state-of-the-art approaches and live demos.
Conference Paper
Full-text available
As the marketplace for digital media increases we witness the rise of new media distribution models where timely delivery, convenience, privacy and personalization are essential features of competitive offerings. Consumers are looking for innovative ways to access content in a service-oriented manner that suits their mobile life style. This paper d...
Conference Paper
Full-text available
Video data is increasingly being used in medical diagnosis. Due to the quality of the video and the complexities of underlying motion captured, it is difficult for an in-experienced physician/radiologist to describe motion abnormalities in a crisp way, leading to possible errors in diagnosis. In this paper, we present a method of capturing video si...
Conference Paper
Full-text available
We present WISDNA (Web information services DNA) as an information visualization paradigm for XML data from Web information services. We propose a visualization and navigational tool based on Context Lenses and MovieDNA to address the need to support exploratory navigation that seamlessly integrates querying with browsing. We present linear and hie...
Article
Full-text available
INTRODUCTION With ever-increasing amounts of data in repositories, finding the information a user needs becomes increasingly challenging. When users approach an information access system they often do not have a clear understanding of how to access the precise information they need. Additionally, it is not rare that users have only a vague notion o...
Article
It is di#cult to understand a multimedia signal without being able to say something about its semantic content or its meaning. This chapter describes two algorithms that help bridge the semantic understanding gap that we have with multimedia. In both cases we represent the semantic content of a multimedia signal as a point in a high-dimensional spa...
Article
Full-text available
It is difficult to understand a multimedia signal without being able to say something about its semantic content or its meaning. This chapter describes two algorithms that help bridge the semantic understanding gap that we have with multimedia. In both cases we represent the semantic content of a multimedia signal as a point in a high-dimensional s...
Chapter
People can browse text documents very quickly and efficiently. A user can find, within seconds, a relevant document from a dozen retrieved items listed on a screen. On the other hand, browsing of multiple audio and video documents could be very time-consuming. Even the task of browsing a single one-hour video to find a relevant segment might take c...
Article
Full-text available
In this paper, we describe a system for automatic and interactive content-based retrieval of video that integrates features, models, and semantics. The novelty of the approach lies in the (1) semi-automatic construction of models of scenes, events, and objects from feature descriptors, and (2) integration of content-based and model-based querying i...
Conference Paper
Full-text available
Compact representations of video data can enable efficient video browsing. Such representations provide the user with information about the content of the particular sequence being examined while preserving the essential message. We propose a method to automatically generate video summaries for long videos. Our video summarization approach involves...
Article
Commercial presentation software focuses on authoring, rather than on supporting the presentation process itself. Typical slide navigation tools are disruptive to the preWk"+}"ffff flow and coulddisclose slide eide We pre se nt a navigation tool forslide prev+}Wk'ff base d on de+T' principle such as"ne"k hide the curre ntslide" We kev the tool simp...
Conference Paper
The amount of digital video and associated metadata being generated and stored is increasing rapidly. Given the complex spatial and temporal structure of video information it is a formidable challenge to provide compact and human-readable representations of such content. The solution has to be versatile in order to satisfy different user needs, suc...
Conference Paper
Full-text available
This paper describes a new representation for the audio and visual information in a video signal. We use reduce the dimensionality of the signals with singular-value decomposition (SVD) or mel-frequency cepstral coefficients (MFCC). We apply these transforms to word, (word transcript, semantic space or latent semantic indexing), image (color histog...
Conference Paper
This paper describes a new algorithm which discovers the hierarchical organization of a document or media presentation. We use latent semantic indexing to describe the semantic content of the signal, and scale-space segmentation to describe its features at many different scales. We present results from a text document and a video transcript
Article
Full-text available
Speech is a tantalizing mode of human communication. On the one hand, humans understand speech with ease and use speech to express complex ideas, information, and knowledge. On the other hand, automatic speech recognition with computers is very hard, and extracting knowledge from speech is even harder. Nevertheless, the potential reward for solving...
Conference Paper
This paper examines the issues related to developing web applications that use digital media, with particular emphasis on digital video. The nature of digital video brings additional complexity to engineering solutions on the web due to the large data sizes in comparison with text, the temporal nature of video, proprietary data formats, and issues...
Conference Paper
algorithm for the segmentation of an audio/video source into topically cohesive segments based on automatic speech recognition (ASR) transcriptions is presented. A novel two-pass algorithm is described that combines a boundary-based method with a content-based method. In the first pass, the temporal proximity and the rate of arrival of ngram featur...
Conference Paper
Full-text available
With more and more streaming media servers becoming commonplace, streaming video has now become a popular medium of instruction, advertisement, and entertainment. With such prevalence comes a new challenge to the servers: Can they track browsing behavior of users to determine what interest users? Learning this information is potentially valuable no...
Conference Paper
Full-text available
This paper describes a new unified representation for the informa�tion in a video. We reduce the dimensionality of the signal with either a singular-value decomposition (on the semantic and image data) or mel-frequency cepstral coefficients (on the audio data) and then concatenate the vectors to form a multi-dimensional represen�tation of the video...
Conference Paper
Experts from leading streaming media formats together with representatives of open standards for media are brought together to discuss about mainstream. However, a common theme emerging from the panelists encompasses several technical issues. Such problems are acknowledged, but not considered to be the primary reason that corporations are not reali...
Conference Paper
Speech is a tantalizing mode of human communication. On the one hand, humans understand speech with ease and use speech to express complex ideas, information, and knowledge. On the other hand, automatic speech recognition with computers is still very hard, and extracting knowledge from speech is even harder. In this paper we motivate the study of s...
Conference Paper
Full-text available
This paper addresses the problem of automatic detection of salient video segments for real-world applications such as corporate training based on associated speech transcriptions. We present a novel segmentation algorithm based on automatic speech recognition (ASR) applied to the audio track of the video. Our feature set consists of word n-grams ex...
Article
This paper describes a signal processing algorithm which discovers the hierarchical organization of a document or media presentation. We use latent semantic indexing to describe the semantic content of the signal, and scale- space segmentation to describe its features at many differ- ent scales. We represent the semantic content of the docu- ment a...
Conference Paper
In the IBM CueVideo project we study various aspects of fully automated video indexing, browsing and retrieval. The technical aspects include audio processing, speech recognition, image processing and information retrieval. Equally important, however, is exploring user expectations and conducting user studies. We focus on the field of video for Tra...
Conference Paper
Advances in speech recognition technology have shown encouraging results for spoken document retrieval where the average precision often approaches 70% of that achieved for perfect text transcriptions. Typical applications of spoken document retrieval pertain to retrieval of stories from archived video/audio assets. In the CueVideo project, our app...
Conference Paper
Content descriptions are commonly used to index audiovisual content for search and retrieval applications. We present multimedia descriptions that can be used to facilitate rapid navigation, browsing, and efficient access to different views of audiovisual programs according to personal preferences and usage conditions. In particular, we illustrate...
Conference Paper
The detection and recognition of events is a challenging problem in video databases. It involves cross-linking and combining information available in multiple modalities such as audio, video and associated text metadata. CueVideo is a system designed for the discovery and recognition of specific events called topics of discussion through advanced v...
Conference Paper
In recent years, a few transitions in multimedia applications may be observed. We can identify at least two trends. Firstly, multimedia is being introduced in mainstream applications, leaving behind its traditional focus on highly professional markets, and on the gaming enhancement arena. Secondly, standardization bodies continue to work on media s...
Conference Paper
Effective use of digital video can be greatly improved by a combination of two technologies: computer vision for automated video analysis and information visualization for data visualization. The unstructured spatio-temporal nature of video poses tough challenges in the extraction of semantics using fully automated techniques. In the CueVideo proje...
Conference Paper
Effective use of digital video can be greatly improved by a combination of two technologies: computer vision for automated video analysis and information visualization for data visualization. The unstructured, spatio-temporal nature of video poses tough challenges in the extraction of semantics using fully automated techniques. In the CueVideo proj...
Conference Paper
The role of audio in the context of multimedia applications involving video is becoming increasingly important. Many efforts in this area focus on audio data that contains some built-in semantic information structure such as in broadcast news, or focus on classification of audio that contains a single type of sound such as cleaar speech or clear mu...
Conference Paper
Full-text available
This paper describes a series of algorithms that allow the unconstrained walkthrough of static scenes shaded with the results of precomputed global illumination. The global illumination includes specular as well as diffuse terms, and intermediate results are cached as surface light fields. The compression of such light fields is examined, and a laz...
Conference Paper
QBICTM (Query By Image Content) is a set of technologies and associated software that allows a user to search, browse, and retrieve image, graphic, and video data from large on-line collections. This paper discusses current research directions of the QBIC project such as indexing for high-dimensional multimedia data, retrieval of gray level images,...
Conference Paper
-—— ——— — : 1. ABSTRACT Mukirnedia data is an increasingly important information medium today. Providing intelligent access for effective use of this information continues to offer challenges in digital Iibrary research. As computer vision, image processing and speech recognition research continue to progress, we examine the effectiveness of these...
Article
Full-text available
Many interior-point methods for linear programming are based on the properties of the logarithmic barrier function. After a preliminary discussion of the convergence of the (primal) projected Newton barrier method, three types of barrier method are analyzed. These methods may be categorized as primal, dual and primal—dual, and may be derived from t...
Article
Video and audio conferencing over networks is becoming increasingly popular due to the availability of video and audio I/O as standard equipment on many computer systems. So far, many algorithms have concentrated on playback only capability. This generally results in unacceptable real-time performance with respect to latency and encoder complexity....
Article
The widespread use of teleconferencing as a major role of remote communication has until now been stymied by the costs associated with deploying specialized hardware required to achieve good performance. Another constraint has been the incompatibility among the various systems that are available. These facts coupled with the rapid increase in compu...
Conference Paper
A small collection of successive frames of a video sequence of a talking person defines a subspace whose dimensionality is much less than the size of one frame. Any face image drawn from the video sequence can be associated with a subspace defined by itself and the frames close to it in time. Representing faces in their associated subspaces can red...
Article
Low bit rate image coding at 10 kbit/s and less is a difficult problem and does not appear possible with the current generation of block transform based methods. Current research efforts center around the use of transforms with less objectionable artifacts such as wavelets or model based methods. We examine a method that is transform based but capt...
Conference Paper
The pile is a new element of the desktop user interface metaphor, designed to support the casual organization of documents. An interface design based on the pile concept suggested uses of content awareness for describing, organizing, and filing textual documents. We describe a prototype implementation of these capabilities, and give a detailed exam...
Article
Visual Object Oriented Unification System (VOUS) computer program provides facility for development of other, high-level software. Does not replace, but rather extends, preexisting software tools for development of other software. Provides comprehensive, graphical, interactive medium for all phases in development of computer code from early explora...
Article
Full-text available
In barrier methods for constrained optimization, the main work lies in solving large linear systems Kp = r, where K is symmetric and indefinite. We have implemented reduced KKT systems in a primal-dual algorithm for linear programming, based on the sparse indefinite solver MA27 from the Harwell Subroutine Library. Some features of the algorithm are...
Article
Full-text available
We present several new algorithms for solving the general large-scale quadratic programming (QP) problem. A feature of QP problems is the presence of linear inequality constraints, which introduce a combinatorial aspect to the problem. Currently the most common approach to solving QP problems is to apply active-set methods, in which only some of th...
Article
Full-text available
We discuss the solution of sparse linear equations Ky = z, where K is symmetric and indefinite. Since exact solutions are not always required, direct and iterative methods are both of interest. An important direct method is the Bunch-Parlett factorization K = U sub T DU, where U is triangular and D is block-diagonal. A sparse implementation exists...
Article
Full-text available
This report introduces a software design platform which departs from the style of most design tools by acting as an extension to, rather than a replacement for, existing design tools. The ultimate goal of this system is to unify and integrate the various functions provided by text editors, graphics editors, text formatters, hyper-text and structure...
Article
Full-text available
This paper describes a new approach to the segmentation and annotation prob- lem using Gaussian mixture model descriptors. These have several advantages over conventional, histogram-based techniques, including: a rigorous statistical basis; the possibility of encoding spatial, colour, texture and motion features in a unified system; and the ability...

Network

Cited By

Projects

Projects (3)