Carlos Roberto Del-Blanco

Carlos Roberto Del-Blanco
  • PhD
  • Professor (Associate) at Universidad Politécnica de Madrid

About

65
Publications
12,084
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
1,533
Citations
Current institution
Universidad Politécnica de Madrid
Current position
  • Professor (Associate)
Additional affiliations
December 2005 - September 2012
Universidad Politécnica de Madrid
Position
  • Researcher

Publications

Publications (65)
Article
Full-text available
Nighttime vehicle detection poses significant challenges, particularly in scenarios with limited lighting, where visibility is often compromised. To address this problem, this paper proposes a novel nighttime vehicle detection system that dynamically adapts to extreme lighting conditions, ranging from bright daytime scenarios to challenging nightti...
Preprint
Full-text available
Research on Reinforcement Learning (RL) approaches for discrete optimization problems has increased considerably, extending RL to an area classically dominated by Operations Research (OR). Vehicle routing problems are a good example of discrete optimization problems with high practical relevance where RL techniques have had considerable success. De...
Article
Full-text available
Automatic hand gesture recognition in video sequences has widespread applications, ranging from home automation to sign language interpretation and clinical operations. The primary challenge lies in achieving real-time recognition while managing temporal dependencies that can impact performance. Existing methods employ 3D convolutional or Transform...
Article
Missions involving Unmanned Aerial Vehicle usually consist of reaching a set of regions, performing some actions in each region, and returning to a determined depot after all the regions have been successfully visited or before the fuel/battery is totally consumed. Hence, planning a route becomes an important task for many applications, especially...
Article
Full-text available
Along with society’s development, transportation has become a key factor in human daily life, increasing the number of vehicles on the streets. Consequently, the task of finding free parking slots in metropolitan areas can be dramatically challenging, increasing the chance of getting involved in an accident and the carbon footprint, and negatively...
Article
Full-text available
The conservation of marine resources requires constant monitoring of the underwater environment by researchers. For this purpose, visual automated monitoring systems are of great interest, especially those that can describe the environment using semantic segmentation based on deep learning. Although they have been successfully used in several appli...
Article
A novel deep-learning people detection algorithm using omnidirectional cameras is presented, which only requires point-based annotations, unlike most of the prominent works that require bounding box annotations. Thus, the effort of manually annotating the needed training databases is significantly reduced, allowing a faster system deployment. The a...
Article
Vehicle monitoring using camera networks is an important task for traffic applications. Moreover, it becomes critical in nighttime, when the probability of an accident considerably increases as visibility conditions worsen. Typical approaches are mostly based on the assumption that regions delimiting vehicle lights are well defined, so that they ar...
Article
This paper describes a system for people indoor localization using omnidirectional cameras and machine learning that significantly reduces the database annotation requirements for the training stage. The most prominent works for people detection are based on machine learning techniques that requires large databases with bounding box annotations (th...
Article
Full-text available
In the past few years, the commercial use of drones has exploded, since they are a safe and cost‐effective solution for many kinds of problems. However, this fact also opens the door for malicious use. This work presents a novel system able to detect and recognise drones from other targets, allowing the police and security agencies to deal with thi...
Article
Full-text available
Visual hand gesture recognition systems are promising technologies for Human Computer Interaction, as they allow a more immersive and intuitive interaction. Most of these systems are based on the analysis of skeleton information, which is in turn inferred from color, depth, or near-infrared imagery. However, the robust extraction of skeleton inform...
Chapter
Currently, there is a plethora of video wearable devices that can easily collect data from daily user life. This fact has promoted the development of lifelogging applications for security, healthcare, and leisure. However, the retrieval of not-pre-defined events is still a challenge due to the impossibility of having a potentially unlimited number...
Chapter
Full-text available
This paper presents an overview of the ImageCLEF 2019 lab, organized as part of the Conference and Labs of the Evaluation Forum - CLEF Labs 2019. ImageCLEF is an ongoing evaluation initiative (started in 2003) that promotes the evaluation of technologies for annotation, indexing and retrieval of visual data with the aim of providing information acc...
Conference Paper
The File Forgery Detection tasks is in its first edition, in 2019. This year, it is composed by three subtasks: a) Forged file discovery, b) Stego image discovery and c) Secret message discovery. The data set contained 6,400 images and pdf files, divided into 3 sets. There were 61 participants and the majority of them participated in all the subtas...
Chapter
Full-text available
This paper presents an overview of the foreseen ImageCLEF 2019 lab that will be organized as part of the Conference and Labs of the Evaluation Forum - CLEF Labs 2019. ImageCLEF is an ongoing evaluation initiative (started in 2003) that promotes the evaluation of technologies for annotation, indexing and retrieval of visual data with the aim of prov...
Article
Full-text available
This paper presents an overview of the foreseen ImageCLEF 2019 lab that will be organized as part of the Conference and Labs of the Evaluation Forum-CLEF Labs 2019. ImageCLEF is an ongoing evaluation initiative (started in 2003) that promotes the evaluation of technologies for annotation, indexing and retrieval of visual data with the aim of provid...
Preprint
Blogs have become one of the most successful tools of the Web 2.0 because of their ease of use and the availability of open platforms. They have quickly spread in the education field thanks to the many attractive qualities that have been attributed to them, such as collaboration, communication, enhancing of professional writing, and the improvement...
Article
There has been a significant increase in the availability of 3D players and displays in the last years. Nonetheless, the amount of 3D content has not experimented an increment of such magnitude. To alleviate this problem, many algorithms for converting images and video from 2D to 3D have been proposed. Here, we present an automatic learning-based 2...
Article
Full-text available
Civilian drones are soon expected to be used in a wide variety of tasks, such as aerial surveillance, delivery, or monitoring of existing architectures. Nevertheless, their deployment in urban environments has so far been limited. Indeed, in unstructured and highly dynamic scenarios, drones face numerous challenges to navigate autonomously in a fea...
Article
Visual hand-gesture recognition is being increasingly desired for human-computer interaction interfaces. In many applications, hands only occupy about 10% of the image, whereas the most of it contains background, human face, and human body. Spatial localization of the hands in such scenarios could be a challenging task and ground truth bounding box...
Article
There has been recently a significant increase in the number of available 3D displays and players. Nevertheless, the amount of 3D content has not increased in the same magnitude, creating a gap between 3D offer and demand. To reduce this difference, many algorithms have appeared that perform 2D-to-3D image and video conversion. These algorithms usu...
Conference Paper
Hand gestures are one of the main alternatives for Human-Computer Interaction. For this reason, a hand gesture recognition system using near-infrared imagery acquired by a Leap Motion sensor is proposed. The recognition system directly characterizes the hand gesture by computing a global image descriptor, called Depth Spatiograms of Quantized Patte...
Article
Human-computer Interaction systems based on hand-gesture recognition are nowadays of great interest to establish a natural communication between humans and machines. However, the visual recognition of gestures and other human poses remains a challenging problem. In this paper, the original volumetric spatiograms of local binary patterns descriptor...
Conference Paper
Automatic 2D-to-3D conversion aims to reduce the existing gap between the scarce 3D content and the incremental amount of displays that can reproduce this 3D content. Here, we present an automatic 2D-to-3D conversion algorithm that extends the functionality of the most of the existing machine learning based conversion approaches to deal with moving...
Article
A novel biometric face recognition algorithm using depth cameras is proposed. The key contribution is the design of a novel and highly discriminative face image descriptor called bag of dense derivative depth patterns (Bag-D3P). This descriptor is composed of four different stages that fully exploit the characteristics of depth information: 1) dens...
Conference Paper
Augmented reality is becoming more and more popular due to the countless number of practical applications. A key element is the understanding of the scene and the involved human activities to be able to offer a rich interaction with the world via virtual actions and elements. For this purpose, a new vision-based human-action recognition module has...
Conference Paper
A novel and robust vision-based human-machine interface system to naturally interact with computers/smart devices is proposed. The key contribution is the introduction of a Compressive Sensing technique to largely reduce the dimensionality of highly discriminative feature descriptors (computed from depth imagery), which originally have an excessive...
Article
A novel and robust biometric face identification algorithm for access control applications is proposed. The key contribution is the design of a high discriminative feature descriptor for depth imagery, called Depth Spatiogram of Local Quantized Patterns, which is used as input of a bank of Support Vector Machine classifiers.
Article
Full-text available
A real-time surveillance system for IP network cameras is presented. Motion, part-body, and whole-body detectors are efficiently combined to generate robust and fast detections, which feed multiple compressive trackers. The generated trajectories are then improved using a re-identification strategy for long term operation.
Article
A more natural, intuitive, user-friendly, and less intrusive Human-Computer interface for controlling an application by executing hand gestures is presented. For this purpose, a robust vision-based hand-gesture recognition system has been developed, and a new database has been created to test it. The system is divided into three stages: detection,...
Article
An automatic machine learning strategy for computing the 3D structure of monocular images from a single image query using Local Binary Patterns is presented. The 3D structure is inferred through a training set composed by a repository of color and depth images, assuming that images with similar structure present similar depth maps. Local Binary Pat...
Article
A depth-based face recognition algorithm specially adapted to high range resolution data acquired by the new Microsoft Kinect 2 sensor is presented. A novel descriptor called Depth Local Quantized Pattern descriptor has been designed to make use of the extended range resolution of the new sensor. This descriptor is a substantial modification of the...
Article
In the recent years, the computer vision community has shown great interest on depth-based applications thanks to the performance and flexibility of the new generation of RGB-D imagery. In this paper, we present an efficient background subtraction algorithm based on the fusion of multiple region-based classifiers that processes depth and color data...
Conference Paper
Automatic 2D-to-3D conversion is an important application for filling the gap between the increasing number of 3D displays and the still scant 3D content. However, existing approaches have an excessive computational cost that complicates its practical application. In this paper, a fast automatic 2D-to-3D conversion technique is proposed, which uses...
Conference Paper
New forms of natural interactions between human operators and UAVs (Unmanned Aerial Vehicle) are demanded by the military industry to achieve a better balance of the UAV control and the burden of the human operator. In this work, a human machine interface (HMI) based on a novel gesture recognition system using depth imagery is proposed for the cont...
Article
Full-text available
Low-cost systems that can obtain a high-quality foreground segmentation almost independently of the existing illumination conditions for indoor environments are very desirable, especially for security and surveillance applications. In this paper, a novel foreground segmentation algorithm that uses only a Kinect depth sensor is proposed to satisfy t...
Conference Paper
This paper presents a novel background modeling system that uses a spatial grid of Support Vector Machines classifiers for segmenting moving objects, which is a key step in many video-based consumer applications. The system is able to adapt to a large range of dynamic background situations since no parametric model or statistical distribution are a...
Article
Full-text available
An innovative background modeling technique that is able to accurately segment foreground regions in RGB-D imagery (RGB plus depth) has been presented in this paper. The technique is based on a Bayesian framework that efficiently fuses different sources of information to segment the foreground. In particular, the final segmentation is obtained by c...
Article
Visual tracking of multiple objects is a fundamental aspect of many video-based systems. Today, there are reliable algorithms that can track a small number of objects in restricted situations. However, the tracking of a large number of objects in uncontrolled situations involving interacting objects with complex dynamics is still a challenge. In th...
Article
Full-text available
Automatic visual object counting and video surveillance have important applications for home and business environments, such as security and management of access points. However, in order to obtain a satisfactory performance these technologies need professional and expensive hardware, complex installations and setups, and the supervision of qualifi...
Article
A low complex but highly-efficient object counter algorithm is presented that can be embedded in hardware with a low computational power. This is achieved by a novel soft-data association strategy that can handle multimodal distributions.
Article
Full-text available
Visual tracking of multiple objects is a key component of many visual-based systems. While there are reliable algorithms for tracking a single object in constrained scenarios, the object tracking is still a challenge in uncontrolled situations involving multiple interacting objects that have a complex dynamics. In this article, a novel Bayesian mod...
Conference Paper
An automatic detection and tracking framework for visual surveillance is proposed, which is able to handle a variable number of moving objects. Video object detectors generate an unordered set of noisy, false, missing, split, and merged measurements that make extremely complex the tracking task. Especially challenging are split detections (one obje...
Article
Full-text available
A novel strategy for object tracking in aerial imagery is presented, which is able to deal with complex situations where the camera ego-motion cannot be reliably estimated due to the aperture problem (related to low structured scenes), the strong ego-motion, and/or the presence of independent moving objects. The proposed algorithm is based on a com...
Conference Paper
Full-text available
A multiple object visual tracking framework is presented, which is able to manage complex object interactions, missing detections and clutter. The main contribution is the ability to deal with complex situations in which the interacting objects can change their dynamics while they are occluded. This is achieved by explicitly estimating putative loc...
Conference Paper
Full-text available
Here, a novel and efficient feedback system for moving object segmentation and tracking is proposed. Through the use of non-parametric background-foreground modeling, moving objects are correctly detected in unfavorable situations such as dynamic backgrounds or illumination changes. After detection, objects are tracked by an original multi-object B...
Article
Full-text available
The increasing availability of powerful computers and high quality video cameras has allowed the proliferation of video based systems, which perform tasks such as vehicle navigation, traffic monitoring, surveillance, etc. A fundamental component in these systems is the visual tracking of objects of interest, whose main goal is to estimate the objec...
Conference Paper
Full-text available
Visual tracking with moving cameras is a challenging task. The global motion induced by the moving camera moves the target object outside the expected search area, according to the object dynamics. The typical approach is to use a registration algorithm to compensate the camera motion. However, in situations involving several moving objects, and ba...
Article
Full-text available
Automatic target tracking in airborne FLIR imagery is currently a challenge due to the camera ego-motion. This phenomenon distorts the spatio-temporal correlation of the video sequence, which dramatically reduces the tracking performance. Several works address this problem using ego-motion compensation strategies. They use a deterministic approach...
Conference Paper
Full-text available
People positioning and tracking in 3D indoor environments are challenging tasks due to background clutter and occlusions. Current works are focused on solving people occlusions in low-cluttered backgrounds, but fail in high-cluttered scenarios, specially when foreground objects occlude people. In this paper, a novel 3D people positioning and tracki...
Conference Paper
Full-text available
Video sequences acquired by a camera mounted on a hand held device or a mobile platform are affected by unwanted shakes and jitters. In this situation, the performance of video applications, such us motion segmentation and tracking, might dramatically be decreased. Several digital video stabilization approaches have been proposed to overcome this p...
Conference Paper
Full-text available
Most multi-camera 3D tracking and positioning systems rely on several independent 2D tracking modules applied over individual camera streams, fused using both geometrical relationships across cameras and/or observed appearance of objects. However, 2D tracking systems suffer inherent difficulties due to point of view limitations (perceptually simila...
Article
Full-text available
Motion estimation in video sequences is a classical intensive computational task that is required for a wide range of applications. Many different methods have been proposed to reduce the computational complexity, but the achieved reduction is not enough to allow real time operation in a non-specialized hardware. In this paper an efficient selectio...
Conference Paper
An efficient automatic detection strategy for aerial moving targets in airborne forward-looking infrared (FLIR) imagery is presented in this paper. Airborne cameras induce a global motion over all objects in the image, that invalidates motion-based segmentation techniques for static cameras. To overcome this drawback, previous works compensate the...
Article
Full-text available
Common strategies for detection and tracking of aerial moving targets in airborne Forward-Looking Infrared (FLIR) images offer accurate results in images composed by a non-textured sky. However, when cloud and earth regions appear in the image sequence, those strategies result in an over-detection that increases very significantly the false alarm r...
Conference Paper
Full-text available
An efficient automatic moving target detection and tracking system in airborne forward looking infrared (FLIR) imagery is presented in this paper. Due to camera ego-motion, these detection and tracking tasks are challenging problems. Besides, previously proposed techniques are not suitable for aerial images, as the predominant regions are non-textu...

Network

Cited By