José Ramón Beltrán Blázquez

José Ramón Beltrán Blázquez
University of Zaragoza | UNIZAR · Department of Electrical Engineering and Communications

PhD

About

54
Publications
14,767
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
291
Citations

Publications

Publications (54)
Preprint
Full-text available
In this article, we present musicaiz, an object-oriented library for analyzing, generating and evaluating symbolic music. The submodules of the package allow the user to create symbolic music data from scratch, build algorithms to analyze symbolic music, encode MIDI data as tokens to train deep learning sequence models, modify existing music data a...
Article
Full-text available
This paper presents a new physiological signal acquisition multi-sensory platform for emotion detection: Multi-sensor Wearable Headband (MsWH). The system is capable of recording and analyzing five different physiological signals: skin temperature, blood oxygen saturation, heart rate (and its variation), movement/position of the user (more specific...
Article
Full-text available
The proven ability of music to transmit emotions provokes the increasing interest in the development of new algorithms for music emotion recognition (MER). In this work, we present an automatic system of emotional classification of music by implementing a neural network. This work is based on a previous implementation of a dimensional emotional pre...
Article
Full-text available
Capacitive MEMS accelerometers have a high thermal sensitivity that drifts the output when subjected to changes in temperature. To improve their performance in applications with thermal variations, it is necessary to compensate for these effects. These drifts can be compensated using a lightweight algorithm by knowing the characteristic thermal par...
Preprint
Full-text available
In this paper, we present a new model for Direction of Arrival (DOA) estimation of sound sources based on an Icosahedral Convolutional Neural Network (CNN) applied over SRP-PHAT power maps computed from the signals received by a microphone array. This icosahedral CNN is equivariant to the 60 rotational symmetries of the icosahedron, which represent...
Preprint
Full-text available
Deep learning models are typically evaluated to measure and compare their performance on a given task. The metrics that are commonly used to evaluate these models are standard metrics that are used for different tasks. In the field of music composition or generation, the standard metrics used in other fields have no clear meaning in terms of music...
Article
Full-text available
The analysis of the structure of musical pieces is a task that remains a challenge for Artificial Intelligence, especially in the field of Deep Learning. It requires prior identification of the structural boundaries of the music pieces, whose structural boundary analysis has recently been studied with unsupervised methods and supervised neural netw...
Article
Full-text available
Capacitive MEMS accelerometers have a high thermal sensitivity that drifts the output when subjected to changes in temperature. To improve their performance in applications with thermal variations, it is necessary to compensate for these effects. These drifts can be compensated using a lightweight algorithm by knowing the characteristic thermal par...
Preprint
Full-text available
Generating a complex work of art such as a musical composition requires exhibiting true creativity that depends on a variety of factors that are related to the hierarchy of musical language. Music generation have been faced with Algorithmic methods and recently, with Deep Learning models that are being used in other fields such as Computer Vision....
Preprint
Full-text available
The aim of this work is to define a model based on deep learning that is able to identify different instrument timbres with as few parameters as possible. For this purpose, we have worked with classical orchestral instruments played with different dynamics, which are part of a few instrument families and which play notes in the same pitch range. It...
Article
Full-text available
The application of MEMS capacitive accelerometers isimited by its thermal dependence, and each accelerometer must be individually calibrated to improve its performance. In this work, aight calibration method based on theoretical studies is proposed to obtain two characteristic parameters of the sensor's operation: the temperature drift of bias and...
Article
Full-text available
Automatic music transcription (AMT) is a critical problem in the field of music information retrieval (MIR). When AMT is faced with deep neural networks, the variety of timbres of different instruments can be an issue that has not been studied in depth yet. The goal of this work is to address AMT transcription by analyzing how timbre affect monopho...
Article
Full-text available
The Image Source Method (ISM) is one of the most employed techniques to calculate acoustic Room Impulse Responses (RIRs), however, its computational complexity grows fast with the reverberation time of the room and its computation time can be prohibitive for some applications where a huge number of RIRs are needed. In this paper, we present a new i...
Article
Full-text available
Note Tracking (NT) is a subtask of Automatic Music Transcription (AMT) which is a critical problem in the field of Music Information Retrieval (MIR). The aim of this work is to compare the performance of two models, one for onsets and frames prediction and another one with pitch detection and a note tracking algorithm in order to study the behaviou...
Article
In this paper, we present a new single sound source DOA estimation and tracking system based on the well-known SRP-PHAT algorithm and a three-dimensional Convolutional Neural Network. It uses SRP-PHAT power maps as input features of a fully convolutional causal architecture that uses 3D convolutional layers to accurately perform the tracking of a s...
Article
Este artículo presenta una revisión de librerías de alto nivel que permiten reconocer ciertas emociones en la música (MER). El principal objetivo del trabajo consiste en estudiar y comparar diferentes analizadores de contenido, mostrando sus principales funcionalidades, enfocadas a la extracción de características de la música y su posterior clasif...
Preprint
Full-text available
The analysis of the structure of musical pieces is a task that remains a challenge for Artificial Intelligence, especially in the field of Deep Learning. It requires prior identification of structural boundaries of the music pieces. This structural boundary analysis has recently been studied with unsupervised methods and \textit{end-to-end} techniq...
Preprint
Full-text available
In this paper, we present a new sound source DOA estimation and tracking system based on the well known SRP-PHAT algorithm and a three-dimensional Convolutional Neural Network. It uses SRP-PHAT power maps as input features of a fully convolutional causal architecture that uses 3D convolutional layers to accurately perform the tracking of a sound so...
Article
In multi-source localization systems, a stronger source can hide weaker sources. In this paper, we present a new technique to eliminate the effect of a source in the Generalized Cross-Correlation functions (GCCs) of the signals captured with a broadband sensor array. The proposed method is based on the projection of the GCCs onto a subspace orthogo...
Chapter
The recognition of emotions for annotating large-size music datasets is still an open challenge. The problem lies in that most of the solutions require the audio of the songs and user/expert intervention during certain phases of the recognition process. In this paper, we propose an automatic solution for overcoming these drawbacks. It consists of a...
Conference Paper
This article focuses on the process of designing a prediction system for automatic recognition of emotions in music. One of the main goals of this work is to analyze a prediction solution and some possible variations in its design that allow maximizing the success rate of predictions through a machine learning technique. For the training process a...
Chapter
Full-text available
Mostly all works dealing with ECG signal and Convolutional Network approach use 1D CNNs and must train them from scratch, usually applying a signal preprocessing, such as noise reduction, R-peak detection or heartbeat detection. Instead, our approach was focused on demonstrating that effective transfer learning from 2D CNNs can be done using a well...
Article
Our main goal was studying the effectiveness of transfer learning using 2D CNNs. For this task, we generated spectrograms from ECG segments that were fed to a CNN to automatically extract features. These features are classified by a MLP into arrhythmic or normal rhythm segments, achieving 90% accuracy.
Chapter
This article presents a review of high-level libraries that enable to recognize emotions in digital files of music. The main objective of the work is to study and compare different high-level content-analyzer libraries, showing their main functionalities, focused on the extraction of low and high level relevant features to classify musical pieces t...
Conference Paper
People that practice running use to listen to music during their training sessions. Music can have a positive influence on runners’ motivation and performance, but it requires selecting the most suitable song at each moment. Most of the music recommendation systems combine users’ preferences and context-aware factors to predict the next song. In th...
Preprint
Full-text available
The Image Source Method (ISM) is one of the most employed techniques to calculate acoustic Room Impulse Responses (RIRs), however, its computational complexity grows fast with the reverberation time of the room and its computation time can be prohibitive for some applications where a huge number of RIRs are needed. In this paper, we present a new i...
Chapter
Music can have a positive influence on long-distance runners’ motivation and performance. It requires selecting the most suitable music by considering the runner’s physiological data, the type of training session and the geographical and environmental conditions under which the activity is done. In this context, we are interested in studying the ru...
Article
In this paper, a methodology to implement the Input Shaping control in overhead cranes is presented. The realization of Input Shaping has been done using the Simulink® tools Stateflow and PLC Coder. Seven Input Shaping algorithms have been developed. A test has been designed to obtain the characteristic parameters, natural frequency of oscillation...
Conference Paper
The Steered Response Power with phase transform (SRP-PHAT) is one of the most employed techniques for Direction of Arrival (DOA) estimation with microphone arrays due its robustness against acoustical conditions as reverberation or noise. Among its main drawbacks is the growth of its computational complexity when the search space increases. To solv...
Article
The Steered Response Power with phase transform (SRP-PHAT) is one of the most employed techniques for Direction of Arrival (DOA) estimation with microphone arrays, but its computational complexity grows when the search space increases. To solve this issue, we propose the use of Neural Networks (NN) to obtain the DOA from low-resolution SRP-PHAT pow...
Conference Paper
This paper presents the development of a new tangible tabletop specially designed for interactive audiovisual and musical control. It explores a new interactive space based on 3D active tangible interaction and user's gestures that allows the user to extend the control of the musical events beyond the tabletop surface. It includes a vertical see-th...
Conference Paper
This paper presents ImmertableApp, an innovative multimodal interface based in tangible interaction in which audio edition is managed through physical controllers. The system is composed by two different main components: a tangible tabletop interface in which the sound parameters can be changed by the manipulation of physical controllers; and a gra...
Conference Paper
Full-text available
Este trabajo presenta las acciones que estamos realizando para crear una herramienta de generación y ejecución de entornos virtuales y simulación altamente inmersivos de bajo coste. El objetivo de la herramienta es que usuarios no expertos (en las tecnologías implicadas) definan entornos de simulación combinando aplicaciones de M&S y dispositivos C...
Article
Full-text available
In this work, an improvement of the Complex Wavelet Additive Synthesis (CWAS) algorithm is presented. This algorithm is based on a discrete version of the Complex Continuous Wavelet Transform (CCWT) which analyzes the input signal in a frame-to-frame approach and under variable frequency resolution per octave. After summarizing several Time-Frequen...
Article
Full-text available
In this study, a new method of blind audio source separation (BASS) of monaural musical harmonic notes is presented. The input (mixed notes) signal is processed using a flexible analysis and synthesis algorithm (complex wavelet additive synthesis, CWAS), which is based on the complex continuous wavelet transform. When the harmonics from two or more...
Article
This work presents a method of analyzing and synthesizing audio signals that uses complex wavelets. In the method, the input signal is filtered by a complex bandpass filter bank through a discrete version of the complex continuous wavelet transform. A general theoretical signal with time-dependent amplitude and phase has been analyzed. The analysis...
Article
In this paper, we present the implementation of different reverberation algorithms in the Matlab programming environment. Matlab is a useful tool to analyze the algorithm's behavior under the signal processing point of view. In addition, the possibility of hearing the results is quite simple and fast. The reverberation algorithms are presented in t...
Article
Full-text available
In this work a new multiresolution method to detect and classify edges appearing in images has been proposed. The edge detection and classification schema is based on the analysis of the data obtained by a multiresolution image analysis using Mallat and Zhong's wavelet. Multiresolution analysis allows to detect edges of different relevance at diffe...
Conference Paper
In this paper a new method for medical images analysis has been proposed. It is based in a multiresolution schema in combination with a k-means clustering algorithm. The edge detection and classification schema is based on the analysis of the data obtained by a multiresolution image analysis (MRA) using Mallat and Zhong's wavelet. The edge detectio...
Conference Paper
An inverter topology based on high frequency power conversion and sigma-delta modulation is proposed. Sigma-delta technique is particularly an attractive idea in audio power amplifying area. Analysis of this technique is presented. Power modulation is a way to obtain power amplifiers with better efficiency than conventional linear power amplifiers....
Article
Full-text available
In this paper, we present the implementation of different reverberation algorithms in the Matlab programming environment. This is a useful tool to analyze the algorithms behavior from the signal processing and sounding point of view. With Matlab environment is possible and simple to view the filter characteristics, impulse response, phase response...
Article
Full-text available
This paper describes a highly versatile, low-cost reverberation system comprising two main elements: a computer for building and editing the desired reverberation effect impulse response, and a commercial DSP-based board, to run the algorithm in real-time, allowing the evaluation of the results. The main parameters of the reverberation algorithm ca...
Conference Paper
In this work we present the method we have developed in order to achieve edge detection and classification in gray level images for five different contour types: step, ramp, stair, pulse and noise. The edge detection method is based in a multiresolution analysis using Mallat and Zhong's wavelet, which is compared with the gaussian-based one. The ed...
Article
Two-dimensional discrete wavelet transforms (DWTs) have become a very powerful tool in computer vision. When implementing DWT in hardware, finite precision arithmetic introduces quantization errors. The hardware designer must look for the optimum register length which, while ensuring the minimum accuracy criteria, would also lead to a high-speed im...
Conference Paper
We have developed an improved edge detector and classifier for grey level images using multiresolution wavelet-based analysis, particularly the wavelet introduced by Mallat (see IEEE Trans. on Patt. Anal. and Machine Intell., vol.14, no.7, p.710, 1992), specifically designed for edge detection. The edge detection algorithm has been designed based o...
Article
Full-text available
In this paper a new algorithm to compute an additive synthesis model of a signal is presented. An analysis based on the Con-tinuous Wavelet Transform (CWT) has been used to extract the time-varying amplitudes and phases of the model. A coarse to fine analysis increases the algorithm efficiency. The computation of the transient analysis is performed...
Article
Full-text available
In this paper we present a reverberation system based on a multi-loudspeaker configuration. The aim of this work is to produce a natural sounding reverberation system with a similar pattern to the produced in real rooms. A new method for sound spatialization is presented, and it is used to locate on the virtual room's surfaces the early reflections...
Article
Full-text available
In this paper, a new method of blind source separation of monaural signals is presented. It is based on similarity cri- teria between envelopes and frequency trajectories of the components of the signal, and on its onset and offset times. The main difference with previous works is that in this paper, the input signal has been filtered using a flexi...
Article
Full-text available
The objective of this paper is to show the ability of complex band-pass filterbanks to extract the intermodulation information that ap-pears when two audio signals interact inside the same analysis band. To perform the analysis a sinusoidal model of the signals has been assumed. Three kinds of signals have been analyzed: a sum of two cosines, a sum...

Network

Cited By

Projects

Projects (3)
Project
El objetivo general de la tesis consiste en elaborar un modelo que permita establecer la relación entre las características intrínsecas de la música y las emociones percibidas por el oyente; haciendo uso de elementos y herramientas de computación afectiva, para generar un sistema recomendador musical y emocional.
Project
The general objective of the project is the creation of personalized pervasive gaming experiences that act dynamically with older people promoting active ageing, by increasing their social relations, particularly intergenerational ones, and improving their overall health and wellbeing.
Project
The goal of the project is to design simple microphone array structures to locate multiple sound sources into a closed environment. An efficient wide-band algorithm like SRP is used to obtain real-time response.