Diego Furtado Silva

Diego Furtado Silva
University of São Paulo | USP · Institute of Mathematical and Computer Sciences (ICMC) (São Carlos)

MSc in Computer Science and Computational Mathematics

About

37
Publications
37,324
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
1,963
Citations
Additional affiliations
January 2012 - present
University of São Paulo
Education
August 2012
University of São Paulo
Field of study
  • Computer Sciences and Computacional Mathematics
February 2007 - July 2011
University of São Paulo
Field of study
  • Computer Science

Publications

Publications (37)
Conference Paper
Full-text available
There is a huge increase of interest for time series methods and techniques. Virtually every piece of information collected from human, natural, and biological processes is susceptible to changes over time, and the study of how these changes occur is a central issue in fully understanding such processes. Among all time series mining tasks, classifi...
Conference Paper
Full-text available
Throughout the history, insects have had an intimate relationship with humanity, both positive and negative. Insects are vectors of diseases that kill millions of people every year and, at the same time, insects pollinate most of the world's food production. Consequently, there is a demand for new devices able to control the populations of harmful...
Conference Paper
Full-text available
Recognition of isolated spoken digits is the core procedure for a large and important number of applications mainly in telephone based services, such as dialing, airline reservation, bank transaction and price quotation, only using speech. Spoken digit recognition is generally a challenging task since the signals last for short period of time and o...
Conference Paper
Full-text available
In the last decade, class imbalance has attracted a huge amount of attention from researchers and practitioners. Class imbalance is ubiquitous in Machine Learning, Data Mining and Pattern Recognition applications; therefore, these research communities have responded to such interest with literally dozens of methods and techniques. Surprisingly, the...
Conference Paper
Photoplethysmography (PPG) is a well-known technique to estimate blood pressure, oxygen saturation, and heart frequency. Recent efforts aim to obtain PPG from wearable and mobile devices, allowing more democratic access. This paper explores the potential of using a smartphone camera as a PPG sensor, getting a time series based on the RGB values of...
Article
Full-text available
Cardiovascular diseases are the leading cause of death in the world. People living in vulnerable and poor places such as slums, rural areas and remote locations have difficulty in accessing medical care and diagnostic tests. In addition, given the COVID-19 pandemic, we are witnessing an increase in the use of telemedicine and non-invasive tools for...
Article
Full-text available
Anemia and jaundice are common health conditions that affect millions of children, adults, and the elderly worldwide. Recently, the pandemic caused by severe acute respiratory syndrome-coronavirus 2 (SARS-CoV-2), the virus that leads to COVID-19, has generated an extreme worldwide concern and a huge impact on public health, education, and economy,...
Article
Full-text available
The pandemic caused by the new coronavirus (SARS-COV-2) has led to more than two million deaths in the world by March 2021. The worldwide call to reduce transmission is enormous. Recently, there has been a rapid growth of telemedicine and the use of mobile health (mHealth) in the context of the COVID-19 pandemic. Smartphone accessories such as a fl...
Conference Paper
Full-text available
Labeling a music recording according to its genre is an intuitive and familiar way to describe its content. Music genres are valuable information especially for music organization, personalized listening experience, and playlist generation. Automatically classifying music genres is a challenging endeavor due to the inherent ambiguity and subjectivi...
Article
Full-text available
Anemia is a public health problem that can have different causes, such as iron deficiency, vitamin deficiency, inflammation, hemolytic anemias, and anemias associated with bone marrow disease. Anemia shows a decrease in the concentration of hemoglobin, a pigmented molecule in the erythrocytes. The objectives of this review were to highlight the imp...
Article
Full-text available
Diabetes is a chronic disease and one of the major public health problems worldwide. It is a multifactorial disease, caused by genetic factors and lifestyle habits. Brazil had ! 16.8 million individuals living with diabetes in 2019 and is expected to reach 26 million people by 2045. There are global increasing needs for the development of noninvasi...
Article
Full-text available
The recently introduced data structure, the Matrix Profile, annotates a time series by recording the location of and distance to the nearest neighbor of every subsequence. This information trivially provides answers to queries for both time series motifs and time series discords, perhaps two of the most frequently used primitives in time series dat...
Article
Full-text available
Dynamic Time Warping (DTW) is a highly competitive distance measure for most time series data mining problems. Obtaining the best performance from DTW requires setting its only parameter, the maximum amount of warping (w). In the supervised case with ample data, w is typically set by cross-validation in the training stage. However, this method is l...
Article
Most algorithms for music data mining and retrieval analyze the similarity between feature sets extracted from the raw audio. A conventional approach to assess similarities within or between recordings is to create similarity matrices. However, this method requires quadratic space for each comparison and typically requires a costly post-processing...
Article
Full-text available
The last decade has seen a flurry of research on all-pairs-similarity-search (or similarity joins) for text, DNA and a handful of other datatypes, and these systems have been applied to many diverse data mining problems. However, there has been surprisingly little progress made on similarity joins for time series subsequences. The lack of progress...
Article
Full-text available
Paper-based devices are an excellent match for low-cost Point-of-Care Testing (POCT) tools. Their user-friendliness, portability, and short time of analysis, coupled with easiness for local manufacture make these devices the best option for inexpensive diagnostic testing tools. However, despite all their positive features, these low-cost diagnostic...
Data
The Macro in Microsoft Excel® (m-Accuracy) shows enormous utility to calculate the accuracy of any medical, chemical, and environmental tests, even if the results show different orders of magnitude, since this tool analyzes the similarity between the two tests. The analysis is performed by a single command (Ctrl + Shift + T). The functioning of m-A...
Conference Paper
Full-text available
While there exist a plethora of classification algorithms for most data types, there is an increasing acceptance that the unique properties of time series mean that the combination of nearest neighbor classifiers and Dynamic Time Warping (DTW) is very competitive across a host of domains, from medicine to astronomy to environmental sensors. While t...
Conference Paper
Full-text available
Time series has attracted much attention in recent years, with thousands of methods for diverse tasks such as classification, clustering, prediction, and anomaly detection. Among all these tasks, classification is likely the most prominent task, accounting for most of the applications and attention from the research community. However, in spite of...
Conference Paper
Full-text available
Data stream classification algorithms for nonstationary environments frequently assume the availability of class labels, instantly or with some lag after the classification. However , certain applications, mainly those related to sensors and robotics, involve high costs to obtain new labels during the classification phase. Such a scenario in which...
Article
Full-text available
Insects have a close relationship with the humanity, in both positive and negative ways. Mosquito borne diseases kill millions of people and insect pests consume and destroy around US $40 billion worth of food each year. In contrast, insects pollinate at least two-thirds of all the food consumed in the world. In order to control populations of dise...
Conference Paper
Full-text available
In the last decade we have witnessed a huge increase of interest in data stream learning algorithms. A stream is an ordered sequence of data records. It is characterized by properties such as the potentially infinite and rapid flow of instances. However, a property that is common to various application domains and is frequently disregarded is the v...
Article
Full-text available
In the last decade, class imbalance has attracted a huge amount of attention from researchers and practitioners. Class imbalance is ubiquitous in Machine Learning, Data Mining and Pattern Recognition applications; therefore, these research communities have responded to such interest with literally dozens of methods and techniques. Surprisingly, the...
Conference Paper
Full-text available
The popularization of music distribution in electronic format has increased the amount of music with incomplete metadata. The incompleteness of data can hamper some important tasks, such as music and artist recommendation. In this scenario, transductive classification can be used to classify the whole dataset considering just few labeled instances....
Conference Paper
Full-text available
Time series are present in many pattern recognition applications related to medicine, biology, astronomy, economy, and others. In particular, the classification task has attracted much attention from a large number of researchers. In such a task, empirical researches has shown that the 1-Nearest Neighbor rule with a distance measure in time domain...
Conference Paper
Full-text available
The choice of the distance measure between time-series representations can be decisive to achieve good classification results in many content-based information retrieval applications. In the field of Music Information Retrieval, two-dimensional representations of the music signal are ubiquitous. Such representations are useful to display patterns o...
Conference Paper
Full-text available
Applications such as intelligent sensors should be able to collect information about the environment and make decisions based on input data. An example is a low-cost sensor able to detect and classify species of insects using a simple laser and machine learning techniques. This sensor is an important step towards the development of intelligent trap...
Article
Full-text available
Recognition of isolated spoken digits is the core procedure for a large number of applications which rely solely on speech for data exchange, as in telephone-based services, such as dialing, airline reservation, bank transaction and price quotation. Spoken digit recognition is generally a challenging task since the signals last for a short period o...
Article
Resumo: Um dos grandes desafios enfrentados atualmente ná area de Tecnologia da Informa aó e o de prover serviços com o objetivo de atende as diferentes necessidades e restri oes de grupos cada vez mais heterogêneos. Para a universaliza ao do acesso a esses serviços, um conceito chavé e o de Acessibilidade. Neste artigo são apresentados os funda-me...

Network

Cited By