Nelson Neto

Nelson Neto
Federal University of Pará | UFPA · Faculty of Computing

PhD

About

53
Publications
20,585
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
238
Citations
Citations since 2016
27 Research Items
140 Citations
20162017201820192020202120220102030
20162017201820192020202120220102030
20162017201820192020202120220102030
20162017201820192020202120220102030
Additional affiliations
July 2011 - present
Federal University of Pará
Position
  • Professor

Publications

Publications (53)
Chapter
There is a need for several applications to interpret the predictions made by machine learning algorithms. In light of this, this paper provides a literature review with the aim of analyzing the use of interpretable frameworks, which are tools coupled to algorithms for a better understanding of output predictions. Altogether, 10 frameworks were cit...
Chapter
Forced phonetic alignment (FPA) is the task of assessing the time boundaries of phonetic units, i.e., calculating when in the speech utterance a certain phoneme starts and ends. This paper describes experiments on FPA for Brazilian Portuguese using Kaldi toolkit. Based on time-delay neural networks (TDNN), several acoustic models were trained on th...
Article
Full-text available
Phonetic analysis of speech, in general, requires the alignment of audio samples to its phonetic transcription. This could be done manually for a couple of files, but as the corpus grows large, it becomes infeasibly time-consuming. This paper describes the evolution process toward creating free resources for phonetic alignment in Brazilian Portugue...
Chapter
Forced phonetic alignment (FPA) is the task of associating a given phonetic unit to a timestamp interval in the speech waveform. Phoneticians are able mark the boundaries with precision, but as the corpus grows it becomes infeasible to do it by hand. For Brazilian Portuguese (BP) in particular, only three tools appear to perform FPA: EasyAlign, Mon...
Chapter
Phonetic analysis of speech, in general, requires the alignment of audio samples to its phonetic transcription. This task could be performed manually for a couple of files, but as the corpus grows large it becomes unfeasibly time-consuming, which emphasizes the need for computational tools that perform such speech-phonemes forced alignment automati...
Article
Full-text available
Tests that evaluate individual strategies of visual exploration may be useful for uncovering deviations from typical development, such as autism spectrum disorder and dyslexia. One subgroup of visual exploration tests, called cancellation tests, requires the identification of specific targets surrounded by distractors. However, the lack of automate...
Article
Full-text available
Kriging is a geostatistical interpolation technique that performs the prediction of observations in unknown locations through previously collected data. The modelling of the variogram is an essential step of the kriging process because it drives the accuracy of the interpolation model. The conventional method of variogram modelling consists of usin...
Conference Paper
Full-text available
Computer systems are approaching human behavior in the sense of performing similar tasks, such as listening, understanding, thinking, and speaking. Although the forms of interaction with these systems have also followed the path of technology evolution, computer mice and keyboards do continue to play the leading role as a bridge between human and c...
Conference Paper
The use of digital games allows the enjoyment of playful moments during the learning process, especially when they are aimed at specific audiences, such as people with disabilities. However, these tools often do not have functionalities that allow the customization of the interaction for each user, according to the needs evaluated by healthcare pro...
Chapter
Kriging is one of the most used spatial estimation methods in real-world applications. Some kriging parameters must be estimated in order to reach a good accuracy in the interpolation process, however, this task remains a challenge. Various optimization methods have been tested to find good parameters of the kriging process. In recent years, many a...
Chapter
Kriging is one of the most used spatial estimation methods in real-world applications. In kriging estimation, some parameters must be estimated in order to reach a good accuracy in the interpolation process, however, this step is still a challenge. Various optimization methods have been tested to find good parameters to this process, however, in re...
Chapter
Full-text available
Text-to-speech (TTS) is currently a mature technology used in many areas such as education and accessibility. Some modules of a TTS system depend on the language and, while there are many public materials for some languages (e.g., English and Japanese), the resources for Brazilian Portuguese (BP) are still limited. This work describes the developme...
Conference Paper
Full-text available
Tests that evaluate the individual strategies of visual exploration may be useful for characterizing deviations from typical development, such as autism spectrum disorder and dyslexia. One subgroup of visual exploration tests, called cancellation tests, require the identification of specific targets surrounded by distractors. However, the lack of a...
Conference Paper
Coercion is an inherent problem of any Internet election. Because voters can vote from any connected place and there is no voting booth to protect them while voting, coercers can easily force them to select their candidates. Although there is no optimal solution for this problem, modern Internet election systems can mitigate it. In other words, the...
Chapter
Full-text available
Brain computer interface establishes a new model of communication, whereby it is possible to communicate using only cerebral signals, that can be obtained from different kind of cerebral stimuli. By the way, one of the most common stimulus is the motor imagery of the arms. However, since a set of variables leads to different levels of classificatio...
Preprint
Assembling metagenomic data sequenced by NGS platforms poses significant computational challenges, especially due to large volumes of data, sequencing errors, and variations in size, complexity, diversity and abundance of organisms present in a given metagenome. To overcome these problems, this work proposes an open-source, bioinfor-matic tool call...
Conference Paper
Technological developments converge to make people interact with electronic devices in an easy way. For people with disabilities, however, that interaction becomes something more than simple: it becomes possible. The current work presents a proposal of an open-source, low cost universal remote control system that translates user's head poses into c...
Conference Paper
This paper presents the results of usability evaluation of an Information Visualization tool with touchless gestural commands. The tool has well-known visualizations tasks implemented in itself, allowing users to interact on a 3D scatterplot visualization technique. The chosen usability evaluation was the Think Aloud protocol, together with questio...
Conference Paper
Mobile Augmented Reality has become more popular mainly because computational resources available in mobile devices, and in the enhanced view of real world that can be seen by the user. The interaction becomes an important point for success of these applications, featuring a natural and intuitive way for the user, and the chance of one, or two hand...
Conference Paper
Several studies point out the importance of interaction in Information Visualization (InfoVis) field to the success of good data visualization. The interaction researches in InfoVis have encouraged the use of non-conventional interfaces, besides the traditional keyboard and mouse, such as voice commands, gesture controls, among others. This work ai...
Conference Paper
This paper aims to highlight the need for access to assistive technologies focused on augmentative and alternative communication (AAC), especially those available for the Brazilian Portuguese language, and the problems involved, as well as provide answers to these difficulties through the VoxLaps software, a free graphical symbol-based AAC applicat...
Conference Paper
The analysis of the phonetic entities of speech nearly always requires the alignment of an audio file with its phonetic transcription. However, it is an extremely labor-intensive task. An automatic alignment tool has modules that depend on the language and, while there are many public resources for some languages (e.g., English and French), the res...
Conference Paper
Full-text available
Resumo The purpose of this article is to identify the Critical Success Factors (CSF) undergraduate courses in Computer. FCS were searched and grouped into four categories: Institution, Teacher, Student and Institution Support Services. Each category consists of several indicators. We used the five point Likert questionnaires to generate quantitati...
Conference Paper
Active learning is a type of semi-supervised learning in which the training algorithm is able to obtain the labels of a small portion of the unlabeled dataset by interacting with an external source (e.g. a human annotator). One strategy employed in active learning is based on the exploration of the cluster structure in the data, by using the labels...
Article
Full-text available
The automatic syllabification process is an essential prerequisite for speech synthesis systems. However, the task is not trivial, and several techniques have been adopted over the last decade. Furthermore, while there are many public resources for some languages (e.g., English and Japanese), the resources for Brazilian Portuguese (BP) are still li...
Conference Paper
Full-text available
Advances in speech processing research rely on the availabil-ity of public resources such as corpora, statistical models and baseline systems. In contrast to languages such as English, there are few specific resources for Brazilian Portuguese. This work describes efforts aiming to decrease such gap. Baseline acoustic models for Brazilian Portuguese...
Conference Paper
Full-text available
Advances in speech processing research rely on the availability of public resources such as corpora, statistical models and baseline systems. In contrast to languages such as English, there are few specific resources for Brazilian Portuguese. This work describes efforts aiming to decrease such gap. Baseline acoustic models for Brazilian Portuguese...
Article
Full-text available
An automatic speech recognition system has modules that depend on the language and, while there are many public resources for some languages (e.g., English and Japanese), the resources for Brazilian Portuguese (BP) are still limited. This work describes the development of resources and free tools for BP speech recognition, consisting of text and au...
Conference Paper
Full-text available
This work is part of the effort to develop a speech recognition system for Brazilian Portuguese. The resources for the training and test stages of this system, such as corpora, pronunciation dictionary, language and acoustic models, are publicly available. Here, an application programming interface is proposed in order to facilitate using the open-...
Article
Full-text available
Text-to-speech (TTS) is currently a mature tech-nology that is used in many applications. Some modules of a TTS depend on the language and, while there are many public resources for English, the resources for some under-represented languages are still limited. This work describes the development of a complete TTS system for Brazilian Portuguese whi...
Conference Paper
Full-text available
This paper reports on recent work in the context of the activities of the PoSTPort project aimed at porting a Broadcast News recognition system originally developed for European Portuguese to other varieties. Concretely, in this paper we have focused on porting to Brazilian Portuguese. The impact of some of the main sources of variability has been...
Conference Paper
This work discusses the integration of available technologies for developing spoken dialog systems in Brazilian Portuguese. As a proof-of-concept, it describes a system for non-visual and on-line Web search on Windows.The prototype system is based on Microsoft's Speech Application Programming Interface (SAPI), which provides an interface that allow...
Conference Paper
Full-text available
Speech processing is a data-driven technology that relies on public corpora and associated resources. In contrast to languages such as English, there are few resources for Brazilian Portuguese (BP). This work describes efforts toward decreasing such gap and presents systems for speech recognition in BP using two public corpora: Spoltech and OGI-22....
Conference Paper
Full-text available
Speech processing is a data-driven technology that relies on public corpora and associated resources. In contrast to languages such as English, there are few resources for Brazilian Portuguese (BP). This work describes efforts toward decreasing such gap and presents systems for speech recognition in BP using two public corpora: Spoltech and OGI-22....
Conference Paper
Full-text available
A conversão de uma seq uência de caracteres em seq uências de fones e um importante pré-requisito para serviços que envolvem reconhecimento e/ou síntese de voz. Contudo, a tarefa nãó e trivial e diversas técnicas de conversão vêm sendo adotadas ao longo dá ultima década. Existe um numero bem menor de estudos ná area dedicados ao Português Brasileir...
Article
Full-text available
Speech is a natural interface for human-computer interaction. Speech (or voice) technology is a well-developed field when one considers the international community. There is a wide variety of academic and industrial software. The majority of them assumes a recognizer or synthesizer is available, and can be programmed through an API. In contrast, th...
Article
Full-text available
This work discusses the integration of available technologies for de-veloping spoken dialog systems in Brazilian Portuguese. As a proof-of-concept, it describes a system for non-visual and on-line Web search on Windows. The prototype system is based on Microsoft's Speech Application Programming In-terface (SAPI), which provides an interface that al...
Article
Full-text available
This paper presents some improvements on an existing set of linguistic rules that is capable of performing the syllabification of Brazilian Portuguese words. An algorithm was also implemented and based on this set, which improvements previously mentioned include new rules that depend on the stressed vowel to achieve the standard syllabification of...
Article
Full-text available
An automatic speech recognition system has modules that depend on the language and, while there are many public resources for some languages (e.g., English and Japanese), the resources for Brazilian Portuguese (BP) are still limited. This work describes the development of resources and free tools for BP speech recognition, consisting of an applicat...
Article
Full-text available
Resumo: O presente trabalho apresenta o projeto de uma ferramenta (FFTranscriber) integrada para a transcrição de áudio para fins de fonética forense. O software fornecerá ao usuário uma interface simples e intuitiva, com todos os elementos necessários a um processo de transcrição. A ferramenta apresenta diversas vantagens em relação aos softwares...
Article
Full-text available
1 A Interface de Programação Ao promover o amplo desenvolvimento de aplicações baseadas em reconheci-mento de voz, os autores observaram que não era suficiente apenas tornar os recursos disponíveis, tais como modelos de linguagem. Esses recursos sãó uteis para pesquisadores, mas o que a maioria dos programadores deseja e a prati-cidade oriunda de u...
Conference Paper
Full-text available
Este trabalho compara dois sistemas de reconhecimento de fala que podem ser usados no desenvolvimento de aplicativos para Android: Julius em modo servidor e Google. Parte do suporte a Português Brasileiro para o Julius foi desenvolvido pelos autores no contexto do projeto FalaBrasil. O Julius também utilizou o servidor do FalaBrasil para prover rec...

Network

Cited By