Heliana Mello

Heliana Mello
Federal University of Minas Gerais | UFMG · Faculty of Languages

PhD

About

79
Publications
9,494
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
177
Citations

Publications

Publications (79)
Article
Full-text available
1 O tratamento computacional das línguas naturais O tratamento computacional de dados linguísticos tem estado na agenda de linguistas e cientistas da computação há no mínimo cinco décadas; entretanto, apenas nas últimas duas décadas tal movimento ganhou impulso no cenário brasileiro. Este movimento conta com a adesão de pesquisadores de diversas ár...
Preprint
Full-text available
This paper presents the C-ORAL-ESQ corpus project, which is dedicated to the study of the speech of individuals with schizophrenia. The main aim of the project is to investigate cognitive aspects of individuals with schizophrenia. This investigation is carried through the compilation of a spontaneous speech corpus and its study, which focuses mainl...
Article
Full-text available
The definition of Topic as well as that of information structure in the literature is very broad (cf. BARBOSA, 2005; MELLO; SILVA, 2015). Here we assume the definition as proposed by the Language into Act Theory (CRESTI, 2000), which says that Topic is the textual unit that is performed by an intonational profile of the prefix type ('t HART et al....
Chapter
This chapter explores the multiple ways in which human cultures and languages conceptualize time. We discuss the very notion of time from different perspectives as well as through the comparison of well-studied languages with lesser-known ones. Our focus is on concepts of time in three indigenous languages and cultures of Brazil: Huni Kuĩ, Awety, a...
Preprint
Full-text available
RESUMO/APRESENTAÇÃO: A sociolinguística, ao longo das suas décadas de desenvolvimento, sempre priorizou o estudo de dados linguísticos autênticos, coletados através de diversos instrumentos. Mais recentemente, com o grande desenvolvimento das metodologias computacionais, a linguística de corpus e a ciência da computação vêm agregando novos métodos...
Article
Full-text available
HOW TO CITE MELLO, H.; METTOUCHI, A.; MITHUN, M.; PANUNZI, A.; RASO, T. (2021). ABSTRACT This paper focuses on the experience of spoken corpora compilation and discusses the relevance of prosody in this type of endeavor, as well as in the study of spoken language in its several possibilities. Through the voices of scholars associated with four diff...
Article
Full-text available
Speech and gestures meet at their departure point which is actionality. The same departing point keeps the two channels connected through their execution in the creation of meaning and interactivity. Both speech and gestures require segmentation in order to be studied and understood scientifically, as knowing what the units of analysis are is cruci...
Article
Full-text available
As redações são instrumentos avaliativos muito importantes para os estudantes brasileiros. Mesmo que seja assumido que a subjetividade esteja presente em todo e qualquer texto, espera-se que as correções dessas redações sejam feitas com o mínimo de subjetividade possível. Entretanto, a partir da análise de uma amostra de correções de redação, perce...
Book
What is the best way to analyze spontaneous spoken language? In their search for the basic units of spoken language the authors of this volume opt for a corpus-driven approach. They share a strong conviction that prosodic structure is essential for the study of spoken discourse and each bring their own theoretical and practical experience to the ta...
Preprint
Full-text available
Essays are very important assessment tools for Brazilian students. Therefore, it is expected that the grading of these texts will be made with as little subjectivity as possible. However, in an analysis of a sample of grading sheet comments by evaluators, we have noticed a high degree of subjectivity in these texts. From this first analysis, carrie...
Article
This special issue of JoSS is one of a series of initiatives undertaken by the Lab of Empirical and Experimental Linguistic Studies (LEEL) at the Federal University of Minas Gerais (UFMG) together with several international partners on the important topic of speech segmentation, primarily segmentation applied to spontaneous speech.
Conference Paper
Full-text available
Speech corpora usually serve as the basis for prosodic, pragmatic and syntactic interface studies, as well as phonetic and phonological ones. Good audibility is an essential feature; however, it is not always enough for research purposes. In this article, we present methodological procedures and criteria adopted in the acoustic quality classificati...
Article
Full-text available
O trabalho apresenta a arquitetura e os critérios de compilação de um corpus de fala espontânea do português angolano. Após uma breve contextualização da realidade linguística de Angola, são apresentados em detalhe as modalidades de gravação e o tratamento das diferentes variações sociolinguísticas documentadas, destacando-se a atenção à variação d...
Article
Full-text available
Resumo Este trabalho avaliou a validade concorrente e de face da escala de MacArthur, que busca aferir o status social subjetivo (SSS) na sociedade, na vizinhança e no trabalho. A amostra de 159 adultos, participantes da coorte ELSA-Brasil, em Minas Gerais (2012-2014), foi selecionada e a análise incluiu métodos epidemiológicos, a teoria cognitiva...
Article
Full-text available
The verbal negation system of Brazilian Portuguese (BP) presents three forms: preverbal, double and postverbal negation, as can be seen in the following examples: *MIC: [91] mas / Michael / eunãofalonessesentido // (ii) *DOM: [101] cêsnũlêemissomaisnão // (iii) *RUT: [220] participanão / minhafilha //. The purpose of this paper is to investigate wh...
Article
Full-text available
Este estudo investigou a sensibilidade de aprendizes brasileiros de inglês aos seguintes fatores que influenciam a alternância dativa do inglês: pronominalidade, animacidade, acessibilidade discursiva e extensão dos argumentos recipiente e tema. Utilizando a metodologia da Linguística de Corpus (MCENERY & HARDIE 2012), nossa investigação baseou-se...
Article
Full-text available
Este artigo sistematiza o resultado de investigações exploratórias sobre as cláusulas relativas encontradas no minicorpus de fala espontânea do português do Brasil (PB), etiquetado informacionalmente, extraído do corpus C-ORAL-BRASIL. A seleção dos dados enfocou a identificação dos enunciados nos quais as cláusulas relativas ocorrem. Para essa tare...
Article
Full-text available
The verbal negation system of Brazilian Portuguese (BP) presents three forms: preverbal, double and postverbal negation, as can be seen in following examples: *MIC: [91] mas / Michael / eu não falo nesse sentido // (ii) *DOM: [101] cês nũ lêem isso mais não // (iii) *RUT: [220] participa não / minha filha //. The goal of this paper is to investigat...
Article
Full-text available
This paper focuses on testing a pragmatic hypothesis about verbal negation in Brazilian Portuguese (BP). The BP verbal negation system presents three forms, namely: preverb-al, double, and postverbal negation. Schwenter (2005) claims that there are constraints in the use of the non-canonical forms, i.e. double and postverbal negations. According to...
Article
Full-text available
O artigo analisa estratégias de cortesia no ato de fala da recusa a pedidos em português brasileiro e italiano. Os resultados apontam a relevância do ensino de aspectos pragmáticos a estudantes de LE
Article
Full-text available
Apresentação do número temático Linguística de Corpus
Article
Full-text available
The inception of studies on information structure are usually attributed to the notions of point de départ and but du discours by Weill (1844) and psychological subject and predicate by Paul (1880) and Gabelenz (1891). Later, within the context of the Prague School the notions theme and rheme were introduced (cf. AMMANN, 1928; MATHESIUS, 1929), and...
Article
Full-text available
Dados Técnicos da Revista
Book
The authors of this book share a common interest in the following topics: the importance of corpora compilation for the empirical study of human language; the importance of pragmatic categories such as emotion, attitude, illocution and information structure in linguistic theory; and a passionate belief in the central role of prosody for the analysi...
Article
Full-text available
In this paper we briefly discuss the history of corpus linguistics and its applications to the study of scientific language. We provide some exemplification for corpora exploitation tools, corpora applications to scientific language studies, data mining and natural language processing.
Conference Paper
Full-text available
Action verbs have many meanings, covering actions in different ontological types. Moreover, each language categorizes action in its own way. The range of variations within and across languages is largely unknown, causing trouble for natural language processing tasks and second language acquisition. IMAGACT is a corpus-based ontology of action conce...
Article
Full-text available
Os estudos baseados em dados que tratam da noção de frame e do seu uso na análise da linguagem adotam unidades de análise que variam desde unidades lexicais até estruturas construcionais, sintáticas e textuais. Os mesmos princípios são utilizados em aplicações de processamento de linguagem natural. Neste artigo discutimos o porquê de se fazer neces...
Article
Full-text available
This paper presents a distributed platform for Natural Language Processing called PyPLN. PyPLN leverages a vast array of NLP and text processing open source tools, managing the distribution of the workload on a variety of configurations: from a single server to a cluster of linux servers. PyPLN is developed using Python 2.7.3 but makes it very easy...
Conference Paper
Full-text available
This article describes the morphosyntactic annotation of the C-ORAL-BRASIL speech corpus, using an adapted version of the Palavras parser. In order to achieve compatibility with annotation rules designed for standard written Portuguese, transcribed words were orthographically normalized, and the parsing lexicon augmented with speech-specific materi...
Conference Paper
The C-ORAL-BRASIL is a Brazilian Portuguese spontaneous speech corpus, representative of the state of Minas Gerais diatopy (primarily from the capital city, Belo Horizonte,metropolitan area). The corpus was compiled following the same architecture and segmentation criteria adopted by the C-ORAL-ROM [1] as well as its alignment software, the WinPitc...
Chapter
Full-text available
A Linguística de Corpus, 1 como área disciplinar que explora corpora computadorizados, embora ainda tímida, tem gradativamente crescido no Brasil nas duas últimas décadas. Apesar de a tradição de estudos linguísticos baseada em dados reais da língua em uso ser muito mais antiga no país, por exemplo, nos estudos da Sociolinguística Variacionista (cf...
Conference Paper
Full-text available
This study investigates the hypothesis that spontaneous speech is organized in pragmatic units, i.e. utterances and informational units, signaled in the speech flow through intonation, according to the Language Into Act Theory. We aim to demonstrate that the segmentation of speech into such units is statistically consistent by means of a quantitati...
Article
Full-text available
Resumo: O presente artigo apresenta proposta de explicação analógica para o surgimento de algumas novas formas lexicais do vernáculo brasileiro. A partir de ocorrências selecionadas on line, constatou-se que há novos modelos de substantivos em uso. Serão analisados especificamente (i) os nomes populares dados aos estádios de futebol terminados em –...
Article
Full-text available
Second language acquisition studies have claimed that feed-back, in the form of recasts, has a positive impact on learners’ L2 develop-ment. " is study aims to examine the e# ectiveness of two corrective feed-back forms, recasts and models, on Brazilian learners of English acquiringtwo language structures and the role of focus attention and noticin...
Article
Full-text available
O presente artigo pretende fazer uma revisão teórica dos principais conceitos relacionados à modalidade, das grandes visões teóricas relacionadas a esse assunto e das principais obras que trataram. Com isso, objetiva-se prover subsídios àqueles que se interessam pela manifestação lingüística da modalidade.
Article
Full-text available
Conference Paper
Full-text available
Esse trabalho analisa a retomada pronominal dita por pronome lembrete em frases como “A Maria, eu gosto muito dela”. A pesquisa baseia-se nos preceitos teórico-metodológicos da Teoria da Língua em Ato (CRESTI, 2000) e em dados do corpus C-ORAL-BRASIL para mostrar as restrições prosódicas e informacionais operantes na retomada por pronome lembrete....
Article
Full-text available
Neste artigo explora-se a noção semântica de modalidade na falaespontânea do português brasileiro (PB) através de um estudo decorpus, cujo objetivo é fazer um levantamento das principaisestratégias modalizadoras na fala espontânea do PB. O substratoteórico que orienta esse estudo é a Teoria da Língua em Ato, deCresti (2000). Buscaram-se os índices...
Article
Full-text available
A Corpus of Brazilian Portuguese (BP) will join C-ORAL-ROM [1] adopting the same corpus design and prosodic annotation schema. The inter-rater agreement concerning the annotation of terminal and non terminal breaks by both experts and non experts is studied and compared with the early C-ORAL-ROM results [2]. Although the overall prominence of proso...
Article
Full-text available
Article
Full-text available
This paper highlights the primary methods employed in the C-ORAL-BRASIL compiling process, i.e, recording, transcribing and segmenting oral texts. The C-ORAL-BRASIL is a Brazilian Portuguese corpus of spontaneous speech, designed for the study of informational structure. It is representative of the diaphasic variation, seeking to cover as many diff...
Article
Full-text available
O português brasileiro coloquial (PBC) apresenta algumas construções de forma [SN V SN] que parecem ser relacionadas a estruturas com a forma [[SN [de SN] ] V], de uso mais geral. Exemplos ilustrativos desse fenômeno podem ser vistos abaixo:1.(a) A Belina deita o banco, sabe? (PBC)(b) O banco da Belina deita, sabe?No par de exemplos, temos padrões...
Article
Full-text available
The advances in the field of second (L2) and foreign (FL) language teaching and learning in the past two decades have been manifold, among these: acquisition theories that have emerged as a consequence of refinements in experimental and methodological tools; the shift of focus to approaches rather than methods in L2 and FL teaching; socio- interact...
Chapter
http://www.casaruibarbosa.gov.br/dados/DOC/artigos/a-j/FCRB_Historia_social_da_lingua_nacional.pdf
Article
Full-text available
Article
Full-text available
In this article we present the first application of the theory of Languagein Act (CRESTI, 2000b) to Brazilian Portuguese. After a summaryof the theory is presented, we show how a text can be divided inutterances and how the utterance can be divided in tone units, throughthe perception of terminal and non terminal prosodic breaks. Basedon dedicated...
Article
Full-text available
This paper focuses on the pertinence of Action Research as a practical tool in the enhancement of language teachers' autonomy. We report on the results achieved after a year of collaborative action research undertaken by teachers enrolled in a Teacher Education Continuing Program and point out the steps taken throughout this initiative. We conclude...
Article
Full-text available
Resumo: O objetivo desta comunicação é relatar os resultados da utilização metodológica dos procedimentos da pesquisa-ação colaborativa no âmbito de um projeto de formação continuada de professores de inglês (EDUCONLE). Tais resultados iluminarão reflexões sobre a aplicabilidade de tal procedimento como instrumento de reflexão e engrandecimento pro...
Article
BOOK NOTICES 695 morphophonemics, and transformational rules. The latter subject is outdated, with mention of three of Chomsky’s books in the references (111), the latest of which is Language and mind (New York: Harcourt Brace and Jovanovich, 1972). Transformational syntax has, needless to say, changed considerably over the past three decades. That...
Article
Full-text available
Article
Full-text available
Article
This article aims to review the major theoretical concepts related to modality, as well as the theoretical views and important books related to th is subject. The objective is to provide information to those interested in the linguistic expression of modality.

Network

Cited By

Projects

Projects (3)
Project
Descrição: compreende a investigação de técnicas de aprendizado de máquina para desidentificação automática de prontuários de pacientes. Dois corpus serão criados utilizando dados em parceria com o Hospital das Clínicas da UFMG. O primeiro corpus será usado no treinamento de algoritmos computacionais e o segundo conterá os dados resultantes do processo de desidentificação. O corpora e as ferramentas criadas, ao longo do projeto, serão disponibilizados de forma gratuita e pública para pesquisa e consulta de profissionais da saúde. Resultados esperados: ao final da realização do trabalho é esperado que o corpus criado contribua em diferentes áreas médicas facilitando a obtenção de dados, fomentando a proteção aos pacientes e impulsionando as pesquisas no país. Em última instância pretende-se promover o acesso à informação médica no Brasil em benefício da sociedade. (texto autoria Guilherme Noronha (autor e executor: guilhermenoronha2001 at gmail.com), doutorando PPG-GOC/UFMG)
Project
Investigate the development of national varieties of a pluricentric language is an important research topic. Portuguese is prototypically a pluricentric language in the sense that it has different national standard varieties (Clyne 1992), namely European Portuguese (EP), Brazilian Portuguese (BP) and other standards in development. We know that there are differences between EP and BP at all levels of linguistic structure. However, we know little about the evolutionary relationship between EP and BP in the recent past. This project aims to investigate the question of whether and how EP and BP converge or diverge over the last 60 years in the domains of lexical variation, constructional variation, and language attitudes. Our previous lexical research (Soares da Silva 2010) involving concepts from football and clothing shows that the hypothesis of divergence is confirmed. This project will examine the extent to which lexical, constructional and attitudinal variables correlate as indicators of divergence between EP and BP. It applies advanced corpus-based and sociolectometrical methods to measure convergence/divergence. This is the first comprehensive attempt to investigate EP-BP convergence/divergence. Within the context of pluricentricity research (Clyne 1992, Soares da Silva 2014), the specificity of the proposal resides in two points. First, we focus on the interplay between conceptual and social aspects of pluricentric variation. Therefore, this project subscribes to the framework of Cognitive Sociolinguistics (Kristiansen&Dirven 2008, Geeraerts et al 2010), an emerging extension of Cognitive Linguistics as a meaning-, usage-based approach. Second, we use sociolectometrical methods that allow linguistic distances to be measured and correlated with all types of sociolinguistic variables. Specifically, we will apply the concept-based, profile-based sociolectometry (Speelman et al 2003), where “profile” stands for the relative frequencies of a set of words or constructions in a conceptual category. The research is concerned with onomasiological variation between semantically equivalent words/constructions (denotational synonyms). The onomasiological method has been adopted to study language-internal variation, since denotational synonyms often display sociolinguistic differences and therefore the competition between language varieties. In addition, looking at alternative expressions of concepts or functions provide us with a reliable control mechanism to avoid thematic and statistical bias. Uniformity, featural and attitudinal measures based on onomasiological profiles quantify convergence and divergence between EP and BP. The data will be extracted from a large corpus of EP-BP texts from the 1950s, 1970s and 2000s, Usenet and spoken usage. Several concepts from several lexical fields and a multitude of morphological and syntactic variables will be analyzed. For the selection of lexical variables, we will apply advanced computational techniques that are based on Semantic Vector Space (SVS) models (Turney & Pantel 2010). These models quantify a functional similarity between pairs of words on the basis of the lexical context, found in large corpora. Applying these fully automated models on a large corpus of EP-BP texts will provide us with a large and unbiased sample of lexical variation on which our study can be based. If necessary, we will also analyze concepts manually selected from politics, health and transport. As for constructional variables we shall proceed in two ways. First, we analyse morphological and syntactic variables studied in the literature as EP-BP variation such as alternate patterns of verbal and nominal agreement, overt/pro subject alternation, impersonal/passive constructions, relative constructions, word order variations. In addition, we will include constructional variables not directly related to EP-BP variation, such as alternate prepositional constructions and patterns of diathesis alternations. Second, we will employ the SVS approach to generate potential syntactic variables. Attitudinal variables include elicitations of attitudinal intentions with regard to words and constructions. Multivariate techniques allow us to compare the impact of lexical and constructional variables, corpus-based and attitudinal variables on national convergence/divergence. This project is important, linguistically and non-linguistically. Linguistically, it allows for the determination of the evolutionary relation between EP and BP and the pluricentric nature of Portuguese. Crucially, the sociolectometrical approach brings a new perspective: instead of looking at the distribution of individual variables, we now look at an aggregate level, which makes it possible to quantify convergence/divergence and stratificational issues. Non-linguistically, it is relevant for official language policies, normative positions and educational practices that may acknowledge and foster the pluricentricity of Portuguese.