Richard Wicentowski

Richard Wicentowski
Swarthmore College · Department of Computer Science

PhD

About

28
Publications
4,512
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
835
Citations
Introduction
Additional affiliations
September 2002 - May 2016
Swarthmore College
Position
  • Professor (Full)

Publications

Publications (28)
Article
Full-text available
Objectives To determine whether data on research studies held by the UK Health Research Authority (HRA) could be summarised automatically with minimal manual intervention. There are numerous initiatives to reduce research waste by improving the design, conduct, analysis and reporting of clinical studies. However, quantitative data on the characteri...
Conference Paper
Roughly once per decade, the ACM and IEEE-Computer Society form a joint task force to produce curricular guidelines for undergraduate computer science programs. The latest guidelines document, Computer Science Curricula 2013 (CS2013), was released in December 2013. CS faculty at many institutions are interested in understanding CS2013, evaluating t...
Conference Paper
Full-text available
In line with institutions across the United States, the Computer Science Department at Swarthmore College has faced the challenge of maintaining a demographic composition of students that matches the student body as a whole. To combat this trend, our department has made a concerted effort to revamp our introductory course sequence to both attract a...
Article
Full-text available
An ensemble of supervised maximum entropy classifiers can accurately detect and identify sentiments expressed in suicide notes. Using lexical and syntactic features extracted from a training set of externally annotated suicide notes, we trained separate classifiers for each of fifteen pre-specified emotions. This formed part of the 2011 i2b2 NLP Sh...
Conference Paper
Full-text available
We present two systems that select the most appropriate Spanish substitutes for a marked word in an English test sentence. These systems were official entries to the SemEval-2010 Cross-Lingual Lexical Substitution task. The first system, Swat-E, finds Spanish substitutions by first finding English substitutions in the English sentence and then tran...
Conference Paper
Full-text available
In the process of automatically annotating songs with de- scriptive labels, multiple types of input information can be used. These include keyword appearances in web docu- ments, acoustic features of the song's audio content, and similarity with other tagged songs. Given these individ- ual data sources, we explore the question of how to aggre- gate...
Article
As part of the 2006 i2b2 NLP Shared Task, we explored two methods for determining the smoking status of patients from their hospital discharge summaries when explicit smoking terms were present and when those same terms were removed. We developed a simple keyword-based classifier to determine smoking status from de-identified hospital discharge sum...
Article
Full-text available
In this paper, we describe our two SemEval-2007 entries. Our first entry, for Task 5: Multilingual Chinese-English Lexical Sam-ple Task, is a supervised system that decides the most appropriate English translation of a Chinese target word. This system uses a combination of Nave Bayes, nearest neigh-bor cosine, decision lists, and latent seman-tic a...
Article
Full-text available
We present two systems that pick the ten most appropriate substitutes for a marked word in a test sentence. The first system scores candidates based on how frequently their local contexts match that of the marked word. The second system, an enhancement to the first, incorporates cosine similarity us-ing unigram features. The core of both sys-tems b...
Conference Paper
Full-text available
Conference Paper
Full-text available
We present two systems that pick the ten most appropriate substitutes for a marked word in a test sentence. The first system scores candidates based on how frequently their local contexts match that of the marked word. The second system, an enhancement to the first, incorporates cosine similarity using unigram features. The core of both systems byp...
Conference Paper
Full-text available
As Computer Science educators, we know that students learn more from projects that are fun and challenging, that seem "real" to them, and that allow them to be creative in designing their solutions. When we have students beating down our office doors wanting to show us what they've done, we know we have designed a project that truly meets its pedag...
Article
Full-text available
This paper presents the WordFrame model, a noise-robust supervised algorithm capable of inducing morphological analyses for languages which exhibit prefixation, suffixation, and internal vowel shifts. In combination with a näive approach to suffix-based morphology, this algorithm is shown to be remark-ably effective across a broad range of language...
Article
Full-text available
This paper presents a corpus-based algorithm capable of inducing infiectional morphological analyses of both regulax and highly irregular forms (such as brought->bring) from distributional patterns in large monolingual text with no direct supervision. The algorithm combines four original alignment models based on relative corpus frequency, contextu...
Article
Full-text available
This paper presents a novel method for unsupervised word sense disambiguation, which combines multiple information sources, including semantic relations, large unlabeled corpora, and cross-lingual distributional statistics.
Article
This article describes the Johns Hopkins University (JHU) sense-disambiguation systems that participated in seven SENSEVAL2 tasks: four supervised lexical choice systems (Basque, English, Spanish, Swedish), one unsupervised lexical choice system (Italian) and two supervised all-words systems (Czech, Estonian). The common core supervised system util...
Article
Full-text available
This paper describes a system and set of algorithms for automatically inducing stand-alone monolingual part-of-speech taggers, base noun-phrase bracketers, named-entity taggers and morphological analyzers for an arbitrary foreign language. Case studies include French, Chinese, Czech and Spanish. Existing text analysis tools for English are applied...
Article
Full-text available
This paper describes the component models and combination model built as a joint effort be-tween Swarthmore College, Hong Kong PolyU, and HKUST. Though other models described elsewhere contributed to the final combination model, this pa-per focuses solely on the joint contributions to the "Swat-HK" effort.
Article
Full-text available
Human annotators and natural language applica-tions are able to identify smoking status from dis-charge summaries with high accuracy when ex-plicit evidence regarding their smoking status is present in the summary. We explore the possi-bility of identifying the smoking status from dis-charge summaries when these smoking terms have been removed. We...
Article
Full-text available
This document contains the instructions for preparing a camera-ready manuscript for the proceedings of NAACL HLT 2010. The doc- ument itself conforms to its own specifica- tions, and is therefore an example of what your manuscript should look like. Authors are asked to conform to all the directions reported in this document.
Article
Full-text available
This paper presents the Swarthmore College word- sense disambiguation system which was designed for the 2004 SENSEVAL3 competition. Our system participated in five tasks: the lexical sample tasks in Basque, Catalan, Italian, Romanian, and Spanish. For each task, a suite of supervised algorithms were combined using voting to form the final system.
Article
Full-text available
Vita. U.M.I. no. 3068229. Thesis (Ph. D.)--Johns Hopkins University, 2003. Includes bibliographical references (leaves 201-205). Microfilm.

Network

Cited By