Robyn Speer

Robyn Speer
Luminoso

Master of Engineering

About

27
Publications
7,309
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
2,824
Citations
Education
September 2002 - August 2007
Massachusetts Institute of Technology
Field of study
  • Computer science

Publications

Publications (27)
Conference Paper
Luminoso participated in the SemEval 2018 task on "Capturing Discriminative Attributes" with a system based on ConceptNet, an open knowledge graph focused on general knowledge. In this paper, we describe how we trained a linear classifier on a small number of semantically-informed features to achieve an $F_1$ score of 0.7368 on the task, close to t...
Article
This paper describes Luminoso's participation in SemEval 2017 Task 2, "Multilingual and Cross-lingual Semantic Word Similarity", with a system based on ConceptNet. ConceptNet is an open, multilingual knowledge graph that focuses on general knowledge that relates the meanings of words and phrases. Our submission to SemEval was an update of previous...
Article
Machine learning about language can be improved by supplying it with specific knowledge and sources of external information. We present here a new version of the linked open data resource ConceptNet that is particularly well suited to be used with modern NLP techniques such as word embeddings. ConceptNet is a knowledge graph that connects words and...
Article
Machine learning about language can be improved by supplying it with specific knowledge and sources of external information. We present here a new version of the linked open data resource ConceptNet that is particularly well suited to be used with modern NLP techniques such as word embeddings. ConceptNet is a knowledge graph that connects words and...
Article
Full-text available
A currently successful approach to computational semantics is to represent words as embeddings in a machine-learned vector space. We present an ensemble method that combines embeddings produced by GloVe (Pennington et al., 2014) and word2vec (Mikolov et al., 2013) with structured knowledge from the semantic networks ConceptNet (Speer and Havasi, 20...
Conference Paper
The Narratarium Colorizer device receives either keyboard input or speech recognition input and uses natural language processing to extract key terms. The terms are queried for in a knowledge base of words and associated colors, created by leveraging the Open Mind Common Sense database and ConceptNet. The system outputs a continually changing color...
Article
ConceptNet is a knowledge representation project, providing a large semantic graph that describes general human knowledge and how it is expressed in natural language. Here we present the latest iteration, ConceptNet 5, with a focus on its fundamental design decisions and ways to interoperate with it.
Conference Paper
In AI, we often need to make sense of data that can be measured in many different dimensions -- thousands of dimensions or more -- especially when this data represents natural language semantics. Dimensionality reduction techniques can make this kind of data more understandable and more powerful, by projecting the data into a space of many fewer di...
Article
We present Luminoso, a tool that helps researchers to visualize and understand a dimensionality-reduced semantic space based on textual information by exploring it interactively. It streamlines the process of creating such a space by taking input from a directory of text documents, and optionally including common-sense background information. This...
Article
Today millions of web-users express their opinions about many topics through blogs, wikis, fora, chats and social networks. For sectors such as e-commerce and e-tourism, it is very useful to automatically analyze the huge amount of social information available on the Web, but the extremely unstructured nature of these contents makes it a difficult...
Article
Full-text available
Understanding the world we live in requires access to a large amount of background knowledge: the commonsense knowledge that most people have and most computer systems don't. Many of the limitations of artificial intelligence today relate to the problem of acquiring and understanding common sense. The Open Mind Common Sense project began to collect...
Article
Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2007. This electronic version was submitted by the student author. The certified thesis is available in the Institute Archives and Special Collections. Includes bibliographical references (p. 107-110). In this thesis, I present a system fo...
Conference Paper
Full-text available
We present a game-based interface for acquiring common sense knowledge. In addition to being interactive and en- tertaining, our interface guides the knowledge acquisition process to learn about the most salient characteristics of a particular concept. We use statistical classification methods to discover the most informative characteristics in the...
Conference Paper
We are interested in the problem of reasoning over very large common sense knowledge bases. When such a knowledge base contains noisy and subjective data, it is important to have a method for making rough conclusions based on similarities and tendencies, rather than absolute truth. We present Analogy Space, which accomplishes this by forming the an...
Article
Open Mind Commons is an interface for collecting common sense knowledge from users over the Web. By giving the user many forms of feedback and using inferences by anal-ogy to find appropriate questions to ask, Open Mind Com-mons can learn well-connected structures of common sense knowledge, refine its existing knowledge, and build analo-gies that l...
Article
The Open Mind Common Sense project has been collecting common-sense knowledge from volun-teers on the Internet since 2000. This knowledge is represented in a machine-interpretable seman-tic network called ConceptNet. We present ConceptNet 3, which improves the acquisition of new knowledge in ConceptNet and facilitates turning edges of the network b...
Article
The occurrence of F0 peaks on nonprominent syllables in American English (e.g., -ing or a- in reading again) raises the question of how to label these inflection points. This pattern is not infrequent, as shown by samples from two prosodically labeled corpora of natural speech (ToBI labeled MIT Maptask and BU FM Radio News). The Maptask sample from...
Article
The expression of motion verbs differs between languages. The path of motion, such as crossing or entering, is more promi-nently featured in path-based languages such as Spanish than in manner-based languages such as English. Here, we revisit the data from a study on manner and path biases in verb lexi-calization (Havasi & Snedeker, 2004), and crea...
Article
—Singular value decomposition (SVD) is a powerful technique for finding similarities and patterns in large data sets. SVD has applications in text analysis, bioinformatics, and recommender systems, and in particular was used in many of the top entries to the Netflix Challenge. It can also help generalize and learn from knowledge represented in a sp...

Network

Cited By

Projects

Project (1)