Figure 2 - uploaded by Gregory Grefenstette
Content may be subject to copyright.
Clicking on the tab expands a box that shows the Inria teams that are associated with subjects on the Wikipedia page. For example, one subject mentioned on the page is "Gaussian processes" and 3 Inria teams that work in this domain are listed ASPI, ATHENA, and BIGS, with their expanded team names. Clicking on "Gaussian processes" goes to an ACM 2012 ontology page for this concept. Clicking on a team name goes to a web page from their 2014 annual report where a project involves "Gaussian processes". On this page, the user can find team members who are experts in the area. (See Figures 3 and 4)
Source publication
Finding experts for a given problem is recognized as a difficult task. Even
when a taxonomy of subject expertise exists, and is associated with a group of
experts, it can be hard to exploit by users who have not internalized the
taxonomy. Here we present a method for both attaching experts to a domain
ontology, and hiding this fact from the end use...
Context in source publication
Context 1
... subjects and teams (see Figure 2) are linked to pages outside Wikipedia, so that Wikipedia has become a search engine with the user browsing towards their query (here "Autoregressive Models", with the pull down expert box corresponding the search engine results page, leading to outside content. The user can find the Wikipedia article closest to his or her concern, and use the expert finding tab to find local experts who know about the subjects on the page. ...
Similar publications
Tree-structured data naturally appear in various fields, particularly in biology where plants and blood vessels may be described by trees, but also in computer science because XML documents form a tree structure. This paper is devoted to the estimation of the relative scale of ordered trees that share the same layout. The theoretical study is achie...
Third party tracking is the practice by which third parties recognize users accross different websites as they browse the web. Recent studies show that 90% of websites contain third party content that is tracking its users across the web. Website developers often need to include third party content in order to provide basic functionality. However,...
Multiple approaches to grab comparable data from the Web have been developed up to date. Nevertheless, coming out with a high-quality comparable corpus of a specific topic is not straightforward. We present a model for the automatic extraction of comparable texts in multiple languages and on specific topics from Wikipedia. In order to prove the val...
In this paper, we describe the construction of TeKnowbase, a knowledge-base of technical concepts in computer science. Our main information sources are technical websites such as Webopedia and Techtarget as well as Wikipedia and online textbooks. We divide the knowledge-base construction problem into two parts -- the acquisition of entities and the...
Mined Semantic Analysis (MSA) is a novel distributional semantics approach
which employs data mining techniques. MSA embraces knowledge-driven analysis of
natural languages. It uncovers implicit relations between concepts by mining
for their associations in target encyclopedic corpora. MSA exploits not only
target corpus content but also its knowle...