Carlos Martín DancausaRobert Gordon University | RGU · Institute for Innovation, Design and Sustainability (IDEAS)
Carlos Martín Dancausa
Research Fellow
About
24
Publications
18,922
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
777
Citations
Publications
Publications (24)
This paper identifies the peaks and troughs in Twitter usage during three televised Scottish Independence Referendum debates in Autumn 2014 and identifies the topics that were the foci of such peaks and troughs. We observe that the issues that caught the most attention from the Twitter sample changed from debate to debate, suggesting that viewers w...
Twitter has become a dependable microblogging tool for real time information dissemination and newsworthy events broadcast. Its users sometimes break news on the network faster than traditional newsagents due to their presence at ongoing real life events at most times. Different topic detection methods are currently used to match Twitter posts to r...
Ask two people to describe an event they have both experienced, and you will usually hear two very different accounts. Witnesses bring their own preconceptions and biases which makes objective story-telling all but impossible. Despite this, recent work on algorithmic topic detection, event summarization and content generation often has a stated aim...
Messages from social media are increasingly being mined to extract useful information and to detect trends. These can relate to matters as serious as earthquakes and wars or as trivial as haircuts and cats. Football remains one of the world’s most popular sports, and events within big matches are heavily discussed on Twitter. It therefore provides...
Identifying and verifying new information quickly are key issues for journalists who use social media. This article examines what tools journalists think they need to cope with the growing volume and complexity of news on social media, and what improvements are needed in existing systems. It gives some initial results from a major European Union re...
Social-networking services such as Twitter offer users the potential to participate in public debate. When used whilst watching a television programme, Twitter allows backchannel discussion and debate in real time, which can add a new dimension and pleasure to television watching. When used in conjunction with televised political debates, Twitter c...
Twitter is becoming an ever more popular platform for discovering and sharing information about current events, both personal and global. The scale and diversity of messages makes the discovery and analysis of breaking news very challenging. Nonetheless, journalists and other news consumers are increasingly relying on tools to help them make sense...
Newsworthy stories are increasingly being shared through social networking platforms such as Twitter and Reddit, and journalists now use them to rapidly discover stories and eye-witness accounts.We present a technique that detects “bursts” of phrases on Twitter that is designed for a real-time topic-detection system. We describe a time-dependent va...
We describe the participation of the SocialSensor team in the Retrieving Diverse Social Images Task of MediaEval 2013. We submitted entries for all five runs after developing independent algorithms for visual features, text features and internet features (including local weather data). Our best CR@10 results came in the visual-only run, while the v...
Online social and news media generate rich and timely information about real-world events of all kinds. However, the huge amount of data available, along with the breadth of the user base, requires a substantial effort of information filtering to successfully drill down to relevant topics and events. Trending topic detection is therefore a fundamen...
This paper presents the first steps towards implementing a vision for a real-time system that aims to incorporate emerging knowledge from Social Media. In order to achieve this, proper context-based analysis techniques are being developed in terms of the SocialSensor FP7 project, and are applied on content residing in social networks in order to re...
Focusing on the context of XML retrieval, in this paper we propose a general methodology for managing structured queries (involving both content and structure) within any given structured probabilistic information retrieval system which is able to compute posterior probabilities of relevance for structural components given a non-structured query (i...
The use of structured documents following XML representation allows us to create content and structure (CAS) queries which are more specific for the user's needs. In this paper we are going to study how to enrich this kind of queries with the user feedback in order to get results closer to their needs. More formally, we are considering how to perfo...
We created a corpus consisting of all parliamentary docu- ments from Spain since its rst legislative period in 1977. The documents were collected from the web page of the Spanish Congress http://www.congreso.es and converted into a uniform XML format with extensive metadata in the Dublin Core standard. The collection contains over 50.000 documents...
Relevance Feedback (RF) is a technique allowing to enrich an initial query according to the user feedback in order to get results closer to the user's information need. This paper presents a new RF method for keyword queries (content queries). It is based on the re-weighting of the original query terms plus the addition of new query terms from the...
Purpose
The purpose of this paper is to present an overview of the reorganisation of the Andalusian Parliament's digital library to improve the electronic representation and access of its official corpus by taking advantage of a document's internal organisation. Video recordings of the parliamentary sessions have also been integrated with their cor...
In this work we propose new utility models for the structured information retrieval system Garnata, and expose the results
of our participation at INEX’08 in the AdHoc track using this system.
This paper describes the development of the XML digital library in Spanish from official documents published by Parliament of Andalucía. These documents include discussions about some important matters affecting citizens from the southern Spanish region of Andalucía. The original documents, which are organized around a very well defined structure,...
IntroductionOverviewBayesian networks and information retrievalTheoretical foundationsBuilding the information retrieval systemConclusion
In this paper, an integrated system for searching the official documents published by the Parliament of Andalusia is presented. It uses the internal struc- ture of these documents in order to offer not only complete documents but parts of them given a query. Additionally, as the sessions of the Parliament are recorded in video, jointly to the text,...
The Parliament of Andalusia records all the parliamentary sessions as well as generates files with the exact transcription of the files. With these two types of media, a search engine, starting from a user's query, would return the relevant documents for that query, but also a link to the corresponding portion of the video where the speech is playe...
This paper exposes the results of our participation at INEX’07 in the AdHoc track and the comparison of these results with
respect to the ones obtained last year. Three runs were submitted to each of the Focused, Relevant In Context and Best In
Context tasks, all of them obtained with Garnata, our Information Retrieval System for structured documen...
Resumen La realimentación por relevancia es una técnica que se puede utilizar para refinar una consulta inicial formulada por un usuario a un sistema de recuperación de información, teniendo en cuenta los resultados obtenidos por el propio sistema y evaluados por el usuario para dicha con-sulta. La consulta modificada se construye incluyendo nuevos...