- A preview of this full-text is provided by Springer Nature.
- Learn more
Preview content only
Content available from Data Mining and Knowledge Discovery
This content is subject to copyright. Terms and conditions apply.
Data Min Knowl Disc (2017) 31:1359–1390
DOI 10.1007/s10618-017-0518-x
MixedTrails: Bayesian hypothesis comparison on
heterogeneous sequential data
Martin Becker1·Florian Lemmerich2·
Philipp Singer2·Markus Strohmaier2·
Andreas Hotho1
Received: 18 December 2016 / Accepted: 5 June 2017 / Published online: 7 July 2017
© The Author(s) 2017
Abstract Sequential traces of user data are frequently observed online and offline,
e.g., as sequences of visited websites or as sequences of locations captured by GPS.
However, understanding factors explaining the production of sequence data is a chal-
lenging task, especially since the data generation is often not homogeneous. For
example, navigation behavior might change in different phases of browsing a website
or movement behavior may vary between groups of users. In this work, we tackle
this task and propose MixedTrails, a Bayesian approach for comparing the plausibil-
ity of hypotheses regarding the generative processes of heterogeneous sequence data.
Each hypothesis is derived from existing literature, theory, or intuition and represents
a belief about transition probabilities between a set of states that can vary between
groups of observed transitions. For example, when trying to understand human move-
ment in a city and given some data, a hypothesis assuming tourists to be more likely
Responsible editors: Kurt Driessens, Dragi Kocev, Marko Robnik-Šikonja, Myra Spiliopoulou
BMartin Becker
becker@informatik.uni-wuerzburg.de
Florian Lemmerich
florian.lemmerich@gesis.org
Philipp Singer
me@philippsinger.com
Markus Strohmaier
markus.strohmaier@gesis.org
Andreas Hotho
hotho@informatik.uni-wuerzburg.de
1Data Mining and Information Retrieval Group, University of Würzburg, Würzburg, Germany
2GESIS - Leibniz Institute for the Social Sciences, Cologne, Germany
123
Content courtesy of Springer Nature, terms of use apply. Rights reserved.