Conference PaperPDF Available

Soundbeam: a Platform for Sonifying Web Tracking

Authors:

Abstract and Figures

Government spying on internet traffic has become ubiquitous. Not to be left out, the private sector tracks our online footprint via web beacons and cookies. Web services such as Google track our progress as we surf the net and click on links. The Mozilla plugin, Lightbeam (formerly Collusion), shows the user a visual map of every site to which a surfer's data is sent. An interconnected web of advertisers and other otherwise invisible data-gatherers quickly builds during normal usage. We have modified this plugin so that as the graph builds, its state is broadcast via OSC. We will act as translational agents in a process of live data sonification. The collected data is the material with which we will develop a set of musical gestures based on patterns we may discover. The findings of our data collection and the developed music will be presented in the form of an audiovisual live performance. Snippets of collected text and URLs will both form the basis of our audio interpretation and also be projected onto a screen, so an audience can voyeuristically experience the activities of governments and advertisers.
Content may be subject to copyright.
Soundbeam: a Platform for Sonifying Web Tracking
Charles Celeste
Hutchins
University of Kent
celesteh@gmail.com
Holger Ballweg
BiLE
holger.ballweg@gmail.com
Shelly Knotts
Durham University
michelle.knotts@durham.ac.uk
Jonas Hummel
Manchester Metropolitan
University
mail@jonashummel.de
Antonio Roberts
BiLE
antonio@hellocatfood.com
ABSTRACT
Government spying on internet traffic has become ubiqui-
tous. Not to be left out, the private sector tracks our online
footprint via web beacons and cookies. Web services such
as Google track our progress as we surf the net and click
on links. The Mozilla plugin, Lightbeam (formerly Collu-
sion), shows the user a visual map of every site to which a
surfer’s data is sent. An interconnected web of advertisers
and other otherwise invisible data-gatherers quickly builds
during normal usage. We have modified this plugin so that
as the graph builds, its state is broadcast via OSC.
We will act as translational agents in a process of live data
sonification. The collected data is the material with which
we will develop a set of musical gestures based on patterns
we may discover. The findings of our data collection and
the developed music will be presented in the form of an
audiovisual live performance. Snippets of collected text and
URLs will both form the basis of our audio interpretation
and also be projected onto a screen, so an audience can
voyeuristically experience the activities of governments and
advertisers.
Keywords
big data, Lightbeam, privacy, sonfication
1. BIG DATA
Our original goal in setting out was to create something
that was based on Open Source Software that engaged the
issue of surveillance. We became specifically interested in
big data because of the ”public/private surveillance part-
nership”[9] between big data and governmental bodies and
the collaboration this requires of the most popular web-
sites. The Mozilla Foundation, who do activism around
issues of privacy and the open internet, publish a Firefox
plugin called Lightbeam. This tool, formerly called Collu-
sion, tracks every server that a user visits. This includes
both the URLs that they purposefully click on or type into
the address bar of their browser and also websites that their
intended destination refers to, but that the user does not
connect to directly[8].
The additional connections embedded in a page can be
entirely benign. For example, Facebook uses a dedicated
Permission to make digital or hard copies of all or part of this work for
personal or classroom use is granted without fee provided that copies are
not made or distributed for profit or commercial advantage and that copies
bear this notice and the full citation on the first page. To copy otherwise, to
republish, to post on servers or to redistribute to lists, requires prior specific
permission and/or a fee.
NIME’14, June 30 – July 03, 2014, Goldsmiths, University of London, UK.
Copyright remains with the author(s).
domain name (fbcdn.net) for pictures. However, some third
party sites can and do track users around the web, with the
collaboration of website owners. Sites such as ScoreCard
Research embed invisible images into third party web pages
and use cookies to track browsing habits[1]. These data are
used for advertising and audience analytics[3]. ComScore,
which owns ScoreCard, tracks two million users at a time[1].
Even websites that do not have advertising help gather in-
formation for big data. Many websites are concerned about
Search Engine Optimisation: they want to know which search
terms brought users to their website, so they install Google
Analytics[5]. It is within Google’s power just to share that
information with website owners. However, with Google
Analytics, website owners get a nice interface and can easily
buy things from Google to increase traffic, such as search
keywords[5] or to track repeat users and serve adverts to
them based on past visits[4]. This data-gathering appears
on many websites that do not serve adverts. Both IRCAM
and the University of Kent have Google tracking on their
websites[10, 6].
By design, this tracking is all invisible. The Lightbeam
plugin aims to make this visible by building a table that
users can view, or, more strikingly, a graph showing the in-
terconnected network of trackers and sites that updates as
the user surfs (see Figure 1). While these data are educa-
tional to users and useful for researchers, users must divert
their attention to the plugin to see the data. Users do not
get an immediate sense of the constant growth of the graph.
2. SONIFICATION
Our project builds on Lightbeam to provide a more powerful
illustration of tracking in real time. By sonifying ongoing
data, we give users a stronger sense of just how much track-
ing is going on. Even people who are concerned about the
privacy issues surrounding big data do not tend to have a
sense of just how many calls to third parties are made when
they click an advertising-supported website. With sonifica-
tion, calls to tracking websites can be brought to the user’s
attention in real time.
We forked the Lightbeam plugin[2] to add support for
OSC messaging. Our version of the plugin uses node.js to
send OSC messages to the localhost on port 57120, which is
the default port for SuperCollider. The OSC message tags
are /visited and /unvisited, the latter of which are poten-
tially tracking the user. Additional arguments are the site’s
url, the number of other sites it’s connected to, the number
of times visited, and the number of cookies attached.
The piece we are developing with this, called Soundbeam,
is therefore based on web surfing. Thus, in accord with the
old computer-musician joke, we are checking our email on
stage, but also Facebook, Buzzfeed or whatever other site
Proceedings of the International Conference on New Interfaces for Musical Expression
497
Figure 1: A Lightbeam graph generated by visiting
some of this article’s references.
captures our fancy. While we surf, we sonify the third party
sites we hit. Additionally, to highlight both the ubiquitous-
ness and interconnectedness of tracking, each user also an-
nounces to the LAN the third party sites they encounter.
When a user receives such a notification, if they have also
been tracked by the same site, they also sonify a match.
The score for the piece calls for this sound to be more in
the foreground than third party sites hit individually. The
presence of cookies, the number of connections and number
of times hit may also impact the timbre of the sound event.
In creating this piece, we have researched big data track-
ing companies and are building a table of regular expres-
sions to match URLs. Matches are associated with known
big data companies.
Listing 1: Regular Expressions that match Google
g oo gl e [ ^ \ t \r \ n \v \ f \. ] *\ . c om
g oo gl e [ ^ \ t \r \ n \v \ f \ .] * \. c o \. u k
g st at i c \. c om
[^ \ t\ r \n \ v \f \ . ]* \ . go o gl e [ : wo rd : ]* \. co m
[^ \ t\ r \ n \v \ f \ .] * \. g oo g l ea p is . co m
d ou b l ec l ic k . n et
This is used to determine whether two players are being
tracked by the same company. Also, depending on how
benign we deem a third party site to be (e.g. Facebook’s
image serving site has no nefarious intentions that we know
of), this may change associated sound events.
As with all BiLE pieces, how sound is generated and
played is left up to each player[7]. In order to avoid in-
terfering with web browsing, the score specifies that players
should have only minimal interaction with any musical GUI.
If players want more control over their gestures, therefore,
they will be empowered to use devices such as MIDI faders
to control aspects of their sounds.
3. VISUALISATIONS
Graphical projections will greatly increase the clarity of the
piece to audiences. This part of the piece is very much still
under development, but possible projections could include
the Lightbeam graphs generated by some of the players,
URLs of tracking sites, and text from websites we inten-
tionally visit.
4. OTHER APPLICATIONS
In addition to live performance, this technology could be
well-suited to to interactive installations. Users could be
encouraged to browse the web via a public kiosk. As they
browsed, sound events would alert them to big data com-
panies tracking them. Additionally, the installation could
be fitted with several additional monitors which could dis-
play information about the tracking sites they encounter. If
a user hit a site with five trackers, such as the University
of Kent homepage[10], five extra monitors mounted around
the kiosk could display information about each tracker in-
cluding the owner, their corporate logo, market penetration
and types of tracking employed. While this would be less
musical than a performance, it could potentially be more
educational and make a greater impression on the audience.
5. CONCLUSION
Surreptitious data collection, including via tracking web-
sites is one of the major privacy issues affecting web users
today. We hope that the Soundbeam piece described here
addresses these issues in a way that is both educational and
musical and that the code we develop finds other artistic
uses that engage these issues.
6. REFERENCES
[1] T. Beleaga and J. Geary. Scorecardresearch
(comscore): What is it and what does it do? The
Guardian, April 2012. http://gu.com/p/374xn/tw
[Access date: 30 April 2014].
[2] BiLE. Bilensemble/lightbeam, February 2014.
https://github.com/BiLEnsemble/lightbeam
[Access date: 30 April 2014].
[3] ComScore. Products, 2014.
http://www.comscore.com/Products [Access date: 30
April 2014].
[4] J. Geary. Tracking the trackers: What are cookies?
an introduction to web tracking. The Guardian, April
2012. http://www.theguardian.com/technology/
2012/apr/23/cookies-and-web-tracking-intro
[Access date: 30 April 2014].
[5] Google. Features: Google analytics, n.d. https:
//www.google.com/analytics/features/index.html
[Access date 30 April 2014].
[6] IRCAM. Www ircam, n.d. http://www.ircam.fr/
[Access date 30 April 2014].
[7] S. Knotts. Bile manifesto, 2011.
http://www.bilensemble.co.uk/manifesto/ [Access
date: 1 April 2014].
[8] Mozilla Foundation. Lighbeam for firefox, n.d.
https://www.mozilla.org/en-US/lightbeam/
[Access date: 30 April 2014].
[9] B. Schneier. Don’t listen to google and facebook: The
public-private surveillance partnership is still going
strong. The Atlantic, April 2014.
http://bit.ly/1rutQdN [Access date: 28 April 2014].
[10] The University of Kent. The university of kent, n.d.
http://www.kent.ac.uk/ [Access date: 30 April
2014].
Proceedings of the International Conference on New Interfaces for Musical Expression
498
... Our contribution is a sonification-based approach to raise user awareness of web tracking which extends the possibilities of ex isting approaches like Soundbeam by Hutchins et al. [10]. We describe a framework for live web tracking analysis and conver sion to OSC 1 events, which can be used to monitor web tracking on any network connection -across all kinds of browsers, apps and devices. ...
... Sonification of web tracking can increase this awareness as it provides immediate auditory feedback to the user while he or she is browsing the internet. Soundbeam [10] sonifies third-party connections extracted by Mozilla Lightbeam, a plug-in for the Mozilla Firefox browser. It sends data on intentionally visited websites and unintentionally visited third-parties (e.g., analytics or advertisement providers) to the SuperCollider synthesis engine via OSC. ...
... When user B encounters a third-party element that has been identified by user A before, it is sonified for both users. This is intended to "highlight both the ubiqitousness and interconnectedness of tracking" [10]. ...
Conference Paper
Full-text available
Web tracking is found on 90 % of common websites. It allows online behavioral analysis which can reveal insights to sensitive personal data of an individual. Most users are not aware of the amout of web tracking happening in the background. This paper contributes a sonification-based approach to raise user awareness by conveying information on web tracking through sound while the user is browsing the web. We present a framework for live web tracking analysis, con­ version to Open Sound Control events and sonification. The amount of web tracking is disclosed by sound each time data is exchanged with a web tracking host. When a connection to one of the most prevalent tracking companies is established, this is additionally indicated by a voice whispering the company name. Compared to existing approaches on web tracking sonification, we add the capability to monitor any network connection, including all browsers, applications and devices. An initial user study with 12 participants showed empirical support for our main hypothesis: exposure to our sonification sig­ nificantly raises web tracking awareness.
... Some examples include sonification of soil elements (Suchánek, 2020), georeferenced data (Park et al., 2010), earthquake data (Lindborg, 2017), web tracking (Hutchins et al., 2014), the movement of fish (Mercer-Taylor and Altosaar, 2015), Nuclear Magnetic ...
Thesis
Full-text available
Even though Python is a very popular programming language with a wide range of applications, in the domain of music, specifically electronic music, it is much less used than other languages and programming environments that have been built explicitly for musical creation, such as SuperCollider, Pure Data, Csound, Max, and Chuck. Since 2010 a Python module for DSP called Pyo has been available. This module consists of a complete set of DSP algorithms, Unit Generators, filters, effects, and other tools for the creation of electronic music and sound, yet its community is rather limited. Being part of Python, this module can be combined with a big variety of native and external Python modules for musical or extra-musical tasks, facilitating the realisation of interdisciplinary artworks focusing on music and sound. Starting a creative journey with this module, I was led to more Pythonic techniques for tasks other than music, like mining tweets from Twitter or creating code poetry, which I incorporated into my musical activity. This practice-based research explores the field of the creation of musical works based on Python by focusing on three works. The first one is a live coding poetry opera where the libretto is written in Python. The second one is a live algorithmic composition for an acoustic ensemble based on input from Twitter. The last work is a combination of live coding with live patching on a hardware modular synthesiser system. The main objective of this thesis is to determine the creative potential of Python in music and mixed media art by posing questions that are answered through these works. By doing this, this research aims to provide a conceptual framework for artistic creation that can function as inspiration to other musicians and artists. The title of this thesis is based on one of the most popular lines of the Monty Python comedy troupe, "the Knights who say Ni!", since the initial developer of the Python i programming language, Guido van Rossum, gave this name to this language inspired by Monty Python.
... Moreover it is restricted by information Google provides to third party developers: The Chrome API gives developers access to each time a cookie is inserted, deleted or updated. It does not allow developers to register how often a cookie is read by the browser and the web server, Listening Back is not the first tool of this type: In June 2014, Charles Hutchins from the University of Kent and his working group published a conference paper on Soundbeam: A Platform for Sonifying Web Tracking (Hutchins et al. 2014). Five years later, Otto Lutz (Fraunhofer Institute for Open Communication Systems FOKUS) and Jacob Leon Kröger (Technical University Berlin) submitted a study on Surfing in Sound: Sonification of Hidden Web Tracking (Lutz et al. 2019). ...
... In this context, users are network security specialists using the auditory modality as a supplementary resource to improve pattern, anomaly and intrusion detection. For non-expert users, hidden online tracking has been sonified to convey privacy and security issues in order to raise awareness for these concerns [21,25]. ...
Conference Paper
Despite 2-factor authentication and other modern approaches, authentication by password is still the most commonly used method on the Internet. Unfortunately, as analyses show, many users still choose weak and easy-to-guess passwords. To alleviate the significant effects of this problem, systems often employ textual or graphical feedback to make the user aware of this problem, which often falls short on engaging the user and achieving the intended user reaction, i.e., choosing a stronger password. In this paper, we introduce auditory feedback as a complementary method to remedy this problem, using the advantages of sound as an affective medium. We investigate the conceptual space of creating usable auditory feedback on password strength, including functional and non-functional requirements, influences and design constraints. We present web-based implementations of four sonification designs for evaluating different characteristics of the conceptual space and define a research roadmap for optimization, evaluation and applications.
Scorecardresearch (comscore): What is it and what does it do? The Guardian
  • T Beleaga
  • J Geary
T. Beleaga and J. Geary. Scorecardresearch (comscore): What is it and what does it do? The Guardian, April 2012. http://gu.com/p/374xn/tw [Access date: 30 April 2014].
Tracking the trackers: What are cookies? an introduction to web tracking. The Guardian
  • J Geary
J. Geary. Tracking the trackers: What are cookies? an introduction to web tracking. The Guardian, April 2012. http://www.theguardian.com/technology/ 2012/apr/23/cookies-and-web-tracking-intro [Access date: 30 April 2014].
Don't listen to google and facebook: The public-private surveillance partnership is still going strong. The Atlantic
  • B Schneier
B. Schneier. Don't listen to google and facebook: The public-private surveillance partnership is still going strong. The Atlantic, April 2014. http://bit.ly/1rutQdN [Access date: 28 April 2014].