Conference PaperPDF Available

Towards Using the Potential of OpenStreetMap History for Disaster Activation Monitoring

Authors:
  • Heidelberg Institute for Geospatial Technology (HeiGIT)
  • Heidelberg Institute for Geoinformation Technology

Abstract and Figures

Over the last couple of years, the growing OpenStreetMap (OSM) database repeatedly proved its potential for various use cases, including disaster management. Disaster mapping activations show increasing numbers of contributions, but oftentimes raise questions related to the quality of the provided Volunteered Geographic Information. In order to better monitor and understand OSM mapping and data quality, we developed the ohsome software platform that applies big data technology to OSM full history data. OSM full history data monitoring allows detailed analyses of the OSM data evolution and the detection of remarkable patterns over time. This paper illustrates the specific potential of our platform for disaster activations by means of two case studies. Initial results demonstrate that our flexible and scalable platform structure enables fast and easy information extraction and supports mapping processes and data quality assurance.
Content may be subject to copyright.
Michael Auer et al. OSM History for Disaster Activation Monitoring
Towards Using the Potential of
OpenStreetMap History for
Disaster Activation Monitoring
Michael Auer
GIScience Research Group, Institute of
Geography, Heidelberg University
Melanie Eckle
GIScience Research Group, Institute of
Geography, Heidelberg University
Sascha Fendrich
GIScience Research Group, Institute of
Geography, Heidelberg University
Luisa Griesbaum
GIScience Research Group, Institute of
Geography, Heidelberg University
Fabian Kowatsch
GIScience Research Group, Institute of
Geography, Heidelberg University
Sabrina Marx
GIScience Research Group, Institute of
Geography, Heidelberg University
Martin Raifer
GIScience Research Group, Institute of
Geography, Heidelberg University
Moritz Schott
GIScience Research Group, Institute of
Geography, Heidelberg University
Rafael Troilo
GIScience Research Group, Institute of
Geography, Heidelberg University
Alexander Zipf
GIScience Research Group, Institute of
Geography, Heidelberg University
ABSTRACT
Over the last couple of years, the growing OpenStreetMap (OSM) database repeatedly proved its potential for various
use cases, including disaster management. Disaster mapping activations show increasing numbers of contributions,
but oftentimes raise questions related to the quality of the provided Volunteered Geographic Information. In order
to better monitor and understand OSM mapping and data quality, we developed the ohsome software platform that
applies big data technology to OSM full history data. OSM full history data monitoring allows detailed analyses
of the OSM data evolution and the detection of remarkable patterns over time. This paper illustrates the specific
potential of our platform for disaster activations by means of two case studies. Initial results demonstrate that
our flexible and scalable platform structure enables fast and easy information extraction and supports mapping
processes and data quality assurance.
Keywords
OpenStreetMap, OpenStreetMap History, Disaster management, Street network analyses
INTRODUCTION
Up-to-date geodata is a valuable source of information in the aftermath of a disaster, e.g., to coordinate disaster
response and to support routing and navigation in the aected areas. When using proprietary or authoritative
map data, it is dicult to provide such up-to-date information. In contrast, openly accessible maps such as
OpenStreetMap (OSM) enable collaborative and quick mapping of disaster-aected areas, thereby constituting
WiPe Paper Geospatial Technologies and Geographic Information Science for Crisis Management (GIS)
Proceedings of the 15th ISCRAM Conference Rochester, NY, USA May 2018
Kees Boersma and Brian Tomaszewski, eds.
317
Michael Auer et al. OSM History for Disaster Activation Monitoring
a valuable source of information. For example, after the Nepal earthquake 2015 more than 7 000 volunteers
contributed map data to the OSM database using current satellite imagery (Radford 2015). The Humanitarian
OpenStreetMap Team (HOT) coordinates such mapping activities through organized mapping tasks using the HOT
Tasking Manager (HOT Tasking Manager 2018) and communicating the needs of the relief organizations and the
aected population on the ground.
Over the last couple of years, OSM data repeatedly proved its potential for disaster applications, in the disaster
response as well as the other phases of the disaster cycle (Palen et al. 2015; Dittus et al. 2016b; Eckle et al. 2017).
The mapping activations that are supported by many volunteers with various levels of experience oftentimes raise
questions related to the quality of the provided Volunteered Geographic Information (VGI). Learning about the data
quality that can be expected in a disaster activation helps to evaluate the quality and fitness for purpose of the OSM
data. For example, in the case of disaster routing, missing or inaccurately represented streets directly aect the
routing quality.
Quality assurance mechanisms are directly integrated into the HOT workflow in the form of validations carried out
by experienced volunteers. Furthermore, such evaluations can be carried out extrinsically by measuring the data
quality of OSM in comparison to other data sets, e.g., proprietary or authoritative data (Haklay 2010; Neis and Zipf
2012; Neis, Zielstra, et al. 2013). While this method proved to be useful in providing measurable results, data to
conduct such comparisons are rarely available for the areas of interest (Mooney et al. 2010).
Therefore, data quality assessments need to be conducted intrinsically by making use of the OSM full history
data. These data sets contain all edits (being it additions, deletions or modifications of objects) that were made by
the OSM community since the beginning of OSM in 2004. While these data sets oer a great variety of crucial
information, their analysis is challenging due to the large amount of raw data and the evolving object taxonomy
in OSM. In order to address these challenges, we develop the ohsome software platform that applies big data
technology to the OSM full history data. This paper introduces the platform and illustrates its potential for disaster
activation monitoring by means of two case studies.
RELATED WORK
Monitoring disaster activations has already proved to be of great potential for the mapping community itself, and
for data users to get a better idea about the data quality. For the Nepal earthquake response in 2015, Anhorn
et al. (2016) presented a framework to iteratively validate and update OSM objects. The study focused on the
spatio-temporal dynamics of spontaneous shelter camps in OSM. Additionally, the accuracy of the mapping was
analyzed using a crowdsourcing approach. Dittus et al. (2016a) also presented a monitoring approach analyzing
the dierent levels of engagement of volunteers and the resulting impact on the data and user activity. As disaster
mapping activations are oftentimes criticized for a lack of data maintenance, Quattrone et al. (2016) conducted a
long-term monitoring study and assessed OSM data contributions in dierent countries. Besides monitoring during
disaster activations, several post-hoc analyses and evaluations have been conducted, e.g., for Haiti 2010, Typhoon
Haiyan 2013 (Zook et al. 2010; Soden and Palen 2014; Palen et al. 2015) and for the more recent activation in
Nepal 2015 (Poiani et al. 2016; Soden and Palen 2016).
With the iOSMAnalyzer, Barron et al. (2014) developed a first framework for intrinsic OSM analyses and provided
quality indicators for dierent use cases. Jokar Arsanjani et al. (2013) furthermore evaluated the evolution of
OSM data to investigate specific patterns. First comprehensive intrinsic OSM data analyses related to collaborative
disaster mapping have been conducted by Dittus et al. (2017), providing insights about mapper behavior, the
mapping community and eects on the contributed OSM data.
Openly accessible OSM analysis platforms like OSM Analytics (2018) (a web based interactive visualization of
OSM feature density) and EPIC-OSM (a collection of customize-able statistics over raw OSM history data by
Anderson et al. (2016)) already provide some quantitative information about data contributions and contribution
activity for, e.g., specific HOT projects, but do not provide a full picture of the history of OSM data. Existing
software tools to support intrinsic data quality assessments are typically limited by oering only subsets of analysis
methods, and/or operate only on small database extracts for a defined number of case study regions.
THE OHSOME OSM HISTORY ANALYTICS PLATFORM
OSM oers a large quantity of data that is also quite diverse in respect of feature variety and scale. One aspect
of this is the free taxonomy scheme which is based on tags consisting of key-value pairs that are defined by the
community itself. This causes the task of analyzing how the data evolved over time to be rather complex, also
explaining the lack of general purpose analysis software until now.
WiPe Paper Geospatial Technologies and Geographic Information Science for Crisis Management (GIS)
Proceedings of the 15th ISCRAM Conference Rochester, NY, USA May 2018
Kees Boersma and Brian Tomaszewski, eds.
318
Michael Auer et al. OSM History for Disaster Activation Monitoring
The aim of the ohsome platform (Ohsome 2018) that is presented in this paper is to implement a distributed database
and computing infrastructure that gives access to OSM history data and allows for fast parallel processing. In order
to permit arbitrary spatio-temporal analyses, all information that is present in the original OSM history data set and
all available metadata fields are herein considered. The ohsome platform is co-developed with the OSM History
Database (OSHDB). The contained data is divided into spatio-temporal data cubes which may be distributed over a
computing cluster infrastructure. In order to meet dierent user demands, the platform provides an application
programming interface (API) at each of the following three abstraction levels:
The ohsome-API implements a representational state transfer (REST) web service that permits one to
interactively answer and visualize common predefined research questions and analyses. That is, the analyses
may be conducted in a standard web browser. This API is most suitable for the general public and for
non-programmers.
The OSHDB-API exposes the MapReduce big data programming model (Dean and Ghemawat 2004) in the
Java programming language. The Java-API is more flexible than the REST-API but still accessible with basic
knowledge of the Java programming language. Most of the more advanced analyses can be realized using it.
A low level raw data access gives even more direct access to the OSM data objects stored in the database and
more fine-grained control over the data processing. However, a basic understanding of the internal database
structure is required in order to interpret the results of a data request correctly.
Besides the above-mentioned focus on intrinsic data quality metrics, further application of the platform include
exploratory data analyses such as visualizing OSM contributor activity and examining individual OSM objects,
the analysis of OSM contribution patterns, or general geo-statistics and visual analytics. Possible applications of
these functionalities and their potential are presented in the following two case studies. Their purpose is not to
provide comprehensive scientific insights, but to illustrate the kind of analyses that may be performed for scientific
investigations of OSM history data.
CASE STUDY 1: NEPAL EARTHQUAKE 2015
On April 25 and May 12 of 2015, two major earthquakes with magnitudes of 7.8 (36 km east of Khudi) and 7.2
(19 km southeast of Kodari) and several minor tremors hit the central part of Nepal. In this case study we use the
ohsome platform to facilitate an exploratory analysis of the evolution of the OSM street network in the aftermath of
the Nepal earthquakes 2015 in scope of the disaster mapping activities. Thereby, we focus on the OSM’s fitness for
use for disaster routing applications.
Common intrinsic quality indicators concerning the street network are completeness of the data set, positional and
semantic accuracy and topological correctness (Neis and Zipf 2012; Barron et al. 2014). With respect to routing
applications, topological errors in OSM data are described as one of the major challenges (Neis, Singler, et al. 2010;
Sehra et al. 2016). For instance, junctions cannot be determined if intersecting OSM ways do not share a common
node (Neis and Zipf 2012). Sehra et al. (2017) assess the street navigability of OSM data in Northern India using
heuristic indicators and by analyzing road lengths as well as attribute completeness and semantic completeness.
In this case study the evolution of the OSM street network (OSM tag highway=*) is investigated for the Nepal
earthquake 2015 based on the following quality indicators:
Completeness: (a) street lengths aggregated for street categories and (b) percentage of streets which include
street names (e.g., Sehra et al. (2017)),
Topological correctness: (c) percentage of intersecting OSM ways that do not share a common node (d)
percentage of OSM endpoints close (distance
1 m) but not connected to an OSM way (Jokar Arsanjani
et al. 2013; Neis and Zipf 2012; Neis, Zielstra, et al. 2013),
User activity: (e) Number of users that edited OSM streets (Neis and Zipf 2012).
We examine the topological correctness in monthly time steps for the year 2015, the completeness and user activity
is investigated on a daily basis in order to evaluate the direct response phase. The results are presented in Fig. 1.
Volunteers started to map streets in OSM right after the April 2015 earthquake. Within seven days 3619 OSM users
added about 310 000 km to the OSM street network. The maximum number of contributors was reached two days
after the event. Thereafter, the number of mappers decreased from more than 1 000 on April 27, to about 100 per
WiPe Paper Geospatial Technologies and Geographic Information Science for Crisis Management (GIS)
Proceedings of the 15th ISCRAM Conference Rochester, NY, USA May 2018
Kees Boersma and Brian Tomaszewski, eds.
319
Michael Auer et al. OSM History for Disaster Activation Monitoring
0.3302
0.7926
1.2551
1e8 hwy=*
2.79
7.84
12.90
highway=road
5.92
11.34
16.77
hwy with name
0.83
2.21
3.59
junction without node
0.14
0.87
1.60
endpoint next to way
0
539
1078
number of contributors
Jan 1 Feb 1 Mar 1 Apr 1 May 1 June 1 July 1 Aug 1 Sept 1 Oct 1 Nov 1 Dec 1
0.008
0.013
0.019
impassable hwys
count % of all jcts. % of all hwys
% of all hwys
% of all endpts.
% of all hwys length [m]
2015
Figure 1. Intrinsic quality indicators, Nepal earthquake response 2015.
day one month later and about 25 contributors per day at the end of June. The Kathmandu district and surrounding
regions are the areas with the highest number of contributions. Fig. 2(left) shows the spatial distribution of the
mapping activities and the area for which specific road mapping tasks have been defined via the HOT tasking
manager.
Not only did the number of mapped streets increase, but also the number of topological errors (Fig. 2, right). A total
of 2 018 intersecting OSM streets without a common node have been detected before the earthquake on April 1,
increasing up to 6 332 on May 1. The percentage of such errors was relatively constant before the earthquake.
However, topological correctness of the data set has been continually deteriorated after the event. Regarding
OSM street categories we focused on the tag highway=road. This tag is considered to be temporary and is used
for unknown or unverified roads (OSM Wiki: Nepal 2018). During the direct disaster response of the OSM
community, the percentage of streets tagged as roads doubled from 6% to 12% within three days. This indicates
possible semantic inaccuracies in the post-disaster OSM data set. However, the percentage of streets with the
tag highway=road went back to 6% two weeks after the earthquake, meaning that the streets have been further
edited and validated. Furthermore, we assessed the attribute completeness for the OSM key name. The percentage
of streets with a name tag decreased from 13% down to 8% within 10 days after the earthquake on April 25. In
contrast to the streets tagged as highway=road, not much editing was done in the post-disaster data set and the
percentage of streets with a name attribute remained low throughout the year 2015.
Disaster relief routing applications rely on up-to-date OSM information. Thus, this information is crucial to
eectively support relief organizations. This exploratory analysis shows that the collaborative mapping eorts
provided a large amount of street network data after the Nepal earthquake 2015. In the direct-response phase this
was achieved at the expense of data quality. However, contributors continued their mapping activities and corrected,
WiPe Paper Geospatial Technologies and Geographic Information Science for Crisis Management (GIS)
Proceedings of the 15th ISCRAM Conference Rochester, NY, USA May 2018
Kees Boersma and Brian Tomaszewski, eds.
320
Michael Auer et al. OSM History for Disaster Activation Monitoring
Figure 2. Evolution of the OSM street network in Nepal 2015; the extent of the HOT tasks which focused on
street mapping is marked in red (left). Heatmap showing the density of crossing OSM ways without common node
(right).
e.g., the possible semantic inaccuracies due to streets tagged as roads to a certain amount. The presented quality
indicators were calculated in a few minutes, providing high potential for near-real time quality analyses and, thus,
facilitating feedback mechanisms to the OSM mapping community that can help to address issues in a timely
manner. This can help to improve data quality and, thus, applications like OSM routing services to support disaster
response.
CASE STUDY 2: COMPARATIVE STUDY OF HOT ACTIVATIONS
As described above, HOT was activated for several disasters and provided their support with an increasing number
of mapping volunteers over the last years. In each of these activations, main contributions are provided in the
response phase of the disasters which generally lasts for more than four weeks after the launch of an activation.
While the numerous contributions proved to be of great value for the coordination of disaster management, the VGI
character of the data and the dierent levels of experience of the contributing mappers cause uncertainty regarding
the usability of the data in the field.
These kind of questions can be addressed using the ohsome platform, e.g., by conducting a follow-up analysis to
assess further changes and, therefore, validations that had to be conducted. To achieve this, we extract OSM objects
that were added during the direct disaster response period and analyze changes that were applied to the data after the
activation period. Because information about buildings and roads are often the most valuable for disaster response,
we focus our analyses thereon. The three HOT disaster activations that were selected are the Haiti earthquake
activation 2010, the Typhoon Haiyan activation 2013 and the Nepal earthquake activation 2015. While the Typhoon
WiPe Paper Geospatial Technologies and Geographic Information Science for Crisis Management (GIS)
Proceedings of the 15th ISCRAM Conference Rochester, NY, USA May 2018
Kees Boersma and Brian Tomaszewski, eds.
321
Michael Auer et al. OSM History for Disaster Activation Monitoring
3 6 9 12 15 18 21 24 27 30
0
5
10
15
20
25
30
Haiti Philippines Nepal
months after the disaster response
% of created objects changed
Figure 3. Comparison of changes to data that was created during activations (accumulated).
Haiyan and Nepal activation were among the largest in sense of contributors and scope, comparing these activations
to the earliest activation in Haiti enables assessing possible developments of the mapping community over time.
Similar to the event-response on the ground, mapping also undergoes dierent stages of activity: pre-event, event,
event-response and post-event. The three events are characterized through some key statistics for the respective
periods. While the pre-event period shows average numbers of 0.94 (Haiti), 675.18 (Philippines) and 369.46 (Nepal)
total contributions per day, these numbers rise to maxima of 12 156, 76 047 and 186 854, respectively, shortly after
the event. However this has to be seen in relation with the total area of the countries (Haiti: 27 750 km
2
, Philippines:
300 000 km
2
, Nepal: 147 181 km
2
(Central Intelligence Agency 2018) as well as the amount of OSM-Data available
at the start of the activation (Haiti: 1 653 elements, Philippines: 460 079, Nepal: 1 357336). The three events are
furthermore characterized through their respective activation outreach with Haiti showing a mean of 33 unique
mappers per day during the event-response while the Philippines and Nepal responses both have averages nearly
three times as high (Philippines: 96; Nepal: 81).
This exemplary follow-up study, run within minutes and with only little coding eort on the ohsome platform, can
help to estimate, e.g., the quality of the added objects and the maintenance of the added data. Quality is high if
the data does not need to be changed shortly after the event while maintenance increases the number of changes
on the long term. Figure 3shows the percentage of OSM objects created during the event-response period and
changed during the post-event period for our three case-studies. This indicates that quality of the added data is high
because a maximum of 15% of the objects were changed until one year after the event. The percentages in the more
recent Philippines and Nepal activations indicate even higher data quality. At the same time maintenance seems
to take place because these numbers still rise two years after the event. Speaking in total numbers 18 185 objects
were changed in Haiti (Philippines: 42 747, Nepal: 46 720) within 30 months after the disaster activations. These
changes were contributed by 233 unique mappers in Haiti while the Philippines were corrected and maintained by
943 and Nepal by 1 639 unique mappers. Hence while data quality rises, the number of mappers also rises whereby
the mean contribution-share of each mapper decreases.
Furthermore, the ohsome platform was used to detect the type of edits that have been applied to the OSM data
(see Fig. 4). Thereby possible patterns, the evolution of the data and dierences between the three events can be
investigated. Comparing the activations, variations in type of edits become apparent. Figures 4a and 4c show that
most edits after the Haiti earthquake were tag changes, while most edits after the Nepal earthquake consisted in
geometry changes. These patterns could represent the ad hoc character of the Haiti activation that caused a lack
of coordination and structure, e.g., regarding tagging schemes, that had to be corrected afterwards. Less changes
in the later activations could show experience in activation coordination and training, and therefore less need for
corrections. On the other hand, frequently updated satellite data, used as mapping-aid can also lead to a high
number of geometry changes if the alignment of imagery varies.
While ohsome cannot directly provide information about all factors influencing mapping in OSM, it enables
detecting patterns that can then be further analyzed through qualitative assessments of the data and the activation
WiPe Paper Geospatial Technologies and Geographic Information Science for Crisis Management (GIS)
Proceedings of the 15th ISCRAM Conference Rochester, NY, USA May 2018
Kees Boersma and Brian Tomaszewski, eds.
322
Michael Auer et al. OSM History for Disaster Activation Monitoring
3 6 9 12 15 18 21 24 27 30
0
20,000
40,000
60,000
80,000
months after the disaster response
number of changes made by category
(a) Haiti
3 6 9 12 15 18 21 24 27 30
0
20,000
40,000
60,000
80,000
(b) Philippines
3 6 9 12 15 18 21 24 27 30
0
20,000
40,000
60,000
80,000
(c) Nepal
3
9
15
21
27
0
60000
other changes
tag changes
geometry changes
deletions
Figure 4. Number of changes by type for the three selected disaster responses (accumulated).
itself. Its combination with other data sets and contextual information can thereby in the future deliver further
insights.
CONCLUSION AND OUTLOOK
We presented two case studies which demonstrate the potential of the ohsome OSM history analytics platform to
extract valuable information from the OSM history data, including mapping processes and data quality related to
disaster management. Due to its flexibility and scalability, the platform facilitates these kind of analyses of the
OSM data evolution and the detection of contribution patterns in VGI data.
The exploratory analysis of the street network in the aftermath of the Nepal earthquake 2015 shows that the
collaborative mapping eorts could provide a large amount of data within a short time. However, this was at first
achieved at the expense of data quality. Additionally, the evolution of OSM objects that were added during the very
active direct event-response phase was analyzed for Haiti 2010, Haiyan 2013 and Nepal 2015. The results indicate
that maintenance of the added OSM data takes places as the total number of follow-up edits increases in the long
term. Comparing the three activations reveals a higher percentage of OSM objects changed in the aftermath of the
Haiti activation. This could represent the ad hoc character of the first HOT activation, whereas the more recent
activations indicate a higher data quality.
The ohsome OSM history analytics platform is not limited to post-hoc disaster response analyses but can also be
applied to gain deeper insights into OSM data at all four stages of disaster management (mitigation, preparedness,
event, response). Likewise to disasters, OSM data itself changes during these phases. Such patterns and changes
can be analyzed using our platform and enable addressing further questions related to recovery, preparedness and
mitigation: Is the OSM data that was added and/or changed during the response period of an event maintained in
and after disaster recovery? What influence does preparedness mapping have for disaster response and data quality?
Further extensions of the database and data analysis framework will make it possible to analyze the OSM data in
respect to its metadata such as changeset attributes or user-to-user communication via OSM notes and changeset
discussions. While in this study we have shown the potential of the OSM history platform for post-hoc analyses,
future work will also facilitate monitoring OSM data contributions during disaster activation in near-real time.
Quasi-live OSM data updates (by implementing a mechanism that applies OSM’s minutely data updates to our
database) will make it possible to create new tools that help to actively improve OSM data quality by providing
feedback-loops between data producers and consumers. Timely data quality assessment during disaster response
WiPe Paper Geospatial Technologies and Geographic Information Science for Crisis Management (GIS)
Proceedings of the 15th ISCRAM Conference Rochester, NY, USA May 2018
Kees Boersma and Brian Tomaszewski, eds.
323
Michael Auer et al. OSM History for Disaster Activation Monitoring
can help to align training materials, coordination of mapping activities as well OSM-based applications like disaster
relief routing.
ACKNOWLEDGMENTS
This work is supported by the Klaus Tschira Foundation (KTS), Heidelberg. We thank the anonymous reviewers
for their helpful suggestions.
REFERENCES
Anderson, Jennings, Soden, Robert, Anderson, Kenneth M., Kogan, Marina, and Palen, Leysia (2016). “EPIC-
OSM: A Software Framework for OpenStreetMap Data Analytics”. In: Proceedings of the 49th Annual Hawaii
International Conference on System Sciences (HICSS). Piscataway, NJ, USA, pp. 5468–5477.
Anhorn, Johannes, Herfort, Benjamin, and Albuquerque, João Porto de (2016). “Crowdsourced validation and
updating of dynamic features in OpenStreetMap an analysis of shelter mapping after the 2015 Nepal earth-
quake”. In: Proceedings of the 13th International Conference on Information Systems for Crisis Response and
Management (ISCRAM). Rio de Janeiro, Brazil.
Barron, Christopher, Neis, Pascal, and Zipf, Alexander (2014). “A Comprehensive Framework for Intrinsic
OpenStreetMap Quality Analysis”. In: Transactions in GIS 18.6, pp. 877–895.
Central Intelligence Agency (2018). The World Factbook.url:https://www.cia.gov/library/publications/the-world-
factbook/(visited on 03/20/2018).
Dean, Jerey and Ghemawat, Sanjay (2004). “MapReduce: Simplified Data Processing on Large Clusters”. In:
Symposium on Operating System Design and Implementation (OSDI). San Francisco, CA, USA, pp. 137–150.
Dittus, Martin, Quattrone, Giovanni, and Capra, Licia (2016a). “Analysing Volunteer Engagement in Humanitarian
Mapping: Building Contributor Communities at Large Scale”. In: Proceedings of the 19th ACM Conference on
Computer-Supported Cooperative Work &Social Computing (CSCW). San Francisco, CA, USA, pp. 108–118.
Dittus, Martin, Quattrone, Giovanni, and Capra, Licia (2016b). “Social Contribution Settings and Newcomer
Retention in Humanitarian Crowd Mapping”. In: Social Informatics. Ed. by Emma Spiro and Yong-Yeol Ahn.
Cham: Springer International Publishing, pp. 179–193.
Dittus, Martin, Quattrone, Giovanni, and Capra, Licia (2017). “Mass Participation During Emergency Response:
Event-centric Crowdsourcing in Humanitarian Mapping”. In: Proceedings of the 20th ACM Conference on
Computer-Supported Cooperative Work &Social Computing (CSCW). Portland, OR, USA, pp. 1290–1303.
Eckle, Melanie, Herfort, Benjamin, Yan, Yingwei, Kuo, Chiao-Ling, and Zipf, Alexander (2017). “Towards using
Volunteered Geographic Information to monitor post-disaster recovery in tourist destinations”. In: Proceedings
of the 14th International Conference on Information Systems for Crisis Response And Management (ISCRAM).
Xanthi, Greece, pp. 1036–1047.
Haklay, Mordechai (2010). “How Good is Volunteered Geographical Information? A Comparative Study of
OpenStreetMap and Ordnance Survey Datasets”. In: Environment and Planning B: Planning and Design 37.4,
pp. 682–703.
HOT Tasking Manager (2018). HOT Tasking Manager.url:https://tasks.hotosm.org (visited on 01/25/2018).
Jokar Arsanjani, Jamal, Helbich, Marco, Bakillah, Mohamed, and Loos, Lukas (2013). “The emergence and
evolution of OpenStreetMap: A cellular automata approach”. In: International Journal of Digital Earth 8.1,
pp. 76–90.
Mooney, Peter, Corcoran, Padraig, and Winstanley, Adam C. (2010). “Towards Quality Metrics for OpenStreetMap”.
In: Proceedings of the 18th SIGSPATIAL International Conference on Advances in Geographic Information
Systems (GIS). San Jose, CA, USA, pp. 514–517.
Neis, Pascal, Singler, Peter, and Zipf, Alexander (2010). “Collaborative mapping and emergency routing for disaster
logistics–case studies from the Haiti earthquake and the UN Portal for Afrika”. In: GI Forum. Salzburg, Austria.
Neis, Pascal, Zielstra, Dennis, and Zipf, Alexander (2013). “Comparison of Volunteered Geographic Information
Data Contributions and Community Development for Selected World Regions”. In: Future Internet 5.2, pp. 282–
300.
Neis, Pascal and Zipf, Alexander (2012). “Analyzing the Contributor Activity of a Volunteered Geographic
Information Project The Case of OpenStreetMap”. In: ISPRS International Journal of Geo-Information 1.3,
pp. 146–165.
WiPe Paper Geospatial Technologies and Geographic Information Science for Crisis Management (GIS)
Proceedings of the 15th ISCRAM Conference Rochester, NY, USA May 2018
Kees Boersma and Brian Tomaszewski, eds.
324
Michael Auer et al. OSM History for Disaster Activation Monitoring
Ohsome (2018). Ohsome OpenStreetMap History Analytics Platform.url:http://www.ohsome.org (visited on
03/13/2018).
OSM Wiki: Nepal (2018). OpenStreetMap Wiki Nepal/Roads.url:https://wiki.openstreetmap.org/wiki/Nepal/
Roads (visited on 01/25/2018).
OSM Analytics (2018). OSM Analytics Tool.url:http://osm-analytics.org (visited on 01/25/2018).
Palen, Leysia, Soden, Robert, Anderson, T. Jennings, and Barrenechea, Mario (2015). “Success & Scale in a
Data-Producing Organization: The Socio-Technical Evolution of OpenStreetMap in Response to Humanitarian
Events”. In: Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems (CHI).
Seoul, Republic of Korea, pp. 4113–4122.
Poiani, Thiago Henrique, Rocha, Roberto dos Santos, Degrossi, Livia Castro, and Albuquerque, Joao Porto de
(2016). “Potential of Collaborative Mapping for Disaster Relief: A Case Study of OpenStreetMap in the Nepal
Earthquake 2015”. In: Proceedings of the 49th Annual Hawaii International Conference on System Sciences
(HICSS). Piscataway, NJ, USA, pp. 188–197.
Quattrone, Giovanni, Dittus, Martin, and Capra, Licia (2016). “Exploring Maintenance Practices in Crowd-
Mapping”. In: Proceedings of the 27th ACM Conference on Hypertext and Social Media. HT ’16. Halifax, Nova
Scotia, Canada: ACM, pp. 285–290.
Radford, Tyler (2015). Nepal Earthquake: A note of thanks to HOT’s aerial imagery providers.url:https :
//www.hotosm.org/updates/2015-07-14_nepal_earthquake_a_note_of_thanks_to_hot%E2%80%99s_aerial_
imagery_providers (visited on 01/25/2018).
Sehra, Sukhjit Singh, Singh, Jaiteg, and Rai, Hardeep Singh (2016). “Analysing OpenStreetMap data for topological
errors”. In: International Journal of Spatial, Temporal and Multimedia Information Systems 1.1, p. 87.
Sehra, Sukhjit Singh, Singh, Jaiteg, and Rai, Hardeep Singh (2017). “Assessing OpenStreetMap Data Using
Intrinsic Quality Indicators: An Extension to the QGIS Processing Toolbox”. In: Future Internet 9.2, p. 15.
Soden, Robert and Palen, Leysia (2014). “From Crowdsourced Mapping to Community Mapping: The Post-
earthquake Work of OpenStreetMap Haiti”. In: Proceedings of the 11th International Conference on the Design
of Cooperative Systems (COOP). Nice, France, pp. 311–326.
Soden, Robert and Palen, Leysia (2016). “Infrastructure in the Wild: What Mapping in Post-Earthquake Nepal
Reveals About Infrastructural Emergence”. In: Proceedings of the 2016 Conference on Human Factors in
Computing Systems (CHI). San Jose, California, USA, pp. 2796–2807.
Zook, Matthew, Graham, Mark, Shelton, Taylor, and Gorman, Sean (2010). “Volunteered Geographic Information
and Crowdsourcing Disaster Relief: A Case Study of the Haitian Earthquake”. In: World Medical &Health
Policy 2.2, pp. 6–32.
WiPe Paper Geospatial Technologies and Geographic Information Science for Crisis Management (GIS)
Proceedings of the 15th ISCRAM Conference Rochester, NY, USA May 2018
Kees Boersma and Brian Tomaszewski, eds.
325
... Em um primeiro momento, foram levantados os dados de entrada: a base cartográfica da Cidade da Beira foi adquirida junto ao Instituto Nacional de Gestão de Calamidades (INGC) (INGC, 2019b). Os dados do OpenStreetMap foram adquiridos a partir da Ohsome API (AUER et al., 2018). Foram selecionadas as feições referentes às vias (key:highway) e edificações (key:building), além do número de contribuidores, para o período entre 01 de janeiro de 2015 a 01 de julho de 2019. ...
Article
Full-text available
Em 2019, a Cidade da Beira em Moçambique foi assolada pela passagem do ciclone tropical Idai, deixando 90% da cidade destruída com mais de 600 vítimas fatais e milhares de pessoas necessitando de serviços essenciais de saúde. O mapeamento sistemático do país está desatualizado e com cobertura desigual, o que torna difícil a tomada de decisão face à vulnerabilidade constante do país a eventos como este. O mapeamento colaborativo do OpenStreetMap (OSM) tem sido considerado como alternativa em caso de ausência de mapas oficiais. Em caso de desastres, os contribuidores da organização Humanitarian OpenStreetMap Team (HOT) se unem para gerar dados geoespaciais de modo imediato para apoiar o resgate e assistência às populações desalojadas. Moçambique foi beneficiado com 64 campanhas de mapeamento humanitário, que produziram dados vitais para a preparação para os próximos desastres. Este artigo apresenta um método para avaliar os parâmetros intrínsecos de qualidade dos dados OSM produzidos entre 2015 e 2019 para a Cidade da Beira e analisar a influência das campanhas da HOT nas atividades relacionadas ao ciclone Idai. A análise dos resultados mostra que, apesar do grande volume de dados obtido, a maior parte deles foi gerada por usuários de outros países, carecendo de toponímia e da inclusão das comunidades locais. Esta experiência mostra a importância do estabelecimento de comunidades locais de mapeadores como a rede YouthMappers, para que seja fomentada a criação e uso de dados espaciais colaborativos para a preparação e resposta a desastres em contraponto ao colonialismo de dados.
... Using intrinsic quality indicators based on OSM history, Sehra et al. [159] assessed OSM evolution in India. OSM historical information was also used as a means to characterize the contributors' response in the aftermath of natural disasters, e.g., in terms of frequency of updates and types of contributors (novice vs. experienced OSM users) [35,94,15]. ...
Thesis
Full-text available
Data stored in OpenStreetMap can be more accurate or more recently updated than authoritative sources in some areas, thus requiring to look at other approaches than evaluating OSM comparing it against other databases: methods to perform intrinsic quality analysis are then needed. As this topic is so wide, it has been decided to focus on a specific and less investigated aspect: temporal accuracy. Comparing different areas or features within OSM can provide a relative measurement of temporal accuracy and up-to-dateness, which has been done using standard tools. After mapping existing solutions and evaluating new approaches to gather and process historical data, it has been hypothesized that it could have been worth trying to develop a new software, targeting both OSM contributors and researchers, with a simple interface, reasonable speed and modest technical requirements. This goal has been achieved to a good extent with the development of "Is OSM up-to-date?", after a long process defined by numerous changes of both approaches and libraries, in an increasingly stronger effort to process larger areas more quickly and more efficiently.
... Thus, OSM not only caters to the needs of the general user community (by supporting common infrastructure networks such as road and street networks), but also supports special interest groups' mapping needs (maps such as wheelchair routing, drinking water sources and humanitarian relief 2 ). While the support for such diverse content is a strength of OSM (discussed across diverse areas such as national security (Papapesios et al. 2018), improving accessibility (Zipf et al. 2016) and disaster monitoring (Auer et al. 2018)), the usability and reliability of its data quality have also been questioned (Salk et al. 2016, Hashemi andAli Abbaspour 2015). OSM mandates manual content creation and curation by users (using armchair and outdoor mapping) 3 . ...
Article
Full-text available
Assessing OpenStreetMap (OSM) data quality against authoritative data sources may not always be viable. This is primarily because of the multi-dimensional nature and heterogeneity of the maps, yet the activity is pivotal for targeted data cleansing and quality enhancement undertakings in these data sets. A salient facet of OSM, allowing contributors to flag potential problems encountered during the mapping process, is the FIXME tag. In this article, we examine and discuss OSM data quality through the vast expanse of issues (knowledge) documented via FIXME. We present a classification and analysis of these quality issues, exposed as topic models and grounded in the ISO-19157 standard, across USA and Australia. Regional distributions of these topics are further qualitatively analyzed to ascertain the variation of key issues in OSM. We also present a comparison of the intrinsic issue classification against those identified in an issue corpus of an authoritative map data source. Due to the considerable heterogeneity in user mapping and reporting, OSM issue detection and classification remains problematic. This research presents a flexible and intrinsic data-mining approach, linking established ISO data quality standards to OSM issue categorization. Our work, thus informs the development of automated error correction methods for VGI datasets.
... Route calculation is used in the wide field of disaster risk and humanitarian response in applications ranging from disaster risk management activation [29], logistics [30] to accessing people in need [31]. Besides flooding, emergency response accessibility calculations are also conducted for other disasters such as earthquakes, e.g., by Ertugay et al. [26] and Toma-Danila [32]. ...
Article
Full-text available
Emergency management services, such as firefighting, rescue teams and ambulances, are all heavily reliant on road networks. However, even for highly industrialised countries such as Germany, and even for large cities, spatial planning tools are lacking for road network interruptions of emergency services. Moreover, dependencies of emergency management expand not only on roads but on many other systemic interrelations, such as blockages of bridges. The first challenge this paper addresses is the development of a novel assessment that captures systemic interrelations of critical services and their dependencies explicitly designed to the needs of the emergency services. This aligns with a second challenge: capturing system nodes and areas around road networks and their geographical interrelation. System nodes, road links and city areas are integrated into a spatial grid of tessellated hexagons (also referred to as tiles) with geographical information systems. The hexagonal grid is designed to provide a simple map visualisation for emergency planners and fire brigades. Travel time planning is then optimised for accessing city areas in need by weighing impaired areas of past events based on operational incidents. The model is developed and tested with official incident data for the city of Cologne, Germany, and will help emergency managers to better device planning of resources based on this novel identification method of critical areas.
... The use of VGI data for productive research in the fields of routing [1], three-dimensional (3D) modeling [2], land-cover and land-use analysis [3,4], disaster monitoring [5], urban modeling [6,7], and environmental visualization [8] was widely documented in recent years. Assessing the trust and reliability of diverse user-generated OSM data is equally important to support these investigations. ...
Article
Full-text available
Volunteered geographic information (VGI) encourages citizens to contribute geographic data voluntarily that helps to enhance geospatial databases. VGI’s significant limitations are trustworthiness and reliability concerning data quality due to the anonymity of data contributors. We propose a data-driven model to address these issues on OpenStreetMap (OSM), a particular case of VGI in recent times. This research examines the hypothesis of evaluating the proficiency of the contributor to assess the credibility of the data contributed. The proposed framework consists of two phases, namely, an exploratory data analysis phase and a learning phase. The former explores OSM data history to perform feature selection, resulting in “OSM Metadata” summarized using principal component analysis. The latter combines unsupervised and supervised learning through K-means for user-clustering and multi-class logistic regression for user classification. We identified five major classes representing user-proficiency levels based on contribution behavior in this study. We tested the framework with India OSM data history, where 17% of users are key contributors, and 27% are unexperienced local users. The results for classifying new users are satisfactory with 95.5% accuracy. Our conclusions recognize the potential of OSM metadata to illustrate the user’s contribution behavior without the knowledge of the user’s profile information.
... Using intrinsic quality indicators based on OSM history, Sehra et al. [60] assessed OSM evolution in India. OSM historical information was also used as a means to characterize the contributors' response in the aftermath of natural disasters, e.g. in terms of frequency of updates and types of contributors (novice vs. experienced OSM users) [61][62][63]. ...
Article
Full-text available
OpenStreetMap (OSM) is a well-known crowdsourcing project which aims to create a geospatial database of the whole world. Intrinsic approaches based on the analysis of the history of data, i.e. its evolution over time, have become an established way to assess OSM quality. After a comprehensive review of scientific as well as software applications focused on the visualization, analysis and processing of OSM history, the paper presents “Is OSM up-to-date?”, an open source web application addressing the need of OSM contributors, community leaders and researchers to quickly assess OSM intrinsic quality based on the object history for any specific region. The software, mainly written in Python, can be also run in the command line or inside a Docker container. The technical architecture, sample applications and future developments of the software are also presented in the paper.
... The versatility of the OSHDB is shown by the fact that it is already used in a variety of applications: in ongoing research projects about intrinsic data quality assessment [44,45], for scientific studies related to disaster management [46] and data quality [47], or to run a public web API [48,49] that provides statistics about the evolution of the OSM data. ...
Article
Full-text available
OpenStreetMap (OSM) is a collaborative project collecting geographical data of the entire world. The level of detail of OSM data and its data quality vary much across different regions and domains. In order to analyse such variations it is often necessary to research the history and evolution of the OSM data. The OpenStreetMap History Database (OSHDB) is a new data analysis tool for spatio-temporal geographical vector data. It is specifically optimized for working with OSM history data on a global scale and allows one to investigate the data evolution and user contributions in a flexible way. Benefits of the OSHDB are for example: to facilitate accessing OSM history data as a research subject and to assess the quality of OSM data by using intrinsic measures. This article describes the requirements of such a system and the resulting technical implementation of the OSHDB: the OSHDB data model and its application programming interface.
... Like Wikipedia, the OpenStreetMap (OSM) project offers FHD. OSM's FHD is considered in various research works overviewed in (Auer et al., 2018). The authors of the mentioned work propose "the OHSOME software platform that applies big data technology to the OSM full history data." ...
Chapter
Full-text available
Many methods for intrinsic quality assessment of spatial data are based on the OpenStreetMap full-history dump. Typically, the high-level analysis is conducted; few approaches take into account the low-level properties of data files. In this chapter, a low-level data-type analysis is introduced. It offers a novel framework for the overview of big data files and assessment of full-history data provenance (lineage). Developed tools generate tables and charts, which facilitate the comparison and analysis of datasets. Also, resulting data helped to develop a universal data model for optimal storing of OpenStreetMap full-history data in the form of a relational database. Databases for several pilot sites were evaluated by two use cases. First, a number of intrinsic data quality indicators and related metrics were implemented. Second, a framework for the inventory of spatial distribution of massive data uploads is discussed. Both use cases confirm the effectiveness of the proposed data-type analysis and derived relational data model.
Chapter
In disaggregate travel demand modeling, we often create an artificial population from a sample of surveyed households and individuals. This synthetic population encompasses the same mobility behaviors as the real one and allows dealing with confidentiality. In this process, Crowdsourcing and Volunteered Geographic Information (VGI) represent very useful data sources. The classical approaches of population synthesis, like synthetic reconstruction and combinatorial optimization, cannot be adapted to manage this huge data. A learning approach is then more suited for the synthesizer to improve the goodness-of-fit of its artificial population as Crowdsourcing data becomes richer. To satisfy such learning requirement, we introduce an evolutionary algorithm for population synthesis. Our results confirm that we can gain incrementality without losing goodness-of-fit.
Chapter
Cardiovascular disease is the leading cause of death around the world and its early detection is a key to improving long-term health outcomes. To detect possible heart anomalies at an early stage, an automatic method enabling cardiac health low-cost screening for the general population would be highly valuable. By analyzing the phonocardiogram (PCG) signals, it is possible to perform cardiac diagnosis and find possible anomalies at an early-term. Accordingly, the development of intelligent and automated analysis tools of the PCG is very relevant.
Article
Full-text available
OpenStreetMap (OSM) is a recent emerging area in computational science. There are several unexplored issues in the quality assessment of OSM. Firstly, researchers are using various established assessment methods by comparing OSM with authoritative dataset. However, these methods are unsuitable to assess OSM data quality in the case of the non-availability of authoritative data. In such a scenario, the intrinsic quality indicators can be used to assess the quality. Secondly, a framework for data assessment specific to different geographic information system (GIS) domains is not available. In this light, the current study presents an extension of the Quantum GIS (QGIS) processing toolbox by using existing functionalities and writing new scripts to handle spatial data. This would enable researchers to assess the completeness of spatial data using intrinsic indicators. The study also proposed a heuristic approach to test the road navigability of OSM data. The developed models are applied on Punjab (India) OSM data. The results suggest that the OSM project in Punjab (India) is progressing at a slow peace, and contributors' motivation is required to enhance the fitness of data. It is concluded that the scripts developed to provide an intuitive method to assess the OSM data based on quality indicators can be easily utilized for evaluating the fitness-of-use of the data of any region.
Article
Full-text available
OpenStreetMap is producing huge spatial data contributed by users of different backgrounds and varying level of mapping experiences. Due to this generated map data may be topologically incorrect, which explicitly expresses the spatial relationship between features. To make the map data navigable, it is important that data is free from topological errors. The current work has been conducted to detect topological errors in OpenStreetMap data. OpenStreetMap data of Punjab (India) has been taken as test data for finding topological errors. For cleaning the topological errors, map data has been processed using different algorithms of open source geographic information systems, and it has been concluded that OpenStreetMap data is not free from topological errors and need a thorough preprocessing before being used for navigation purposes.
Conference Paper
Full-text available
Disasters and their impacts have unavoidable spatial characteristics. As such, maps are necessary and omnipresent features of the information landscapes that surround and support disaster response. Professional and volunteer GIS services are increasingly in demand to support map-based information visualization during crises. This paper investigates the work of mapmakers working on the response to the 2015 Nepal earthquakes. In comparison to prior events, we found significantly more collaboration and spatial data sharing took place between map producers working across humanitarian organizations and parts of the Nepal government. Collaboration between mapping practitioners was supported by a complex and emergent information infrastructure composed of social and technical elements, some of which were brought through experience with prior disaster events, and some which were shaped anew by the availability and acceptance of open data sources. Our research investigates these elements of the spatial information infrastructure in post-earthquake Nepal to consider infrastructural emergence.
Conference Paper
Full-text available
The paper presents results from a validation process of OpenStreetMap (OSM) rapid mapping activities using crowdsourcing technology in the aftermath of the Gorkha earthquake 2015 in Nepal. We present a framework and tool to iteratively validate and update OSM objects. Two main objectives are addressed: first, analyzing the accuracy of the volunteered geographic information (VGI) generated by the OSM community; second, investigating the spatio-temporal dynamics of spontaneous shelter camps in Kathmandu. Results from three independent validation iterations show that only 10 % of the OSM objects are false positives (no shelter camps). Unexpectedly, previous mapping experience only had a minor influence on mapping accuracy. The results further show that it is critical to monitor the temporal dynamics. Out of 4,893 identified shelter camps, 54% were already empty/closed six days after the first mapping. So far, updating geographical features during humanitarian crisis is not properly addressed by the existing crowdsourcing approaches.
Conference Paper
Crowdsourcing platforms have become important information providers after disaster events. While they can build on some prior experiences, it is not yet well understood how contributor capacity for such activities is constituted. To what extent are initiatives building a dormant task force that springs to action when it is needed? Alternatively, do they mainly rely on the recruitment of new contributors during disaster events, possibly at the expense of contribution quality? We seek to develop a better understanding of these relationships, using the example of the Humanitarian OpenStreetMap Team. In a large-scale quantitative study, we assess the outcomes of 26 campaigns with almost 20,000 participants. We find that event-centric campaigns can be significant recruiting and reactivation events, however that this is not guaranteed. Our analytical methods provide a means of interpreting key differences in outcomes. We close with recommendations relating to the promotion and coordination of event-centric campaigns in HOT and related platforms.
Conference Paper
Organisers of crowd mapping initiatives seek to identify practices that foster an active contributor community. Theory suggests that social contribution settings can provide important support functions for newcomers, yet to date there are no empirical studies of such an effect. We present the first study that evaluates the relationship between colocated practice and newcomer retention in a crowd mapping community, involving hundreds of first-time participants. We find that certain settings are associated with a significant increase in newcomer retention, as are regular meetings, and a greater mix of experiences among attendees. Factors relating to the setting such as food breaks and technical disruptions have comparatively little impact. We posit that successful social contribution settings serve as an attractor: they provide opportunities to meet enthusiastic contributors, and can capture prospective contributors who have a latent interest in the practice.
Conference Paper
Crowd-mapping is a form of collaborative work that empowers users to gather and share geographic knowledge. OpenStreetMap is one of the most successful examples of such paradigm, where the goal of building a global map of the world is collectively performed by over 2M contributors. Despite geographic information being intrinsically evolving, little research has so far gone into analysing maintenance practices in these domains. In this paper, we perform a preliminary exploration to quantitatively capture maintenance dynamics in geographic crowd-sourced datasets, in terms of: the extent to which different maintenance actions are taking place, the type of spatial information that is being maintained, and who engages in these practices. We apply this method to 117 countries in OSM, over one year of mapping activity. Our findings reveal that, although maintenance practices vary substantially from country to country in terms of how widespread they are, strong commonalities exist in terms of what metadata is being maintained and by whom.
Conference Paper
Organisers of large-scale crowdsourcing initiatives need to consider how to produce outcomes with their projects, but also how to build volunteer capacity. The initial project experience of contributors plays an important role in this, particularly when the contribution process requires some degree of expertise. We propose three analytical dimensions to assess first-time contributor engagement based on readily available public data: cohort analysis, task analysis, and observation of contributor performance. We apply these to a large-scale study of remote mapping activities coordinated by the Humanitarian OpenStreetMap Team, a global volunteer effort with thousands of contributors. Our study shows that different coordination practices can have a marked impact on contributor retention, and that complex task designs can be a deterrent for certain contributor groups. We close by providing recommendations about how to build and sustain volunteer capacity in these and comparable crowdsourcing systems.