ArticlePDF Available

Crowdsourcing architectural beauty: Online photo frequency predicts building aesthetic ratings

PLOS
PLOS One
Authors:

Abstract and Figures

The aesthetic quality of the built environment is of paramount importance to the quality of life of an increasingly urbanizing population. However, a lack of data has hindered the development of comprehensive measures of perceived architectural beauty. In this paper, we demonstrate that the local frequency of geotagged photos posted by internet users in two photo-sharing websites strongly predict the beauty ratings of buildings. We conduct an independent beauty survey with respondents rating proprietary stock photos of 1,000 buildings across the United States. Buildings with higher ratings were found more likely to be geotagged with user-uploaded photos in both Google Maps and Flickr. This correlation also holds for the beauty rankings of raters who seldom upload materials to the internet. Objective architectural characteristics that predict higher average beauty ratings of buildings also positively covary with their internet photo frequency. These results validate the use of localized user-generated image uploads in photo-sharing sites to measure the aesthetic appeal of the urban environment in the study of architecture, real estate, urbanism, planning, and environmental psychology.
This content is subject to copyright.
RESEARCH ARTICLE
Crowdsourcing architectural beauty: Online
photo frequency predicts building aesthetic
ratings
Albert Saiz
1
, Arianna Salazar
1
*, James Bernard
2
1Urban Studies and Planning Department, Massachusetts Institute of Technology, Cambridge,
Massachusetts, United States of America, 2Economics Department, Brown University, Providence, Rhode
Island, United States of America
*ariana@mit.edu
Abstract
The aesthetic quality of the built environment is of paramount importance to the quality of life
of an increasingly urbanizing population. However, a lack of data has hindered the develop-
ment of comprehensive measures of perceived architectural beauty. In this paper, we dem-
onstrate that the local frequency of geotagged photos posted by internet users in two photo-
sharing websites strongly predict the beauty ratings of buildings. We conduct an indepen-
dent beauty survey with respondents rating proprietary stock photos of 1,000 buildings
across the United States. Buildings with higher ratings were found more likely to be geo-
tagged with user-uploaded photos in both Google Maps and Flickr. This correlation also
holds for the beauty rankings of raters who seldom upload materials to the internet. Objec-
tive architectural characteristics that predict higher average beauty ratings of buildings also
positively covary with their internet photo frequency. These results validate the use of local-
ized user-generated image uploads in photo-sharing sites to measure the aesthetic appeal
of the urban environment in the study of architecture, real estate, urbanism, planning, and
environmental psychology.
Introduction
Our ability to understand the effect of a city’s environment on social outcomes has been lim-
ited by a lack of data on the physical perception of urban space, such as the subjective beauty
of buildings, the appearance of neighborhoods, and the emotional cues that these convey. New
data tools make it increasingly possible to measure these perceptual attributes, which may be
useful for urban planners and scientists looking to explain the social consequences of urban
development.
In this paper, we validate the use of large amounts of geotagged user-generated images to
measure the average subjective beauty of buildings in a scalable, comparable, and systematic
way. We use the geographic location of all images obtained from two different photo-sharing
websites (Flickr
1
and Panoramio
1
), and we estimate the frequency of photos posted in the
vicinity of each building in a catalog of 206,216 properties in the United States. To validate
PLOS ONE | https://doi.org/10.1371/journal.pone.0194369 July 25, 2018 1 / 15
a1111111111
a1111111111
a1111111111
a1111111111
a1111111111
OPEN ACCESS
Citation: Saiz A, Salazar A, Bernard J (2018)
Crowdsourcing architectural beauty: Online photo
frequency predicts building aesthetic ratings. PLoS
ONE 13(7): e0194369. https://doi.org/10.1371/
journal.pone.0194369
Editor: Tobias Preis, University of Warwick,
UNITED KINGDOM
Received: April 27, 2017
Accepted: March 1, 2018
Published: July 25, 2018
Copyright: ©2018 Saiz et al. This is an open
access article distributed under the terms of the
Creative Commons Attribution License, which
permits unrestricted use, distribution, and
reproduction in any medium, provided the original
author and source are credited.
Data Availability Statement: All relevant data are
available from the Harvard Dataverse at the
following DOI: 10.7910/DVN/OQVBDF.
Funding: The author(s) received no specific
funding for this work.
Competing interests: The authors have declared
that no competing interests exist.
online posting frequency as a metric for building beauty, we conducted a survey where 605
respondents ranked a subsample of 1,000 buildings according to their perceived beauty using
separate professional stock photos. We show that all of our measures of localized internet
image uploads correlate strongly and positively with building beauty. The relationship holds
across sub-groups of respondents who have different demographic characteristics. Both image
uploads and building beauty measurements display similarly-signed covariances with groups
of observable building characteristics, such as height, age, and architectural style. Importantly,
the conditional correlation between image uploads and building beauty vanishes for photos
taken more than 50 meters from the building of interest, which negates alternative explana-
tions based on spurious or contextual effects.
Our research informs a growing body of literature that explores the connection between
social science, urban form, architecture, and city planning. Prior research has focused on how
the built environment impacts outcomes such as the quality of life, psychological well-being,
and social connections. For example, [1] hypothesized that urban design could influence
the level of crime by creating “defensible space.” An extensive literature in environmental psy-
chology has demonstrated strong connections between the visual appearance of a city’s neigh-
borhoods and the behavior and health of its citizens (e.g., [2]). [3], for example, show that
individuals exposed to more scenic environments report better health across urban, suburban,
and rural areas. In fact, [4] find that models including crowdsourced data from Flickr and
OpenStreetMap can generate more accurate estimates of “scenicness” than models that con-
sider only basic census measurements. Our paper focuses on building-level characteristics and
complements existing evidence that has focused on area-level attributes.
Efforts to quantify neighborhood appearance in architecture have used visual perception
surveys, in which people were asked to rate or compare images. These methods have been
used to create perceptual maps of cities ([5]; [6]; [7]). However, the manual nature of tradi-
tional data collection processes meant that they could not be deployed over large geographical
areas or different time periods. Therefore, the recent data explosion can help advance our
understanding of urban perception. Research by [8] shows how user ratings of online images
can be combined with machine-learning methods to measure perceptions of street safety.
Complementary research by [9] opens up further avenues for generating aesthetics estimates
by using deep learning techniques to generate ratings for photographs of the built environ-
ment that survey participants have not previously rated.
A number of papers have previously used existing internet image geolocations to study pat-
terns of city attractiveness. For example, [10] compare statistics of the spatial distribution of
user-uploaded photos across ten cities, showing that European cities have more centralized
patterns of photo uploads. [11] use unsupervised methods to extract representative views and
images for landmarks. [12] show that tourists and native city-dwellers display similar photo
uploading patterns and that tourists display strong seasonality, and [13] uses photo upload fre-
quencies to identify high-amenity areas in central Berlin and London. Other researchers have
analyzed user behavior on photo sharing services. [14], for example, identify a broad range of
motivational factors that are associated with participation in online photo-sharing communi-
ties, and [15] show that web applications, such as Flickr, can serve as reliable, universal sources
of spatial content.
Our research also complements papers that have used online user-generated content to
extract time-series data about consumer behavior ([16]), health ([17]; [18]), or finance ([19]),
or to obtain cross-sectional socioeconomic data ([20]). A growing literature in urban tomogra-
phy ([21]) is demonstrating that adding geographical identification to such methods can
improve research and practice in urban planning, urban sciences, environmental science or
psychology, and architecture. For example, [22] shows the conditions under which user-
Crowdsourcing architectural beauty
PLOS ONE | https://doi.org/10.1371/journal.pone.0194369 July 25, 2018 2 / 15
generated opinions can be deemed reliable for planning decisions. [23] and [24] show how the
local frequency of online Flickr photo tags –short texts accompanying the images– can be used
ingeniously to describe relevant qualitative attributes of urban environments as perceived by
posters.
Our results indicate that the upload frequency of images can be used to create scalable
quantitative measures of aesthetic perception of specifically-targeted buildings. The localized
frequency of image uploads provides a readily available beauty metric that can be useful for
practitioners and researchers seeking to link urban perception with other social, political,
economic, and cultural aspects of cities. Moreover, we also examine the relative impact of
architectural styles on building beauty, thus informing our understanding of the urban
environment.
The rest of the paper is organized as follows. In section 2 we construct the image upload
measurements using two different photo-sharing websites. Section 3 turns to our building sur-
vey structure, building beauty measurements for the survey subsample, and the validation of
image uploads as an alternate beauty proxy. Section 4 concludes and suggests future research
directions and applications.
Localized internet photo frequencies
Measurement
We first obtained data from Flickr and Panoramio (the source of the images in Google Maps
1
until November 4, 2016). These photo-sharing websites contain millions of geotagged photos
contributed by people from all around the world. We used the Flickr Application Program-
ming Interface (API) to download all of the approximately 13 million open-access photos that
were geotagged by users at the maximum accuracy level in the United States between February
1, 2004, and January 1, 2014. The maximum accuracy corresponds to photos which already
contain GPS coordinates from the camera or to those where the user has presumably zoomed
into the relevant street in order to pin down the photo to its location. To ensure consistency of
results, we followed a similar collection process for Panoramio at two separate points in time.
We thus obtained information on approximately 800,000 photos posted in 2011 (prior to our
survey) and approximately 3 million photos posted in 2014 (after we completed our survey but
before the data analysis).
To count the number of photos in the vicinity of each building, we took the following steps.
First, we combined both Panoramio and Flickr data with a proprietary dataset produced by
Emporis
1
, which contains information on the exact location and characteristics (such as
height, age, and architectural style) of 206,216 major buildings across the United States. Sec-
ond, we used GIS software to assign the user-generated images to each building on the basis of
the distance between their spatial location. Third, we counted the number of photos within
each annuli (two-dimensional “donuts”) of different lengths around each building in our sam-
ple: starting at 0–50 meters, 50 to 100 meters, 100 to 250 meters, and ending at 250 to 500
meters. For robustness, we also counted the number of pictures using 10-meter intervals that
started from 0 to 10 meters, and continued up to 490 to 500 meters. We refer to these several
measures as “Flickr or Panoramio image uploads”.
Table 1 summarizes the image upload measures. We focus on the number of pictures taken
within 50 meters of a building in Panoramio, which corresponds to the main distance used in
the empirical analysis. Column 1 shows that on average, in 2011, there were 0.242 pictures
taken within 50 meters of the buildings in this sample (standard deviation = 0.652). This num-
ber rose to 0.612 in 2014 (standard deviation = 4.15), which presumably reflects the increased
access to technology and social media in recent years. As expected, the distribution is very
Crowdsourcing architectural beauty
PLOS ONE | https://doi.org/10.1371/journal.pone.0194369 July 25, 2018 3 / 15
skewed: 87 percent of the buildings had no image assignment in 2014. The Flickr data up until
2014 shows an average of 2.070 pictures posted within 50 meters of the buildings in our sam-
ple, with a large standard deviation of 46.25. Column 2 repeats the exercise for the subset of
999 buildings that were effectively rated by our sample respondents. One of the photos in our
original survey sample of 1000 buildings was never randomly assigned to any rater; thus, our
usable sample consists of 999 buildings. In columns 3 through 5 we separate the sample of
buildings into those that have no pictures uploaded to them and those that have one or more
photos in the 2011 vintage of Panoramio. This breakdown shows the strong correlation
between image uploads in both Panoramio vintages and Flickr (2014).
Fig 1 plots the relationship between the number of photos uploaded to the two Panoramio
vintages and Flickr. In Fig 1(a) we group buildings on the horizontal axis on the basis of the
number of photos uploaded within 50 meters of them on Panoramio in 2011. Within each
x-axis bin, we then calculate the average number of photos assigned by the 2014 vintage of
Panoramio to the same buildings, which we plot in the y-axis. The pattern is consistent with
photos that accumulate at an approximately proportional percentage rate, even though
approximately 10 percent of the photos uploaded in 2011 were removed from the site by users
in 2014. Buildings that had zero photos in 2011 had smaller chances of attracting large num-
bers of image uploads in 2014. Fig 1(b) plots the average number of photos uploaded to Flickr
(vertical axis) by groups of buildings sorted according to the number of photos assigned using
Panoramio’s 2014 vintage. Reassuringly, image upload measures are strongly correlated over
time and across social media sites.
To illustrate the spatial distribution of our intended proxy for beauty value, we map the fre-
quency of photos within a 50 meter radius of each building (image uploads) in the Emporis sam-
ple in four selected cities. As shown in Fig 2, despite a tendency for metropolitan centers and
major cities to reveal overall higher number of uploaded photos, our maps show that less touristic
Table 1. Summary statistics.
BUILDINGS BY NUMBER OF IMAGE UPLOADS IN PANORAMIO 2011
All Buildings Buildings with no photos Buildings with photos
N= 206,216 N= 999 N= 173,146 N= 607 N= 33,070 N= 392
Panel A. Image uploads
Panoramio image uploads within 50 meters (2014) 0.612
[4.156]
1.942
[5.813]
0.284
[2.634]
0.873
[4.348]
2.329
[8.240]
3.609
[7.249]
Panoramio image uploads within 50 meters (2011) 0.242
[0.652]
0.622
[0.970]
0.000
[0.000]
0.000
[0.000]
1.507
[0.862]
1.594
[0.928]
Flickr image uploads within 50 meters 2.070
[46.250]
3.751
[14.033]
1.585
[49.085]
2.873
[14.456]
4.610
[26.773]
5.121
[13.250]
Mean survey score 5.343
[1.083]
5.219
[1.090]
5.536
[1.045]
Panel B. Other Covariates
Building year 1954.57
[41.72]
1959.53
[70.21]
1959.04
[41.17]
1961.59
[85.43]
1933.69
[37.74]
1956.32
[35.02]
Building height 86.55
[88.30]
198.27
[133.51]
76.12
[74.40]
174.20
[113.24]
145.81
[128.53]
235.37
[152.70]
Notes: The table presents the sample means and standard deviation (in brackets) of the main variables used in our empirical analysis. Panel A summarizes image upload
variables; Panel B summarizes other covariates. The first column summarizes the variables for our sample of 206,216 buildings; the second column shows the same
statistics for the subsample of 999 that were rated by our survey respondents; the remaining columns present the summary statistics separately for buildings that had
online images associated to them and those that were not geotagged with online pictures in the 2011 vintage of Panoramio (alternating again full and survey
subsamples). Note that not all building characteristics are available for the whole sample.
https://doi.org/10.1371/journal.pone.0194369.t001
Crowdsourcing architectural beauty
PLOS ONE | https://doi.org/10.1371/journal.pone.0194369 July 25, 2018 4 / 15
cities, such as Sacramento and Dallas, also encounter a relatively high frequency of uploaded
photos. Moreover, there is considerable variation in the distribution of photos within cities. S2
Fig in the appendix section maps the wide distribution of photo uploads throughout the U.S.
Having described how we created the frequency of image upload measurements for each
building, we turn to the validation of such measurements as proxies for building beauty.
Validation
Many factors may affect the propensity of internet users to upload photos. Some buildings
acquire iconic status, regardless of their potential aesthetic appeal to observers who are
Fig 1. Relational scatter plots of the average number of photos within 50 meters 138uploaded to the two Panoramio data vintages (left panel)
and Panoramio and Flickr 139 (right panel).
https://doi.org/10.1371/journal.pone.0194369.g001
Fig 2. 2014 image uploads within 50 meters of every building in four selected cities.
https://doi.org/10.1371/journal.pone.0194369.g002
Crowdsourcing architectural beauty
PLOS ONE | https://doi.org/10.1371/journal.pone.0194369 July 25, 2018 5 / 15
unaware of their social importance. Others happen to be in high-traffic (e.g., touristic) areas or
cities. Finally, internet users may upload photos for idiosyncratic reasons. Such reasons, which
could amount to random noise in the data, need not invalidate the use of photo frequencies as
proxies for average subjective beauty ratings. However, one can think of scenarios where: i)
the noise to signal ratio is too high for image uploads to be practical; ii) idiosyncratic noise is
correlated negatively with the latent variable of interest (e.g., if sharing photos of the most aes-
thetically unpleasant buildings was more entertaining to web users); iii) the aesthetic tastes of
people actively sharing online content are substantially different from those of the public at
large. Therefore, the use of geotagged image frequencies as proxies for the human appreciation
of urban environmental features needs to be validated. To do this, we conducted a survey of
the perception of beauty for a sub-sample of 1,000 buildings. The study (Protocol number:
1301005482) was reviewed and approved by the Committee on the Use of Humans as Experi-
mental Subjects (COUHES) at MIT. We randomly selected buildings in the Emporis database
in a way designed to oversample those tagged with at least one internet upload using the 2011
Panoramio vintage. Out of the initial 206,216 buildings, we settled on a sample size of 1,000,
and randomly targeted to select 40 percent of the observations within the subsample of 33,070
buildings with any geotagged photo in 2011. Conversely, we targeted for the random selection
of about 600 observations from the rest of our edifice population. Having a picture posted was
a relatively rare occurrence; consequently, a powered research design required oversampling.
In addition, the design allows for enough variance to trace the relationship between the num-
ber of photos taken for each building—not just whether a building has photos or not— and
our survey-based measure of beauty. One of the randomly selected buildings happened to be
the Empire State in New York City (an obviously iconic building with an outsized number of
posted photos), for which we collected survey data, but which we have removed from the sam-
ple as an outlier. In a similar manner, we removed another two iconic buildings that were also
found to be outliers (without substantial change to key results).
One could be concerned that a selection of images with more artistic flair might be more
likely to be uploaded by internet users, and receive higher ratings independent of their intrin-
sic architectural value. Therefore, the stock images used in the survey were taken by profes-
sional photographers as provided by Emporis. 605 respondents ranked these buildings
according to how beautiful they perceived them. The survey service was provided by Qual-
trics
1
, conducted online, and attracted respondents from the 50 states in the U.S. and the
District of Columbia, of which half reported living in metropolitan areas. Other collected
demographic information included level of education, race/ethnicity, age, gender, and likeli-
hood of posting public content on the internet. It is worth noting that average survey responses
don’t vary significantly by age, gender, or location.
In the first stage of our survey, respondents were shown stock photographs of five distinct
buildings (chosen randomly from three initial sets of five photos). They were then given up to
30 seconds to consider this set of photos side-by-side, which allowed them to get an idea of the
range of the buildings that would be shown later in the survey. Providing these initial photo
sets also allows us to have some consistency across respondents concerning the initial stimuli
to which they were exposed, and it also makes it possible to test for potential statistical differ-
ences across starting frames, to which there were none. During the next 10 minutes, after see-
ing the five initial photographs side-by-side, respondents were shown a random sequence of
single images of up to 1,000 buildings. To avoid framing effects [25], we did not provide any
additional context to the photos. Respondents ranked each photo shown to them on a scale of
1 (ugliest) to 10 (most beautiful). After they had entered each image’s valuation or after 30 sec-
onds (whichever occurred first), another photo appeared. We allowed participants to rank as
many photos as possible during the survey’s 10-minute duration. To measure the consistency
Crowdsourcing architectural beauty
PLOS ONE | https://doi.org/10.1371/journal.pone.0194369 July 25, 2018 6 / 15
of respondents, two of the five initial practice photographs were shown to respondents again,
which ensured that these pictures were ranked twice by each respondent.
Respondents provided an average of 107.76 image evaluations, rating one photo each 5.5
seconds. The maximum number of rated pictures provided by a single user was 365. Scientists
still do not have a full understanding of the neural, affective, and cultural processes behind
subjective beauty assessments [26]. However, the pace taken by even the fastest rater ensures
that the two different types of brain networks involved in such judgments were likely engaged,
since a 1.5 second interval is sufficient to activate both of them [27]. On the other hand, our
design did not allow for very careful reflections on each image, since it has been shown that
longer looking times are unlikely to change reported aesthetic preferences [28]. The mean sur-
vey score for ranked buildings was 5.342 and the standard deviation was 1.086.
To study whether buildings with higher image uploads in our survey were also considered
more beautiful by respondents, we estimate the following baseline regression model:
Scorebi ¼aiþbImageUploadsbþεbð1Þ
Here Score
bi
is the score that survey respondents iassigned to building b.ImageUploads
b
is
one of the measurements of the number of internet user-uploaded pictures geotagged at differ-
ent distances from building b, and βis the coefficient of interest, which captures whether these
measures are related to the subjective survey scores. Finally, ε
b
is an error term. Since image
uploads are building specific, we cluster standard errors at the building level.
The first panel in Table 2 displays OLS estimates of Eq (1) using 2014 Panoramio as the
image uploads measure. Column 1 presents results using internet pictures geotagged within 50
meters of a given building; it includes no other controls or rater fixed effects. Our estimate
implies that for every ten additional pictures, the average score obtained by a given building in
our survey increased by 0.435 (standard error of the estimate = 0.103). Column 2 presents the
most parsimonious specification with fixed effects for each of the 605 raters. Survey respon-
dent dummies take account of systematic differences across individuals or in the environments
under which the surveys were conducted. This specification also shows that for every ten addi-
tional pictures posted online, the average score obtained by a given building in our survey
increased by 0.443 (standard error of the estimate = 0.102). To avoid framing effects, in col-
umn 3 we also control for a full set of dummies that capture the order in which a particular
building appeared in each respondent’s session. In line with the random ordering of buildings
in the survey, these controls have little impact on our estimates. In column 4 we address poten-
tial respondent inconsistency. By design, respondents were required to rank exactly 2 of the
buildings twice during their session. For each respondent we compute the average absolute dif-
ference between rankings given to the same buildings. We then use the inverse of one plus the
average differences to reweigh observations in the regressions. This procedure grants less
weight to respondents who behaved inconsistently. Reassuringly, results remain unchanged.
In column 5 we augment our baseline specification by adding the number of pictures taken
within 50-100, 100-250, and 250- 500 meters as controls. Estimates show that only pictures
taken within 50 meters of a building can marginally predict the survey’s building beauty. The
coefficients for the number of pictures taken further away are small, are close to zero, and are
not statistically significant. These results suggest that image uploads in previous specifications
were not proxying for broader neighborhood and regional differences but instead captured
very localized effects.
Fig 3 elaborates on these findings graphically using data from the 2014 vintage of Panora-
mio. In particular, we estimate similar models to those in the last column of Table 2, but allow
pictures taken within each 10-meter range to take on a different coefficient. We then plot these
Crowdsourcing architectural beauty
PLOS ONE | https://doi.org/10.1371/journal.pone.0194369 July 25, 2018 7 / 15
Table 2. ESTIMATES OF THE RELATIONSHIP BETWEEN IMAGE UPLOADS AND BUILDING BEAUTY: OLS AND PCF ESTIMATES.
DEPENDENT VARIABLE: AVERAGE SURVEY SCORE
(1) (2) (3) (4) (5)
I. Panoramio photo uploads (2014)
Tens of photos within 0–50 meters 0.435
(0.103)
0.443
(0.102)
0.442
(0.099)
0.464
(0.104)
0.290
(0.105)
Tens of photos within 50-100 meters 0.066
(0.056)
Tens of photos within 100-250 meters 0.014
(0.011)
Tens of photos within 250–500 meters -0.003
(0.005)
Observations 65021 65021 65021 64507 65021
Clusters 996 996 996 996 996
R-squared 0.01 0.29 0.30 0.32 0.30
II. Flickr photo uploads
Tens of photos within 0–50 meters 0.125
(0.037)
0.123
(0.037)
0.123
(0.036)
0.123
(0.039)
0.107
(0.041)
Tens of photos within 50-100 meters 0.015
(0.014)
Tens of photos within 100-250 meters 0.001
(0.003)
Tens of photos within 250–500 meters -0.000
(0.003)
Observations 65021 65021 65021 64507 65021
Clusters 996 996 996 996 996
R-squared 0.00 0.29 0.30 0.31 0.30
III. Pcf photo uploads
Tens of photos within 0–50 meters 0.251
(0.048)
0.252
(0.048)
0.252
(0.046)
0.258
(0.049)
0.199
(0.055)
Tens of photos within 50-100 meters 0.053
(0.060)
Tens of photos within 100-250 meters 0.057
(0.052)
Tens of photos within 250–500 meters -0.028
(0.077)
Observations 65021 65021 65021 64507 65021
Clusters 996 996 996 996 996
R-squared 0.01 0.29 0.30 0.32 0.30
Covariates and weighting:
Rater effects
Photo order effects
Weighting by consistency (dif)
Notes: The number of photos within each annuli is shown in tens. The top two panels present OLS estimates of the relationship between image uploads and building
beauty; the bottom panel presents PCF (principal Component Factor) estimates constructed using the common variation in Flickr and Panoramio 2014. The left-hand
side variable is building beauty and the main explanatory variable image uploads. Observations are building and rater specific. Each column presents a different
specification, and the bottom rows describe the covariates in each model. Below each of our estimates and in parentheses, we report standard errors that are robust to
heteroskedasticity and clustered at the building level.
 denotes a coefficient significant at the 1% level,
 at the 5% level, and
at the 10% level.
https://doi.org/10.1371/journal.pone.0194369.t002
Crowdsourcing architectural beauty
PLOS ONE | https://doi.org/10.1371/journal.pone.0194369 July 25, 2018 8 / 15
coefficients and their confidence intervals, together with a smoothed moving average line and
linear and quadratic fits. The left panel shows a marked decay in the information conveyed by
pictures taken further away from a given building. Only the groups of pictures taken within 60
meters of a building predict its average score in our survey; while the remaining pictures have
essentially zero additional predictive power. The right panel zooms in to illustrate the marked
decay. S1 Fig in the online appendix displays a similar pattern using the Flickr data.
Panel B in Table 2 presents an analogous exercise to Panel A but uses Flickr photo uploads
as the outcome of interest. Overall, we find a similar pattern as the one uncovered for photos
uploaded to Panoramio. Column 3 shows that for every ten additional pictures, the average
score obtained by a given building in our survey increases by 0.123 (standard error of the esti-
mate = 0.036) and is statistically significant at the 1 percent level.
To exploit the common variation in Flickr and Panoramio, in Panel C we compute their
principal component factor and use it as an alternative measure of image uploads. Column 3
shows that for an additional standard deviation in the number of photo uploads, the average
score obtained by a given building in our survey increases by 0.252, which constitutes a sub-
stantial one-fifth of the standard deviation in building beauty across images.
Coincident Covariation with Other Correlates of building beauty
Our previous results establish that buildings with higher image uploads also display stronger
building beauty. We now explore whether observable physical characteristics of buildings
Fig 3. Estimates survey score marginal gains from pictures in range.
https://doi.org/10.1371/journal.pone.0194369.g003
Crowdsourcing architectural beauty
PLOS ONE | https://doi.org/10.1371/journal.pone.0194369 July 25, 2018 9 / 15
drive part of this relationship. That is, do taller or more modern buildings consistently display
higher building beauty? If so, for image upload frequency to be a good proxy, it should yield
similar correlations. Alternatively, image uploads could capture unobserved dimensions of
beauty beyond the measurable characteristics that are known to have an aesthetic impact.
We hence use the most commonly available building attributes in the Emporis database
(height, year of construction, and dummies for architectural styles) as independent variables
in an OLS regression in which we explain a building’s beauty as a linear function of these char-
acteristics. This specification is available in Table Coincident Covariation with Other Corre-
lates of building beauty of the online appendix section. The number of observations here
corresponds to the number of buildings. Using the estimated coefficients of this regression, we
separate building beauty as the sum of the component “predicted” by this OLS model and its
“residual” component. In Table 3, column 1, we show results from another OLS regression
using Panoramio image uploads 2014 as the dependent variable and the “predicted” compo-
nent of building beauty as the main explanatory regressor. Reassuringly, a linear combination
of building characteristics that are associated with higher building beauty is also a predictor of
image uploads. Therefore, the image upload and building beauty metrics are not only corre-
lated with each other; but they also covary in the same direction with observables, which is to
be expected if they are capturing a similar phenomenon. Nevertheless, in column 2, we see
that “residual” building beauty also correlates positively with image uploads, which suggests
that the proxy contains information about beauty above and beyond that captured by the
observed building characteristics that we were able to include in the model. For robustness, in
columns 3 and 4 we also present negative binomial models that account for the discrete nature
of the dependent variable. Reassuringly, we find similar results.
Image uploads predicts building beauty for demographic groups not active
online
One potential concern of using image uploads as proxies for aesthetic value is that these mea-
sures could be capturing preferences exclusive to the population uploading content in photo-
Table 3. COVARIANCE BETWEEN THE PREDICTED AND RESIDUAL COMPONENTS OF BUILDING BEAUTY AND IMAGE UPLOADS: OLS AND NEGATIVE BINOMIAL ESTIMATES.
DEPENDENT VARIABLE: 2014 PANORAMIO UPLOADS (50 METERS)
OLS NEGATIVE BINOMIAL
(1) (2) (3) (4)
observable beauty 1.646
(0.418)
1.737
(0.423)
0.842
(0.146)
0.855
(0.135)
unobservable beauty 0.944
(0.256)
0.389
(0.082)
Observations 976 976 976 976
Clusters
R-squared 0.03 0.05
Notes: The table presents OLS and Negative Binomial estimates of regressions with Panoramio 2014 50-meter image uploads as the dependent variable. Independent
variables are what we denominate “predicted” and “residual” beauty. Predicted beauty results from a linear combination of building characteristics (height, year of
construction, and dummies for 26 architectural style types) fitted from OLS regression explaining building beauty. Residual beauty are the residuals from an OLS
regression of building characteristics on building beauty. Each column presents a different specification. Below each of our estimates and in parentheses, we report
standard errors that are robust to heteroskedasticity and clustered at the building level.
 denotes a coefficient significant at the 1% level,
 at the 5% level, and
at the 10% level.
https://doi.org/10.1371/journal.pone.0194369.t003
Crowdsourcing architectural beauty
PLOS ONE | https://doi.org/10.1371/journal.pone.0194369 July 25, 2018 10 / 15
sharing websites. For instance, users of social media sites tend to be younger and more likely
to live in urbanized areas. If planning boards used the covariates of image uploads to classify
buildings “of interest”, such decisions might rely on a relatively narrow view of aesthetic
quality. To address this concern, we asked survey respondents whether they had regularly
uploaded content online. We now use this binary qualitative variable to estimate the following
OLS model:
Scorebi ¼aiþbNImageUploadsbNonposteri
þbPImageUploadsbPosteriþεbi:ð2Þ
Here, Poster
i
and Nonposter
i
are dummy variables that indicate, respectively, whether
respondent ireports regularly posting content online or not. 54.56 percent of the people in our
sample are posters.
Columns 1 and 3 of Table 4 present the results of explaining building beauty using both
Panoramio and -separately- Flickr image uploads interacted with the rater’s poster status.
Again, standard errors are clustered at the building level. The coefficients for posters and non-
posters in columns 1 and 3 are similar, which suggests that the frequency of image uploads
and perception of beauty do not differ significantly for people with different posting habits. Of
course, poster status is not a very fine-tuned variable for capturing differences across groups
Table 4. ESTIMATES BY DEMOGRAPHIC GROUPS,PANORAMIO 2014 AND FLICKR: OLS ESTIMATES.
DEPENDENT VARIABLE: AVERAGE SURVEY SCORE
PHOTOS FROM PANORAMIO 2014 PHOTOS FROM FLICKR
BY POSTER STATUS BY LIKELIHOOD OF POSTING BY POSTER STATUS BY LIKELIHOOD OF POSTING
(1) (2) (3) (4)
photos x non-posters 0.440
(0.099)
0.133
(0.032)
photos x posters 0.445
(0.102)
0.117
(0.040)
photos x quartile 1 0.442
(0.103)
0.154
(0.036)
photos x quartile 2 0.515
(0.117)
0.158
(0.039)
photos x quartile 3 0.476
(0.101)
0.117
(0.041)
photos x quartile 4 0.358
(0.092)
0.073
(0.030)
Observations 64577 63251 64577 63251
Clusters 996 996 996 996
R-squared 0.30 0.29 0.29 0.29
Covariates:
Rater effects
Photo order effects
Notes: The table presents OLS estimates of the demographic groups using Panoramio 2014 and Flickr. Each column presents a different specification, and the bottom
rows describe the covariates and sample restrictions on each model. The Posters is an indicator for persons that responded “yes” to posting public content on the
Internet for other people to use. Columns 1 and 3 present our results by poster status for Panoramio and Flickr, respectively; while columns 2 and 4 present our results
by likelihood of posting. Below each of our estimates and in parentheses, we report standard errors that are robust against heteroskedasticity and clustered on buildings.
 denotes a coefficient significant at the 1% level,
 at the 5% level, and
at the 10% level.
https://doi.org/10.1371/journal.pone.0194369.t004
Crowdsourcing architectural beauty
PLOS ONE | https://doi.org/10.1371/journal.pone.0194369 July 25, 2018 11 / 15
that have different online behaviors. Therefore, we next estimate a logit model that predicts
the individual likelihood of posting content online based on demographic characteristics:
gender, education, race, a dummy for metropolitan status, and age. We then estimate Eq [2],
this time interacting image uploads with dummies for each of the quartiles of predicted poster
likelihood (i.e., quartiles of the propensity score from the logit estimation) (ImageUploads
b
).
Quartile 4 comprises the 25 percent of the survey respondents who have the highest estimated
propensity score. Columns 2 and 4 show these estimates. In line with our previous results, the
estimated coefficients for respondents in different quartiles are all similar and do not differ sta-
tistically from one another. This pattern suggests that image upload measures, crowdsourced
exclusively from internet-posting users, are equivalently good predictors of building beauty for
individuals who post less or no contents online.
Discussion
Millions of internet users post content online. A growing scientific literature makes use of
such online behavior to extract information about social trends. Here we have focused on web-
sites that allow users to share images about the built environment. We have shown that the fre-
quency at which photos are geotagged around a building predicts its average subjective beauty
ratings. To validate this correlation we run a rating experiment with professional stock photos
that are separate from online content.
The partial correlation between building beauty ratings and the frequency of image uploads
posted around its geo-coordinates is strongest for images that are geotagged within 50 meters
of a building’s address. Flickr users typically download the coordinates of the location where
the photo was taken from their smart-phones, whereas Panoramio’s application makes it easier
for users to locate the photo by identifying the feature at which the camera was pointing. Empir-
ically, however, a 0–50 meter radius better captures image uploads as a proxy for building
beauty for both data sets. Interesting differences between the two data sources in this regard are
uncovered in the SI section.
The results validate the use of very localized image upload frequencies as an index to quan-
tify architectural beauty for applications in architecture, urban planning, urban sciences, social
psychology, and environmental studies. They therefore complement the existing literature on
the use of online, geotagged user-generated content in those contexts (e.g. [21,23,24]).
Several current limitations of using image uploads as building beauty proxies are worth
noting: i) The spatial distribution of uploaded photos is very skewed, with a large number of
buildings in our sample of relatively notable ones displaying zero photos; image upload is
therefore more likely to capture the right tail of the beauty distribution; ii) while predictive
of average beauty survey ratings, most of the variance in subjective aesthetic evaluations
between individuals and buildings remains to be explained; the use of image uploads is there-
fore not suitable for assessing or comparing specific buildings; it should therefore be con-
strained to statistical applications where the laws of large numbers apply; iii) if/as image
upload measures become more popular, prospective use may be contaminated by strategic
image uploads made to influence such measures; iv) other factors aside from beauty are also
likely to impact image uploads; for instance, high pedestrian traffic or touristic areas are
more likely to be pictured; note that such factors were made irrelevant in our research design
due to the randomization of buildings and respondents in the survey, the anonymization of
contextual building information (we did not provide information about city, neighborhood,
or building names), and the focus on building beauty ratings for the buildings as such; never-
theless, confounding correlates of image uploads above and beyond building beauty will cer-
tainly be an issue in observational studies; good applications of image uploads as proxies for
Crowdsourcing architectural beauty
PLOS ONE | https://doi.org/10.1371/journal.pone.0194369 July 25, 2018 12 / 15
building beauty will therefore account for potential confounders (e.g., controlling for pedes-
trian traffic in a regression that tries to explain retail sales in buildings or city areas as a func-
tion of environmental image uploads).
The frequency of images assigned to each building using user volunteered geographic infor-
mation will necessarily be a noisy measure of the actual images that refer to the specific build-
ings. This noise introduces a downward bias to our estimates of the relationship between
image uploads and building beauty. In our particular setting this may be less of a problem
because, despite the noise in our measures, the number of image uploads presumably preserves
the relative ranking of buildings with respect to beauty. While we discuss this issue in more
detail in the appendix, the fact that we obtain precise and positive estimates of the relationship
between image uploads and building beauty suggests that there is enough signal in our mea-
sure to be of practical use to researchers.
On the other hand, some interesting issues are worth studying further: i) the issue of other
potential correlates of image uploads is of intrinsic interest because it may lead to a better
understanding of the behavioral patterns of internet users beyond their assessment of environ-
mental beauty; ii) many applications of image uploads (like the extant [13] or [9]) use photo
frequencies within larger geographical areas; it will be interesting to study the importance of
contextual effects: is the sum of individual image uploads of buildings a worse or better statistic
for the building beauty of larger areas? In order to better understand image uploads and build-
ing beauty dynamics across clusters, researchers could conduct similar rating surveys of urban
environments using geographically-clustered images.
Supporting information
S1 Fig. Estimated survey score marginal gains from pictures in range (Flickr).
(TIF)
S2 Fig. Distribution of photo uploads throughout the U.S.
(TIF)
S1 File. Supplemental information.
(PDF)
Author Contributions
Conceptualization: Albert Saiz, Arianna Salazar.
Data curation: Arianna Salazar.
Formal analysis: Arianna Salazar.
Methodology: Albert Saiz, Arianna Salazar.
Project administration: Albert Saiz, James Bernard.
Resources: James Bernard.
Visualization: Arianna Salazar.
Writing original draft: Arianna Salazar.
Writing review & editing: Albert Saiz, Arianna Salazar.
References
1. Newman O. Defensible space: crime prevention through urban space. Collier Books; 1973.
Crowdsourcing architectural beauty
PLOS ONE | https://doi.org/10.1371/journal.pone.0194369 July 25, 2018 13 / 15
2. Martin PR, Cheung FM, Knowles MC, Kyrios M, Littlefield L, Overmier JB, et al. IAAP Handbook of
Applied Psychology. Martin PR, Cheung FM, Knowles MC, Kyrios M, Overmier JB, Prieto JM, editors.
Oxford, UK: Wiley-Blackwell; 2011. Available from: http://doi.wiley.com/10.1002/9781444395150.
3. Seresinhe CI, Preis T, Moat HS. Quantifying the Impact of Scenic Environments on Health. Scientific
Reports. 2015; 5:1–9. https://doi.org/10.1038/srep16899
4. Seresinhe CI, Moat HS, Preis T. Quantifying scenic areas using crowdsourced data. Environment and
Planning B: Urban Analytics and City Science. 2017; p. 026581351668730.
5. Lynch K. The image of the city. MIT Press; 1960.
6. Milgram S. Psycholohgical Maps of Paris; 1977.
7. Nasar JL. The Evaluative Image of the City. Journal of the American Planning Association. 1990; 56
(1):41–53. https://doi.org/10.1080/01944369008975742
8. Dubey A, Naik N, Parikh D, Raskar R, Hidalgo CA. Deep learning the city: Quantifying urban perception
at a global scale. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial
Intelligence and Lecture Notes in Bioinformatics). 2016; 9905 LNCS:196–212.
9. Seresinhe CI, Preis T, Moat HS. Using deep learning to quantify the beauty of outdoor places. Royal
Society Open Science. 2017; 4(7):170170. https://doi.org/10.1098/rsos.170170 PMID: 28791142
10. Paldino S, Kondor D, Bojic I, Sobolevsky S, Gonza
´lez MC, Ratti C. Uncovering Urban Temporal Pat-
terns from Geo-Tagged Photography. PLOS ONE. 2016; 11(12):1–14. https://doi.org/10.1371/journal.
pone.0165753
11. Kennedy LS, Naaman M. Generating Diverse and Representative Image Search Results for Land-
marks. In: Proceedings of the 17th International Conference on World Wide Web. WWW’08. New York,
NY, USA: ACM; 2008. p. 297–306. Available from: http://doi.acm.org/10.1145/1367497.1367539.
12. Paldino S, Bojic I, Sobolevsky S, Ratti C, Gonza
´lez MC. Urban magnetism through the lens of geo-
tagged photography. EPJ Data Science. 2015; 4(1):5. https://doi.org/10.1140/epjds/s13688-015-
0043-3
13. Ahlfeldt GM. Urbanity. 2013;.
14. Nov O, Naaman M, Ye C. Analysis of Participation in an Online Photo-sharing Community: A Multidi-
mensional Perspective. J Am Soc Inf Sci Technol. 2010; 61(3):555–566.
15. Antoniou V, Morley J, Haklay M. Web 2.0 geotagged photos: Assessing the spatial dimension of the
phenomenon. Geomatica. 2010; 64:99–110.
16. Varian H, Choi H. Predicting the Present with Google Trends. 2012.
17. Ginsberg J, Mohebbi MH, Patel RS, Brammer L, Smolinski MS, Brilliant L. Detecting influenza epidem-
ics using search engine query data. Nature. 2008; 457:1012. https://doi.org/10.1038/nature07634
18. Lazer D, Kennedy R, King G, Vespignani A. The Parable of Google Flu: Traps in Big Data Analysis. Sci-
ence. 2014; 343(6176):1203–1205. https://doi.org/10.1126/science.1248506 PMID: 24626916
19. Antweiler W, Frank MZ. Is All That Talk Just Noise? The Information Content of Internet Stock Message
Boards. The Journal of Finance. 2004; 59(3):1259–1294. https://doi.org/10.1111/j.1540-6261.2004.
00662.x
20. Saiz A, Simonsohn U. PROXYING FOR UNOBSERVABLE VARIABLES WITH INTERNET DOCU-
MENT-FREQUENCY. Journal of the European Economic Association. 2013; 11(1):137–165. https://
doi.org/10.1111/j.1542-4774.2012.01110.x
21. Krieger MH, Ra MR, Paek J, Govindan R, Evans-Cowley J. Urban Tomography. Journal of Urban Tech-
nology. 2010; 17(2):21–36. https://doi.org/10.1080/10630732.2010.515087
22. Brown G. A Review of Sampling Effects and Response Bias in Internet Participatory Mapping (PPGIS/
PGIS/VGI). Transactions in GIS. 2017; 21(1):39–56. https://doi.org/10.1111/tgis.12207
23. Feick R, Robertson C. A multi-scale approach to exploring urban places in geotagged photographs.
Computers, Environment and Urban Systems. 2015; 53:96–109. https://doi.org/10.1016/j.
compenvurbsys.2013.11.006
24. Dunkel A. Research Paper: Visualizing the perceived environment using crowdsourced photo geodata.
Landscape and Urban Planning. 2015; 142(Special Issue: Critical Approaches to Landscape Visualiza-
tion):173–186. https://doi.org/10.1016/j.landurbplan.2015.02.022
25. Silveira S, Fehse K, Vedder A, Elvers K, Hennig-Fast K. Is it the picture or is it the frame? An fMRI study
on the neurobiology of framing effects. Frontiers in Human Neuroscience. 2015; 9:528. https://doi.org/
10.3389/fnhum.2015.00528 PMID: 26528161
26. Nadal M, Munar E, Capo
´MÀ, Rossello
´J, Cela-Conde CJ. Towards a framework for the study of the
neural correlates of aesthetic preference. Spatial Vision. 2008; 21(3):379–396. https://doi.org/10.1163/
156856808784532653. PMID: 18534110
Crowdsourcing architectural beauty
PLOS ONE | https://doi.org/10.1371/journal.pone.0194369 July 25, 2018 14 / 15
27. Cela-Conde CJ, Garcia-Prieto J, Ramasco JJ, Mirasso CR, Bajo R, Munar E, et al. Dynamics of brain
networks in the aesthetic appreciation. Proceedings of the National Academy of Sciences. 2013; 110
(Supplement_2):10454–10461. https://doi.org/10.1073/pnas.1302855110
28. Isham EA, Geng JJ. Looking Time Predicts Choice but Not Aesthetic Value. PLOS ONE. 2013; 8(8):1–
7. https://doi.org/10.1371/journal.pone.0071698
Crowdsourcing architectural beauty
PLOS ONE | https://doi.org/10.1371/journal.pone.0194369 July 25, 2018 15 / 15
... This period also saw research that utilizes VR, eye-trackers, and other physiological measurement devices to evaluate people's reactions to design (Zeile and Resch, 2018;Hollander et al., 2020;Fisher-Gewirtzman, 2019;Wang et al., 2023a;Sakhaei et al., 2023;Chinazzo et al., 2021). Geotagged images and SVI were also used by many studies; for instance, evaluation of soundscape (Zhao et al., 2023), assessment of beauty and color quality in buildings (Saiz et al., 2018;Wan et al., 2022). ...
Article
Full-text available
Visual characteristics of the built environment affect how people perceive and experience cities. For a long time, many studies have examined visual perception in cities. Such efforts have accelerated in recent years due to advancements in technologies and the proliferation of relevant data (e.g., street view imagery, geo-tagged photos, videos, virtual reality, and aerial imagery). There has not been a comprehensive systematic review paper on this topic to reveal an overarching set of research trends, limitations, and future research opportunities. Such omission is plausibly due to the difficulty in reviewing a large number of relevant papers on this popular topic. In this study, we utilized machine learning techniques (i.e., natural language processing and large language models) to semi-automate the review process and reviewed 393 relevant papers. Through the review, we found that these papers can be categorized into the physical aspects of cities: greenery and water, street design, building design, landscape, public space, and the city as a whole. We also revealed that many studies conducted quantitative analyses with a recent trend of increasingly utilizing big data and advanced technologies, such as combinations of street view imagery and deep learning models. Limitations and research gaps were also identified as follows: (1) a limited scope in terms of study areas, sample size, and attributes; (2) low quality of subjective and visual data; and (3) the need for more controlled and sophisticated methods to infer more closely * Corresponding author. examined impacts of visual features on human perceptions. We suggest that future studies utilize and contribute to open data and take advantage of existing data and technologies to examine the causality of visual features on human perception. The approach developed to accelerate this review proved to be accurate, efficient, and insightful. Considering its novelty, we also describe it to enable replications in the future.
... Their work evaluated the effects of physical disorder, such as litter, graffiti, and building conditions, on people's feelings, providing a method to measure the "sense of place" in expansive urban areas. Similarly, Saiz, Salazar, and Bernard (2018) harnessed the widespread sharing of photographs online to gauge how people value the aesthetic aspects of their environment. They demonstrated that street-level imagery could be a scalable tool for measuring subjective attractiveness, enhancing our comprehension of how individuals perceive urban spaces. ...
Article
Full-text available
The visual dimension of cities has been a fundamental subject in urban studies since the pioneering work of late-nineteenth- to mid-twentieth-century scholars such as Camillo Sitte, Kevin Lynch, Rudolf Arnheim, and Jane Jacobs. Several decades later, big data and artificial intelligence (AI) are revolutionizing how people move, sense, and interact with cities. This article reviews the literature on the appearance and function of cities to illustrate how visual information has been used to understand them. A conceptual framework, urban visual intelligence, is introduced to systematically elaborate on how new image data sources and AI techniques are reshaping the way researchers perceive and measure cities, enabling the study of the physical environment and its interactions with the socioeconomic environment at various scales. The article argues that these new approaches would allow researchers to revisit the classic urban theories and themes and potentially help cities create environments that align with human behaviors and aspirations in today’s AI-driven and data-centric era.
... Users who posted pictures consistently for at least one week during this period were considered tourists, while others were considered residents. This approach uses the timing of photos to distinguish between residents and tourists, a method used in previous research studies (Ahlfeldt, 2012;Saiz et al., 2018;Gaigné et al., 2022). ...
Preprint
Full-text available
Tourism is an essential sector of the global economy, contributing significantly to GDP and employment. Despite its importance, our understanding of its impact on urban economic activity remains limited. This paper aims to fill this gap by examining the impact of tourism on urban transformation using a dataset of hotel openings in Madrid from 2001-2010. I show that hotel openings positively impact the number of establishments and employment by using the number of protected buildings as an instrumental variable to account for the non-random distribution of hotel openings. Interestingly, hotel openings contribute to changes in the composition of the economic activities and the business structures, enhancing tourist-oriented corporate-owned businesses over other individual-owned companies. Finally, economic effects extend to the real estate market, increasing rental prices and residential investment.
... In particular, the study explores how street features affect human perceptions and also measures whether the physical disorder of a place (measured using litter, graffiti, and poorly maintained buildings as proxies) has a negative effect on people's feelings, providing an effective tool to evaluate the "sense of place" of large-scale urban areas. Saiz et al. (2018) uses the ubiquitous posting of millions of photographs online to understand how people value the aesthetic dimension of the physical environment. They show that street-level imagery offers a scalable way to measure subjective attractiveness across and within cities, enabling us to build a more comprehensive understanding of how people perceive their surroundings. ...
Preprint
Full-text available
The visual dimension of cities has been a fundamental subject in urban studies, since the pioneering work of scholars such as Sitte, Lynch, Arnheim, and Jacobs. Several decades later, big data and artificial intelligence (AI) are revolutionizing how people move, sense, and interact with cities. This paper reviews the literature on the appearance and function of cities to illustrate how visual information has been used to understand them. A conceptual framework, Urban Visual Intelligence, is introduced to systematically elaborate on how new image data sources and AI techniques are reshaping the way researchers perceive and measure cities, enabling the study of the physical environment and its interactions with socioeconomic environments at various scales. The paper argues that these new approaches enable researchers to revisit the classic urban theories and themes, and potentially help cities create environments that are more in line with human behaviors and aspirations in the digital age.
... It is well known that measuring the quality of amenities is a hard task. In this paper, we use a proxy suggested by Ahlfeldt (2013) and Saiz et al. (2018): the number of outside geocoded pictures taken by residents at a certain location. One key advantage of this LQI is that it lets consumers choose the aesthetic quality of buildings and locations they like best by "voting with their clicks" (Carlino and Saiz, 2019). ...
Article
We develop a new model of a “featureful” city in which locations are differentiated by two attributes, that is, the distance to employment centers and the accessibility to given amenities. The residential equilibrium involves the spatial separation of households sharing similar incomes. Under Stone-Geary preferences, amenities and commuting are subsumed into a location-quality index. Hence, the assignment of households to locations becomes one-dimensional. Since residential choices are driven by the location-quality index, the income mapping may be fully characterized. Using a rich micro-dataset on the Netherlands, we show that household income sorting is indeed driven by amenities and commuting times.
... Google Street View images and photos voluntarily uploaded by individual users to social media and photo-sharing websites and have created new opportunities for planners and designers to evaluate the quality of urban space. Using photos posted on Flickr and Panoramio for hundreds of thousands of properties across the US, researchers found that the local volume of user-contributed, geotagged photos predicted ratings of the built environment's aesthetic quality (Saiz et al., 2018). Computer vision models and artificial intelligence techniques have been applied on images for various tasks related to urban design. ...
... Google Street View images and photos voluntarily uploaded by individual users to social media and photo-sharing websites and have created new opportunities for planners and designers to evaluate the quality of urban space. Using photos posted on Flickr and Panoramio for hundreds of thousands of properties across the US, researchers found that the local volume of user-contributed, geotagged photos predicted ratings of the built environment's aesthetic quality (Saiz et al., 2018). Computer vision models and artificial intelligence techniques have been applied on images for various tasks related to urban design. ...
Preprint
Full-text available
Urban analytics combines spatial analysis, statistics, computer science, and urban planning to understand and shape city futures. While it promises better policymaking insights, concerns exist around its epistemological scope and impacts on privacy, ethics, and social control. This chapter reflects on the history and trajectory of urban analytics as a scholarly and professional discipline. In particular, it considers the direction in which this field is going and whether it improves our collective and individual welfare. It first introduces early theories, models, and deductive methods from which the field originated before shifting toward induction. It then explores urban network analytics that enrich traditional representations of spatial interaction and structure. Next it discusses urban applications of spatiotemporal big data and machine learning. Finally, it argues that privacy and ethical concerns are too often ignored as ubiquitous monitoring and analytics can empower social repression. It concludes with a call for a more critical urban analytics that recognizes its epistemological limits, emphasizes human dignity, and learns from and supports marginalized communities.
... Google Street View images and photos voluntarily uploaded by individual users to social media and photo-sharing websites and have created new opportunities for planners and designers to evaluate the quality of urban space. Using photos posted on Flickr and Panoramio for hundreds of thousands of properties across the US, researchers found that the local volume of user-contributed, geotagged photos predicted ratings of the built environment's aesthetic quality (Saiz et al., 2018). Computer vision models and artificial intelligence techniques have been applied on images for various tasks related to urban design. ...
Preprint
Full-text available
Urban analytics combines spatial analysis, statistics, computer science, and urban planning to understand and shape city futures. While it promises better policymaking insights, concerns exist around its epistemological scope and impacts on privacy, ethics, and social control. This chapter reflects on the history and trajectory of urban analytics as a scholarly and professional discipline. In particular, it considers the direction in which this field is going and whether it improves our collective and individual welfare. It first introduces early theories, models, and deductive methods from which the field originated before shifting toward induction. It then explores urban network analytics that enrich traditional representations of spatial interaction and structure. Next it discusses urban applications of spatiotemporal big data and machine learning. Finally, it argues that privacy and ethical concerns are too often ignored as ubiquitous monitoring and analytics can empower social repression. It concludes with a call for a more critical urban analytics that recognizes its epistemological limits, emphasizes human dignity, and learns from and supports marginalized communities.
Article
Full-text available
Tourism is an essential sector of the global economy, contributing significantly to GDP and employment. Despite its importance, our understanding of its impact on urban economic activity remains limited. This paper aims to fill this gap by examining the impact of tourism on urban transformation using a dataset of hotel openings in Madrid from 2001–2010. I show that hotel openings positively impact the number of establishments and employment by using the number of protected buildings as an instrumental variable to account for the non-random distribution of hotel openings. Interestingly, hotel openings contribute to changes in the composition of the economic activities and the business structures, enhancing tourist-oriented corporate-owned businesses over other individual-owned companies. Finally, economic effects extend to the real estate market, increasing rental prices and residential investment.
Preprint
Urban human-made environments present a range of potential benefits for wellbeing through their design. However, there is a lack of comprehensive organization on this topic. To this end, we performed a scoping review to provide an overview of how the urban environment, particularly its designed components, has been previously studied in relation to aesthetics and wellbeing. A total of 255 articles related to urban aesthetics were identified, of which 122 were also related to wellbeing. The results showed the frequency (most and least studied) and the diversity in the man-made components, aesthetic and wellbeing dimensions studied in relation to the urban environment within two decades. Our review highlights the need for consensus in terminology regarding the distinction between aesthetics and wellbeing concepts, and which terms/measures ought to be implemented more in environmental psychology research. All in all, we provide the basis for (1) researchers from various fields, who can use our findings to plan future studies, and (2) urban planners and designers alike, who can use our review to assess the aesthetic and wellbeing potential of urban elements for their designs; so that they can make our cities more aesthetically pleasing, better for wellbeing, and, fundamentally, better places to live.
Article
Full-text available
Beautiful outdoor locations are protected by governments and have recently been shown to be associated with better health. But what makes an outdoor space beautiful? Does a beautiful outdoor location differ from an outdoor location that is simply natural? Here, we explore whether ratings of over 200 000 images of Great Britain from the online game Scenic-Or-Not, combined with hundreds of image features extracted using the Places Convolutional Neural Network, might help us understand what beautiful outdoor spaces are composed of. We discover that, as well as natural features such as ‘Coast’, ‘Mountain’ and ‘Canal Natural’, man-made structures such as ‘Tower’, ‘Castle’ and ‘Viaduct’ lead to places being considered more scenic. Importantly, while scenes containing ‘Trees’ tend to rate highly, places containing more bland natural green features such as ‘Grass’ and ‘Athletic Fields’ are considered less scenic. We also find that a neural network can be trained to automatically identify scenic places, and that this network highlights both natural and built locations. Our findings demonstrate how online data combined with neural networks can provide a deeper understanding of what environments we might find beautiful and offer quantitative insights for policymakers charged with design and protection of our built and natural environments.
Article
Full-text available
For centuries, philosophers, policy-makers and urban planners have debated whether aesthetically pleasing surroundings can improve our wellbeing. To date, quantifying how scenic an area is has proved challenging, due to the difficulty of gathering large-scale measurements of scenicness. In this study we ask whether images uploaded to the website Flickr, combined with crowdsourced geographic data from OpenStreetMap, can help us estimate how scenic people consider an area to be. We validate our findings using crowdsourced data from Scenic-Or-Not, a website where users rate the scenicness of photos from all around Great Britain. We find that models including crowdsourced data from Flickr and OpenStreetMap can generate more accurate estimates of scenicness than models that consider only basic census measurements such as population density or whether an area is urban or rural. Our results provide evidence that by exploiting the vast quantity of data generated on the Internet, scientists and policy-makers may be able to develop a better understanding of people's subjective experience of the environment in which they live.
Article
Full-text available
We live in a world where digital trails of different forms of human activities compose big urban data, allowing us to detect many aspects of how people experience the city in which they live or come to visit. In this study we propose to enhance urban planning by taking into a consideration individual preferences using information from an unconventional big data source: dataset of geo-tagged photographs that people take in cities which we then use as a measure of urban attractiveness. We discover and compare a temporal behavior of residents and visitors in ten most photographed cities in the world. Looking at the periodicity in urban attractiveness, the results show that the strongest periodic patterns for visitors are usually weekly or monthly. Moreover, by dividing cities into two groups based on which continent they belong to (i.e., North America or Europe), it can be concluded that unlike European cities, behavior of visitors in the US cities in general is similar to the behavior of their residents. Finally, we apply two indices, called “dilatation attractiveness index” and “dilatation index”, to our dataset which tell us the spatial and temporal attractiveness pulsations in the city. The proposed methodology is not only important for urban planning, but also does support various business and public stakeholder decision processes, concentrated for example around the question how to attract more visitors to the city or estimate the impact of special events organized there.
Conference Paper
Full-text available
Computer vision methods that quantify the perception of urban environment are increasingly being used to study the relationship between a city's physical appearance and the behavior and health of its residents. Yet, the throughput of current methods is too limited to quantify the perception of cities across the world. To tackle this challenge, we introduce a new crowdsourced dataset containing 110,988 images from 56 cities, and 1,170,000 pairwise comparisons provided by 81,630 online volunteers along six perceptual attributes: safe, lively, boring, wealthy, depressing, and beautiful. Using this data, we train a Siamese-like convolutional neural architecture, which learns from a joint classification and ranking loss, to predict human judgments of pairwise image comparisons. Our results show that crowdsourcing combined with neural networks can produce urban perception data at the global scale.
Article
Full-text available
Few people would deny an intuitive sense of increased wellbeing when spending time in beautiful locations. Here, we ask: can we quantify the relationship between environmental aesthetics and human health? We draw on data from Scenic-Or-Not, a website that crowdsources ratings of “scenicness” for geotagged photographs across Great Britain, in combination with data on citizen-reported health from the Census for England and Wales. We find that inhabitants of more scenic environments report better health, across urban, suburban and rural areas, even when taking core socioeconomic indicators of deprivation into account, such as income, employment and access to services. Our results provide evidence in line with the striking hypothesis that the aesthetics of the environment may have quantifiable consequences for our wellbeing.
Article
Full-text available
Using functional magnetic resonance imaging (fMRI) we investigated whether a culturally defined context modulates the neurocognitive processing of artworks. We presented subjects with paintings from the Museum of Modern Art (MoMA) in New York, and labeled them as being either from the MoMA or from an adult education center. Irrespective of aesthetic appreciation, we found higher neural activation in the left precuneus, superior and inferior parietal cortex for the MoMA condition compared to the control label condition. When taking the aesthetic preference for a painting into account, the MoMA condition elicited higher involvement of right precuneus, bilateral anterior cingulate cortex (ACC), and temporoparietal junction (TPJ). Our findings indicate that mental frames, in particular labels of social value, modulate both cognitive and affective aspects of sensory processing.
Article
Springer International Publishing AG 2016. Computer vision methods that quantify the perception of urban environment are increasingly being used to study the relationship between a city’s physical appearance and the behavior and health of its residents. Yet, the throughput of current methods is too limited to quantify the perception of cities across the world. To tackle this challenge, we introduce a new crowdsourced dataset containing 110,988 images from 56 cities, and 1,170,000 pairwise comparisons provided by 81,630 online volunteers along six perceptual attributes: safe, lively, boring, wealthy, depressing, and beautiful. Using this data, we train a Siamese-like convolutional neural architecture, which learns from a joint classification and ranking loss, to predict human judgments of pairwise image comparisons. Our results show that crowdsourcing combined with neural networks can produce urban perception data at the global scale.
Book
Product Information About The Product The IAAP Handbook of Applied Psychology, an up-to-date and authoritative reference, provides a critical overview of applied psychology from an international perspective. • Brings together articles by leading authorities from around the world • Provides the reader with a complete overview of the field and highlights key research findings • Divided into three parts: professional psychology, substantive areas of applied psychology, and special topics in applied psychology • Explores the challenges, opportunities, and potential future developments in applied psychology • Features comprehensive coverage of the field, including topics as diverse as clinical health psychology, environmental psychology, and consumer psychology.
Article
Global interest in participatory mapping described as public participation GIS (PPGIS), participatory GIS (PGIS), and volunteered geographic information (VGI) continues to grow, but systematic study of spatial data quality and sampling effects is limited. This paper provides a review and meta-analysis of internet-based PPGIS studies conducted 2006–2015 (n=26) to answer the following research questions: (1) How does mapping effort, as a proxy measure for spatial data quality, differ by sampling group? (2) Does the purpose and context of PPGIS influence mapping results? (3) What is the potential for mapping bias through sampling design? (4) Given the results, what should be the focus of future PPGIS research? Mapping effort was highest in sampling groups whose livelihoods were closely related to the purpose of the study, there was greater mapping effort in household sampling groups compared to volunteer groups, and participant domicile had strong effects on mapped results through spatial discounting. The use of online internet panels provides higher response rates but lower spatial data quality. Future research should focus on increasing sampling response rates, assessing social trade-offs using alternative spatial weighting schemes, and examining the capacity of the public to select land use alternatives as a complement to traditional expert-driven planning systems.