Conference PaperPDF Available

Abstract

In this paper, we present an ongoing work that will ultimately result in a movie recommender system based on the Mise-en-Scène characteristics of the movies. We believe that the preferences of users on movies can be well described in terms of the mise-en-scène, i.e., the design aspects of movie making influencing aesthetic and style. Examples of mise-en-scène characteristics are Lighting, colors, background, and movements. Our recommender system opens new opportunities in the design of new user interfaces able to offer a personalized way to search for interesting movies through the analysis of film styles rather than using the traditional classifications of movies based on explicit attributes such as genre and cast.
Recommending Movies Based on
Mise-en-Sc`
ene Design
Yashar Deldjoo
Politecnico di Milano
yashar.deldjoo@polimi.it
Franca Garzotto
Politecnico di Milano
franca.garzotto@polimi.it
Mehdi Elahi
Politecnico di Milano
mehdi.elahi@polimi.it
Pietro Piazzolla
Politecnico di Milano
pietro.piazzolla@polimi.it
Paolo Cremonesi
Politecnico di Milano
paolo.cremonesi@polimi.it
Permission to make digital or hard copies of part or all of this work for personal or
classroom use is granted without fee provided that copies are not made or distributed
for profit or commercial advantage and that copies bear this notice and the full citation
on the first page. Copyrights for third-party components of this work must be honored.
For all other uses, contact the owner/author(s). Copyright is held by the
author/owner(s).
CHI’16 Extended Abstracts, May 7–12, 2016, San Jose, CA, USA.
ACM 978-1-4503-4082-3/16/05.
http://dx.doi.org/10.1145/2851581.2892551
Abstract
In this paper, we present an ongoing work that will ulti-
mately result in a movie recommender system based on
the Mise-en-Scène characteristics of the movies. We be-
lieve that the preferences of users on movies can be well
described in terms of the mise-en-scène, i.e., the design
aspects of movie making influencing aesthetic and style.
Examples of mise-en-scène characteristics are Lighting,
colors, background, and movements. Our recommender
system opens new opportunities in the design of new user
interfaces able to offer a personalized way to search for in-
teresting movies through the analysis of film styles rather
than using the traditional classifications of movies based on
explicit attributes such as genre and cast.
Author Keywords
movie recommendation, film making, video processing
Introduction
Recommender Systems (RSs) are applications, that are ca-
pable of filtering large information and selecting the items
that are likely to be attractive for users [18]. Particularly,
they play an important role in video-on-demand web ap-
plications (e.g., YouTube and Netflix) characterized by a
huge catalogs of movies: the ultimate aim of RSs is to find
and recommend to users the movies that are most likely
to be attractive for them. However, RSs cannot make rele-
Late-Breaking Work: Engineering of Interactive Systems
#chi4good, CHI 2016, San Jose, CA, USA
1540
vant recommendation of movies, before some information is
available on these movies.
Recommendations are typically made using implicit and ex-
plicit preferences of users on movies’ attributes (e.g., genre,
director, and actors)[6]. However, user’s preferences can
be also described by the mise-en-scène characteristics of
movies [8, 7], i.e., the design aspects of a movie produc-
tion used to classify aesthetic and style. Lighting, colors,
background, and movements in a movie are all examples
of mise-en-scène features. Although viewers may not con-
sciously notice movie style, it still influences the viewer’s
experience of the movie. The mise-en-scène highlights sim-
ilarities in the narratives, as movie makers typically relate
the overall movie style to reflect the story, and can be used
to categorize movies at a finer level compared to the tradi-
tional movie features [7].
In this research work, we propose the exploitation of auto-
matically extracted design visual features of movies based
on mise-en-scène design characteristics, in the context
of recommender systems. We propose a novel recom-
mender system that automatically analyze video contents
and extracts a set of representative stylistic visual features
grounded on Applied Media Aesthetic [22], i.e., the theory
that is concerned with the relation of aesthetic media at-
tributes (e.g., light, camera movements, and colors) with the
perceptual reactions they are able to evoke in consumers of
media communication, particularly movies.
Our results poses new challenges in the design of user in-
terfaces able to integrate the stylistic features of movies into
a comprehensive and practical recommender system, as
the perceived quality of a recommender system is deter-
mined by its algorithm as well as by its usability [5, 4, 3].
This is a novel and multidisciplinary approach, from both
design and engineering perspectives, toward video rec-
ommendation systems. It can build huge influence on the
research area, and revolutionize the industry, e.g., social
video sharing.
More specifically, in this work we have conducted a prelimi-
nary data analysis in order to investigate two conjectures:
(i) if stylistic visual features extracted from trailers are a
good representation of the corresponding features extracted
from the original full-length movies, and (ii) if the stylistic
visual features are informative indicators of the movies.
We briefly present the results of the analysis and provide
a discussion on the ultimate goals we pursue. An extended
version of this work has been published in [7].
Technical Background
A prerequisite for RSs is the availability of information about
“explicit” content features of the items. In movie RSs, such
features are associated to the items as structured meta-
information (e.g., movie genre, director, and cast) or un-
structured meta-information (e.g., plot, tags and textual
reviews). In contrast, we propose a stylistic-based movie
recommendation technique that exploits “implicit” content
characteristics of items, i.e., features that are “encapsu-
lated" in the items and must be computationally “extracted”
from them.
For example, two movies may be from the same genre, but
they can be different based on the movie style. “The Fifth
Element" and the “War of the Worlds" are both sci-fi movies
about an alien invasion. However, they are shot completely
different, with Luc Besson (The Fifth Element) using bright
colors while Steven Spielberg (War of the Worlds) preferring
dark scenes. Although a viewer may not consciously notice
the two different movie styles, they still affect the viewer’s
Late-Breaking Work: Engineering of Interactive Systems
#chi4good, CHI 2016, San Jose, CA, USA
1541
experience of the movie. There are countless ways to cre-
ate a movie based on the same script simply by changing
the mise-en-scène [11].
Furthermore, mise-en-scène characteristics of the movies
can bring additional benefits to RSs. For example, mise-
en-scène can be used to tackle with the Cold Start problem
which occurs when the system is unable to accurately rec-
ommend a new item to the existing users [10]. This is a
situation that typically occurs in social movie-sharing web
applications (e.g., YouTube) where every day, hundred mil-
lions of hours of videos are uploaded by users and may
contain no meta-data and no user preference. Traditional
techniques would neglect to consider these new items even
if they may be relevant for recommendation purposes, as
the recommender has no content to analyze but video files.
To the best of my knowledge, this problem has not been yet
effectively solved [19].
Figure 1: above. Out of the past
(1947) an example of highly
contrasted lighting. below. The
wizard of OZ (1939) flat lighting
example.
Figure 2: above. An image from
Django Unchained (2012). The red
hue is used to increase the scene
sense of violence. below. An
image from Lincoln (2012). Blue
tone is used to produce the sense
of coldness and fatigue
experienced by the characters.
Artistic Motivation
In this section, we describe the artistic background to the
idea of stylistic visual features for movie recommendation.
We do this by describing the stylistic visual features from
an artistic point of view and explaining the relation between
these visual features and the corresponding aesthetic vari-
ables in movie-making domain.
The study on aesthetic elements and how their combination
contributes to establish the meaning conveyed by an artistic
work is the subject of different disciplines such as semi-
otics, and traditional aesthetic studies. The shared notion
is that humans respond to certain stimuli in ways that are
predictable, up to a given extent. One of the consequences
of the above notion is that similar stimuli are expected to
provoke similar reactions, and this as the result may allow
to group similar works of art together by the reaction they
are expected to provoke.
Among these disciplines, Applied Media Aesthetic [22], par-
ticularly, is concerned with the relation between a number
of media elements, such as light, camera movements, col-
ors, with the perceptual reactions they are able to evoke
in consumers of media communication, mainly videos and
films. Such media elements, that together build the visual
images composing the media, are investigated following a
rather formalistic approach that suits the purposes of this
paper. By an analysis of cameras, lenses, lighting, etc., as
production tools as well as their aesthetic characteristics
and uses, Applied Media Aesthetic tries to identify patterns
in how such elements operate to produce the desired effect
in communicating emotions and meanings.
The image elements that are usually addressed as funda-
mental in the literature, e.g. in [9], even if with slight differ-
ences due to the specific context, are lights and shadows,
colors, space representation, motion. It has been proved,
e.g. in [17][1], that some aspects concerning these ele-
ments can be computed from the video data stream as sta-
tistical values. We call these computable aspects as fea-
tures.
We will now look into closer details of the features, investi-
gated for content-based video recommendation in this pa-
per to provide a solid overview on how they are used to
producing perceptual reaction in the audience.
Lighting
There are at least two different purposes for lighting in
movies chiaroscuro and f lat lighting. While, the first
is a lighting technique characterized by high contrast be-
tween light and shadow areas that puts the emphasis on
an unnatural effect, the latter instead is a neutral, realistic,
Late-Breaking Work: Engineering of Interactive Systems
#chi4good, CHI 2016, San Jose, CA, USA
1542
way of illuminating, whose purpose is to enable recognition
of stage objects. Figure 1 illustrates the difference between
these two alternatives.
Colors
The expressive quality of colors is closely related to that
of lighting, sharing the same ability to set or magnify the
feeling derived by a given situation. Even if an exact cor-
relation between colors and the feeling they may evoke is
not currently supported by enough scientific data, colors
nonetheless, have an expressive impact that has been in-
vestigated thoroughly, e.g. in [20]. An interesting metric to
quantify this impact has been proposed in [21] as perceived
color energy, a quantity that depends on a color’s satura-
tion, brightness and the size of the area the color covers
in an image. Also the hue plays a role as if it tends toward
reds, the quantity of energy is more, while if it tends more
on blues, it is less. These tendencies are shown in exam-
ples of Figure 2.
Motion
The illusion of movement given by screening a sequence of
still frames in rapid succession is the very reason of cinema
existence. In a video or movie, there are different types of
motions to consider:
Profilmic movements: Every movement that con-
cerns elements, shot by the camera, falls in this cate-
gory, e.g. performers motion, or vehicles. The move-
ment can be real or perceived. By deciding the type
and quantity of motion an ‘actor’ has, considering as
actor any possible protagonist of a scene, the director
defines, among others, the level of attention to, or ex-
pectations from, the scene. As an example, the hero
walking slowly in a dark alley, or a fast car chasing.
Camera movements: are the movements that al-
ter the point of view on the narrated events. Camera
movements, such as the pan, truck, pedestal, or dolly,
can be used for different purposes. Some usages are
descriptive, to introduce landscapes or actors, to fol-
low performers actions, and others concern the nar-
ration, to relate two or more different elements, e.g.,
anticipating a car’s route to show an unseen obstacle,
to move toward or away from events.
Sequences movements: As shots changes, using
cuts or other transitions, the rhythm of the movie
changes accordingly. Generally, a faster rhythm is
associated with excitement, and a slower rhythm sug-
gests a more relaxed pace [2].
In this paper, we followed the approach in [17], considering
the motion content of a scene as a feature that aggregate
and generalize both profilmic and camera movements.
Research Objectives
There are a number of objectives that are expected to achieve
at the end of this on-going research work:
development and evaluation of a novel movie recom-
mendation system, based on automatic extraction of
visual stylistic features from the multimedia content;
the extracted features represent Mise-en-Scène char-
acteristics of the movies;
as a broader goal, design and development of a novel
video retrieval platform, including the HCI, that im-
proves searching and recommendation capabilities,
based on aesthetic attributes (i.e., visual features)
derived from movie styles as determined by movie
maker professionals, and accurately match viewers’
perceptions;
Late-Breaking Work: Engineering of Interactive Systems
#chi4good, CHI 2016, San Jose, CA, USA
1543
extraction of audio features that can effectively de-
scribe the movies in the audio feature space and will
be used together with visual features to improve the
representation model.
Visual features
While stylistic visual features have been marginally ex-
plored in the community of recommender systems, they
have been extensively studied in other fields such as Com-
puter Vision and Video Retrieval [17, 14]. By reviewing the
state-of-the-art works in these disciplines, we have iden-
tified and selected five visual features that have shown
promising results in representing the movie contents and
being the most informative and distinctive visual features:
(1) Average Shot Length, (2) Color Variance, (3) Average
Motion, (4) Motion Variation, and (5) Lighting Key.
Average Shot Length: a single camera action is named
ashot and the total number of shots in a video can be in-
dicative of the pace at which the movie is being created.
For example, action movies typically contain quick move-
ments of the camera in comparison to drama movies. Hence,
in action movies, average shot length is expected to be high
and in drama to be low. Color Variance: it is known that
variance of colors in movies is highly correlated with their
corresponding genre. Indeed, directors tend to use a large
variety of bright colors for comedy movies and darker com-
bination of colors for horror movies. For each key frame
represented in LUV color space, we compute the gener-
alized color variance [17], which is indicative of the color
variation in that key frame. Average and Variation of Mo-
tion in a video can be caused either as the result of the
camera movements (camera motion) or movements of the
objects being filmed (object motion). While measuring the
average shot length may focuses on the former, it is also
desired that the latter type to be captured accurately. For
this purpose, motion features are extracted. We used op-
tical flow [13], indicative of motion, as a robust estimate of
the pixel velocities over a sequence of images being filmed.
Lighting is considered as a discriminating factor among
movie genres and shall be effectively measured as a key
playing factor to control the type of emotion induced to a
movie consumer. For example, comedy movies often con-
tain abundance of light with a low key-to-fill ratio, i.e., a low
ratio between the brightest and dimmest light. This concept
in cinematography is known as high-key lightening. On the
other hand, horror or noir movies exploit low-key lighten-
ing, i.e. low amount of light and a high key-to-fill ratio.
We have extracted these visual features, automatically, from
each video and used them for recommendation generation.
We conducted a preliminary analysis which is described in
the next section.
Preliminary Analysis
We have conducted a preliminary experiment using a dataset
of 167 movies sampled randomly from 4 main genres, i.e.,
Action, Comedy, Drama, and Horror. Some of the movies
were from mixed genres. The dataset consisted of both full-
length movies and their trailers. Almost 95% of the movies
are recent (year of production between 1990 and 2015).
Only 5% of the movies were produce before the 90s.
In this preliminary experiment, we are interested in inves-
tigating (i) if visual features extracted from trailers are, in
general, a good approximation of the corresponding fea-
tures extracted from the original full-length movies, and (ii) if
the visual features are informative indicators of the movies.
We have computed the similarity between the visual fea-
tures extracted from the full-length movies and the trailers.
The similarity values have been computed using the well
known Cosine similarity metric [15, 16]. The average sim-
Late-Breaking Work: Engineering of Interactive Systems
#chi4good, CHI 2016, San Jose, CA, USA
1544
ilarity is 0.78 out of 1 (median is 0.80). More than 75% of
the movies have a similarity greater than 0.7 between the
full-length movie and trailer. Moreover, less than 3% of the
movies have a similarity below 0.5.
Overall, the cosine similarity shows a substantial correla-
tion between the full-length movies and trailers. This is an
interesting outcome that basically indicates that the trailers
of the movies can be considered as good representatives of
the corresponding full-length movies.
We have obtained high correlation between all visual fea-
tures, except with feature 2 (color variance) and 4 (object
motion): the average similarity values are 0.71, 0.57, 0.76,
0.56, and 0.92 for the first to fifth visual feature, respec-
tively. Features 2 and 4 show less similarity, comparing the
full-length movies and trailers, suggesting that their adop-
tion, if extracted from trailers, should provide less accurate
recommendations.
We have also performed a Wilcoxon significance test com-
paring features extracted from the full-length movies and
trailers. The results show that no significant difference ex-
ists between the features average motion and lighting key,
which clearly shows that the full-length movies and trailers
are highly correlation with respect to these two features.
For the other features, significant differences have been
obtained. This basically states that some of the extracted
features may be either less correlated or not very informa-
tive.
In order to identify the visual features that are more use-
ful in terms of recommendation quality, we have computed
Entropy as a measure [12] of the informativeness of the
data. Our results show that the entropy scores of almost
all visual stylistic features are large, meaning that the infor-
mative content is rich: the entropy values are 0.83, 0.61,
0.70, 0.76, and 0.93 for the first to fifth visual feature, re-
spectively. The most informative feature, in terms of entropy
score is feature 5, i.e., lighting key, and the least informative
feature is the feature 2, i.e., color variance. This observa-
tion is in the full consistency with the other findings, that
we have obtained from, e.g. Wilcoxon test and correlation
analysis (similarity of between features)
Conclusion
In this paper, we present an ongoing work of building a
video recommender system that uses Mise-en-Scène char-
acteristics of movies in order to generate recommendations.
Our recommender system will encompass a technique to
automatically analyze video contents and to extract a set of
representative stylistic features, i.e., Average Shot Length,
Color Variance, Average Motion, Motion Variation, and
Lighting Key. We present a preliminary results of analysis
that show (i) the trailers of the movies are well representa-
tive of the full-length movies (ii) the stylistic visual features
are well informative of the movie content.
For future work, we plan to design and develop an online
web application with a novel HCI that will provide a per-
sonalized way to search for interesting movies through the
analysis of film styles rather than using the traditional clas-
sifications of movies based on explicit attributes such as
genre and cast. We plan to design and conduct a real user
study in order to evaluate the quality of the recommenda-
tion as well as the usability of the system.
Acknowledgements
This work is supported by Telecom Italia S.p.A., Open Inno-
vation Department, Joint Open Lab S-Cube, Milan.
Late-Breaking Work: Engineering of Interactive Systems
#chi4good, CHI 2016, San Jose, CA, USA
1545
REFERENCES
1. Warren Buckland. 2008. What Does the Statistical
Style Analysis of Film Involve? A Review of Moving into
Pictures. More on Film History, Style, and Analysis.
Literary and Linguistic Computing 23, 2 (2008),
219–230. DOI:
http://dx.doi.org/10.1093/llc/fqm046
2. Kazimierz Choro´
s. 2009. Video Shot Selection and
Content-Based Scene Detection for Automatic
Classification of TV Sports News. In Internet 
Technical Development and Applications, Ewaryst
Tkacz and Adrian Kapczynski (Eds.). Advances in
Intelligent and Soft Computing, Vol. 64. Springer Berlin
Heidelberg, 73–80.
3. Dan Cosley, Shyong K Lam, Istvan Albert, Joseph A
Konstan, and John Riedl. 2003. Is seeing believing?:
how recommender system interfaces affect users’
opinions. In Proceedings of the SIGCHI conference on
Human factors in computing systems. ACM, 585–592.
4. Paolo Cremonesi, Franca Garzotto, and Roberto Turrin.
2012a. Investigating the persuasion potential of
recommender systems from a quality perspective: An
empirical study. ACM Transactions on Interactive
Intelligent Systems (TiiS) 2, 2 (2012), 11.
5. Paolo Cremonesi, Franca Garzottto, and Roberto
Turrin. 2012b. User effort vs. accuracy in rating-based
elicitation. In Proceedings of the sixth ACM conference
on Recommender systems. ACM, 27–34.
6. Marco de Gemmis, Pasquale Lops, Cataldo Musto,
Fedelucio Narducci, and Giovanni Semeraro. 2015.
Semantics-Aware Content-Based Recommender
Systems. In Recommender Systems Handbook.
Springer, 119–159.
7. Yashar Deldjoo, Mehdi Elahi, Paolo Cremonesi, Franca
Garzotto, Pietro Piazzolla, and Massimo Quadrana.
2016. Content-based Video Recommendation System
based on Stylistic Visual Features. Journal on Data
Semantics Special Issue on Recommender Systems
(2016).
8. Yashar Deldjoo, Mehdi Elahi, Massimo Quadrana,
Paolo Cremonesi, and Franca Garzotto. 2015. Toward
Effective Movie Recommendations Based on
Mise-en-Scène Film Styles. In Proceedings of the 11th
Biannual Conference on Italian SIGCHI Chapter. ACM,
162–165.
9. Chitra Dorai and Svetha Venkatesh. 2001.
Computational Media Aesthetics: Finding Meaning
Beautiful. IEEE MultiMedia 8, 4 (Oct. 2001), 10–12.
DOI:http://dx.doi.org/10.1109/93.959093
10. Mehdi Elahi, Francesco Ricci, and Neil Rubens. 2013.
Active learning strategies for rating elicitation in
collaborative filtering: a system-wide perspective. ACM
Transactions on Intelligent Systems and Technology
(TIST) 5, 1 (2013), 13.
11. J. Gibbs. 2002. Mise-en-scène: Film Style and
Interpretation. Wallflower.
https://books.google.it/books?id=j4dqY_phZlEC
12. Isabelle Guyon, Nada Matic, Vladimir Vapnik, and
others. 1996. Discovering Informative Patterns and
Data Cleaning. (1996).
13. Berthold K Horn and Brian G Schunck. 1981.
Determining optical flow. In 1981 Technical Symposium
East. International Society for Optics and Photonics,
319–331.
Late-Breaking Work: Engineering of Interactive Systems
#chi4good, CHI 2016, San Jose, CA, USA
1546
14. Weiming Hu, Nianhua Xie, Li Li, Xianglin Zeng, and
Stephen Maybank. 2011. A survey on visual
content-based video indexing and retrieval. Systems,
Man, and Cybernetics, Part C: Applications and
Reviews, IEEE Transactions on 41, 6 (2011), 797–819.
15. Pasquale Lops, Marco De Gemmis, and Giovanni
Semeraro. 2011. Content-based recommender
systems: State of the art and trends. In Recommender
systems handbook. Springer, 73–105.
16. Michael J. Pazzani and Daniel Billsus. 2007. The
Adaptive Web. Springer-Verlag, Berlin, Heidelberg,
Chapter Content-based Recommendation Systems,
325–341. http:
//dl.acm.org/citation.cfm?id=1768197.1768209
17. Zeeshan Rasheed, Yaser Sheikh, and Mubarak Shah.
2005. On the use of computable features for film
classification. Circuits and Systems for Video
Technology, IEEE Transactions on 15, 1 (2005), 52–64.
18. Francesco Ricci, Lior Rokach, and Bracha Shapira.
2011. Introduction to recommender systems handbook.
In Recommender Systems Handbook, Francesco Ricci,
Lior Rokach, Bracha Shapira, and Paul Kantor (Eds.).
Springer Verlag, 1–35.
19. Neil Rubens, Mehdi Elahi, Masashi Sugiyama, and
Dain Kaplan. 2015. Active Learning in Recommender
Systems. In Recommender Systems Handbook -
chapter 24: Recommending Active Learning. Springer
US, 809–846.
20. Patricia Valdez and Albert Mehrabian. 1994. Effects of
color on emotions. Journal of Experimental
Psychology: General 123, 4 (1994), 394.
21. Hee Lin Wang and Loong-Fah Cheong. 2006. Affective
understanding in film. Circuits and Systems for Video
Technology, IEEE Transactions on 16, 6 (June 2006),
689–704. DOI:
http://dx.doi.org/10.1109/TCSVT.2006.873781
22. Herbert Zettl. 2002. Essentials of Applied Media
Aesthetics. In Media Computing, Chitra Dorai and
Svetha Venkatesh (Eds.). The Springer International
Series in Video Computing, Vol. 4. Springer US, 11–38.
Late-Breaking Work: Engineering of Interactive Systems
#chi4good, CHI 2016, San Jose, CA, USA
1547
... Filmic spaces might have the potential to provide creative stimuli to architectural representation as a new approach to problem identification in designing. Filmic elements such as shooting, montage, and mise-en-scène are the design methods for visualizing aesthetics and style [40]. These elements provide a new opportunity to design a student interface that can provide a personalized way to find interesting films for the analysis of filmic spaces instead of using the traditional classifications of film based on explicit attributes such as genre and cast. ...
... The program was designed in accordance with the accreditation standards of the procedures and conditions of the Korea Architecture Accreditation Board (KAAB) [46]. The Korea Institute of Architecture Education Accreditation specifies the core competencies that each university/graduate architecture education program must meet to obtain KAAB certification, and its corresponding student performance criteria (SPC) [40]. Out of the 26 criteria of SPC, the Visual Communication Design Studio set its curriculum centered on five criteria. ...
... Table 1. Student performance criteria (SPC) [40]. ...
Article
Full-text available
Representing visual experiences is an essential part of architectural design education for creativity. The representation of creative ideas relates to the ability to communicate spatial design concepts. This study examined whether filmic spaces could function as visual communication to enhance students’ creative thinking in architecture. It explored how creativity can be supported throughout an architectural design studio with a conceptual tool that translates filmic spaces into spatial design. To investigate the ways to translate filmic space into spatial design tools for creative thinking, we conducted a design studio with first-year university students. Focusing on using various elements of film, including movement, frame, montage, light, and color, and scene changes to represent architectural languages, a curriculum was developed and implemented in a Visual Communication Design Studio for one semester, stimulating students to engage in expressing their ideas in three-dimensional spaces. The overall results suggested that the design education method that used the filmic space as a stimulating tool for creative thinking, emphasizing the role of visual communication, could enhance students’ creative thinking, leading to improved creative design processes.
... Filmic spaces might have the potential to provide creative stimuli to architectural representation as a new approach to problem identification in designing. Filmic elements such as shooting, montage, and mise-en-scène are the design methods for visualizing aesthetics and style [40]. These elements provide a new opportunity to design a student interface that can provide a personalized way to find interesting films for the analysis of filmic spaces instead of using the traditional classifications of film based on explicit attributes such as genre and cast. ...
... The program was designed in accordance with the accreditation standards of the procedures and conditions of the Korea Architecture Accreditation Board (KAAB) [46]. The Korea Institute of Architecture Education Accreditation specifies the core competencies that each university/graduate architecture education program must meet to obtain KAAB certification, and its corresponding student performance criteria (SPC) [40]. Out of the 26 criteria of SPC, the Visual Communication Design Studio set its curriculum centered on five criteria. ...
... Student performance criteria (SPC)[40]. ...
Article
Full-text available
Representing visual experiences is an essential part of architectural design education for creativity. The representation of creative ideas relates to the ability to communicate spatial design concepts. This study examined whether filmic spaces could function as visual communication to enhance students’ creative thinking in architecture. It explored how creativity can be supported throughout an architectural design studio with a conceptual tool that translates filmic spaces into spatial design. To investigate the ways to translate filmic space into spatial design tools for creative thinking, we conducted a design studio with first-year university students. Focusing on using various elements of film, including movement, frame, montage, light, and color, and scene changes to represent architectural languages, a curriculum was developed and implemented in a Visual Communication Design Studio for one semester, stimulating students to engage in expressing their ideas in three-dimensional spaces. The overall results suggested that the design education method that used the filmic space as a stimulating tool for creative thinking, emphasizing the role of visual communication, could enhance students’ creative thinking, leading to improved creative design processes.
... We might expect these to be more variable, however, as the extent of expository narrative will vary between productions. More ambitious, though potentially rewarding, could be to combine textual with visual features, as lighting and motion to give a more holistic approach to genre representation (e.g., Deldjoo, 2016). In terms of genre prediction, while we have shown that topic features can be good predictors of certain genres for the plot summary dataset, it will be instructive to undertake sensitivity analysis with topic models using different topic numbers and hyperparameter values, in order to see how term-topic and topic-document complexity affects prediction. ...
Article
Full-text available
Genre plays an important role in the description, navigation, and discovery of movies, but it is rarely studied at large scale using quantitative methods. This allows an analysis of how genre labels are applied, how genres are composed and how these ingredients change, and how genres compare. We apply unsupervised topic modeling to a large collection of textual movie summaries and then use the model's topic proportions to investigate key questions in genre, including recognizability, mapping, canonicity, and change over time. We find that many genres can be quite easily predicted by their lexical signatures and this defines their position on the genre landscape. We find significant genre composition changes between periods for westerns, science fiction and road movies, reflecting changes in production and consumption values. We show that in terms of canonicity, canonical examples are often at the high end of the topic distribution profile for the genre rather than central as might be predicted by categorization theory.
... Simply speaking, this concept covers everything that appears on the scene; e.g., characters, props, backgrounds, etc. Although a few existing studies [1,4] have been conducted on the mise-en-scène in films, they focus on mise-en-scène as physical expressions; e.g., lighting, composition, camera work, etc. These studies can not cover meanings of the expressions, which can be metaphorical symbols located by directors. ...
Conference Paper
This study aims to analyze mise-en-scène in movies. Although the mise-en-scène includes all the entities that appear on the scene, we focus on metaphorical meanings of the entities. When entities are expressions of the same metaphorical symbol, lexical meanings of the entities will be relevant to each other. Also, if a metaphorical symbol is significant for storytelling, the symbol will appear on most of the scenes. Therefore, we find groups of entities, which share similar lexical meanings and appear allover the movie. As a preliminary study, we have applied these approaches on terms in movie scripts with simple natural language processing techniques. Although we have not conducted evaluation yet, we anticipate that the proposed method will be helpful for analyzing artistic and topical intentions of movie directors.
... Through querying YouTube, we obtained a huge dataset of 13923 movie trailers [7,26,31] based on the titles available in the Movielens dataset [18]. Prior work showed a high similarity of visual features extracted from movie trailers and their respective fulllength movies [8]. ...
Conference Paper
Full-text available
Recommender Systems (RSs) have become essential tools in any video-sharing platforms (such as YouTube) by generating video suggestions for users. Although, RSs have been effective, however, they suffer from the so-called New Item problem. New item problem, as part of Cold Start problem, happens when a new item is added to the system catalogue and the RS has no or little data available for that new item. In such a case, the system may fail to meaningfully recommend the new item to any user. In this paper, we propose a novel recommendation technique based on visual tags, i.e., tags that are automatically annotated to videos based on visual description of videos. Such visual tags can be used in an extreme cold start situation, where neither any rating, nor any tag is available for the new video item. The visual tags could also be used in the moderate cold start situation when the new video item has been annotated with few tags. This type of content features can be extracted automatically without any human involvement and have been shown to be very effective in representing the video content. We have used a large dataset of videos and shown that automat- ically extracted visual tags can be incorporated into the cold start recommendation process and achieve superior results compared to the recommendation based on human-annotated tags.
Article
With the increasing popularity of social networks, many businesses have started implementing their branding or targeted advertising strategies to reach potential customers through social media platforms. It is desirable and essential to help businesses to reach mass audiences and assist users to find favorite business accounts on social media platforms. In the movie industry, movie companies often create business accounts (movie accounts) to promote their movies and capture the attention of followers on Instagram. Instagram contains rich information about movies and user feedback, while IMDb, one of the most popular online databases, contains well-organized information related to movies. The features extracted from the data collected from Instagram and IMDb can complement each other. Therefore, in this study, we propose a framework for recommending movie accounts to users on Instagram by using the data collected from Instagram and IMDb platforms. The experiment results show that our proposed framework outperforms the comparing methods in terms of precision, recall, F1-score, and Normalized Discounted Cumulative Gain (NDCG), and mitigates the effect of cold start problems. The proposed framework can help movie companies or businesses reach potential audiences and implement effective targeted advertising strategies.
Conference Paper
Full-text available
We present an exploratory study in the context of digital film analysis inspecting and comparing five canonical movies by applying methods of computer vision. We extract one frame per second of each movie which we regard as our sample. As computer vision methods we explore image-based object detection, emotion recognition, gender and age detection with state-of-the-art models. We were able to identify significant differences between the movies for all methods. We present our results and discuss the limitations and benefits of each method. We close by formulating future research questions we plan to answer by applying and optimizing the methods.
Article
Full-text available
Over the past years, the research of video recommender systems (RSs) has been mainly focussed on the development of novel algorithms. Although beneficial, still any algorithm may fail to recommend video items that the system has no form of data associated to them (New Item Cold Start). This problem occurs when a new item is added to the catalogue of the system and no data are available for that item. In content‐based RSs, the video items are typically represented by semantic attributes, when generating recommendations. These attributes require a group of experts or users for annotation, and still, the generated recommendations might not capture a complete picture of the users' preferences, for example, the visual tastes of users on video style. This article addresses this problem by proposing recommendation based on novel visual features that do not require human annotation and can represent visual aspects of video items. We have designed a novel evaluation methodology considering three realistic scenarios, that is, (a) extreme cold start, (b) moderate cold start and (c) warm‐start scenario. We have conducted a set of comprehensive experiments, and our results have shown the superior performance of recommendations based on visual features, in all of the evaluation scenarios.
Chapter
Full-text available
In Recommender Systems (RS), a users preferences are expressed in terms of rated items, where incorporating each rating may improve the RS’s predictive accuracy. In addition to a user rating items at-will (a passive process), RSs may also actively elicit the user to rate items, a process known as Active Learning (AL). However, the number of interactions between the RS and the user is still limited. One aim of AL is therefore the selection of items whose ratings are likely to provide the most information about the user’s preferences. In this chapter, we provide an overview of AL within RSs, discuss general objectives and considerations, and then summarize a variety of methods commonly employed. AL methods are categorized based on our interpretation of their primary motivation/goal, and then sub-classified into two commonly classified types, instance-based and model-based, for easier comprehension. We conclude the chapter by outlining ways in which AL methods could be evaluated, and provide a brief summary of methods performance.
Chapter
Full-text available
Content-based recommender systems (CBRSs) rely on item and user descriptions (content) to build item representations and user profiles that can be effectively exploited to suggest items similar to those a target user already liked in the past. Most content-based recommender systems use textual features to represent items and user profiles, hence they suffer from the classical problems of natural language ambiguity. This chapter presents a comprehensive survey of semantic representations of items and user profiles that attempt to overcome the main problems of the simpler approaches based on keywords. We propose a classification of semantic approaches into top-down and bottom-up. The former rely on the integration of external knowledge sources, such as ontologies, encyclopedic knowledge and data from the Linked Data cloud, while the latter rely on a lightweight semantic representation based on the hypothesis that the meaning of words depends on their use in large corpora of textual documents. The chapter shows how to make recommender systems aware of semantics to realize a new generation of content-based recommenders.
Article
Full-text available
This paper investigates the use of automatically extracted visual features of videos in the context of recommender systems and brings some novel contributions in the domain of video recommendations. We propose a new content-based recommender system that encompasses a technique to automatically analyze video contents and to extract a set of representative stylistic features (lighting, color, and motion) grounded on existing approaches of Applied Media Theory. The evaluation of the proposed recommendations, assessed w.r.t. relevance metrics (e.g., recall) and comThe evaluation of the proposed recommendations, assessed w.r.t. relevance metrics (e.g., recall) and compared with existing content-based recommender sys-tems that exploit explicit features such as movie genre, shows that our technique leads to more accurate rec-ommendations. Our proposed technique achieves better results not only when visual features are extracted from full-length videos, but also when the feature extraction technique operates on movie trailers, pinpointing that our approach is effective also when full-length videos are not available or when there are performance requirements. Our recommender can be used in combination with more traditional content-based recommendation tech-niques that exploit explicit content features associated to video files, in order to improve the accuracy of recom-mendations. Our recommender can also be used alone, to address the problem originated from video files that have no meta-data, a typical situation of popular movie-sharing websites (e.g., YouTube) where every day hun-dred millions of hours of videos are uploaded by users and may contain no associated information. As they lack explicit content, these items cannot be considered for recommendation purposes by conventional content-based techniques even when they could be relevant for the user.
Conference Paper
Full-text available
Recommender Systems (RSs) play an increasingly important role in video-on-demand web applications -- such as YouTube and Netflix -- characterized by a very large catalogs of videos and movies. Their goal is to filter information and to recommend to users only the videos that are likely of interest to them. Recommendations are traditionally generated on the basis of user's preferences on movies' attributes, such as genre, director, actors. Preferences on attributes are implicitly detected by analyzing the user's past opinions on movies. However, we believe that the opinion of users on movies is better described in terms of the mise-en-scène, i.e., the design aspects of a film production affecting aesthetic and style. Lighting, colors, background, and movements are all examples of mise-en-scène. Although viewers may not consciously notice film style, it still affects the viewer's experience of the film. The mise-en-scène highlights similarities in the narratives, as filmmakers typically relate the overall film style to reflect the story, and can be used to categorize movies at a finer level than with traditional film attributes, such as genre and cast. Two films may be from the same genre, but they can look different based on the film style. In this paper, we present an ongoing work for the development of a novel application that offers a personalized way to search for interesting multimedia content. Instead of using the traditional classifications of movies based on attributes such as genre and cast, we use aesthetic movie features derived from film styles as determined by filmmaker professionals. Stylistic features of movies are extracted with an automatic video analysis tool and are used by our application to generate personalized recommendations and to help users in searching for interesting content.
Article
Full-text available
The accuracy of collaborative-filtering recommender systems largely depends on three factors: the quality of the rating prediction algorithm, and the quantity and quality of available ratings. While research in the field of recommender systems often concentrates on improving prediction algorithms, even the best algorithms will fail if they are fed poor-quality data during training, that is, garbage in, garbage out. Active learning aims to remedy this problem by focusing on obtaining better-quality data that more aptly reflects a user's preferences. However, traditional evaluation of active learning strategies has two major flaws, which have significant negative ramifications on accurately evaluating the system's performance (prediction error, precision, and quantity of elicited ratings). (1) Performance has been evaluated for each user independently (ignoring system-wide improvements). (2) Active learning strategies have been evaluated in isolation from unsolicited user ratings (natural acquisition). In this article we show that an elicited rating has effects across the system, so a typical user-centric evaluation which ignores any changes of rating prediction of other users also ignores these cumulative effects, which may be more influential on the performance of the system as a whole (system centric). We propose a new evaluation methodology and use it to evaluate some novel and state-of-the-art rating elicitation strategies. We found that the system-wide effectiveness of a rating elicitation strategy depends on the stage of the rating elicitation process, and on the evaluation measures (MAE, NDCG, and Precision). In particular, we show that using some common user-centric strategies may actually degrade the overall performance of a system. Finally, we show that the performance of many common active learning strategies changes significantly when evaluated concurrently with the natural acquisition of ratings in recommender systems.
Chapter
Full-text available
Recommender Systems (RSs) are software tools and techniques providing suggestions for items to be of use to a user. In this introductory chapter we briefly discuss basic RS ideas and concepts. Our main goal is to delineate, in a coherent and structured way, the chapters included in this handbook and to help the reader navigate the extremely rich and detailed content that the handbook offers.
Chapter
This chapter discusses content-based recommendation systems, i.e., systems that recommend an item to a user based upon a description of the item and a profile of the user's interests. Content-based recommendation systems may be used in a variety of domains ranging from recommending web pages, news articles, restaurants, television programs, and items for sale. Although the details of various systems differ, content-based recommendation systems share in common a means for describing the items that may be recommended, a means for creating a profile of the user that describes the types of items the user likes, and a means of comparing items to the user profile to determine what to re commend. The profile is often created and updated automatically in response to feedback on the desirability of items that have been presented to the user.
Article
One of the unresolved issues when designing a recommender system is the number of ratings -- i.e., the profile length -- that should be collected from a new user before providing recommendations. A design tension exists, induced by two conflicting requirements. On the one hand, the system must collect "enough"ratings from the user in order to learn her/his preferences and improve the accuracy of recommendations. On the other hand, gathering more ratings adds a burden on the user, which may negatively affect the user experience. Our research investigates the effects of profile length from both a subjective (user-centric) point of view and an objective (accuracy-based) perspective. We carried on an offline simulation with three algorithms, and a set of online experiments involving overall 960 users and four recommender algorithms, to measure which of the two contrasting forces influenced by the number of collected ratings -- recommendations relevance and burden of the rating process -- has stronger effects on the perceived quality of the user experience. Moreover, our study identifies the potentially optimal profile length for an explicit, rating based, and human controlled elicitation strategy.
Article
Recommender Systems (RSs) help users search large amounts of digital contents and services by allowing them to identify the items that are likely to be more attractive or useful. RSs play an important persuasion role, as they can potentially augment the users’ trust towards in an application and orient their decisions or actions towards specific directions. This article explores the persuasiveness of RSs, presenting two vast empirical studies that address a number of research questions. First, we investigate if a design property of RSs, defined by the statistically measured quality of algorithms, is a reliable predictor of their potential for persuasion. This factor is measured in terms of perceived quality, defined by the overall satisfaction, as well as by how users judge the accuracy and novelty of recommendations. For our purposes, we designed an empirical study involving 210 subjects and implemented seven full-sized versions of a commercial RS, each one using the same interface and dataset (a subset of Netflix), but each with a different recommender algorithm. In each experimental configuration we computed the statistical quality (recall and F-measures) and collected data regarding the quality perceived by 30 users. The results show us that algorithmic attributes are less crucial than we might expect in determining the user’s perception of an RS’s quality, and suggest that the user’s judgment and attitude towards a recommender are likely to be more affected by factors related to the user experience. Second, we explore the persuasiveness of RSs in the context of large interactive TV services. We report a study aimed at assessing whether measurable persuasion effects (e.g., changes of shopping behavior) can be achieved through the introduction of a recommender. Our data, collected for more than one year, allow us to conclude that, (1) the adoption of an RS can affect both the lift factor and the conversion rate, determining an increased volume of sales and influencing the user’s decision to actually buy one of the recommended products, (2) the introduction of an RS tends to diversify purchases and orient users towards less obvious choices (the long tail), and (3) the perceived novelty of recommendations is likely to be more influential than their perceived accuracy. Overall, the results of these studies improve our understanding of the persuasion phenomena induced by RSs, and have implications that can be of interest to academic scholars, designers, and adopters of this class of systems.