Jing YangETH Zurich | ETH Zürich
Jing Yang
Doctor of Sciences
About
21
Publications
6,919
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
441
Citations
Introduction
I obtained a Doctor of Sciences degree from the Department of Computer Science, ETH Zurich in 2021. My research interests span multiple aspects of Audio Augmented Reality (AAR), Virtual Reality (VR), Mixed Reality (MR), Human-Computer Interaction (HCI), music signal processing and analysis, and audio style transfer.
Education
March 2017 - November 2021
ETH Zurich
Field of study
- Computer Science
September 2014 - January 2017
ETH Zurich
Field of study
- Electrical Engineering and Information Technology
September 2010 - July 2014
Publications
Publications (21)
Less intrusive information delivery has been a popular research topic for auditory displays. While most research has addressed this issue by creating new notification cues such as rendering ambient soundscapes or modifying background music, we present a novel method to gently deliver artificial notification sounds that have been commonly used in di...
Virtual Reality (VR) Head-Mounted Display (HMD) technology immerses a user in a computer generated virtual environment. However, a VR HMD also blocks the users’ view of their physical surroundings, and so prevents them from using their mobile phones in a natural manner. In this paper, we present a novel Augmented Virtuality (AV) interface that enab...
Audio augmented reality (AAR) applications need to render virtual sounds with acoustic effects that match the real environment of the user to create an experience with strong sense of presence. This audio rendering process can be formulated as the convolu-tion between the dry sound signal and the room impulse response (IR) that covers the audible f...
Collaborative Mixed Reality (MR) technologies enable remote people to work together by sharing communication cues intrinsic to face-to-face conversations, such as eye gaze and hand gestures. While the role of visual cues has been investigated in many collaborative MR systems, the use of spatial auditory cues remains underexplored. In this paper, we...
In this work, we enrich landscape and genre paintings by spatializ-ing sounds for the drawn objects and scenes, which expands visitors' perception of the paintings and immerses them in the depicted scenarios. Plus, we personalize such spatial audio perception based on visitors' viewing behavior by applying gaze tracking. Through a preliminary user...
The auditory sense is an intuitive and immersive channel to experience our surroundings, which motivates us to augment our perception of the real world with digital auditory content. We present a wearable audio augmented reality prototype that tracks the user with six degrees of freedom in a known environment, synthesizes 3D sounds, and plays spati...
In the past, augmented reality (AR) research focused mostly on visual augmentation, which requires a visual rendering device like head-mounted displays that are usually obtrusive, expensive, and socially unaccepted. In contrast, wearable audio headsets are already popularized and the auditory sense also plays an important role in everyday interacti...
The ubiquity of wearable audio devices and the importance of the auditory sense imply great potential for audio augmented reality. In this work, we propose a concept and a prototype of synthesizing spatial sounds from arbitrary real objects to users in everyday interactions, whereby all sounds are rendered directly by the user's own ear pods instea...
While spatial audio has been an essential component in Virtual Reality, it has been rarely applied to Augmented Reality. We propose a concept and a prototype to enhance human-object interactions in daily life with 3D audio. We augment real objects in a small workspace around the user with spatial audio notifications.
For next place prediction, machine learning methods which incorporate contextual data are frequently used. However, previous studies often do not allow deriving generalizable methodological recommendations, since they use different datasets, methods for discretizing space, scales of prediction, prediction algorithms, and context data, and therefore...
Many social network applications depend on robust representations of spatio-temporal data. In this work, we present an embedding model based on feed-forward neural networks which transforms social media check-ins into dense feature vectors encoding geographic, temporal, and functional aspects for modeling places, neighborhoods, and users. We employ...
The auditory output channel is rather under-utilized in smart object to human communication. One reason is that in a smart environment, multiple overlapping audio sources can be disturbing to people. We propose a wearable audio augmentation system which allows people to effortlessly select and switch between sound sources given their interest. Our...
Many social network applications depend on robust representations of spatio-temporal data. In this work, we present an embedding model based on feed-forward neural networks which transforms social media check-ins into dense feature vectors encoding geographic, temporal, and functional aspects for modelling places, neighborhoods, and users. On the b...
The preservation of geo-privacy is a critical consideration for location-based service (LBS) providers. Unfortunately, a trade-off typically exists between the quality of location-based services and revealing of private information (e.g. geo-coordinates) to obtain such services. In this work, we develop semantic obfuscation methods, which allow a t...
Regions in urban environments often afford a mixture of different utilities. Their identification allows urban planners to leverage important insights on the emerging functional dynamics of cities. With the increasing availability of human mobility data and other forms of online digital breadcrumbs, we can now characterize urban regions with multi-...