Eva Cheng

RMIT University, Melbourne, Victoria, Australia

Are you Eva Cheng?

Claim your profile

Publications (33)

  • Hieu Minh Bui · Margaret Lech · Eva Cheng · [...] · Ian S. Burnett
    Conference Paper · Jul 2016
  • Conference Paper · Mar 2016
  • Conference Paper · Nov 2015
  • Xiaoying Wang · Eva Cheng · Ian S. Burnett
    Conference Paper · Jul 2015
  • Source
    Sipei Zhao · Xiaojun Qiu · Eva Cheng · [...] · Mark Burry
    [Show abstract] [Hide abstract] ABSTRACT: This note is intended to understand relative importance of room shape and fine structures on the sound quality inside small meeting rooms in terms of the reverberation time, the sound field distribution and the speech transmission index with similar room volume, surface area and the absorption coefficients. First, different shaped rooms with smooth walls are modeled and simulated to investigate the effects of room shape on the sound quality, and then hyperboloid cells are made on the walls to examine the influence of fine structural surface on sound quality with both regular and random arrangements. It is found that the reverberation time is affected significantly by the room shape while is not sensitive to the hyperboloid cells. The sound field distribution is affected little by the room shape and the hyperboloid cells and the difference is smaller than the Just-Noticeable-Difference in most cases. The impact of the room shape and fine structural surface on the speech transmission index mainly lies in the transition area between the direct sound and the reverberant sound. The reliability of the simulation remarks is confirmed by the experiments carried out in two different meeting rooms. The main conclusion of the note is that when the room volume, the surface area and the absorption coefficients are kept constant, the room shape and fine structural surface have little impact on the sound field distribution and speech intelligibility inside small rooms with ordinary surface absorption, while the reverberation time is affected significantly by room shape but slightly by the fine structural surface.
    Full-text available · Article · Jun 2015 · Applied Acoustics
  • A. Albahri · M. Lech · E. Cheng
    Article · Jan 2015
  • L. Wu · X. Qiu · I.S. Burnett · [...] · Y. Guo
    [Show abstract] [Hide abstract] ABSTRACT: In real active noise control (ANC)applications,the following situations frequently occur, one isthat disturbances only present at the error sensor and havelowcorrelation with reference signal, the other is thatthere is no enough space or ideal position for locating the reference sensor to satisfy causality condition. Thusthe residual noise after feedforward control can be seen as uncorrelated narrowband disturbancesin these situationsand ahybrid adaptive feedforward and feedback structure is often utilized to cope with this problem.Many efforts have been paid to improve the performance of the hybrid ANC system, nevertheless, few interests are concerned about the combination method between the feedforward and feedback structure. After investigating the conventional combination method of hybrid feedforward and feedback system, this paper introduces analternate combination method for hybrid ANC systemwhich featuresthat itavoidsthe coupling between the feedforward and feedback structures and both structures are concatenated to attenuate the ambient noise. Simulations are carried out to validatethe effectiveness of the introduced methodfor ANCwith uncorrelated narrowband disturbances.
    Article · Jan 2014
  • Li Ling · Eva Cheng · Ian S. Burnett
    [Show abstract] [Hide abstract] ABSTRACT: This paper proposes the use of the Iterated Extended Kalman Filter (IEKF) in a real-time 3D mapping framework applied to Microsoft Kinect RGB-D data. Standard EKF techniques typically used for 3D mapping are susceptible to errors introduced during the state prediction linearization and measurement prediction. When models are highly nonlinear due to measurement errors e.g., outliers, occlusions and feature initialization errors, the errors propagate and directly result in divergence and estimation inconsistencies. To prevent linearized error propagation, this paper proposes repetitive linearization of the nonlinear measurement model to provide a running estimate of camera motion. The effects of iterated-EKF are experimentally simulated with synthetic map and landmark data on a range and bearing camera model. It was shown that the IEKF measurement update outperforms the EKF update when the state causes nonlinearities in the measurement function. In the real indoor environment 3D mapping experiment, more robust convergence behavior for the IEKF was demonstrated, whilst the EKF updates failed to converge.
    Conference Paper · May 2013
  • E. Cheng · P. Burton · J. Burton · [...] · I. Burnett
    [Show abstract] [Hide abstract] ABSTRACT: There has been much recent interest, both from industry and research communities, in 3D video technologies and processing techniques. However, with the standardisation of 3D video coding well underway and researchers studying 3D multimedia delivery and users' quality of multimedia experience in 3D video environments, there exist few publicly available databases of 3D video content. Further, there are even fewer sources of uncompressed 3D video content for flexible use in a number of research studies and applications. This paper thus presents a preliminary version of RMIT3DV: an uncompressed HD 3D video database currently composed of 31 video sequences that encompass a range of environments, lighting conditions, textures, motion, etc. The database was natively filmed on a professional HD 3D camera, and this paper describes the 3D film production workflow in addition to the database distribution and potential future applications of the content. The database is freely available online via the creative commons license, and researchers are encouraged to contribute 3D content to grow the resource for the (HD) 3D video research community.
    Conference Paper · Jul 2012
  • Source
    Benjamin Rainer · Markus Waltl · Eva Cheng · [...] · Hermann Hellwagner
    [Show abstract] [Hide abstract] ABSTRACT: Multimedia is ubiquitously available online with large amounts of video increasingly consumed through Web sites such as YouTube or Google Video. However, online multimedia typically limits users to visual/auditory stimulus, with onscreen visual media accompanied by audio. The recent introduction of MPEG-V proposed multi-sensory user experiences in multimedia environments, such as enriching video content with so-called sensory effects like wind, vibration, light, etc. In MPEG-V, these sensory effects are represented as Sensory Effect Metadata (SEM), which is additionally associated to the multimedia content. This paper presents three user studies that utilize the sensory effects framework of MPEG-V, investigating the emotional response of users and enhancement of Quality of Experience (QoE) of Web video sequences from a range of genres with and without sensory effects. In particular, the user studies were conducted in Austria and Australia to investigate whether geography and cultural differences affect users' elicited emotional responses and QoE.
    Full-text available · Conference Paper · Jul 2012
  • [Show abstract] [Hide abstract] ABSTRACT: This paper investigates how minimal user interaction paradigms and markerless image recognition technologies can be applied to matching print media content to online digital proofs. By linking print material to online content, users can enhance their experience with traditional forms of print media with updated online content, videos, interactive online features etc. The proposed approach is based on extracting features from images/text from mobile device camera images to form 'fingerprints' that are used to find matching images/text within a limited test set. An important criterion for these applications is to ensure that the user Quality of Experience (QoE), particularly in terms of matching accuracy and time, is robust to a variety of conditions typically encountered in practical scenarios. In this paper, the performance of a number of computer vision techniques that extract the image features and form the fingerprints are analysed and compared. Both computer simulation tests and mobile device experiments in realistic user conditions are conducted to study the effectiveness of the techniques when considering scale, rotation, blur and lighting variations typically encountered by a user.
    Conference Paper · Jul 2012
  • Li Ling · Ian S. Burrent · Eva Cheng
    [Show abstract] [Hide abstract] ABSTRACT: Current approaches for 3D reconstruction from feature points of images are classed as sparse and dense techniques. However, the sparse approaches are insufficient for surface reconstruction since only sparsely distributed feature points are presented. Further, existing dense reconstruction approaches require pre-calibrated camera orientation, which limits the applicability and flexibility. This paper proposes a one-stop 3D reconstruction solution that reconstructs a highly dense surface from an uncalibrated video sequence, the camera orientations and surface reconstruction are simultaneously computed from new dense point features using an approach motivated by Structure from Motion (SfM) techniques. Further, this paper presents a flexible automatic method with the simple interface of 'videos to 3D model'. These improvements are essential to practical applications in 3D modeling and visualization. The reliability of the proposed algorithm has been tested on various data sets and the accuracy and performance are compared with both sparse and dense reconstruction benchmark algorithms.
    Conference Paper · Jul 2012
  • Li Ling · Ian S. Burnett · Eva Cheng
    [Show abstract] [Hide abstract] ABSTRACT: This paper proposes a flexible, markerless registration method that addresses the problem of realistic virtual object placement at any position in a video sequence. The registration consists of two steps: four points are specified by the user to build the world coordinate system, where the virtual object is rendered. A self-calibration camera tracking algorithm is then proposed to recover the camera viewpoint frame-by-frame, such that the virtual object can be dynamically and correctly rendered according to camera movement. The proposed registration method needs no reference fiducials, knowledge of camera parameters or the user environment, where the virtual object can be placed in any environment even without any distinct features. Experimental evaluations demonstrate low errors for several camera motion rotations around the X and Y axes for the self-calibration algorithm. Finally, virtual object rendering applications in different user environments are evaluated.
    Conference Paper · Oct 2011
  • [Show abstract] [Hide abstract] ABSTRACT: Efficient content-based access to large multimedia collections requires annotations that are human-meaningful, and user tagging of media is one means to obtain such semantic metadata. Tags can also act as user feedback essential for quality of multimedia experience assessment; however, tags can lack user context and become ambiguous between different users. Further, user tagging is a deliberate and discrete event where a user's response to the media can significantly vary in-between tagging events. This paper extends upon the authors' social multimedia adaptation framework to explore the use of EEG biosignals obtained from consumer EEG headsets to form context around explicit tagging activities and as user emotional feedback in-between user tagging events. Preliminary user studies investigating grouped participant responses indicate the most indicative emotional states to be short-term excitement, engagement and frustration in addition to gyroscope information.
    Conference Paper · Sep 2011
  • Li Ling · Eva Cheng · Ian S. Burnett
    [Show abstract] [Hide abstract] ABSTRACT: This paper considers a self-calibration approach to the estimation of motion parameters for an unknown camera used for video-based augmented reality. Whilst existing systems derive four SVD solutions of the essential matrix, which encodes the epipolar geometry between two camera views, this paper presents eight possible solutions derived from mathematical computation and geometrical analysis. The eight solutions not only reflect the position and orientation of the camera in static displacement but also the dynamic, relative orientation between the camera and an object in continuous motion. This paper details a novel algorithm that introduces three geometric constraints to determine the rotation and translation matrix from the eight possible essential matrix solutions. An OpenGL camera motion simulator is used to demonstrate and evaluate the reliability of the proposed algorithms; this directly visualizes the abstract computer vision parameters into real 3D.
    Conference Paper · Jul 2011
  • Conference Paper · Jun 2011
  • Eva Cheng · Ian S. Burnett
    [Show abstract] [Hide abstract] ABSTRACT: The recent ubiquity of mobile telephony has posed the challenge of forensic speech analysis on compressed speech content. Whilst existing research studies have investigated the effect of mobile speech compression on speaker and speech parameters, this paper addresses the effect of speech compression on parameters when an interfering background speaker is present in clean and noisy conditions. Preliminary evaluations presented in this paper study the effect of the Adaptive Multi-Rate (AMR) and Adaptive Multi-Rate Wideband (AMR-WB) speech coders on the Linear Prediction (LP) speech spectrum, Line Spectral Frequencies (LSFs), and Mel Frequency Cepstral Coefficients (MFCCs). Results indicate that due caution should be employed for the forensic analysis of mobile telephony speech: speech coder parameters are significantly degraded when an interfering speaker or noise is present, compared to parameters obtained from the main speaker alone. Moreover, at high SNR the speech parameters exhibit values that gradually transition from those ideally and independently obtained from the main speaker to those of the background speaker as the amplitude of the background interfering speaker increases.
    Conference Paper · May 2011
  • M. Hamilton · F. Salim · E. Cheng · S. L. Choy
    Conference Paper · May 2011
  • Source
    Full-text available · Chapter · Apr 2011
  • Source
    F. Salim · E. Cheng · S. L. Choy
    [Show abstract] [Hide abstract] ABSTRACT: An earlier version of this paper was presented at the 2011 IEEE International Symposium on Technology and Society (ISTAS) at Saint Xavier University in Chicago, Illinois (and printed in the 2011 ISTAS proceedings). This paper describes a proposed mobile platform, Transafe, that captures and analyses public perceptions of safety to deliver 'crowdsourced' collective intelligence about places in the City of Melbourne, Australia, and their affective states at various times of the day. Public perceptions of crime on public transport in Melbourne are often mismatched with actual crime statistics and such perceptions thus can act as social barriers to visitors and locals traversing within and through the city. Using interactive mobile applications and social media, the visualization of this crowdsourced safety perception information will increase the commuter's awareness of various situations in the City of Melbourne. In addition, through social behavioral analysis and ethnographic research, the collective public intelligence will also help inform the stakeholders of the city for future policy-making and policing strategies for safety perception management. At the centre of the proposed platform is the design and development of a mobile phone application that can contribute to people feeling safer by supporting users to report crimes and misdemeanors that they witness, and provide information about transportation and emergency services around where the users are located. The proposed application can also act as a crime deterrent with one feature that enables user tracking by up to three nominated friends if the user opts to activate tracking when feeling unsafe while roaming the city.
    Full-text available · Article · Jan 2011 · ACM SIGCAS Computers and Society

Publication Stats

85 Citations


  • 2012
    • RMIT University
      • School of Electrical and Computer Engineering
      Melbourne, Victoria, Australia
    • Melbourne Institute of Technology
      Melbourne, Victoria, Australia
    • University of Vic
      Vic, Catalonia, Spain
  • 2005
    • University of Wollongong
      • School of Electrical, Computer and Telecommunications Engineering (SECTE)
      City of Greater Wollongong, New South Wales, Australia