Dejan Arsic

Dejan Arsic
  • Dr.-Ing.
  • Account Manager at Müller BBM VibroAkustik Systeme GmbH

About

87
Publications
13,551
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
773
Citations
Current institution
Müller BBM VibroAkustik Systeme GmbH
Current position
  • Account Manager
Additional affiliations
November 2004 - December 2010
Technical University of Munich
January 2008 - December 2010
Technical University of Munich
Position
  • Prometheus
November 2004 - February 2008
Technical University of Munich
Position
  • SAFEE

Publications

Publications (87)
Conference Paper
Full-text available
Die Elektrifizierung von Fahrzeugen stellt die Automobilindustrie aktuell vor viele neue Herausforderungen. Auch die Fahrzeugakustik bleibt hiervon nicht verschont. Das akustische Verhalten eines Fahrzeugs verändert sich sowohl außen als auch innen. Der Verbrennungsmotor ist in vielen Betriebszuständen meist die dominante Quelle im Fahrzeug. Dessen...
Conference Paper
Full-text available
Mit der steigenden Anzahl an Derivaten und Ausstattungsvarianten steigt die Anzahl an Messungen, die durchgeführt und ausgewertet werden müssen. Um den Entwicklungsprozess zu unterstützen, sollen unterschiedlichste Aussagen automatisch aus den Messungen getroffen werden. Dies könnten beispielsweise Trendanalysen, Zielwerte oder die Prädiktion beim...
Conference Paper
Full-text available
The Transfer Path Analysis (TPA) has been thoroughly investigated in the past, and a wide range of different approaches has already been implemented [1]. Each approach has its own properties and possible application cases. These range from a physically complete description of the entire assembly, including contributions and forces, a description of...
Conference Paper
Full-text available
Zur akustischen Untersuchung von passiven Komponenten und der Beurteilung der Dämmung bzw. Durchlässigkeit werden traditionell Intensitätsmessungen an Fensterprüfständen durchgeführt. Hierbei wird das Objekt in einem Fenster zwischen Senderaum, in dem eine Anregung stattfindet, und Empfangsraum eingespannt. Die beschallte Fläche wird nun entweder m...
Conference Paper
Full-text available
In general automobile manufacturers need to reduce development time and improve cost-efficiency to stay competitive. This demand rises even more due to increasing numbers of models and vehicle derivatives. Therefore the state-of-the-art development process of new vehicles rely on baselines with a wide range of ready-to-use components, such as power...
Conference Paper
Full-text available
Due to new regulations regarding exterior noise of vehicles, requiring a more complex measurement procedure, and the increasing number of models and vehicle derivatives, the demand of pass-by measurements is increasing. Under these intensified conditions and while keeping organizational efforts low, the simulated pass-by has been widely accepted as...
Conference Paper
Full-text available
The exterior noise is one of the key components in the automotive NVH development process. While strict regulations have to be met, the acoustic character is of major importance for the OEMs. The simulated pass-by is a widely accepted alternative to the real test track, as it is more convenient and independent from environmental conditions. Modific...
Conference Paper
Full-text available
Simulated pass-by is an approved method to increase testing productivity by being independent of weather conditions and by lowering organizational effort. Instead of using just 2 microphones at the AA line, up to 72 microphones are aligned in a line array along the virtual track. The desired signal is synthesized by an interpolation of the individu...
Conference Paper
Full-text available
Powertrains usually consist of various rotating parts, e.g. clutch, gearbox, and crankshaft. Rotational and torsional vibration propagates across the entire powertrain. It is transmitted to connected components, creating unwanted noises and vibrations. Using a precise Tacho signal allows to determine the torsional vibration of components and their...
Conference Paper
Full-text available
Die Transferpfadanalyse ist heute ein fester Bestandteil der versuchsgestützten Untersuchung von Übertragungspfaden sowohl von Luft-als auch Körperschall in Fahrzeugen. Seit der Einführung der Netzwerkanalogie in Maschinenwesen, wurden mannigfaltige Ansätze zur TPA entwickelt und haben den Sprung von die Forschung in industrielle Anwendungen gefund...
Conference Paper
Full-text available
The acoustic damping of a vehicle influences the comfort of its passengers. In order to evaluate this it can be helpful to auralize the sound affecting the vehicle. When normalized transfer functions are measured, various operating conditions can be auralized. The presented theories will show an approach not only capable of auralizing constant oper...
Conference Paper
Full-text available
The Acoustic damping of a vehicle influences the comfort of its passengers. In order to evaluate this it can be helpful to auralize the sound affecting the vehicle. When normalized transfer functions are measured, various operating conditions can be auralized. The presented theories will show an approach not only capable of auralizing constant oper...
Conference Paper
Full-text available
The acoustic damping of a vehicle influences the comfort of its passengers. In order to evaluate this it can be helpful to auralize the sound affecting the vehicle. When normalized transfer functions are measured, various operating conditions can be auralized. The presented theories will show an approach not only capable of auralizing constant oper...
Conference Paper
Full-text available
Sowohl am Prüfstand als auch bei realen Fahrversuchen treten oft unerwünschte Phä-nomene auf, sei es hörbar oder im Spektrum sichtbar, die sich nicht mit einfachen Mitteln oder " by Inspection " lokalisieren lassen. Um den Anwender bei deren Ermitt-lung zu unterstützen, gibt es verschiedene Werkzeuge, die eine Visualisierung der Schallquelle ermögl...
Conference Paper
Full-text available
Die Windenergie gilt als einer der Schlüsselfaktoren für eine erfolgreiche Energiewende. In dicht besiedelten Gebieten muss neben einem hohen Wirkungsgrad auch eine möglichst niedrige akustische Emission gewährleistet werden. Beeinflussende Größen sind dabei insbesondere die Tonhaltigkeit, die Schalleistung und die Amplitudenmodulation zu nennen. Z...
Conference Paper
Full-text available
Since inland locations for wind turbines are mostly near populated areas, noise emissions have to be reduced to a minimum. These usually originate from rotating or vibrating parts. By applying accelerometers, microphones, and rotation sensors, it is possible to correlate relevant quantities, determine transfer functions and locate possible sound so...
Conference Paper
Full-text available
Wind turbines and residential areas are continuously coming closer due to expanding cities, rising demands on regenerative energy and limited suitable turbine locations. Wind turbines create noise during operation, such as every rotating system, which obviously influences or disturbs the environment and people living nearby. Standards like the IEC6...
Article
Full-text available
The multi-modal multi-sensor PROMETHEUS database was created in support of research and development activities [PROMETHEUS (FP7-ICT-214901): http://www.prometheus-FP7.eu] aiming at the creation of a framework for monitoring and interpretation of human behaviors in unrestricted indoor and outdoor environments. The distinctiveness of the PROMETHEUS d...
Conference Paper
Full-text available
In this contribution a novel method to compute dense point-to-point correspondences between 3D faces is presented. The correspondences can be employed for various face processing applications, for example for building up a 3D Morphable Model (3DMM). Paths connecting landmarks are traced on the 3D facial surface and the resulting patches are mapped...
Article
The 'Restaurant of the Future' is a futuristic approach for consumer research in food-related scenarios. Customers are closely monitored by cameras. Currently the video footage is evaluated and annotated manually. For a gradual automation of this process a video database with eating scenarios is published. The database is quite challenging with lig...
Conference Paper
Full-text available
In everyday live head gestures such as head shaking or nodding and hand gestures like pointing gestures form important aspects of human-human interaction. Therefore, recent research considers integrating these intuitive communication cues into technical systems for improving and easing human-computer interaction. In this paper we present a vision-b...
Conference Paper
Full-text available
While monocular gesture recognition slowly reaches maturity, the inclusion of 3D gestures remains a challenge. In order to enable robust and versatile depth-enabled gestures, a depth-image based tracking approach is developed. Using a model-based annealing particle filter approach, the pose of a single subject is retrieved and tracked over longer i...
Conference Paper
Full-text available
This paper introduces a new visual tracking technique combining particle filtering and Dynamic Bayesian Networks. The particle filter is utilized to robustly track an object in a video sequence and gain sets of descriptive object features. Dynamic Bayesian Networks use feature sequences to determine different motion patterns. A Graphical Model is i...
Conference Paper
Full-text available
Exact 3D tracking of facial feature points is appealing for many applications in human-machine interaction. In this work a 3D Active Shape Model (ASM) that can be shifted, scaled, and rotated is used to track the points. The efficient Gauss-Newton method is applied to estimate the 3D ASM, rotation, translation, and scale parameters. If the head tur...
Conference Paper
Full-text available
Current experiments with HCIs have shown a high demand for more natural interaction paradigms. Gestures are thereby considered the most important cue besides speech. In order to recognize gestures it is necessary to extract meaningful motion features from the body. Up to now mostly marker based tracking systems are used in virtual reality environme...
Conference Paper
Full-text available
This paper introduces our research platform for enabling a multimodal Human-Robot Interaction scenario as well as our research vision: approaching problems in a holistic way to realize this scenario. However, in this paper the main focus is laid on the image processing domain, where our vision has been realized by combining particle tracking and Dy...
Book
Full-text available
CCTV systems are omnipresent in daily life and are considered as a widely accepted tool to provide a high level of security in public and private places. The past has unfortunately shown that such systems are not capable to prevent crimes, as the video streams have to be analyzed on the y, in order to react in time instead of utilizing the recorde...
Conference Paper
Full-text available
Reliable tracking of objects is an inevitable prerequisite for automated video surveillance systems. As most object detection methods, which are based on machine learning, require adequate data for the application scenario, foreground segmentation is a popular method to find possible regions of interest. These usually require a specific learning ph...
Conference Paper
Full-text available
Video surveillance systems have been introduced in various fields of our daily life to enhance security and protect individuals and sensitive infrastructure. Up to now it has been usually utilized as a forensic tool for after the fact investigations and are commonly monitored by human operators. In order to assist these and to be able to react in t...
Conference Paper
Full-text available
Non-rigid registration of 3D facial surfaces is a crucial step in a variety of applications. Outliers, i.e., features in a facial surface that are not present in the reference face, often perturb the registration process. In this paper, we present a novel method which registers facial surfaces reliably also in the presence of huge outlier regions....
Chapter
Full-text available
Video surveillance systems have been introduced in various fields of our daily life to enhance security and protect individuals and sensitive infrastructure. Up to now they have been usually utilized as a forensic tool for after the fact investigations and are commonly monitored by human operators. A further gain in safety can only be achieved by t...
Conference Paper
Full-text available
Accurate 3D tracking of facial feature points from one monocular video sequence is appealing for many applications in human-machine interaction. In this work facial feature points are tracked with a Kanade-Lucas-Tomasi (KLT) feature tracker and the tracking results are linked with a 3D Active Shape Model (ASM). Thus, the efficient Gauss-Newton meth...
Conference Paper
Full-text available
In this work we present a multi-modal video editing system for meetings, which uses graphical models for the segmentation and classification of the video modes. The task of video editing is about selecting the camera, that represents the meeting in the best way out of various available cameras. Therefore a new training structure for graphical model...
Conference Paper
Full-text available
The present paper describes the construction of a multimodal database, referred to as the PROMETHEUS database, which contains recordings from heterogeneous sensors. The main purpose of this database is the development of a framework for monitoring and interpretation of human behavior in unrestricted environments of both indoor and outdoor type. It...
Conference Paper
Full-text available
Recently great interest has been shown in the visual surveillance of public transportation systems. The challenge is the automated analysis of passenger's behaviors with a set of visual low-level features, which can be extracted robustly. On a set of global motion features computed in different parts of the image, here the complete image, the face...
Conference Paper
Full-text available
Automatic labeling of chords in original audio recordings is challenging due to heavy acoustic overlay by melody and percussion sections, detuning and arpeggios that demand for a measure-grid to assign notes to chords. Further chord labeling benefits from contextual information. In this respect we suggest applying an HMM framework incorporating a m...
Conference Paper
Full-text available
In this work semantic features are used to improve the results of the camera selection. These semantic features are group action, person action and person speaking. For this purpose low level acoustic and visual features are combined with high level semantic ones. After the feature fusion, a segmentation and classification are performed by hidden M...
Conference Paper
Full-text available
This work introduces a robot driven camera controlled by speech. The SIMIS database of 20 recordings of real life surgical operations serves as basis for analyses and noise modelling. To overcome low recognition performance due to high noise levels during operations, the vocabulary was chosen to be highly limited and multiple noise reduction method...
Article
Full-text available
CCTV systems have been introduced in most public spaces in order to increase security. Video outputs are observed by human operators if possible but mostly used as a forensic tool. Therefore it seems desirable to automate video surveillance systems, in order to be able to detect potentially dangerous situations as soon as possible. Multi camera sys...
Conference Paper
Full-text available
Surveillance of drivers, pilots or passengers possesses significant potential for increased security within passenger transport. In an automotive setting the interaction can e.g. be improved by social awareness of an MMI. As further example security marshals can be efficiently positioned guided by according systems. Within this scope the detection...
Conference Paper
Full-text available
CCTV systems have been introduced in most public spaces in order to increase security. Video outputs are observed by human operators if possible but mostly used as a forensic tool. Therefore it seems desirable to automate video surveillance systems, in order to be able to detect potentially dangerous situations as soon as possible. Multi camera sys...
Conference Paper
Full-text available
Bimodal emotion recognition through audiovisual feature fusion has been shown superior over each individual modality in the past. Still, synchronization of the two streams is a challenge, as many vision approaches work on a frame basis opposing audio turn- or chunk-basis. Therefore, late fusion schemes such as simple logic or voting strategies are...
Conference Paper
Full-text available
Recognition of emotion in speech usually uses acoustic models that ignore the spoken content. Likewise one general model per emotion is trained independent of the phonetic structure. Given sufficient data, this approach seemingly works well enough. Yet, this paper tries to answer the question whether acoustic emotion recognition strongly depends on...
Conference Paper
Full-text available
While the " 'quasi-state-of-the-art'" towards acoustic emotion recognition relies on multivariate time-series analysis of e.g. pitch, energy, or MFCC by statistical functionals as moments or extrema, only few respect statistical noise by outliers due to too long segments as turns. Such noise can be overcome by hierarchical functionals as means of e...
Conference Paper
Full-text available
Bimodal emotion recognition through audiovisual feature fusion has been shown superior over each individual modality in the past. Still, synchronization of the two streams is a challenge, as many vision approaches work on a frame basis opposing audio turn- or chunk-basis. Therefore, late fusion schemes such as simple logic or voting strategies are...
Conference Paper
Full-text available
Great interest has been shown in the visual surveillance of public transportation systems. The challenge is the automated analysis of passengers' behaviors with a set of visual low-level features, which can be extracted robustly. On a set of global motion features computed in different parts of the image, here the complete image, the face and skin...
Conference Paper
Full-text available
Great interest is recently shown in behavior modeling, especially in public surveillance tasks. In general it is agreed upon the benefits of use of several input cues as audio and video. Yet, synchronization and fusion of these information sources remains the main challenge. We therefore show results for a feature space combination, which allows fo...
Conference Paper
Full-text available
Video based analysis of a persons' mood or behavior is in general performed by interpreting various features observed on the body. Facial actions, such as speaking, yawning or laughing are considered as key features. Dynamic changes within the face can be modeled with the well known hidden Markov models (HMM). Unfortunately even within one class ex...
Conference Paper
Full-text available
Automatic discrimination of musical signal types as speech, singing, music, genres or drumbeats within audio streams is of great importance, e.g. for radio broadcast stream segmentation. Yet, feature sets are largely discussed. We therefore suggest a large open feature set approach starting with systematical generation of 7k hi-level features based...
Article
Full-text available
In the present treatise, we propose an approach for a highly configurable image based online person behaviour monitoring system. The particular application scenario is a crew supporting multi-stream on-board threat detection system, which is getting more desirable for the use in public trans port. For such frameworks, to work robust in mostly uncon...
Conference Paper
Full-text available
Affective computing has grown an important field in today's man-machine-interaction, and the acoustic speech signal is very popular as basis for an automatic classification at the moment. However, recognition performances reported today are mostly not sufficient for a real usage within working systems. Therefore we want to improve on this challenge...
Conference Paper
Full-text available
Face recognition is employed in several systems and computer based applications besides security applications. A broad variety of appearance based approaches has been presented and evaluated even on large datasets. However, most of the systems mainly concentrate on images and models that have been taken from the frontal view of the face. Thus they...
Conference Paper
Full-text available
Video surveillance is an omnipresent topic when it comes to enhancing security in public places and transportation systems. Fully automated behavior detection systems are desirable when it comes to cutting costs for analysing video and audio streams online. These will initiate an alarm signal autonomously if a possibly dangerous situation is detect...
Conference Paper
Full-text available
In the present treatise, we propose an approach for a highly configurable image based online person behaviour monitoring system. The particular application scenario is a crew supporting multi-stream on-board threat detection system, which is getting more desirable for the use in public transport. For such frameworks, to work robustly in mostly unco...
Conference Paper
Full-text available
In this work we strive to find an optimal set of acoustic features for the discrimination of speech, monophonic singing, and polyphonic music to robustly segment acoustic media streams for annotation and interaction purposes. Furthermore we introduce ensemble-based classification approaches within this task. From a basis of 276 attributes we select...

Network

Cited By