
Dejan Arsic- Dr.-Ing.
- Account Manager at Müller BBM VibroAkustik Systeme GmbH
Dejan Arsic
- Dr.-Ing.
- Account Manager at Müller BBM VibroAkustik Systeme GmbH
About
87
Publications
13,551
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
773
Citations
Introduction
Current institution
Müller BBM VibroAkustik Systeme GmbH
Current position
- Account Manager
Additional affiliations
November 2004 - December 2010
January 2008 - December 2010
November 2004 - February 2008
Publications
Publications (87)
Die Elektrifizierung von Fahrzeugen stellt die Automobilindustrie aktuell vor viele neue Herausforderungen. Auch die Fahrzeugakustik bleibt hiervon nicht verschont. Das akustische Verhalten eines Fahrzeugs verändert sich sowohl außen als auch innen. Der Verbrennungsmotor ist in vielen Betriebszuständen meist die dominante Quelle im Fahrzeug. Dessen...
Mit der steigenden Anzahl an Derivaten und Ausstattungsvarianten steigt die Anzahl an Messungen, die durchgeführt und ausgewertet werden müssen. Um den Entwicklungsprozess zu unterstützen, sollen unterschiedlichste Aussagen automatisch aus den Messungen getroffen werden. Dies könnten beispielsweise Trendanalysen, Zielwerte oder die Prädiktion beim...
The Transfer Path Analysis (TPA) has been thoroughly investigated in the past, and a wide range of different approaches has already been implemented [1]. Each approach has its own properties and possible application cases. These range from a physically complete description of the entire assembly, including contributions and forces, a description of...
Zur akustischen Untersuchung von passiven Komponenten und der Beurteilung der Dämmung bzw. Durchlässigkeit werden traditionell Intensitätsmessungen an Fensterprüfständen durchgeführt. Hierbei wird das Objekt in einem Fenster zwischen Senderaum, in dem eine Anregung stattfindet, und Empfangsraum eingespannt. Die beschallte Fläche wird nun entweder m...
In general automobile manufacturers need to reduce development time and improve cost-efficiency to stay competitive. This demand rises even more due to increasing numbers of models and vehicle derivatives. Therefore the state-of-the-art development process of new vehicles rely on baselines with a wide range of ready-to-use components, such as power...
Due to new regulations regarding exterior noise of vehicles, requiring a more complex measurement procedure, and the increasing number of models and vehicle derivatives, the demand of pass-by measurements is increasing. Under these intensified conditions and while keeping organizational efforts low, the simulated pass-by has been widely accepted as...
The exterior noise is one of the key components in the automotive NVH development process. While strict regulations have to be met, the acoustic character is of major importance for the OEMs. The simulated pass-by is a widely accepted alternative to the real test track, as it is more convenient and independent from environmental conditions. Modific...
Simulated pass-by is an approved method to increase testing productivity by being independent of weather conditions and by lowering organizational effort. Instead of using just 2 microphones at the AA line, up to 72 microphones are aligned in a line array along the virtual track. The desired signal is synthesized by an interpolation of the individu...
Powertrains usually consist of various rotating parts, e.g. clutch, gearbox, and crankshaft. Rotational and torsional vibration propagates across the entire powertrain. It is transmitted to connected components, creating unwanted noises and vibrations. Using a precise Tacho signal allows to determine the torsional vibration of components and their...
Die Transferpfadanalyse ist heute ein fester Bestandteil der versuchsgestützten Untersuchung von Übertragungspfaden sowohl von Luft-als auch Körperschall in Fahrzeugen. Seit der Einführung der Netzwerkanalogie in Maschinenwesen, wurden mannigfaltige Ansätze zur TPA entwickelt und haben den Sprung von die Forschung in industrielle Anwendungen gefund...
The acoustic damping of a vehicle influences the comfort of its passengers. In order to evaluate this it can be helpful to auralize the sound affecting the vehicle. When normalized transfer functions are measured, various operating conditions can be auralized. The presented theories will show an approach not only capable of auralizing constant oper...
The Acoustic damping of a vehicle influences the comfort of its passengers. In order to evaluate this it can be helpful to auralize the sound affecting the vehicle. When normalized transfer functions are measured, various operating conditions can be auralized. The presented theories will show an approach not only capable of auralizing constant oper...
The acoustic damping of a vehicle influences the comfort of its passengers. In order to evaluate this it can be helpful to auralize the sound affecting the vehicle. When normalized transfer functions are measured, various operating conditions can be auralized. The presented theories will show an approach not only capable of auralizing constant oper...
Sowohl am Prüfstand als auch bei realen Fahrversuchen treten oft unerwünschte Phä-nomene auf, sei es hörbar oder im Spektrum sichtbar, die sich nicht mit einfachen Mitteln oder " by Inspection " lokalisieren lassen. Um den Anwender bei deren Ermitt-lung zu unterstützen, gibt es verschiedene Werkzeuge, die eine Visualisierung der Schallquelle ermögl...
Die Windenergie gilt als einer der Schlüsselfaktoren für eine erfolgreiche Energiewende. In dicht besiedelten Gebieten muss neben einem hohen Wirkungsgrad auch eine möglichst niedrige akustische Emission gewährleistet werden. Beeinflussende Größen sind dabei insbesondere die Tonhaltigkeit, die Schalleistung und die Amplitudenmodulation zu nennen. Z...
Since inland locations for wind turbines are mostly near populated areas, noise emissions have to be reduced to a minimum. These usually originate from rotating or vibrating parts. By applying accelerometers, microphones, and rotation sensors, it is possible to correlate relevant quantities, determine transfer functions and locate possible sound so...
Wind turbines and residential areas are continuously coming closer due to expanding cities, rising demands on regenerative energy and limited suitable turbine locations. Wind turbines create noise during operation, such as every rotating system, which obviously influences or disturbs the environment and people living nearby. Standards like the IEC6...
The multi-modal multi-sensor PROMETHEUS database was created in support of research and development activities [PROMETHEUS (FP7-ICT-214901): http://www.prometheus-FP7.eu] aiming at the creation of a framework for monitoring and interpretation of human behaviors in unrestricted indoor and outdoor environments. The distinctiveness of the PROMETHEUS d...
In this contribution a novel method to compute dense point-to-point correspondences between 3D faces is presented. The correspondences can be employed for various face processing applications, for example for building up a 3D Morphable Model (3DMM). Paths connecting landmarks are traced on the 3D facial surface and the resulting patches are mapped...
The 'Restaurant of the Future' is a futuristic approach for consumer research in food-related scenarios. Customers are closely monitored by cameras. Currently the video footage is evaluated and annotated manually. For a gradual automation of this process a video database with eating scenarios is published. The database is quite challenging with lig...
In everyday live head gestures such as head shaking or nodding and hand gestures like pointing gestures form important aspects of human-human interaction. Therefore, recent research considers integrating these intuitive communication cues into technical systems for improving and easing human-computer interaction. In this paper we present a vision-b...
While monocular gesture recognition slowly reaches maturity, the inclusion of 3D gestures remains a challenge. In order to enable robust and versatile depth-enabled gestures, a depth-image based tracking approach is developed. Using a model-based annealing particle filter approach, the pose of a single subject is retrieved and tracked over longer i...
This paper introduces a new visual tracking technique combining particle filtering and Dynamic Bayesian Networks. The particle filter is utilized to robustly track an object in a video sequence and gain sets of descriptive object features. Dynamic Bayesian Networks use feature sequences to determine different motion patterns. A Graphical Model is i...
Exact 3D tracking of facial feature points is appealing for many applications in human-machine interaction. In this work a 3D Active Shape Model (ASM) that can be shifted, scaled, and rotated is used to track the points. The efficient Gauss-Newton method is applied to estimate the 3D ASM, rotation, translation, and scale parameters. If the head tur...
Current experiments with HCIs have shown a high demand for more natural interaction paradigms. Gestures are thereby considered the most important cue besides speech. In order to recognize gestures it is necessary to extract meaningful motion features from the body. Up to now mostly marker based tracking systems are used in virtual reality environme...
This paper introduces our research platform for enabling a multimodal Human-Robot Interaction scenario as well as our research vision: approaching problems in a holistic way to realize this scenario. However, in this paper the main focus is laid on the image processing domain, where our vision has been realized by combining particle tracking and Dy...
CCTV systems are omnipresent in daily life and are considered as a widely accepted tool
to provide a high level of security in public and private places. The past has unfortunately
shown that such systems are not capable to prevent crimes, as the video streams have to
be analyzed on the
y, in order to react in time instead of utilizing the recorde...
Reliable tracking of objects is an inevitable prerequisite for automated video surveillance systems. As most object detection methods, which are based on machine learning, require adequate data for the application scenario, foreground segmentation is a popular method to find possible regions of interest. These usually require a specific learning ph...
Video surveillance systems have been introduced in various fields of our daily life to enhance security and protect individuals and sensitive infrastructure. Up to now it has been usually utilized as a forensic tool for after the fact investigations and are commonly monitored by human operators. In order to assist these and to be able to react in t...
Non-rigid registration of 3D facial surfaces is a crucial step in a variety of applications. Outliers, i.e., features in a facial surface that are not present in the reference face, often perturb the registration process. In this paper, we present a novel method which registers facial surfaces reliably also in the presence of huge outlier regions....
Video surveillance systems have been introduced in various fields of our daily life to enhance security and protect individuals and sensitive infrastructure. Up to now they have been usually utilized as a forensic tool for after the fact investigations and are commonly monitored by human operators. A further gain in safety can only be achieved by t...
Accurate 3D tracking of facial feature points from one monocular video sequence is appealing for many applications in human-machine interaction. In this work facial feature points are tracked with a Kanade-Lucas-Tomasi (KLT) feature tracker and the tracking results are linked with a 3D Active Shape Model (ASM). Thus, the efficient Gauss-Newton meth...
In this work we present a multi-modal video editing system for meetings, which uses graphical models for the segmentation and classification of the video modes. The task of video editing is about selecting the camera, that represents the meeting in the best way out of various available cameras. Therefore a new training structure for graphical model...
The present paper describes the construction of a multimodal database, referred to as the PROMETHEUS database, which contains recordings from heterogeneous sensors. The main purpose of this database is the development of a framework for monitoring and interpretation of human behavior in unrestricted environments of both indoor and outdoor type. It...
Recently great interest has been shown in the visual surveillance of public transportation systems. The challenge is the automated analysis of passenger's behaviors with a set of visual low-level features, which can be extracted robustly. On a set of global motion features computed in different parts of the image, here the complete image, the face...
Automatic labeling of chords in original audio recordings is challenging due to heavy acoustic overlay by melody and percussion sections, detuning and arpeggios that demand for a measure-grid to assign notes to chords. Further chord labeling benefits from contextual information. In this respect we suggest applying an HMM framework incorporating a m...
In this work semantic features are used to improve the results of the camera selection. These semantic features are group action, person action and person speaking. For this purpose low level acoustic and visual features are combined with high level semantic ones. After the feature fusion, a segmentation and classification are performed by hidden M...
This work introduces a robot driven camera controlled by speech. The SIMIS database of 20 recordings of real life surgical operations serves as basis for analyses and noise modelling. To overcome low recognition performance due to high noise levels during operations, the vocabulary was chosen to be highly limited and multiple noise reduction method...
CCTV systems have been introduced in most public spaces in order to increase security. Video outputs are observed by human operators if possible but mostly used as a forensic tool. Therefore it seems desirable to automate video surveillance systems, in order to be able to detect potentially dangerous situations as soon as possible. Multi camera sys...
Surveillance of drivers, pilots or passengers possesses significant potential for increased security within passenger transport. In an automotive setting the interaction can e.g. be improved by social awareness of an MMI. As further example security marshals can be efficiently positioned guided by according systems. Within this scope the detection...
CCTV systems have been introduced in most public spaces in order to increase security. Video outputs are observed by human operators if possible but mostly used as a forensic tool. Therefore it seems desirable to automate video surveillance systems, in order to be able to detect potentially dangerous situations as soon as possible. Multi camera sys...
Bimodal emotion recognition through audiovisual feature fusion has been shown superior over each individual modality in the past. Still, synchronization of the two streams is a challenge, as many vision approaches work on a frame basis opposing audio turn- or chunk-basis. Therefore, late fusion schemes such as simple logic or voting strategies are...
Recognition of emotion in speech usually uses acoustic models that ignore the spoken content. Likewise one general model per emotion is trained independent of the phonetic structure. Given sufficient data, this approach seemingly works well enough. Yet, this paper tries to answer the question whether acoustic emotion recognition strongly depends on...
While the " 'quasi-state-of-the-art'" towards acoustic emotion recognition relies on multivariate time-series analysis of e.g. pitch, energy, or MFCC by statistical functionals as moments or extrema, only few respect statistical noise by outliers due to too long segments as turns. Such noise can be overcome by hierarchical functionals as means of e...
Bimodal emotion recognition through audiovisual feature fusion has been shown superior over each individual modality in the past. Still, synchronization of the two streams is a challenge, as many vision approaches work on a frame basis opposing audio turn- or chunk-basis. Therefore, late fusion schemes such as simple logic or voting strategies are...
Great interest has been shown in the visual surveillance of public transportation systems. The challenge is the automated analysis of passengers' behaviors with a set of visual low-level features, which can be extracted robustly. On a set of global motion features computed in different parts of the image, here the complete image, the face and skin...
Great interest is recently shown in behavior modeling, especially in public surveillance tasks. In general it is agreed upon the benefits of use of several input cues as audio and video. Yet, synchronization and fusion of these information sources remains the main challenge. We therefore show results for a feature space combination, which allows fo...
Video based analysis of a persons' mood or behavior is in general performed by interpreting various features observed on the body. Facial actions, such as speaking, yawning or laughing are considered as key features. Dynamic changes within the face can be modeled with the well known hidden Markov models (HMM). Unfortunately even within one class ex...
Automatic discrimination of musical signal types as speech, singing, music, genres or drumbeats within audio streams is of great importance, e.g. for radio broadcast stream segmentation. Yet, feature sets are largely discussed. We therefore suggest a large open feature set approach starting with systematical generation of 7k hi-level features based...
In the present treatise, we propose an approach for a highly configurable image based online person behaviour monitoring system. The particular application scenario is a crew supporting multi-stream on-board threat detection system, which is getting more desirable for the use in public trans port. For such frameworks, to work robust in mostly uncon...
Affective computing has grown an important field in today's man-machine-interaction, and the acoustic speech signal is very popular as basis for an automatic classification at the moment. However, recognition performances reported today are mostly not sufficient for a real usage within working systems. Therefore we want to improve on this challenge...
Face recognition is employed in several systems and computer based applications besides security applications. A broad variety of appearance based approaches has been presented and evaluated even on large datasets. However, most of the systems mainly concentrate on images and models that have been taken from the frontal view of the face. Thus they...
Video surveillance is an omnipresent topic when it comes to enhancing security in public places and transportation systems. Fully automated behavior detection systems are desirable when it comes to cutting costs for analysing video and audio streams online. These will initiate an alarm signal autonomously if a possibly dangerous situation is detect...
In the present treatise, we propose an approach for a highly configurable image based online person behaviour monitoring system. The particular application scenario is a crew supporting multi-stream on-board threat detection system, which is getting more desirable for the use in public transport. For such frameworks, to work robustly in mostly unco...
In this work we strive to find an optimal set of acoustic features for the discrimination of speech, monophonic singing, and polyphonic music to robustly segment acoustic media streams for annotation and interaction purposes. Furthermore we introduce ensemble-based classification approaches within this task. From a basis of 276 attributes we select...