Conference Paper

Generative Adversial Network based Extended Target Detection for Automotive MIMO Radar

Conference Paper

Generative Adversial Network based Extended Target Detection for Automotive MIMO Radar

If you want to read the PDF, try requesting it from the authors.

No full-text available

Request Full-text Paper PDF

To read the full-text of this research,
you can request a copy directly from the authors.

... Applying the OS-CFAR algorithm for the peak detection, requires a clustering to group all detections of the same individual target. Therefore, a density-based spatial clustering (DBSCAN) is used [66]. ...
... The design-space of the CNN architecture involves large number of parameters which makes it hard to find the optimum architecture for the given problem definition. Thus, the choice of the architecture design hyper-parameters is mainly inspired from [66], where the authors used a similar architecture for the target detection on sparse radar RD-maps. Both encoder and decoder have a 3-layered convolution layer VOLUME 9, 2021 with a rectified linear unit (ReLu) as the non-linearity function. ...
Article
Full-text available
With the recent advancements in radar systems, radar sensors offer a promising and effective perception of the surrounding. This includes target detection, classification and tracking. Compared to the state-of-the-art, where the state vector of classical tracker considers only localization parameters, this paper proposes an integrated Bayesian framework by augmenting state vector with feature embedding as appearance parameter together with localization parameter. In context of automotive vulnerable road users (VRUs) such as pedestrian and cyclist, the classical tracker poses multiple challenges to preserve the identity of the tracked target during partial or complete occlusion, due to low inter-class (pedestrian-cyclist) variations and strong similarity between intra-class (pedestrian-pedestrian). Subsequently, feature embedding corresponding to target’s micro-Doppler signature are learned using novel Bayesian based deep metric learning approaches. The tracker’s performance is optimized due to a better separability of the targets. At the same time, the classifiers’ performance is enhanced due to Bayesian formulation utilizing the temporal smoothing of the classifier’s embedding vector. In this work, we demonstrate the performance of the proposed Bayesian framework using several vulnerable user targets based on a 77 GHz automotive radar.
Article
Target localization and classification from radar point clouds is a challenging task due to the inherently sparse nature of the data with highly non-uniform target distribution. This work presents HARadNet, a novel attention based anchor free target detection and classification network architecture in a multi-task learning framework for radar point clouds data. A direction field vector is used as motion modality to achieve attention inside the network. The attention operates at different hierarchy of the feature abstraction layer with each point sampled according to a conditional direction field vector, allowing the network to exploit and learn a joint feature representation and correlation to its neighborhood. This leads to a significant improvement in the performance of the classification. Additionally, a parameter-free target localization is proposed using Bayesian sampling conditioned on a pre-trained direction field vector. The extensive evaluation on a public radar dataset shows an substantial increase in localization and classification performance.
Conference Paper
Automatic radar based classification of automotive targets, such as pedestrians and cyclist, poses several challenges due to low inter-class variations among different classes and large intra-class variations. Further, different targets required to track in typical automotive scenario can have completely varying dynamics which gets challenging for tracker using conventional state vectors. Compared to state-of-the-art using independent classification and tracking, in this paper, we propose an integrated tracker and classifier leading to a novel Bayesian framework. The tracker's state vector in the proposed framework not only includes the localization parameters of the targets but is also augmented with the targets's feature embedding vector. In consequence, the tracker's performance is optimized due to a better separability of the targets. Furthermore, the classifier's performance is enhanced due to Bayesian formulation utilizing the temporal smoothing of classifier's embedding vector.
Article
Full-text available
Road extraction from aerial images has been a hot research topic in the field of remote sensing image analysis. In this letter, a semantic segmentation neural network which combines the strengths of residual learning and U-Net is proposed for road area extraction. The network is built with residual units and has similar architecture to that of U-Net. The benefits of this model is two-fold: first, residual units ease training of deep networks. Second, the rich skip connections within the network could facilitate information propagation, allowing us to design networks with fewer parameters however better performance. We test our network on a public road dataset and compare it with U-Net and other two state of the art deep learning based road extraction methods. The proposed approach outperforms all the comparing methods, which demonstrates its superiority over recently developed state of the arts.
Article
Full-text available
Image semantic segmentation is more and more being of interest for computer vision and machine learning researchers. Many applications on the rise need accurate and efficient segmentation mechanisms: autonomous driving, indoor navigation, and even virtual or augmented reality systems to name a few. This demand coincides with the rise of deep learning approaches in almost every field or application target related to computer vision, including semantic segmentation or scene understanding. This paper provides a review on deep learning methods for semantic segmentation applied to various application areas. Firstly, we describe the terminology of this field as well as mandatory background concepts. Next, the main datasets and challenges are exposed to help researchers decide which are the ones that best suit their needs and their targets. Then, existing methods are reviewed, highlighting their contributions and their significance in the field. Finally, quantitative results are given for the described methods and the datasets in which they were evaluated, following up with a discussion of the results. At last, we point out a set of promising future works and draw our own conclusions about the state of the art of semantic segmentation using deep learning techniques.
Article
Full-text available
There is large consent that successful training of deep networks requires many thousand annotated training samples. In this paper, we present a network and training strategy that relies on the strong use of data augmentation to use the available annotated samples more efficiently. The architecture consists of a contracting path to capture context and a symmetric expanding path that enables precise localization. We show that such a network can be trained end-to-end from very few images and outperforms the prior best method (a sliding-window convolutional network) on the ISBI challenge for segmentation of neuronal structures in electron microscopic stacks. Using the same network trained on transmitted light microscopy images (phase contrast and DIC) we won the ISBI cell tracking challenge 2015 in these categories by a large margin. Moreover, the network is fast. Segmentation of a 512x512 image takes less than a second on a recent GPU. The full implementation (based on Caffe) and the trained networks are available at http://lmb.informatik.uni-freiburg.de/people/ronneber/u-net .
Article
Full-text available
Data analysis plays an indispensable role for understanding various phenomena. Cluster analysis, primitive exploration with little or no prior knowledge, consists of research developed across a wide variety of communities. The diversity, on one hand, equips us with many tools. On the other hand, the profusion of options causes confusion. We survey clustering algorithms for data sets appearing in statistics, computer science, and machine learning, and illustrate their applications in some benchmark data sets, the traveling salesman problem, and bioinformatics, a new field attracting intensive efforts. Several tightly related topics, proximity measure, and cluster validation, are also discussed.
Conference Paper
There is large consent that successful training of deep networks requires many thousand annotated training samples. In this paper, we present a network and training strategy that relies on the strong use of data augmentation to use the available annotated samples more efficiently. The architecture consists of a contracting path to capture context and a symmetric expanding path that enables precise localization. We show that such a network can be trained end-to-end from very few images and outperforms the prior best method (a sliding-window convolutional network) on the ISBI challenge for segmentation of neuronal structures in electron microscopic stacks. Using the same network trained on transmitted light microscopy images (phase contrast and DIC) we won the ISBI cell tracking challenge 2015 in these categories by a large margin. Moreover, the network is fast. Segmentation of a 512x512 image takes less than a second on a recent GPU. The full implementation (based on Caffe) and the trained networks are available at http://lmb.informatik.uni-freiburg.de/people/ronneber/u-net .
Conference Paper
In radar the reflected signal is received by the antenna which is amplified, down converted and then the required signal is extracted (video signal). The video signal is then passed through Moving Target Indicator (MTI) processor which suppresses clutter. The post-MTI data is passed through Constant False Alarm Rate (CFAR) processor which qualifies echoes as targets or otherwise. The role of CFAR processor is to determine a threshold, above which any return can be considered to be from target. If this threshold is too low, more targets will be detected at the expense of more false alarms. Conversely, if the threshold is set too high, then fewer targets will be detected but the number of false alarms will be less. The distribution of the clutter can be approximated by certain probability distribution functions, where each medium follows a different probability distribution. We shall investigate important CFAR processing techniques in Gaussian noise and Rayleigh clutter. The threshold is set adaptive, that is the threshold is raised or lowered, to maintain a required Probability of False Alarm (Pfa). This paper discusses various CFAR processing techniques, by applying them to raw video of real radar, analyzing advantages and disadvantages of each technique.
Article
Generative Adversarial Nets [8] were recently introduced as a novel way to train generative models. In this work we introduce the conditional version of generative adversarial nets, which can be constructed by simply feeding the data, y, we wish to condition on to both the generator and discriminator. We show that this model can generate MNIST digits conditioned on class labels. We also illustrate how this model could be used to learn a multi-modal model, and provide preliminary examples of an application to image tagging in which we demonstrate how this approach can generate descriptive tags which are not part of training labels.
Sensor fusion: A comparison of sensing capabilities of human drivers and highly automated vehicles
  • B Schoettle