Temporal Difference Learning Waveform Selection

JCP 09/2010; 5:1394-1401. DOI: 10.1109/CCCM.2009.5267516
Source: DBLP

ABSTRACT The largest difference between cognitive radar and other adaptive radar is the adaptivity of transmitter in cognitive radar. How to optimally decide or select the radar waveform for next transmission based on the observation of past radar returns is one of the important issues. In this paper, with the stochastic dynamic programming model of waveform selection, we use the method of temporal difference learning to solve this problem and realize the adaptivity of waveform selection. The simulation results show that the uncertainty of state estimation using temporal difference learning is less than that using fixed waveform.

  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: In this paper, we present a new nonlinear filter for high-dimensional state estimation, which we have named the cubature Kalman filter (CKF). The heart of the CKF is a spherical-radial cubature rule, which makes it possible to numerically compute multivariate moment integrals encountered in the nonlinear Bayesian filter. Specifically, we derive a third-degree spherical-radial cubature rule that provides a set of cubature points scaling linearly with the state-vector dimension. The CKF may therefore provide a systematic solution for high-dimensional nonlinear filtering problems. The paper also includes the derivation of a square-root version of the CKF for improved numerical stability. The CKF is tested experimentally in two nonlinear state estimation problems. In the first problem, the proposed cubature rule is used to compute the second-order statistics of a nonlinearly transformed Gaussian random variable. The second problem addresses the use of the CKF for tracking a maneuvering aircraft. The results of both experiments demonstrate the improved performance of the CKF over conventional nonlinear filters.
    IEEE Transactions on Automatic Control 07/2009; 54(6-54):1254 - 1269. DOI:10.1109/TAC.2009.2019800 · 3.17 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: A key component of a cognitive radar system is the method by which the transmitted waveform is adapted in response to information regarding the radar environment. The goal of such adaptation methods is to provide a flexible framework that can synthesize waveforms that provide different tradeoffs between a variety of performance objectives, and can do so efficiently. In this paper, we propose a waveform design method that efficiently synthesizes waveforms that provide a trade-off between estimation performance for a Gaussian ensemble of targets and detection performance for a specific target. In particular, the method synthesizes (finite length) waveforms that achieve an inherent trade-off between the (Gaussian) mutual information and the signal-to-noise ratio (SNR) for a particular target. In addition, the method can accommodate a variety of constraints on the transmitted spectrum. We show that the waveform design problem can be formulated as a convex optimization problem in the autocorrelation of the waveform, and we develop a customized interior point method for efficiently obtaining a globally optimal waveform.
    Signals, Systems and Computers, 2008 42nd Asilomar Conference on; 11/2008
  • [Show abstract] [Hide abstract]
    ABSTRACT: An adaptive, waveform selective probabilistic data association (WSPDA) algorithm for tracking a single target in clutter is presented. The assumption of an optimal receiver allows the inclusion of transmitted waveform specification parameters in the tracking subsystem equations, leading to a waveform selection scheme where the next transmitted waveform parameters are selected so as to minimize the average total mean-square tracking error at the next time step. Semiclosed form solutions are given to the local (one-step-ahead) adaptive waveform selection problem for the case of one-dimensional target motion. A simple simulation example is given to compare the performance of a tracking system using a WSFDA based tracking filter with that of a conventional system with a fixed waveform shape and probabilistic data association (PDA) tracking filter.
    IEEE Transactions on Aerospace and Electronic Systems 11/1997; 33(4-33):1180 - 1188. DOI:10.1109/7.625110 · 1.39 Impact Factor


Available from