A preview of this full-text is provided by Optica Publishing Group.
Content available from Optica
This content is subject to copyright. Terms and conditions apply.
Nonlinear Fourier transform for optical data
processing and transmission: advances and
perspectives
SERGEI K. TURITSYN,1,2,*JAROSLAW E. PRILEPSKY,1SON THAI LE,3SANDER WAHLS,4LEONID L. FRUMIN,2,5
MORTEZA KAMALIAN,1AND STANISLAV A. DEREVYANKO6
1Aston Institute of Photonic Technologies, Aston University, Birmingham B4 7ET, UK
2Novosibirsk State University, Novosibirsk 630090, Russia
3Nokia Bell Labs, Stuttgart, Germany
4Delft Center for Systems and Control, Delft University of Technology, 2628 CD, Delft, The Netherlands
5Institute of Automation and Electrometry, Siberian Branch, Russian Academy of Sciences, Novosibirsk 630090, Russia
6Department of Electrical and Computer Engineering, Ben-Gurion University of the Negev, Beer Sheva 84105, Israel
*Corresponding author: s.k.turitsyn@aston.ac.uk
Received 14 November 2016; revised 13 January 2017; accepted 15 January 2017 (Doc. ID 280880); published 28 February 2017
Fiber-optic communication systems are nowadays facing serious challenges due to the fast growing demand on capacity
from various new applications and services. It is now well recognized that nonlinear effects limit the spectral efficiency
and transmission reach of modern fiber-optic communications. Nonlinearity compensation is therefore widely believed
to be of paramount importance for increasing the capacity of future optical networks. Recently, there has been steadily
growing interest in the application of a powerful mathematical tool—the nonlinear Fourier transform (NFT)—in the
development of fundamentally novel nonlinearity mitigation tools for fiber-optic channels. It has been recognized that,
within this paradigm, the nonlinear crosstalk due to the Kerr effect is effectively absent, and fiber nonlinearity due to the
Kerr effect can enter as a constructive element rather than a degrading factor. The novelty and the mathematical com-
plexity of the NFT, the versatility of the proposed system designs, and the lack of a unified vision of an optimal NFT-type
communication system, however, constitute significant difficulties for communication researchers. In this paper, we
therefore survey the existing approaches in a common framework and review the progress in this area with a focus on
practical implementation aspects. First, an overview of existing key algorithms for the efficacious computation of the
direct and inverse NFT is given, and the issues of accuracy and numerical complexity are elucidated. We then describe
different approaches for the utilization of the NFT in practical transmission schemes. After that we discuss the
differences, advantages, and challenges of various recently emerged system designs employing the NFT, as well as
the spectral efficiency estimates available up-to-date. With many practical implementation aspects still being open,
our mini-review is aimed at helping researchers assess the perspectives, understand the bottlenecks, and envision
the development paths in the upcoming NFT-based transmission technologies. © 2017 Optical Society of America
OCIS codes: (060.1660) Coherent communications; (060.2330) Fiber optics communications; (070.4340) Nonlinear optical signal
processing; (290.3200) Inverse scattering.
https://doi.org/10.1364/OPTICA.4.000307
1. INTRODUCTION
The exponential surge in global data traffic driven by the skyrock-
eting proliferation of different bandwidth-hungry online services,
such as cloud computing, on-demand HD video streams, and on-
line business analytics, brings about escalating pressure on the
speed (capacity) and quality (bit error rate) characteristics of in-
formation flows interconnecting individual network participants
[1–5]. Optical fiber systems are the backbone of the global tele-
communication networks. It is hard to overstate the impact that
fiber communications have made on the economy, public ser-
vices, society, and almost all aspects of our lives. It is also well
recognized [3–12] that rapidly increasing data rates in the core
fiber communication systems are quickly approaching the limits
of current transmission technologies, many of which were origi-
nally developed for linear (e.g., radio) communication [13,14].
Optical fiber channels are very different from wireless and
other traditional linear channels. The main order effect here is
the signal attenuation due to fiber loss that is compensated by
optical amplifiers, e.g., erbium-doped amplifiers (EDFAs) or dis-
tributed Raman amplification (DRA) [1]. Optical amplification
adds amplified spontaneous emission (ASE) noise that mixes with
the signal during the transmission. In general, optical noise
2334-2536/17/030307-16 Journal © 2017 Optical Society of America
Review Article Vol. 4, No. 3 / March 2017 / Optica 307
together with dispersion and nonlinearity are the three key physical
effects having a major impact on signal transmission in optical
fiber links. The successful implementation of the “fifth genera-
tion”of optical transmission systems, operating with coherent de-
tection, wavelength division multiplexing (WDM), advanced
multilevel modulation formats, and digital signal processing
(DSP), has led to the possibility of channel rates exceeding
100 Gb/s [2,5,7]. The key to this breakthrough is the digital mit-
igation of the most important linear transmission impairments,
such as chromatic and polarization mode dispersion [1,2,15].
After the equalization of linear effects, noise and nonlinearity be-
come the principal factors deteriorating the performance of opti-
cal networks. Indeed, the Kerr nonlinear effect at high signal
powers leads to power-dependent nonlinear transmission signal
distortions in the fiber channel. In this sense, fiber nonlinearity
has a detrimental effect on the transmission of information, and,
thereby, serious worldwide efforts are aimed at the suppression or
compensation of nonlinear impairments. It was stressed in [6]
that, in contrast to linear channels [13], the spectral efficiency
of optical fiber WDM networks cannot increase indefinitely
and starts to decay at high signal powers due to the spectral chan-
nel crosstalk imposed by fiber nonlinearity. The nonlinear fiber
effects are behind the infamous “nonlinear capacity limit”prob-
lem [5,7,9,10,16].
In spite of the immense recent progress in optical communi-
cation technologies, the next step in the future systems’design has
appeared to be not so straightforward [17]. Space-division multi-
plexing (SDM) is considered by many engineers as a promising
direction in the evolution of optical transmission systems [18].
However, the SDM technology requires a considerable upgrade
in the infrastructure. The compensation of nonlinearity-induced
effects is a principal research and engineering challenge, and it is
likely to remain so in the future. A plethora of nonlinearity com-
pensation methods have been proposed, including digital back-
propagation (DBP) [19], digital [20] and optical [21,22] phase
conjugation (spectral inversion), and phase-conjugated twin
waves [23], to mention just a few important advances (see reviews
[17,24]). Note that in most of the compensation techniques, the
fiber nonlinearity is treated as an undesirable effect, and the
purpose of all of those methods is just to mitigate or suppress
its impact.
There is, however, an alternative and not yet widely popular
viewpoint: since fiber channels are inherently nonlinear, rather
than treating nonlinearity as a completely destructive feature,
it can be considered as an essential element in the design of fiber
communication systems. There is growing evidence of the neces-
sity of a novel paradigm and radically new approaches to coding,
transmission, and processing of information, which would take
into account the nonlinear properties of the optical fiber. In this
work, we describe one such recently resurrected approach: the
nonlinear Fourier transform (NFT). The NFT-based transmis-
sion method belongs to a conceptually different bevy of tech-
niques compared to those mentioned above [25]: here the
nonlinearity enters as an undetachable element of the processing
and transmission, defining the features of the system architecture
and its characteristics. The application of such paradigm-shifting
nonlinear methods means that some common “linear”method-
ology may need to be reconsidered or appended with a new mean-
ing. For instance, in addition to the usual notions of frequency,
spectral power, and bandwidth, one has to work with their
nonlinear analogues that can be drastically non-conventional,
but can serve as new well-defined and adjustable characteristics
of the optical signal in nonlinear systems. It will be convenient
further to distinguish between signal characteristics in the stan-
dard frequency domain and those in the so-called nonlinear
spectral domain. Note also that for the sake of clarity, within this
review we address only the single-mode and single-polarization
fiber transmission model, leaving aside the polarization degree
of freedom and specific peculiarities of multimode systems [5].
We would like to stress that the beauty of the mathematical
theory presented here is inevitably spoiled by the limits of
applicability of the master model—the integrable nonlinear
Schrödinger equation (NLSE)—for the description of signal
transmission in fiber links. The application of the NFT methods
is limited by deviations of the optical signal dynamics from the
NLSE channel model. Apart from the deviations due to periodic
variations of signal power caused by alternation of loss and gain in
practical systems (in that case the NLSE emanates as a leading
approximation within the path-averaged model), various other ef-
fects contribute to perturbations that are not accounted for by the
pure NLSE; e.g., higher-order dispersion [26–28], polarization
effects [27,29,30,15], the Raman effect [27,28,31], and the
acoustic effects (electrostriction) [32] all limit the validity of this
channel model. Consideration of the impact of these effects is
beyond the scope of this survey, which is focused on the NFT
techniques for the NLSE-based channel.
To assist reading of the paper, Supplement 1 contains a list of
acronyms used in our review.
2. PRINCIPLES OF INTEGRABILITY AND NFT
In physics and, notably, in photonics, many important phenom-
ena and the evolution of underlying systems can be modelled by
the NLSE [1,27,28,33–36]. In particular, the NLSE is a principal
master model governing the evolution of the slow-varying optical
field envelope qz;t(zwill further play the role of the distance
along the fiber while tis the time variable) along a single-mode
fiber,
i∂q
∂z1
2
∂2q
∂t2jqj2q0:(1)
Note that this is the NLSE in its normalized form. Here and in
what follows, the upper sign in Eq. (1)(“+“) corresponds to
anomalous fiber dispersion, while the lower one (“−“) refers to
the normal dispersion case. Formally, the NLSE (1) describes
the evolution of light in a lossless optical fiber under the effects
of dispersion and Kerr nonlinearity. Albeit all real fibers, certainly,
have losses, this model appears as a result of averaging over peri-
odic gain and loss variation, leading to effectively conservative sig-
nal evolution [27,28,33,34]. Close to ideal compensation of losses
along optical fiber is possible in specific schemes of the so-called
ultra-long fiber lasers DRA [37,38]. Such a quasi-lossless
transmission was demonstrated experimentally in [38,39].
The possibility to approximate signal evolution in practical
fiber channels by the NLSE gives a remarkable opportunity to
apply advanced mathematical techniques, developed in the
1970s, to optical communications. It was first shown in the semi-
nal work by Zakharov and Shabat [40] that Eq. (1) belongs to the
class of the so-called integrable nonlinear systems. The mathemati-
cal method, widely known in the physical and mathematical com-
munities as the inverse scattering transform (IST), can be applied
Review Article Vol. 4, No. 3 / March 2017 / Optica 308
to find the solution of integrable nonlinear equations. In this con-
text, one can think of integrability as an elegant transform of the
original nonlinear system into the so-called action-angle variables
corresponding to a set of uncoupled trivial evolutionary equa-
tions. Mathematically, this can be treated as an effective lineari-
zation of the nonlinear evolution. There exists a vast amount of
literature where the integrability notion is elucidated in great
detail; see, e.g., [28,35,36,40–44]. The integrability itself implies
a lot of consequences in both mathematical and physical contexts.
For example, the NLSE (1) possesses an infinite number of con-
served quantities: while the conservation of power, momentum,
and Hamiltonian for Eq. (1) is relatively obvious, the rest of the
conserved quantities are nontrivial. As we will show below, a lot of
fruitful ideas based on the conserved quantities have been
successfully implemented.
In a nutshell, NFTs can be used to solve initial value problems
for a special class of nonlinear evolutionary equations. In fiber op-
tics, where the signal evolution occurs along the fiber, the initial
conditions correspond to the time-domain waveform at the trans-
mitter. Similar to conventional Fourier transform (FT), initial con-
ditions of the integrable nonlinear equations (such as the NLSE or
a Manakov system [45], corresponding to the integrable two-com-
ponent NLSE generalization) can be decomposed into (nonlinear)
spectral data. For the NLSE this is done through the solution of a
linear scattering problem known as the Zakharov–Shabat problem
(ZSP) [40,41]. Any solution of the NLSE can be presented as the
evolution of nonlinear spectral data that evolve effectively in a lin-
ear manner. The inverse transform, namely the recovery of the
space-time-domain field distribution from the known nonlinear
modes, is classically implemented through solution of the
Gelfand–Levitan–Marchenko equation (GLME). Altogether, this
means that such a spectral transform can be interpreted as the di-
rect nonlinear analog of the FT [35,40,41]: similarly to the FT
transforming the dispersion of a linear propagation to a phase ro-
tation in frequency space, the NFT recasts both the nonlinearity
and dispersion of the NLSE into a simple decoupled evolution of
nonlinear spectral data inside the NFT domain (see Fig. 1); the
latter plays the role of Fourier spectrum for nonlinear problems.
By performing the NFT of a given profile qz; t, we segregate
two distinct components, the dispersive nonlinear radiation and
the non-dispersive solitons, although either of these two can be
absent for some specific profiles. For normal dispersion, the in-
puts localized in time cannot nucleate solitons. For the dispersive
part of the nonlinear spectrum, the NLSE evolution produces ex-
actly the linear phase rotation of spectral components as we have
for linear systems. For the anomalous dispersion, the solitons, as-
sociated with the complex “nonlinear frequencies”(eigenvalues),
in addition to the rotation of soliton phases, can involve either the
motion as a whole or a more nontrivial beating dynamics of
bound states—the so-called multisoliton breathers [28,33],
although inside the NFT domain the solitonic degrees of freedom
remain decoupled. Note that NFT methods are much richer, more
flexible, and more versatile with respect to the system design and
performance compared to just soliton-based techniques, studied
previously in much detail [27,28,33]. In the NFT methods deal-
ing with the discrete part of the nonlinear spectrum (solitonic
eigenvalues), the information carriers are not the fundamental sol-
itons themselves but the NFT parameters (nonlinear spectral data)
attributed to a multisolution pulse. In this sense, the traditional
soliton-based transmission emerges as the simplest (and not nec-
essarily optimal) subclass of the NFT methods. The NFT com-
munications are, to some extent, the extension of not only the
soliton-based approach but also the coherent communication idea
itself: while for the latter both the signal’s amplitude and phase are
used for modulation, the NFT approach goes further and employs
the nonlinear characteristics of the signal.
By using the NFT all effects caused by the fiber Kerr nonlin-
earity can be described as a trivial change of the phase in the non-
linear spectrum. This paves the way to fundamentally novel
nonlinear techniques for compensation of the effects of chromatic
dispersion and fiber nonlinearity. In 1993 Hasegawa and Nyu
[42] (see also Chapter 4 of [28]) proposed the truly innovative
idea of eigenvalue communications based on the exploitation of
discrete eigenvalues (corresponding to solitons) emerging in
the NFT signal decomposition to encode and transmit informa-
tion [40–42]. This approach potentially solves the problem of
nonlinear crosstalk that is one of the major challenges in optical
WDM networks. The concept of Hasegawa and Nyu has recently
been resurrected with various modifications and further exten-
sions [25,44,46–79], including the new direction employing
the modulation of continuous nonlinear spectrum [51–
60,62,63,79], first experiments using transmission and processing
of discrete eigenvalues [64–67] and continuous spectrum [58,59],
NFT-based DBP [50,75], and, most recently, polarization divi-
sion multiplexing with the NFT [74]. The transition from the
space-time domain into the nonlinear spectral domain and back
is achieved by performing the NFT operations. Generally, there
exists the straightforward interrelation not only in the ideology of
FT and NFT methods but also between the linear and NFT spec-
tra [35]: in the low-power limit one can prove the asymptotic
equivalence of the linear FT and NFT [41]. However, in spite
of the similarity, the explicit form of the NFT operations is much
more mathematically involved as compared to the simple profile
convolution with exponentials for the usual FT. Thus, the com-
plexity of the NFT operations and the “change of notions”often
bring about difficulties for the communication engineers. The
purpose of our survey is to demonstrate without going deep into
mathematical details how the NFT method and various integra-
bility features can be employed for the sake of efficient optical
transmission, also summing up the existing numerical tools that
can be employed for the computation of the NFT.
This paper is organized as follows. First, in Section 3,we
describe the NLSE model for realistic optical fiber systems. In
Fig. 1. Exemplary NF spectrum (anomalous dispersion case), contain-
ing solitons (discrete eigenvalues) and continuous nonlinear spectrum
(depicted on the real axis ξ).
Review Article Vol. 4, No. 3 / March 2017 / Optica 309
Section 4we introduce minimally required notations for the NFT
operations, including the periodic NFT variant. Then, in
Section 5, we overview the existing numerical methods for the
calculation of the direct and inverse NFT operations.
Subsection 5.C is focused on fast NFT algorithms. Then, in
Section 6, we directly address different NFT-based transmission
methods, also presenting some new results and generalizations.
After that, in Section 7, we overview recent results with regard
to the efficiency of NFT-based optical transmission mehods.
The paper ends with Section 8, the conclusion, where we also
outline some NFT perspectives and further development
directions.
3. GENERALIZED NLSE MODEL OF OPTICAL
FIBER
The principal master model for the electrical field qz; tevolu-
tion inside a single-mode optical fiber with the account of am-
plification can be written as a generalized NLSE (GNLSE)
[1,27,28]:
i∂q
∂z
−
β2
2
∂2q
∂t2γjqj2qigzqηz; t;(2)
where zis the distance (in kilometers) along the fiber, and tis the
time (in picosconds) in the frame co-moving with the velocity of
the envelope. The parameter β2(in ps2∕km) is the characteristic
of chromatic dispersion that is negative for the anomalous
dispersion (the most important practical case) or β2>0for
the normal dispersion (jβ2jcan vary from 5ps
2∕km to
60 ps2∕km at the typical operating wavelength of 1550 nm);
further, for a standard single-mode fiber we assume β2
−22 ps2∕km.γis the nonlinear Kerr coefficient, typically γ
1.27 W−1km−1. The function gzcharacterizes thegain–loss
profile of a particular amplification scheme. For the quasi-lossless
DRA scheme, the function gz≡0[38,39], resulting in the
lossless NLSE perturbed by an additive white Gaussian noise
(AWGN) term ηz; t(having zero mean). The latter is
completely characterized by the ASE spectral power density D:
Eηz;t¯
ηz0;t0 2Dδt−t0δz−z0;(3)
where the overbar means the complex conjugate, E·is the ex-
pectation value, and δ·is the Dirac delta-function. In the case of
ideal DRA we have 2Dhν0KTα, where αis the fiber loss co-
efficient, typically α≈0.2 dB∕km at the carrying wavelength
λ01.55 μm;KTis the temperature-dependent factor (related
to the phonon-occupancy factor) that characterizes the Raman
pump providing the distributed gain; KTis typically in the range
from 1.1 to 1.2; and ν0is the carrying frequency of the signal
corresponding to λ0:ν0193.55 THz. Taking these typical
values of parameters, one estimates the order of characteristic
noise intensity per complex signal component (polarization),
per unit of propagation length and per unit of bandwidth, to
be D∼10−21 J∕km; for KT1.13 we have: D≈3.3 ·10−21 J∕
km. Such an idealized form of optical channel [the lossless
integrable NLSE (1) weakly perturbed by AWGN] suits NFT
applications [52] best as it is close to the integrable NLSE (1).
However, the NFT method can still be successfully applied to
the EDFA (lumped) [53] or non-ideal DRA [54,55] cases. For the
EDFA we have gz−α∕2in between point-action (lumped)
amplifiers, but the signal is boosted to the initial power level after
each span of length Za. For the DRA scheme we have a more
complicated non-flat profile of gz, e.g., that corresponding
to the open-cavity random distributed feedback laser-based am-
plification as it provides the best performance among various
other Raman amplification schemes [80], where the gain profile
recurs periodically after each span of the length Za. Now, by using
the path-averaged approach [27,28,34,53–55] one can introduce
the new field variable as ˜qz;tqz; tG1∕2z, where
Gzexp2RZa
0gzdz, and this substitution recasts Eq. (2)
into the lossless NLSE for ˜qz; twith the z-dependent
factor Gznear the nonlinear term. In the leading order with
respect to Za∕Zd, with Zdbeing the dispersion length
[ZdW2jβ2j−1, where Wis the signal’s bandwidth], the dis-
tance-dependent nonlinearity coefficient can be approximated
with the averaged value ˜γγZ−1
aRZa
0Gzdz, such that we ar-
rive at the lossless path-averaged (LPA) NLSE written for ˜qz;t
with constant coefficients with ˜γin place of the original γfrom
Eq. (2); for EDFA system ˜γγGa−1∕lnGawith Ga
exp−αZa. In general, the applicability limits of the LPA
NLSE model depend on the signal power, bandwidth, and trans-
mission distance. The accuracy of the LPA NLSE for optical links
with EDFA was investigated in [53] for a link distance of
2000 km, signal powers up to 8 dBm, and bandwidths up to
80 GHz. It was found that the LPA NLSE model can be applied
with a normalized mean square error below −20 dBm when the
signal power is below 3 dBm, almost independently of the signal
bandwidth. The LPA model was found to work under more re-
laxed requirements with non-ideal RDA [55] as this amplification
scheme provides a lower gain variation along the link, depending
on the specific RDA scheme. The applicability limits of this
model for the EDFA case with regards to NFT applications were
presented in [53], and for the RDA scheme in [55]. The noise
term is assumed to possess the same properties as we have for
the ideal RDA case; i.e., it is the circular AWGN with only a dif-
ferent expression for the intensity
˜
D∼10−21 J∕km. For the
EDFA system 2
˜
Dnsphν0G−1
a−1∕Za, where nsp ≈1is the
spontaneous emission factor [33].
Having recast our GNLSE (2) to the approximate LPA NLSE
form with the distance-independent coefficients or using the ideal
RDA model, we introduce the normalizations
t∕Ts→t; z∕Zs→z; q∕ffiffiffiffiffi
P0
p→q; (4)
with P0γZs−1(or the same with the ˜γfor EDFA or non-ideal
DRA and resulting LPA NLSE), ZsT2
s∕jβ2j, and we finally
have the standard NLSE model, Eq. (1), but with the AWGN
term in the r.h.s. In Eq. (4) any of three parameters, Ts,Zs,
or P0, can be taken for the normalization, but then the remaining
two have to be properly adjusted: Tscan be, e.g., the extent of our
symbol, or setting it to be the reciprocal bandwidths, TsW−1,
our normalized distance unit becomes the dispersive length men-
tioned above; in soliton-related problems Tsis often set as an
individual soliton full width at half maximum (FWHM). The
noise intensity has to be normalized in accordance with
Eq. (4): DZsP0Ts−1→D. We also omit tildes in ˜q,˜γ, and
˜
Dfurther, assuming that Eq. (1) refers to a simplified description
pertaining to a particular amplification scheme. Of course, the
results for the NFT application for non-ideal DRA or EDFA
schemes are expected to show slightly worse performance as com-
pared to the ideal DRA [53–55], though the higher-order correc-
tions with respect to Za∕Zdmay also be taken into account by
using, e.g., the guiding center approximation [28].
Review Article Vol. 4, No. 3 / March 2017 / Optica 310
4. EXPLICIT FORM OF NFT OPERATIONS
In this section, the direct (forward) and inverse NFT (INFT) are
introduced. The NFT considers the signal qz; tat a fixed loca-
tion zz0and returns the corresponding NFT spectrum. The
INFT reverses this process; i.e., given a NFT spectrum it returns
the corresponding signal qz0;t. Since only the main features can
be outlined here, the reader is referred to [28,35,40–47] for fur-
ther details. The section ends with some properties of the periodic
NFT (PNFT).
A. Direct NFT
The direct NFT is computed from specific (auxiliary) solutions
v1;2t;ζv1;2t;ζ;z0to the ZSP [40]
dv1
dt qz0;tv2−iζv1;dv2
dt ¯qz0;tv1iζv2(5)
for different values of the complex parameter ζξiη, which
will play the role of a nonlinear analog of frequency. The signal
qz;tacts as a potential. The upper and lower signs correspond
to the anomalous and normal dispersion according to Eq. (1).
Under the assumption that qz0;tdecays at least exponentially
for t→∞, specific solutions (the so-called Jost functions)
ϕ1;2t;ζand ψ1;2t;ζto the ZSP can be obtained from the
boundary conditions:
ϕ1t;ζe−iζto1;ϕ2t;ζo1for t→−∞;
(6)
ψ1t;ζo1;ψ2t;ζeiζto1for t→∞:
(7)
In practical realization of the transmission schemes, the pulse qt
is truncated and we operate in the so-called burst mode [52]; see
Fig. 2. The above pairs of functions solve the ZSP, and all these
different solutions are linearly dependent:
ϕ1ϕ2aζ ˜
ψ1
˜
ψ2bζ ψ1ψ2;(8)
˜
ϕ1
˜
ϕ2−˜aζ ψ1ψ2
˜
bζ ˜
ψ1
˜
ψ2:(9)
The functions aζand bζare known as the Jost scattering
coefficients. They serve as the basis on which the NFT spectrum
is defined. Due to the boundary conditions, we have
aζ lim
t→∞
ϕ1t;ζeiζt;bζ lim
t→∞
ϕ2t;ζe−iζt:(10)
Another important property of the Jost scattering coefficients is
that they satisfy jaξj2jbξj21for all real ξ, where the
upper and lower signs refer to those in Eqs. (1) and (5).
The NFT spectrum of the signal qz0;tconsists of two parts.
The first part is given by either the left or the right reflection
coefficient (RC), respectively:
lξ¯
bξ∕aξ;rξbξ∕aξ;ξ∈R:(11)
The second part of the NFT spectrum consists of the discrete
eigenvalues ζnξniηn, which are the eigenvalues of the
ZSP with a positive imaginary part η>0, and their associated
left or right norming constants (also often referred to as spectral
amplitudes), which are defined by the residue of lζ(or rζ)at
the point ζn:
lnbζna0ζn−1;r
nbζn∕a0ζn;(12)
where the prime designates the derivative with respect to ζ.We
therefore have four real parameters defining each solitary degree of
freedom. The complete (left or right) NFT spectrum of the signal
qz0;tis given by
Σlflξ;ζn;lnN
n1g;Σrfrξ;ζn;rnN
n1g;(13)
where Nis the total number of solitons in the signal; an exem-
plary NF spectrum is shown in Fig. 1. The NF spectrum char-
acterizes the signal qz0;tcompletely and can be used to recover
the corresponding time-domain signal given that it vanishes suf-
ficiently fast for jtj→∞. Note that in the normal dispersion
case, the signal cannot have solitonic components and either lξ
or rξis sufficient to uniquely recover the corresponding profile
qz0;t. The zdependence of the NF spectrum, Σl;rz, is given
by the following expressions. The eigenvalues ζnare independent
on z. For the remaining quantities, we have
lξ;zlξ;z0e−2iξ2z−z0;l
nξ;zlnξ;z0e−2iζ2
nz−z0;
rξ;zrξ;z0e2iξ2z−z0;r
nξ;zrnξ;z0e2iζ2
nz−z0:
(14)
Finally, we remark that the solitons disappear and the NFT re-
duces to conventional FT when the signal power becomes small.
Any rescaled signal qϵtϵqtsatisfies [35,41]
ϵ−1¯rϵξ;ϵ−1lϵξ→−qωjω−2ξwhen ϵ→0;(15)
where qωR∞
−∞qte−iωtdt. Also note that, in optics, the ZSP
(5) also appears widely in the field of Bragg grating synthesis
[1,81–83], where the functions v1;2play the role of slowly varying
coupled mode amplitudes: the anomalous dispersion [the upper
sign in Eqs. (5) and (16)] corresponds to the coupling of co-
propagating waves, while the normal dispersion (the lower sign)
refers to counter-propagating modes.
B. INFT Operation (Left Set of Scattering Data)
The INFT maps the scattering data Σl;r onto the field qt. This is
classically achieved via the GLME for the unknown functions
K1;2t;t0[28,35,40,41,44]. The GLME, written in terms of
the left scattering data, reads
¯
K1τ;τ0Zτ
−∞
dyLτ0yK2τ;y0;
¯
K2τ;τ0Lττ0Zτ
−∞
dyLτ0yK1τ;y0
(16)
for τ>τ0, where the upper and lower signs correspond to upper
and lower ones in Eqs. (1) and (5). In the realistic applications,
where the operations are performed on a finite interval of τ, say
0<τ<T, we have a finite region for the change of τ0,τ0<jτj.
For the anomalous dispersion [the sign “−“in Eqs. (16)] the
quantity Lτcan contain contributions from both solitonic
(discrete) and radiation (continuous) spectrum parts, Lτ
LsolτLrad τ, where
Lsolτ−iX
n
lne−iζnτ;L
radτ 1
2πZ∞
−∞
dξlξe−iξτ
(17)
and we have assumed that all discrete eigenvalues have a multi-
plicity one. The “nonlinear time”variable τis thus Fourier
Review Article Vol. 4, No. 3 / March 2017 / Optica 311
conjugated to the “nonlinear frequency”ξ, so that one can start
not from the ξdomain but immediately from the functions given
by Eq. (17) in the τdomain. In this paper, we have chosen to
work with the left reflection coefficient, l, corresponding to
the GLME inversion around −∞. The reason for such choice
is that (as we shall see below) this is a common convention in
the fiber Bragg grating reconstruction problems from which
we borrow most of our INFT numerical algorithms. Having
solved the GLME (16) for K1;2τ;τ0, the sought solution in
the space-time domain is recovered as qt−2¯
K2t;t. For
the soliton-free case we have Lsolτ0, and the only quantity
participating in Eq. (16) is the FT of RC lξ:Lτ≡Lradτ.
When one is interested in the solution qz0;tat some distance
zz0, the quantity lξin Eq. (17) is replaced with lz0;ξ.
The resulting solution of the GLME (16) becomes a function
of z0:K1;2z0;τ;τ0.
C. Periodic NFT
The usual NFT operations assume that the optical signal, qz0;t,
decays as t→∞. So the ordinary NFT assumes that we have a
burst-mode transmission, Fig. 2; i.e., at each zz0the signal
duration and the processing region coincide. However, in com-
munication applications it is often more convenient to work with
periodic signals for the processing of a data stream: the periodicity
assumption in our notations is expressed as qz0;tqz0;t
Tpfor the period Tp. Thus, the PNFT may be considered as a
natural choice for the replacement of linear (say, FFT-based)
processing elements. The PNFT was recently introduced within
the circle of available solutions for the nonlinear signal processing
in [84,85]. Basically, the PNFT offers the same possibilities for
the communication system design and concepts (with the use
of the periodically continued signals) as the NFT does for the
vanishing signals by adding a cyclic prefix extension instead of
zero-padded wings for ordinary NFT (Fig. 2). Together with this,
the periodicity assumption can bring about some other benefits:
(i) Only a finite part of a periodic signal (one period) represents
the whole signal, so we do not have to process the entire interval
accounting for the dispersion-induced memory, as it occurs for
the ordinary NFT; see Fig. 2. Because of this, one can have
a considerable processing speed-up when using PNFT.
(ii) When using an ordinary NFT, in particular, within nonlinear
synthesis [51–53], it is difficult to control the time duration of
the resulting wave-shapes. Using the PNFT, where signals have a
finite “meaningful”time duration (the PNFT period), we can
attain more control over the time-domain profiles. (iii) For the
PNFT, the encoding schemes can be, to some extent, based
on the encoders of currently used communication systems, as
the PNFT shares a cyclic-prefix profile extension idea.
(iv) Producing periodic solutions of the NLSE could be generally
done using Riemann theta functions that can be seen as the multi-
dimensional generalization of the FT, such that some properties
of linear modulation can still be kept within the PNFT paradigm.
(v) By using periodically extended signals we can have a continu-
ous stream of data without sudden droppings of power, thus
reducing the peak-to-average power ratio (PAPR), in contrast
to the burst mode with the ordinary NFT usage. For the sake
of completeness we briefly describe below some basic elements
of PNFT (see [85–89] for examples of such a communication
system).
1. Direct PNFT
Similarly to the case of ordinary NFT, in the periodic problem we
have two parts of nonlinear spectrum associated with a general
periodic time-domain profile: the constant main spectrum, which
serves as an analog of soliton eigenvalues, and the dynamical aux-
iliary spectrum. In contrast to the ordinary NFT, here both parts
of the spectrum consist of discrete points and there is no continu-
ous component. To define the scattering data we now have to deal
with the solutions (the so-called Bloch solutions) of ZSP (5) with
a periodic potential, qz0;tqz0;t Tp, subject to condi-
tions φt0;t0;ζ1;0Tand ˜
φt0;t0;ζ0;1T, where t0is
an arbitrary base point. The so-called 2×2fundamental matrix is
defined through the Bloch ZSP solutions as Φt; t0;ζ
φt;t0;ζ;˜
φt;t0;ζ. Evaluating this fundamental matrix at
tt0Tp, one gets the monodromy matrix, Mt0;ζ
Φt0Tp;t0;ζ. The monodromy matrix plays a crucial role
in the Floquet theory, which deals with differential systems with
periodic structure. At the endpoints of stable bands, the Bloch
solutions are (anti-)periodic and the values of parameter ζ
corresponding to these endpoints, i.e. the main spectrum, M, can
be defined through the Floquet discriminant Δζ
1∕2Tr Mt0;ζas [84]
Mζm
Δζm1;dΔ
dζ
ζζm
≠0:(18)
Ag-band (g-gap) periodic solution of NLSE is the solution in
which there are only 2gelements in M[90,91]. The important
property of the main spectrum is that it remains invariant during
the pulse evolution along the zdirection. The definition of the
auxiliary spectrum, μiz; t, is given in Supplement 1.
2. Inverse PNFT
The inverse PNFT is the procedure for getting the (periodic in
time) profile qz;tstarting from given main and (evolved) aux-
iliary nonlinear spectrum parts. There are several methods to
construct finite-gap periodic solutions of the NLSE; see [84].
One can use the theta-function representation [92,93]
qz;tq0;0ΘW−jτ
ΘWjτeik0z−iω0t;(19)
where k0and ω0are some constants obtained from the nonlinear
spectrum, and the Riemann theta function, ΘWjτ, is defined
as [93]
Fig. 2. Burst mode for the window in vanishing signal processing
(ordinary NFT) and the processing window for the periodic signal with
cyclic extension (PNFT).
Review Article Vol. 4, No. 3 / March 2017 / Optica 312
ΘWjτ X
m∈Zg
exp2πimTWπimTτm:(20)
Here mis a g-dimensional vector with integer elements, and
Wπkzωtδ∕2is a vector calculated from the non-
linear spectrum. The set fk;ω;δ;τgis called the Riemann spec-
trum, and τis the Riemann (period) matrix [93]; their particular
values can be, again, obtained from the full set of nonlinear spec-
tral data. Thus, within the representation [Eq. (19)] the inverse
PNFT procedure can be reformulated as the problem of finding
the Riemann spectrum from the given nonlinear spectrum.
Although there is still currently a lack of a generic approach for
how to deal with the inverse PNFT, there are several software
packages allowing one to construct the profile in the time domain
using the periodic spectral data. For finding the Riemann spec-
trum, there are some packages and codes embedded in Maple,
Sage, and Mathematica [94–96]. For the second stage, which
is to construct the Riemann theta functions (20) using the
Riemann spectrum, in addition to the symbolic implementations
[97,98], some “hyper-fast”algorithms for the numerical
reconstruction of special classes of signals were proposed [93].
5. NUMERICAL ALGORITHMS FOR THE NFT
In this section, we overview existing numerical methods for the
forward (5) and inverse (16) NFT, paying particular attention for
the methods that have already been tested for the transmission
purposes. The goal of the forward NFT is to calculate the non-
linear spectrum Σz0(13) from the given space-time-domain
profile qz0;t. The INFT method must provide the time-
domain waveform starting from given Σ.
The signal qz0;tis in practice only known at the specific
points in time due to sampling operations, which means that
for the forward NFT the nonlinear spectrum has to be approxi-
mated based on the samples
qmqz0;T1εm−1;m1;…;M;
where T1is close enough to −∞such that the boundary condi-
tion in Eq. (6) is approximately satisfied for tT1and T2>T1
is sufficiently close to ∞such that Eq. (10) is approximately
satisfied, respectively. The parameter εT2−T1∕M−1
denotes the sampling interval.
The methods are classified according to how their numerical
complexity (in terms of floating point operations, flops) and the
accuracy of the result change as the number of sample points M
increases.
A. Algorithms for Direct NFT
Numerous algorithms for computing the NFT have been de-
scribed in the literature. The two most well-known are probably
the methods of Ablowitz–Ladik (AL) and Boffetta–Osborne
(BO). We will first describe these two methods and then briefly
list other approaches. More details can be found in [46,84,99].
We, however, note that our review does not, of course, cover all
existing possibilities for the NFT operations implementation
(e.g., in recent work [100] a bi-direction algorithm for the calcu-
lation of soliton norming constants was described), and we rather
concentrate on the methods that have already found their way
into optical transmission studies, although there are some new
methods that have yet to be tested; see, e.g., [101].
1. Boffetta–Osborne Transfer Method
The general idea is to approximate the Jost scattering functions
aζand bζusing Eq. (10). Therefore, Boffetta and Osborne
[102] assumed that the signal qz0;tis piecewise constant,
i.e., qz0;tqmconst:for t∈T1m−0.5ε;T1
m0.5ε, and solved the ZSP [Eq. (5)] in closed form under
that assumption. For each interval T1m−0.5ε;T1
m0.5ε, one has
ϕ1T1m0.5ε;ζ
ϕ2T1m0.5ε;ζTmϕ1T1m−0.5ε;ζ
ϕ2T1m−0.5ε;ζ;
(21)
where Tmζexpmε−iζqm
¯qmiζ:(22)
Here, expm·denotes the matrix exponential. Taking the boun-
dary conditions (6) and (10) into account, one finds that
aζ
bζ≈diageiζT20.5ε;e
−iζT20.5εTMζ×…
×T1ζe−iζT1−0.5ε:(23)
This approximation can be used straight-away to evaluate the RC
Eq. (11). In order to locate the discrete eigenvalues ζn, Boffetta
and Osborne proposed to apply Newton’s method to aζ. A non-
linear version of Parseval’s relation can be used to check whether
all discrete eigenvalues have been found [102]. The complexity
for evaluating Eq. (23) in a straightforward way is OM. The
total complexity of a search method to find the discrete eigenval-
ues is therefore OkiterNguessesM, where kiter is the average num-
ber of iterations per initial guess and Nguesses is the number of
initial guesses used. The complexity of evaluating the RC
Eq. (11) on a grid of Mnonlinear frequencies is OM2.
The BO method has a second-order approximation accuracy;
i.e., for any fixed ζζ0, the distance between the numerical
approximations of aζ0and bζ0and their true values is of
the order OM−2[102,103]. Note that the hidden constant
in the big-O notation depends on ζ. For the BO method, the
hidden constant was found to be ∼jζj−1for large ζin [103].
The BO method was used in the works [49,51–53] for the cal-
culation of continuous nonlinear spectrum for the nonlinear in-
verse synthesis scheme (see Subsection 5.B below). It also
demonstrated good results in the calculation of the perturbed dy-
namics of solitonic eigenvalues [103–105]. The calculation of
norming constants, requiring a0ζn, is described in [102,103].
In [103], the BO method was compared to the direct fourth-order
Runge–Kutta integration of the ZSP (5), where for the latter
method the hidden constant in the big-Onotation was found
to be ∼jζj4. It was concluded that, generally, the BO method
is more convenient especially when the wide range of ζvalues
is addressed.
2. Ablowitz–Ladik Discretization Method (Normalized)
The AL discretization [106,107] is another method widely used
for the NFT-based transmission [62–67]. It corresponds to the
approximation of the NLSE by a discrete integrable problem.
The method also takes the form in Eq. (23), but with
Tmζ 1
ffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi
1ε2jqmj2
pZεqm
ε¯qmZ−1;Zeiζε:(24)
Review Article Vol. 4, No. 3 / March 2017 / Optica 313
For the location of zeros of aζ, one can again apply a search
routine. It was shown [46] that the AL algorithm can produce
some small spurious solitonic eigenvalues, which, however, can
be readily sorted out. In Ref. [56] the AL method was compared
with the BO method for realistic NFT-based transmission param-
eters, and it turned out that the AL method demonstrates slightly
better performance when applied to the calculation of continuous
spectrum (RC). The accuracy of the AL algorithm is of the second
order as for the BO scheme. This was, e.g., shown in [108] for a
variant of the AL algorithm that is commonly used in fiber Bragg
grating design. The relation of the scheme in [108] to the AL
algorithm below is elaborated in [109]. Apparently in contrast
to this, Boffetta and Osborne had observed in [102,110] that
the AL discretization achieves only first-order accuracy, when
the discrete eigenvalues computed by the AL and BO methods
were compared to exact analytical values. The AL discretization
that was investigated in [110], however, was an early version
[106], in which the coordinate transform Z1−iζε was used
instead of the now common transform Zeiζε that was given
later in [107]. In various numerical experiments that were reported
in [46], the errors of the AL and BO schemes decrease at similar
rates w.r.t. M. The same inference was confirmed in the study [56],
related to the true NFT-based transmission profiles.
3. Fourier Collocation Method
The Fourier collocation method has been used by the Osaka
group and co-authors [68–74]. Within this method the ZSP sol-
ution components v1;2are expanded in the Fourier series and the
ZSP itself is reformulated as an eigenvalue problem in the Fourier
space [43,46]. However, this method is inconvenient for the com-
putation of the continuous nonlinear spectrum and soliton norm-
ing constants, and it has been used only for eigenvalue
communication where the solitonic discrete eigenvalues them-
selves are adopted as information carriers. Another drawback
of this method is its numerical complexity: the method requires
the diagonalization of the non-Hermitial (for anomalous
dispersion) matrix, where the number of required flops is OM3.
4. Direct Toeplitz Inner Bordering Method
A new efficacious algorithm for the computation of continuous
nonlinear spectrum using the Toeplitz matrix transformations was
proposed in [111]. In the numerical example considered in [111],
this method outperforms the BO method in terms of speed and
accuracy; it has an error level of OM−2and the number of flops
OM2. This method is based on the reversion of the Toeplitz
matrix-based INFT algorithm; we provide the corresponding
INFT in the next subsection and description of the method in
Supplement 1. However, when dealing with the direct Toeplitz
inner bordering (TIB) method, one has to keep in mind that it
recovers the kernel of the GLME Lτthat in general includes
both discrete (solitonic) and continuous spectral components,
Eq. (17), simultaneously.
B. Numerical Methods for the INFT
The methods for numerical INFT computation were largely stud-
ied within the Bragg gratings’synthesis and characterization. Here,
using the traditional “matrix-inversion”terminology, we name the
INFT methods requiring OM2operation as “fast”and those
with lower complexity as “superfast.”Almost all INFT approaches
are based on the numerical solution of the GLME (16). After the
discretization, one aims at determining functions K1;2τ;τ0on the
grid of M×Mpoints. Note that the straightforward path there
based on the solution of Mnested linear matrix equations takes
OM4flops and is therefore unproductive.
The earlier approaches utilize iterative methods of matrix inver-
sion with the computational complexity in the order of OkiM3,
where ki<Mis a number of iterations. As an example, we men-
tion the group of methods with the GLME kernel parametrization
[112,113]. A similar method was also employed recently for optical
transmission tasks [60]. The main drawbacks of these algorithms
are the problem of choosing an initial approximation and high
computational complexity. A more advanced family of algorithms
is based on the layer peeling (LP) method. This class of methods is
built on the representation of the RC attributed to a particular pro-
file qtthrough the sequence of individual actions of Mpoint
reflectors [81,82,114]. The LP algorithms are comparatively fast
and require about OM2flops. Some of them provide an error
that is globally proportional to M−2. Conventional algorithms
based on LP show numerical instabilities with exponential ampli-
fication of noise when the reflection coefficients (participating in
the LP step) contain noise [115]. This means that there exists no
signal such that forward scattering with Msamples can result in the
desired reflection coefficient [108,116], or, in other words, when
one has independent noisy additions to the scattering data them-
selves. Physically, when some profile corrupted by noise in the
space-time domain was converted into the NF spectrum, and then
this spectrum was used for the superfast LP algorithm considered
further, we observed that the instability in all numerical examples
was absent; see Supplement 1. For the properties of space-time
noise conversion into the NF domain, see Refs. [117–120]. In
some transmission systems this limitation has recently been cir-
cumvented in some first algorithms where the reflection coefficient
is ensured to be realizable by construction [75,109,121]. On the
other hand, this instability can reveal itself when one synthesizes a
profile starting from some randomly encoded spectral data in the
NFD. This question requires further analysis.
The LP with improved accuracy [83], known as an integral LP,
has some issues with the overall efficiency, as it requires many more
arithmetic operations. The drawback of these algorithms is the ac-
cumulation of computational errors during calculation and the re-
sulting decrease in their accuracy when enlarging the qtextent.
Another interesting group of algorithms is based on recasting
the GLME as the system of partial differential equations
[122,123] (see also [124] for the comparison of such algorithms),
including also the “leap-frog”algorithm [125]. The numerical
complexity is OM2flops; however, the error there is only of
the first order, OM−1{[123], Fig. 2(b)}.
In [126] another algorithm was proposed, based on a different
computational approach, whose error was proportional to M−2.
This algorithm is “slow,”requiring OM3flops, and addresses
only the case of normal dispersion. However, the important fea-
ture there is that it introduced the very idea of the bordering pro-
cedure itself. Later, a more efficient algorithm that has a M−2error
and at the same time uses OM2flops was described in
[111,127]. The algorithm exploits the Toeplitz symmetry of dis-
cretized GLME using TIB, similar to technique for common
Toeplitz matrices [128,129]. As the TIB was successfully used
in a number of transmission-related works [51–55], we provide
here more details on the TIB method. First, we change the
variables in Eq. (16)as
Review Article Vol. 4, No. 3 / March 2017 / Optica 314
uτ;τ0K1τ;τ−τ0;wτ;τ0¯
K2τ;τ0−τ:(25)
In new notations, explicitly assuming the finite extent of qt,
0≤t≤T, after the complex conjugation of the first of
GLME, we get
uτ;yZ2τ
y
¯
Lτ0−ywτ;τ0dτ00;
wτ;τ0Zτ0
0
Lτ0−yuτ;ydyLτ00;(26)
0≤y; τ0<2τ,0≤τ≤T. The sought solution in the time do-
main now reads as qt2wt;2t−0. The GLME form (26)
allows one to obtain the Toeplitz-type problem after the discre-
tization and to use the fast Toeplitz matrix-inversion algorithms
[128,129] for the recovery of qt. Further details of the
TIB-based INFT are given in Supplement 1.
At the end we mention a recent work on the INFT methods by
Civelli et al. [130]: the authors introduced yet another INFT first-
order solution algorithm based on iterated convolutions with the
GLME kernel using the FFT, which demonstrated the better per-
formance in terms of accuracy and time consumption than the
first-order TIB [52] and the Nyström conjugate gradient method
[131]. However, the last approach has not been tested so far on
transmission-related problems. Note that NFT can be formulated
in terms of the so-called Riemann–Hilbert problem (see, e.g., [43]
and references therein), and numerical solution of the NFT can
be implemented using this approach [132].
C. Superfast NFT Algorithms
It has recently been observed that the AL method (and others) for
computing the NFT can be significantly sped up, leading to a
superfast NFT analogous to the celebrated FFT [84,99]. We
illustrate how to deal with fast NFTs using AL discretization.
The matrix Tmζin Eq. (24) can be written as
TmζSmZ∕dmZ;Zeiλε;(27)
where SmZand dmZare polynomials with respect to Z:
SmZ 1
ffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi
1ε2jqmj2
pZ2εqmZ
−ε¯qmZ1;d
mZZ:
(28)
Consequently, with SZSMZ×…×S1Zand
dZdMZ×…×d1Z, Eq. (23) can be written as
aζ
bζ≈diageiζT20.5ε;e
−iζT20.5εSZ
dZe−iζT1−0.5ε:
(29)
Since the SmZand dmZare polynomials with degrees at most
two, both SZand dZare again polynomials whose degrees are
upper bounded by 2M. The superfast NFT exploits this observa-
tion and proceeds in two steps. First, the monomial expansions of
the polynomials SZand dZhave to be computed. That is, the
unique matrices Skand scalars dkneed to be found such that
SZ X
2M−1
k0
SkZk;dZ X
2M−1
k0
dkZk:(30)
One first needs a fast method to compute the monomial expan-
sions. A naive implementation, e.g., to compute the expansion
of SZ, would proceed as follows:
SZSMZSM−1ZSM−2Z S1Z…:(31)
However, this leads to a OM2or even OM3runtime, depend-
ing on how the product of polynomials is found. In order to get a
superfast NFT algorithm, a divide-and-conquer strategy is used
instead. One starts with the elementary polynomials SmZ,
m0;…;M −1, partitions them into pairs, and computes the
products of these pairs. The products are again partitioned into
pairs, and then multiplied. This process is iterated until only
one product is left, which will be SZ. It turns out that this
algorithm finds the monomial expansion Sk,k0;…;M −1
using only OMlog2Mflops given that polynomial products
are computed with the FFT. The pseudocode for this algorithm
is provided in Supplement 1.
The second step of the superfast NFT now applies algorithms
for fast polynomial arithmetic in order