Cihan Akcay’s research while affiliated with General Atomics and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (16)


A cross section of the DIII-D tokamak with all of the external diagnostics and PF coils that enter magnetic EFIT as least squares constraints. (a) The red line segments represent the 76 magnetic (b) probes, the blue circles the 44 flux ( ψ) loops, and the orange blocks the 18 poloidal field (PF) coils and 6 Ohmic coils. The gray corresponds to the vacuum vessel wall.
Mean (arrows) normalized magnetic diagnostics and coil currents and their 3σ standard deviations (vertical bar) for DIII-D discharge 180087. All 145 input features are normalized by various combinations of the time-varying (toroidal) vacuum magnetic field B0 and the major and (average) minor radii of DIII-D, R0=1.67 m and a=0.6 m (and vacuum permeability μ0 for the coil currents) to bring the scale of the inputs approximately within the [−1,1] range.
Explained variance showing the amount of compressed information in the first 30 PCA components for a 2D image of embedded magnetic inputs.
The first four PCA components of the toroidal current density Jϕ for the dataset used in training EFIT-Prime.
The mean prediction of the poloidal flux ψ by the EFIT-Prime model, given the principal components of the external magnetic measurements and coil currents embedded in a 2D map: Shown are (a) the R2 (blue) and SSIM (orange) distributions of the predicted flux ψ for nearly 1.8×104 test samples, (b)–(d) the overlay of the true flux surfaces (black) against the NN-predicted flux surfaces (red dashed) for three samples with the worst, median, and best R2, (e)–(g) aleatory, and (h)–(j) epistemic uncertainties in the flux prediction for the same three samples.

+8

EFIT-Prime: Probabilistic and physics-constrained reduced-order neural network model for equilibrium reconstruction in DIII-D
  • Article
  • Full-text available

September 2024

·

45 Reads

·

3 Citations

S. Madireddy

·

C. Akçay

·

S. E. Kruger

·

[...]

·

L. L. Lao

We introduce EFIT-Prime, a novel machine learning surrogate model for EFIT (Equilibrium FIT) that integrates probabilistic and physics-informed methodologies to overcome typical limitations associated with deterministic and ad hoc neural network architectures. EFIT-Prime utilizes a neural architecture search-based deep ensemble for robust uncertainty quantification, providing scalable and efficient neural architectures that comprehensively quantify both data and model uncertainties. Physically informed by the Grad–Shafranov equation, EFIT-Prime applies a constraint on the current density Jtor and a smoothness constraint on the first derivative of the poloidal flux, ensuring physically plausible solutions. Furthermore, the spatial location of the diagnostics is explicitly incorporated in the inputs to account for their spatial correlation. Extensive evaluations demonstrate EFIT-Prime's accuracy and robustness across diverse scenarios, most notably showing good generalization on negative-triangularity discharges that were excluded from training. Timing studies indicate an ensemble inference time of 15 ms for predicting a new equilibrium, offering the possibility of plasma control in real-time, if the model is optimized for speed.

Download

Augmenting machine learning of Grad–Shafranov equilibrium reconstruction with Green's functions

August 2024

·

20 Reads

·

2 Citations

This work presents a method for predicting plasma equilibria in tokamak fusion experiments and reactors. The approach involves representing the plasma current as a linear combination of basis functions using principal component analysis of plasma toroidal current densities ( Jt) from the EFIT-AI equilibrium database. Then utilizing EFIT's Green's function tables, basis functions are created for the poloidal flux ( ψ) and diagnostics generated from the toroidal current ( Jt). Similar to the idea of a physics-informed neural network (NN), this physically enforces consistency between ψ, Jt, and the synthetic diagnostics. First, the predictive capability of a least squares technique to minimize the error on the synthetic diagnostics is employed. The results show that the method achieves high accuracy in predicting ψ and moderate accuracy in predicting Jt with median R2 = 0.9993 and R2 = 0.978, respectively. A comprehensive NN using a network architecture search is also employed to predict the coefficients of the basis functions. The NN demonstrates significantly better performance compared to the least squares method with median R2 = 0.9997 and 0.9916 for Jt and ψ, respectively. The robustness of the method is evaluated by handling missing or incorrect data through the least squares filling of missing data, which shows that the NN prediction remains strong even with a reduced number of diagnostics. Additionally, the method is tested on plasmas outside of the training range showing reasonable results.


Impact of various DIII-D diagnostics on the accuracy of neural network surrogates for kinetic EFIT reconstructions

July 2024

·

16 Reads

·

2 Citations

Kinetic equilibrium reconstructions make use of profile information such as particle density and temperature measurements in addition to magnetics data to compute a self-consistent equilibrium. They are used in a multitude of physics-based modeling. This work develops a multi-layer perceptron (MLP) neural network (NN) model as a surrogate for kinetic Equilibrium Fitting (EFITs) and trains on the 2019 DIII-D discharge campaign database of kinetic equilibrium reconstructions. We investigate the impact of including various diagnostic data and machine actuator controls as input into the NN. When giving various categories of data as input into NN models that have been trained using those same categories of data, the predictions on multiple equilibrium reconstruction solutions (poloidal magnetic flux, global scalars, pressure profile, current profile) are highly accurate. When comparing different models with different diagnostics as input, the magnetics-only model outputs accurate kinetic profiles and the inclusion of additional data does not significantly impact the accuracy. When the NN is tasked with inferring only a single target such as the EFIT pressure profile or EFIT current profile, we see a large increase in the accuracy of the prediction of the kinetic profiles as more data is included. These results indicate that certain MLP NN configurations can be reasonably robust to different burning-plasma-relevant diagnostics depending on the accuracy requirements for equilibrium reconstruction tasks.


Thinking Bayesian for plasma physicists

May 2024

·

14 Reads

·

1 Citation

Bayesian statistics offers a powerful technique for plasma physicists to infer knowledge from the heterogeneous data types encountered. To explain this power, a simple example, Gaussian Process Regression, and the application of Bayesian statistics to inverse problems are explained. The likelihood is the key distribution because it contains the data model, or theoretic predictions, of the desired quantities. By using prior knowledge, the distribution of the inferred quantities of interest based on the data given can be inferred. Because it is a distribution of inferred quantities given the data and not a single prediction, uncertainty quantification is a natural consequence of Bayesian statistics. The benefits of machine learning in developing surrogate models for solving inverse problems are discussed, as well as progress in quantitatively understanding the errors that such a model introduces.


Probabilistic locked mode predictor in the presence of a resistive wall and finite island saturation in tokamaks

March 2024

·

18 Reads

We present a framework for estimating the probability of locking to an error field in a rotating tokamak plasma. This leverages machine learning methods trained on data from a mode-locking model, including an error field, resistive magnetohydrodynamics modeling of the plasma, a resistive wall, and an external vacuum region, leading to a fifth-order ordinary differential equation (ODE) system. It is an extension of the model without a resistive wall introduced by Akçay et al. [Phys. Plasmas 28, 082106 (2021)]. Tearing mode saturation by a finite island width is also modeled. We vary three pairs of control parameters in our studies: the momentum source plus either the error field, the tearing stability index, or the island saturation term. The order parameters are the time-asymptotic values of the five ODE variables. Normalization of them reduces the system to 2D and facilitates the classification into locked (L) or unlocked (U) states, as illustrated by Akçay et al., [Phys. Plasmas 28, 082106 (2021)]. This classification splits the control space into three regions: L ̂, with only L states; U ̂, with only U states; and a hysteresis (hysteretic) region H ̂, with both L and U states. In regions L ̂ and U ̂, the cubic equation of torque balance yields one real root. Region H ̂ has three roots, allowing bifurcations between the L and U states. The classification of the ODE solutions into L/U is used to estimate the locking probability, conditional on the pair of the control parameters, using a neural network. We also explore estimating the locking probability for a sparse dataset, using a transfer learning method based on a dense model dataset.


MHD modeling of shattered pellet injection in JET

April 2023

·

50 Reads

·

10 Citations

Nonlinear 3D MHD simulations of shattered-pellet injection (SPI) in JET show prototypical SPI- driven disruptions using the M3D-C1 and NIMROD extended-MHD codes. Initially, radiation-driven thermal quenches are accelerated by MHD activity as the pellet crosses rational surfaces, leading to a radiation spike, global stochasticization of the magnetic field, and a complete thermal quench. Eventually, current quenches, preceded by a current spike are seen as the Ohmic heating becomes equal to the radiative cooling. The results are qualitatively similar for both a single monolithic pellet, pencil-beam model, and a realistic shatter to represent the SPI plume. A scan in viscosity from 500-2000 m2/s for MHD simulations finds that reducing viscosity increases MHD activity and decreases thermal quench time 1slightly. A realistic cloud of fragments modeling shows that mixed-D-Ne pellet travels deeper into the plasma core before the thermal quench. At the slow pellet speeds, the pellet is found to be moving slowly enough inward that even the 5% neon in the mixed pellet is enough to effectively radiate the thermal energy available. Radiation toroidal peaking is predicted to be at levels consistent with experimental observations and reduced as the pellet travels deeper into the plasma. These simulations lay the ground work for more-sophisticated validative and predictive modeling of SPI in JET using both M3D-C1 and NIMROD


Surrogate models for plasma displacement and current in 3-D perturbed magnetohydrodynamic equilibria in tokamaks

October 2022

·

29 Reads

·

8 Citations

A numerical database of over one thousand perturbed three-dimensional (3D) equilibria has been generated, constructed based on the MARS-F (Liu et al 2000 Phys. Plasmas 7 3681) computed plasma response to the externally applied 3D field sources in multiple tokamak devices. Perturbed 3D equilibria with the n = 1–4 ( n is the toroidal mode number) toroidal periodicity are computed. Surrogate models are created for the computed perturbed 3D equilibrium utilizing model order reduction (MOR) techniques. In particular, retaining the first few eigenstates from the singular value decomposition (SVD) of the data is found to produce reasonably accurate MOR-representations for the key perturbed quantities, such as the perturbed parallel plasma current density and the plasma radial displacement. SVD also helps to reveal the core versus edge plasma response to the applied 3D field. For the database covering the conventional aspect ratio devices, about 95% of data can be represented by the truncated SVD-series with inclusion of only the first five eigenstates, achieving a relative error (RE) below 20%. The MOR-data is further utilized to train neural networks (NNs) to enable fast reconstruction of perturbed 3D equilibria, based on the two-dimensional equilibrium input and the 3D source field. The best NN-training is achieved for the MOR-data obtained with a global SVD approach, where the full set of samples used for NN training and testing are stretched and form a large matrix which is then subject to SVD. The fully connected multi-layer perceptron, with one or two hidden layers, can be trained to predict the MOR-data with less than 10% RE. As a key insight, a better strategy is to train separate NNs for the plasma response fields with different toroidal mode numbers. It is also better to apply MOR and to subsequently train NNs separately for conventional and low aspect ratio devices, due to enhanced toroidal coupling of Fourier spectra in the plasma response in the latter case.


Figure 4. The database-wide pair of mean energy-based features. The left panel shows the toroidal mean feature x where x is the toroidal term ftor of (1/2)J · A. The right panel shows the mean feature when instead x is the poloidal term f pol of (1/2)J · A. Around 2.5 million equilibrium reconstructions were used in the analysis, accrued from 17 thousand different DIII-D plasma discharges.
Figure 6. The database-wide PCA basis vectors. The top row shows the first 8 orthonormal vectors for the toroidal feature. The bottom row shows the first 8 orthonormal vectors for the poloidal feature. The first vector in both rows is largely (anti-)parallel to the respective mean feature (cosine similarities −0.99 and −0.98). The color-scale is adapted to each thumbnail independently. The basis vectors are the columns of the matrix (3.2) for the respective feature (toroidal and poloidal).
Figure 8. Panel (a) visualizes three random prior draws of the PCLLR β parameter using the smoothness enforcing covariance (4.10). Panel (b) shows the visualization of the estimated PCLLR model in physical space, using the prior shown in panel (a). The inner product of the weight fields in panel (b) with the pair of features for a plasma time-slice (4.8), plus an intercept term, yields the PCLLR model log-odds output, as defined by equation (4.1).
Database-wide hazard modelling of the onset of DIII-D tearing modes with field features

October 2022

·

24 Reads

·

1 Citation

Journal of Plasma Physics

The rate of onset (hazard) of tearing modes is modelled probabilistically using statistical learning algorithms. Axisymmetric energy-density equilibrium fields are taken as raw high-dimensional input features which are reduced with principal component analysis. Signal processing of non-axisymmetric magnetics fluctuation array data provides the target information from which to learn. Model selection, visualization and calibration assessment procedures are detailed. The analysis is deployed at large scale across the DIII-D tokamak database. Standard model selection criteria suggest that the energy-density post-processed feature is a better choice for modelling the onset rate compared to the non-processed equilibrium reconstruction solution. Two example applications of the learned rate function are demonstrated: (i) proximity-to-onset discharge monitoring and (ii) database analysis showing an (expected) observational global trend that the general hazard increases as a plasma performance metric increases. An important connection between the hazard function and its use as a conditional probability generator is reviewed in the Appendix.


Application of machine learning and artificial intelligence to extend EFIT equilibrium reconstruction

May 2022

·

80 Reads

·

36 Citations

Recent progress in the application of machine learning (ML) / artificial intelligence (AI) algorithms to improve EFIT equilibrium reconstruction for fusion data analysis applications is presented. A device-independent portable core equilibrium solver capable of computing or reconstructing equilibrium for different tokamaks has been created to facilitate adaptation of ML/AI algorithms. A large EFIT database comprising of DIII-D magnetic, Motional-Stark Effect (MSE), and kinetic reconstruction data has been generated for developments of EFIT Model-Order-Reduction (MOR) surrogate models to reconstruct approximate equilibrium solutions. A neural-network (NN) MOR surrogate model has been successfully trained and tested using the magnetically reconstructed datasets with encouraging results. Other progress includes developments of a Gaussian-Process (GP) Bayesian framework that can adapt its many hyperparameters to improve processing of experimental input data and a 3D perturbed equilibrium database from toroidal full magnetohydrodynamic linear response modeling using the MARS-F code for developments of 3D-MOR surrogate models.


Machine learning methods for probabilistic locked-mode predictors in tokamak plasmas

August 2021

·

18 Reads

·

7 Citations

A rotating tokamak plasma can interact resonantly with the external helical magnetic perturbations, also known as error fields. This can lead to locking and then to disruptions. We leverage machine learning (ML) methods to predict the locking events. We use a coupled third-order nonlinear ordinary differential equation model to represent the interaction of the magnetic perturbation and the plasma rotation with the error field. This model is sufficient to describe qualitatively the locking and unlocking bifurcations. We explore using ML algorithms with the simulation data and experimental data, focusing on the methods that can be used with sparse datasets. These methods lead to the possibility of the avoidance of locking in real-time operations. We describe the operational space in terms of two control parameters: the magnitude of the error field and the rotation frequency associated with the momentum source that maintains the plasma rotation. The outcomes are quantified by order parameters that completely characterize the state, whether locked or unlocked. We use unsupervised ML methods to classify locked/unlocked states and note the usefulness of a certain normalization of the order parameters. Three supervised ML classifiers are used in suite to estimate the probability of locking in the region of control parameter space with hysteresis, i.e., the set of control parameters for which both locked and unlocked states can exist. The results show that a neural network gives the best estimate of the locking probability. An analogy of the present locking model with the van der Waals equation of state is also provided.


Citations (10)


... 10,11 Neural networks have also been trained for the DIII-D tokamak, showcasing promising predictive capabilities. 12 Notably, there have been substantial refinements and improvements made to the equilibrium surrogates 13 using a comprehensive neural architecture search (NAS). 14 However, thus far, these studies have not leveraged the separation of the known external contributions and unknown internal contributions to the Grad-Shafranov equation, with the exception of recent work examining equilibrium in the NSTX tokamak. ...

Reference:

Augmenting machine learning of Grad–Shafranov equilibrium reconstruction with Green's functions
EFIT-Prime: Probabilistic and physics-constrained reduced-order neural network model for equilibrium reconstruction in DIII-D

... By using only DIII-D data from within one years' run campaign, we ensure that the same diagnostics are available and, with proper imputation schemes, avoid these issues that are not currently handled by the NNs. As a parallel study, the performance and impact of explicitly encoding the physical position of the diagnostic into the NN is being studied [28,29]. Furthermore, synthetic diagnostics can be used via physicsbased models or data-based imputation schemes, which would allow extrapolation to regimes where the diagnostic data is unavailable (for example, early discharge times). ...

Augmenting machine learning of Grad–Shafranov equilibrium reconstruction with Green's functions

... In nuclear fusion research, neural networks have been utilized for various applications including plasma tomography, where they have facilitated full, pixel-by-pixel reconstructions of plasma profiles using the bolometer system at JET [59], and neutron emissivity tomography [60]. Recent studies have also applied neural networks to MHD equilibrium reconstruction [61,62], kinetic profile reconstruction [63], and disruption prediction [64][65][66]. ...

Impact of various DIII-D diagnostics on the accuracy of neural network surrogates for kinetic EFIT reconstructions

... On the numerical front, continuous developments aim to achieve predictive capabilities regarding the performance of SPI. This includes work from nonlinear 3D MHD codes such as M3D-C1 [17][18][19], NIMROD [20,21], and JOREK [22][23][24][25][26][27], as well as work in reduced dimensions, such as INDEX [28] and DREAM [29]. Although it is desirable to validate these numerical models against SPI experiments conducted in existing devices, the highly nonlinear and violent phase of plasma resulting from SPI poses challenges on both experimental and numerical sides. ...

MHD modeling of shattered pellet injection in JET

... Each layer's number of nodes was set to the geometric mean of the nodes in the neighboring layers. This was a principled architecture decision following the results of the numerical studies in [45]. The remaining hyper-parameters chosen are as follows: ...

Surrogate models for plasma displacement and current in 3-D perturbed magnetohydrodynamic equilibria in tokamaks

... Recent work more directly relevant to our investigation is by Murari et al. 23 on the probabilistic locked-mode predictor, which uses support-vector machine classifiers 24 and adaptive training on the JET data. Another example by Olofsson et al. [25][26][27] focuses on hazard/survival analysis for calculating the event onset intensity with statistical models. ...

Database-wide hazard modelling of the onset of DIII-D tearing modes with field features

Journal of Plasma Physics

... Benefiting from the development of machine learning, these approaches have been applied in plasma research in various fields. These include the regression of energy confinement scaling nonlinearly [19], database construction [20], equilibrium reconstruction [21], reconstruction of temperature profile [22], analysis of charge exchange spectrum on the JET device [23], turbulence [24,25] and turbulent transport [26,27], automatic identification [28], disruption prediction [29][30][31][32], properties of micro-tearing modes [33,34], tokamak control [35,36] and development of surrogate models [37][38][39][40][41]. ...

Application of machine learning and artificial intelligence to extend EFIT equilibrium reconstruction
  • Citing Article
  • May 2022

... Also, a set of pseudo-potentials that are utilized in computation form part of the given data. In addition to being useful to other researchers for a variety of purposes, these data could serve as material in the interdisciplinary fields of machine learning, astrophysics, and fusion physics [16][17][18][19]. Moreover, the data could be useful in technological facilities, power-generating facilities, attosecond laser experiments like in extreme light infrastructure (ELI) beamline facilities [20][21][22], etc. ...

Machine learning methods for probabilistic locked-mode predictors in tokamak plasmas
  • Citing Article
  • August 2021

... Recent nonlinear resistive single-fluid MHD simulations of plasma response have been carried out in a cylindrical configuration using the NIMROD code, where only the poloidal rotation evolution is considered. 30 Many other early and recent nonlinear simulations on plasma response using various MHD models and codes have been reported, where the attention is directed toward other key aspects of the response process instead of the flow evolution (e.g., Refs. 34 and 35). ...

Nonlinear error field response in the presence of plasma rotation and real frequencies due to favorable curvature
  • Citing Article
  • March 2020

... If we are not interested in the precise effects of the injector geometry, we can gain considerable numerical efficiency by solving the evolution equations with algorithms optimized for axisymmetric geometries. NIMROD is a versatile extended MHD code used for simulating spheromaks 5,7,18,28,29 as well as many other axisymmetric or quasi-axisymmetric plasma systems 30,31 . Instead of implementing the boundary condition J ·n, a thin layer of high resistivity with η wall /η plasma ≈ 10 5 is used to impede current flow into the wall. ...

Formation of closed flux surfaces in spheromaks sustained by Steady Inductive Helicity Injection
  • Citing Article
  • April 2019