Michael A. Osborne’s research while affiliated with University of Oxford and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (115)


Device schematics. Si FinFET (a), GeSi nanowire (b) and Ge/SiGe heterostructure (c) device architectures and their corresponding current pinch-off hypersurfaces for hole transport calculated using a Gaussian process model for one of the tuning algorithm runs (d–f). Three gates are plotted for illustrative purposes with the remaining gates on each device set to a constant value. The bias was kept constant throughout the experiment. CATSAI was given control over the gate electrodes V1\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$V_{1}$$\end{document}–V4\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$V_{4}$$\end{document}, V1\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$V_{1}$$\end{document}–V5\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$V_{5}$$\end{document}, and V1\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$V_{1}$$\end{document}–V7\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$V_{7}$$\end{document} on the FinFET, nanowire and heterostructure, respectively. The dashed white circles show the approximate locations of the quantum dots formed in the devices.
Outline of CATSAI’s workflow. The initialisation stage consists of setting Vbias\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$V_{\textrm{bias}}$$\end{document} then measuring the maximum and minimum (offset) current flowing through the device. The sampling stage detects pinch-off locations in gate voltage space. The algorithm selects a vector in gate voltage space u\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\varvec{u}$$\end{document} based on the model it generates of the hypersurface and of the probability of finding Coulomb peaks in a given location in gate voltage space. In the investigation stage the algorithm uses the plunger gates to sequentially acquire current traces and maps which are sent to the relevant classifiers. The Coulomb peak detector is a random forest classifier which determines whether Coulomb peaks are present (positive) or not (negative) within a current trace. In each iteration, the algorithm outputs a high-resolution current map if the double dot check score function is passed. After the investigation stage, the algorithm returns to the sampling stage.
Gate-voltage space exploration. Different charge carriers (gate operation modes) are represented in different columns (rows). Each panel illustrates the initial placement of the origin (white circle), search boundary (red cross), and search direction (black arrow). The gate voltage space is divided into regions of near-zero (blue) and non-zero (pink) current. Regions of voltage space which cannot be explored due to the gate voltage bounds set to avoid device damage are greyed out.
Device tuning. Examples of current map outputs on the different devices in which CATSAI was run. High resolution maps are generated during the investigation stage by sweeping the plunger gates of each device Vp1,p2\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$V_{p1,p2}$$\end{document}; for the FinFET V3,2\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$V_{3,2}$$\end{document} (a–c), the nanowire V4,2\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$V_{4,2}$$\end{document} (d–f) and the heterostructure V3,5\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$V_{3,5}$$\end{document} (g–i). These current maps are labelled a posteriori by humans to verify whether they correspond to the double quantum dot regime. C indicates the number of humans out of four who labelled the current map as corresponding to a double quantum dot regime. Red (blue) indicates regions of high (low) current in each map.
Benchmarking the algorithm’s performance. Cumulative sum of the average number of double quantum dot regimes verified by humans C¯\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\bar{C}$$\end{document} (first and second columns) and probability of finding Coulomb peaks P(peaks)\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\mathrm {P(peaks)}$$\end{document} (third and fourth columns), as a function of laboratory time for each run of CATSAI and Random Search algorithms. Each coloured line corresponds to a different run. Rows correspond to the different devices. Only the first 4 h of each tuning run are shown for ease of visualisation. CATSAI outperforms Random Search in the number of double quantum dot regimes located for all devices. The value of C¯\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\bar{C}$$\end{document} remains at 0 in many of the Random Search runs, and thus are not visible in the plots of C¯\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\bar{C}$$\end{document} as a function of time. The increase in P(peaks)\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\mathrm {P(peaks)}$$\end{document} as a function of laboratory time observed for the CATSAI runs after the first 12 iterations can be explained by the algorithm ‘learning’ a better model of the hypersurface as the Gaussian process regression acquires more observations.

+1

Cross-architecture tuning of silicon and SiGe-based quantum devices using machine learning
  • Article
  • Full-text available

July 2024

·

46 Reads

·

9 Citations

B. Severin

·

·

L. C. Camenzind

·

[...]

·

N. Ares

The potential of Si and SiGe-based devices for the scaling of quantum circuits is tainted by device variability. Each device needs to be tuned to operation conditions and each device realisation requires a different tuning protocol. We demonstrate that it is possible to automate the tuning of a 4-gate Si FinFET, a 5-gate GeSi nanowire and a 7-gate Ge/SiGe heterostructure double quantum dot device from scratch with the same algorithm. We achieve tuning times of 30, 10, and 92 min, respectively. The algorithm also provides insight into the parameter space landscape for each of these devices, allowing for the characterization of the regions where double quantum dot regimes are found. These results show that overarching solutions for the tuning of quantum devices are enabled by machine learning.

Download

Bridging the Reality Gap in Quantum Devices with Physics-Aware Machine Learning

January 2024

·

43 Reads

·

9 Citations

Physical Review X

The discrepancies between reality and simulation impede the optimization and scalability of solid-state quantum devices. Disorder induced by the unpredictable distribution of material defects is one of the major contributions to the reality gap. We bridge this gap using physics-aware machine learning, in particular, using an approach combining a physical model, deep learning, Gaussian random field, and Bayesian inference. This approach enables us to infer the disorder potential of a nanoscale electronic device from electron-transport data. This inference is validated by verifying the algorithm’s predictions about the gate-voltage values required for a laterally defined quantum-dot device in AlGaAs/GaAs to produce current features corresponding to a double-quantum-dot regime. Published by the American Physical Society 2024


Figure 1: A binary tree kernel with four data points. In this example, k(x 1 , x 1 ) = 1, k(x 1 , x 2 ) = 0, k(x 1 , x 3 ) = 0.8, and k(x 1 , x 4 ) = 0.3.
Figure 3: Function from [0, 1] 2 → B 8 .
Log-Linear-Time Gaussian Processes Using Binary Tree Kernels

October 2022

·

46 Reads

Gaussian processes (GPs) produce good probabilistic models of functions, but most GP kernels require O((n+m)n2)O((n+m)n^2) time, where n is the number of data points and m the number of predictive locations. We present a new kernel that allows for Gaussian process regression in O((n+m)log(n+m))O((n+m)\log(n+m)) time. Our "binary tree" kernel places all data points on the leaves of a binary tree, with the kernel depending only on the depth of the deepest common ancestor. We can store the resulting kernel matrix in O(n) space in O(nlogn)O(n \log n) time, as a sum of sparse rank-one matrices, and approximately invert the kernel matrix in O(n) time. Sparse GP methods also offer linear run time, but they predict less well than higher dimensional kernels. On a classic suite of regression tasks, we compare our kernel against Mat\'ern, sparse, and sparse variational kernels. The binary tree GP assigns the highest likelihood to the test data on a plurality of datasets, usually achieves lower mean squared error than the sparse methods, and often ties or beats the Mat\'ern GP. On large datasets, the binary tree GP is fastest, and much faster than a Mat\'ern GP.


μdist and μprox model the world, perhaps coarsely, outside of the computer implementing the agent itself. μdist outputs reward equal to the box display, while μprox outputs reward according to an optical character recognition function applied to part of the visual field of a camera. (As a side note, some coarseness to this simulation is unavoidable, since a computable agent generally cannot perfectly model a world that includes itself (Leike, Taylor, and Fallenstein 2016); hence, the laptop is not in blue.)
Assistants in an assistance game model how actions and human actions produce observations and unobserved utility. These classes of models categorize (nonexhaustively) how the human action might affect the internals of the model.
Advanced artificial agents intervene in the provision of reward

August 2022

·

75 Reads

·

32 Citations

We analyze the expected behavior of an advanced artificial agent with a learned goal planning in an unknown environment. Given a few assumptions, we argue that it will encounter a fundamental ambiguity in the data about its goal. For example, if we provide a large reward to indicate that something about the world is satisfactory to us, it may hypothesize that what satisfied us was the sending of the reward itself; no observation can refute that. Then we argue that this ambiguity will lead it to intervene in whatever protocol we set up to provide data for the agent about its goal. We discuss an analogous failure mode of approximate solutions to assistance games. Finally, we briefly review some recent approaches that may avoid this problem.


Bayesian Generational Population-Based Training

July 2022

·

29 Reads

Reinforcement learning (RL) offers the potential for training generally capable agents that can interact autonomously in the real world. However, one key limitation is the brittleness of RL algorithms to core hyperparameters and network architecture choice. Furthermore, non-stationarities such as evolving training data and increased agent complexity mean that different hyperparameters and architectures may be optimal at different points of training. This motivates AutoRL, a class of methods seeking to automate these design choices. One prominent class of AutoRL methods is Population-Based Training (PBT), which have led to impressive performance in several large scale settings. In this paper, we introduce two new innovations in PBT-style methods. First, we employ trust-region based Bayesian Optimization, enabling full coverage of the high-dimensional mixed hyperparameter search space. Second, we show that using a generational approach, we can also learn both architectures and hyperparameters jointly on-the-fly in a single training run. Leveraging the new highly parallelizable Brax physics engine, we show that these innovations lead to large performance gains, significantly outperforming the tuned baseline while learning entire configurations on the fly. Code is available at https://github.com/xingchenwan/bgpbt.


Probabilistic Numerics: Computation as Machine Learning

June 2022

·

17 Reads

·

29 Citations

Probabilistic numerical computation formalises the connection between machine learning and applied mathematics. Numerical algorithms approximate intractable quantities from computable ones. They estimate integrals from evaluations of the integrand, or the path of a dynamical system described by differential equations from evaluations of the vector field. In other words, they infer a latent quantity from data. This book shows that it is thus formally possible to think of computational routines as learning machines, and to use the notion of Bayesian inference to build more flexible, efficient, or customised algorithms for computation. The text caters for Masters' and PhD students, as well as postgraduate researchers in artificial intelligence, computer science, statistics, and applied mathematics. Extensive background material is provided along with a wealth of figures, worked examples, and exercises (with solutions) to develop intuition.


32 - Value Loss

June 2022

·

1 Read

Probabilistic numerical computation formalises the connection between machine learning and applied mathematics. Numerical algorithms approximate intractable quantities from computable ones. They estimate integrals from evaluations of the integrand, or the path of a dynamical system described by differential equations from evaluations of the vector field. In other words, they infer a latent quantity from data. This book shows that it is thus formally possible to think of computational routines as learning machines, and to use the notion of Bayesian inference to build more flexible, efficient, or customised algorithms for computation. The text caters for Masters' and PhD students, as well as postgraduate researchers in artificial intelligence, computer science, statistics, and applied mathematics. Extensive background material is provided along with a wealth of figures, worked examples, and exercises (with solutions) to develop intuition.


VI - Solving Ordinary Differential Equations

June 2022

·

1 Read

Probabilistic numerical computation formalises the connection between machine learning and applied mathematics. Numerical algorithms approximate intractable quantities from computable ones. They estimate integrals from evaluations of the integrand, or the path of a dynamical system described by differential equations from evaluations of the vector field. In other words, they infer a latent quantity from data. This book shows that it is thus formally possible to think of computational routines as learning machines, and to use the notion of Bayesian inference to build more flexible, efficient, or customised algorithms for computation. The text caters for Masters' and PhD students, as well as postgraduate researchers in artificial intelligence, computer science, statistics, and applied mathematics. Extensive background material is provided along with a wealth of figures, worked examples, and exercises (with solutions) to develop intuition.


V - Global Optimisation

June 2022

·

1 Read

Probabilistic numerical computation formalises the connection between machine learning and applied mathematics. Numerical algorithms approximate intractable quantities from computable ones. They estimate integrals from evaluations of the integrand, or the path of a dynamical system described by differential equations from evaluations of the vector field. In other words, they infer a latent quantity from data. This book shows that it is thus formally possible to think of computational routines as learning machines, and to use the notion of Bayesian inference to build more flexible, efficient, or customised algorithms for computation. The text caters for Masters' and PhD students, as well as postgraduate researchers in artificial intelligence, computer science, statistics, and applied mathematics. Extensive background material is provided along with a wealth of figures, worked examples, and exercises (with solutions) to develop intuition.


2 - Probabilistic Inference

June 2022

·

2 Reads

Probabilistic numerical computation formalises the connection between machine learning and applied mathematics. Numerical algorithms approximate intractable quantities from computable ones. They estimate integrals from evaluations of the integrand, or the path of a dynamical system described by differential equations from evaluations of the vector field. In other words, they infer a latent quantity from data. This book shows that it is thus formally possible to think of computational routines as learning machines, and to use the notion of Bayesian inference to build more flexible, efficient, or customised algorithms for computation. The text caters for Masters' and PhD students, as well as postgraduate researchers in artificial intelligence, computer science, statistics, and applied mathematics. Extensive background material is provided along with a wealth of figures, worked examples, and exercises (with solutions) to develop intuition.


Citations (29)


... If a charge sensor is available, undoped SiMOS devices have been autotuned to achieve a single-electron QD [18]. For nanowires, which by design have a single, known current channel, automation into fully formed QDs has been demonstrated, provided device-specific details are placed into a configuration file [19]. ...

Reference:

BATIS: Bootstrapping, Autonomous Testing, and Initialization System for Quantum Dot Devices
Cross-architecture tuning of silicon and SiGe-based quantum devices using machine learning

... Graybox models also have utility in more applied tasks, such as optimizing control voltages for a photonic quantum circuits (Fig. 3) [43]. More generally, ML methods have been used to learn quantum device characteristics otherwise inaccessible to experiments -such as disorder potentials [44,45] and the nuclear environment of a qubit [46]. ML-based algorithms can also be used to tune, characterize and optimize qubit operation (see section 4), take data efficiently [47] and learn the Hamiltonian parameters of a variety of quantum systems [40]. ...

Bridging the Reality Gap in Quantum Devices with Physics-Aware Machine Learning

Physical Review X

... These are current geopolitical issues that have to date proven unresolvable; it is unknown whether the risks inherent with AI change the calculus on the resolvability of these issues. 30 Through our experiences facilitating Intelligence Rising, we ourselves have come to better internalise the extreme uncertainty inherent in such a complex and wicked problem at the intersection of science, technology and international relations (Gruetzemacher 2018). This has taught us to respect the need for policymakers and decision-makers in AI firms to be very adaptable in the AI strategy space and to try to come up with governance proposals that are versatile to a wide range of possible events, trajectories and discontinuities. ...

Advanced artificial agents intervene in the provision of reward

... A related idea has appeared recently consisting in a proof of principle where the disorder is determined in Majorana nanowire systems [14]. Another application which consists in using machine learning to adjust device parameters to compensate for uncontrolled disorder effects has been recently implemented in the case of a double quantum dot nanostructure [15]. It has also been suggested that properties of the disorder between the fingers of a QPC can be extracted from SGM data using cellular neural networks [16] or a swarming algorithm [17]. ...

Bridging the reality gap in quantum devices with physics-aware machine learning

... Working memory resources are distributed between the choice of a task solving strategy, its application, memorization of the task conditions and intermediate variations of the answer [14,15]. The successful task solving results in more effective and ranked involvement of cognitive processes, first of all volitional attention, during the complex mental task fulfillment [16,17,18]. ...

Personalized brain stimulation for effective neurointervention across participants

... One such application is the facilitation of quantum experiments by partly or fully automating the experimental workflow in cases where tuning of several parameters is necessary [4,5]. Significant work has already been done towards characterization of multi-quantum-dot devices [6][7][8][9][10][11][12][13] and automated tuning of such systems to a desired operating configuration [14][15][16]. One common challenge in tuning up quantum devices is posed by material imperfections and other types of unpredictable disorder, which often hinder straightforward navigation through a large parameter space. ...

Deep reinforcement learning for efficient measurement of quantum devices

npj Quantum Information

... One such application is the facilitation of quantum experiments by partly or fully automating the experimental workflow in cases where tuning of several parameters is necessary [4,5]. Significant work has already been done towards characterization of multi-quantum-dot devices [6][7][8][9][10][11][12][13] and automated tuning of such systems to a desired operating configuration [14][15][16]. One common challenge in tuning up quantum devices is posed by material imperfections and other types of unpredictable disorder, which often hinder straightforward navigation through a large parameter space. ...

Machine learning enables completely automatic tuning of a quantum device faster than human experts

... Proficiency with data analysis tools can help them give more meaningful insights and reports. Routine administrative tasks are becoming automated through automation solutions (Willis et al., 2020). Office managers may use software to handle scheduling, reminders, and document processing in today's modern workplaces, which will boost output and reduce error risk. ...

Qualitative and quantitative approach to assess of the potential for automating administrative tasks in general practice

BMJ Open

... It is estimated that around one million qubits will be required for practical quantum computers 9 . As the number of qubits increases [10][11][12] , the parameter tuning process becomes complicated and infeasible, making automation critical for large-scale quantum computers [13][14][15] . ...

Efficiently measuring a quantum device using machine learning

npj Quantum Information