Alán Aspuru-Guzik

Alán Aspuru-Guzik
Harvard University | Harvard · Department of Chemistry and Chemical Biology

PhD

About

674
Publications
172,838
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
52,632
Citations
Citations since 2017
367 Research Items
44176 Citations
201720182019202020212022202302,0004,0006,0008,00010,000
201720182019202020212022202302,0004,0006,0008,00010,000
201720182019202020212022202302,0004,0006,0008,00010,000
201720182019202020212022202302,0004,0006,0008,00010,000
Introduction
I do research in theoretical chemistry at the interface with quantum physics, renewable energy, computer science and applied mathematics.
Additional affiliations
January 2006 - December 2012
Harvard University
January 1999 - December 2005
University of California, Berkeley
January 1994 - present
Universidad Nacional Autónoma de México
Education
July 1999 - December 2004
University of California, Berkeley
Field of study
  • Theoretical Physical Chemistry

Publications

Publications (674)
Preprint
Current quantum computing hardware is restricted by the availability of only few, noisy qubits which limits the investigation of larger, more complex molecules in quantum chemistry calculations on quantum computers in the near-term. In this work, we investigate the limits of their classical and near-classical treatment while staying within the fram...
Preprint
Full-text available
Hamiltonian simulation is known to be one of the fundamental building blocks of a variety of quantum algorithms such as its most immediate application, that of simulating many-body systems to extract their physical properties. In this work, we present qSWIFT, a high-order randomized algorithm for Hamiltonian simulation. In qSWIFT, the required numb...
Preprint
Transparent object perception is a crucial skill for applications such as robot manipulation in household and laboratory settings. Existing methods utilize RGB-D or stereo inputs to handle a subset of perception tasks including depth and pose estimation. However, transparent object perception remains to be an open problem. In this paper, we forgo t...
Article
Semi-empirical quantum chemistry has recently seen a renaissance with applications in high-throughput virtual screening and machine learning. The simplest semi-empirical model still in widespread use in chemistry is Hückel's $\pi$-electron molecular orbital theory. In this work, we implemented a Hückel program using differentiable programming with...
Preprint
Full-text available
String-based molecular representations play a crucial role in cheminformatics applications, and with the growing success of deep learning in chemistry, have been readily adopted into machine learning pipelines. However, traditional string-based representations such as SMILES are often prone to syntactic and semantic errors when produced by generati...
Article
Accelerated Materials Discovery In article number 2207070, Tony C Wu, Alán Aspuru‐Guzik, and co‐workers report a materials acceleration platform to automatically search for high‐performing organic laser molecules, which includes synthesis, product identification, and property measurements. This platform has discovered two state‐of‐the‐art organic l...
Article
Full-text available
Long-acting injectables are considered one of the most promising therapeutic strategies for the treatment of chronic diseases as they can afford improved therapeutic efficacy, safety, and patient compliance. The use of polymer materials in such a drug formulation strategy can offer unparalleled diversity owing to the ability to synthesize materials...
Article
Full-text available
The application of artificial intelligence (AI) has been considered as a revolutionary change in drug discovery and development. In 2020, AlphaFold computer program predicted protein structures for the whole human...
Preprint
Full-text available
Quantum computers promise to impact industrial applications, for which quantum chemical calculations are required, by virtue of their high accuracy. This perspective explores the challenges and opportunities of applying quantum computers to drug design, discusses where they could transform industrial research and elaborates on what is needed to rea...
Article
Full-text available
Digital Discovery celebrates its first anniversary.
Preprint
Full-text available
In the process of materials discovery, chemists currently need to perform many laborious, time-consuming, and often dangerous lab experiments. To accelerate this process, we propose a framework for robots to assist chemists by performing lab experiments autonomously. The solution allows a general-purpose robot to perform diverse chemistry experimen...
Article
Full-text available
Conventional materials discovery is a laborious and time‐consuming process that can take decades from initial conception of the material to commercialization. Recent developments in materials acceleration platforms promise to accelerate materials discovery using automation of experiments coupled with machine learning. However, most of the automatio...
Preprint
Deep learning models that leverage large datasets are often the state of the art for modelling molecular properties. When the datasets are smaller (< 2000 molecules), it is not clear that deep learning approaches are the right modelling tool. In this work we perform an extensive study of the calibration and generalizability of probabilistic machine...
Preprint
Full-text available
We present MatSim: a synthetic dataset, a benchmark, and a method for computer vision based recognition of similarities and transitions between materials and textures, focusing on identifying any material under any conditions using one or a few examples (one-shot learning). The visual recognition of materials is essential to everything from examini...
Preprint
Full-text available
One of the biggest obstacles to successful polymer property prediction is an effective representation that accurately captures the sequence of repeat units in a polymer. Motivated by the successes of data augmentation in computer vision and natural language processing, we explore augmenting polymer data by rearranging the molecular representation w...
Preprint
Full-text available
Semi-empirical quantum chemistry has recently seen a renaissance with applications in high-throughput virtual screening and machine learning. The simplest semi-empirical model still in widespread use in chemistry is H\"uckel's $\pi$-electron molecular orbital theory. In this work, we implemented a H\"uckel program using differentiable programming w...
Preprint
Full-text available
In this paper, we introduce four main novelties: First, we present a new way of handling the topology problem of normalizing flows. Second, we describe a technique to enforce certain classes of boundary conditions onto normalizing flows. Third, we introduce the I-Spline bijection, which, similar to previous work, leverages splines but, in contrast...
Preprint
Full-text available
We introduce Group SELFIES, a molecular string representation that leverages group tokens to represent functional groups or entire substructures while maintaining chemical robustness guarantees. Molecular string representations, such as SMILES and SELFIES, serve as the basis for molecular generation and optimization in chemical language models, dee...
Article
Nonpairwise multiqubit interactions present a useful resource for quantum information processors. Their implementation would facilitate more efficient quantum simulations of molecules and combinatorial optimization problems, and they could simplify error suppression and error correction schemes. Here, we present a superconducting circuit architectu...
Article
Full-text available
Computational power and quantum chemical methods have improved immensely since computers were first applied to the study of reactivity, but the de novo prediction of chemical reactions has remained challenging. We show that complex reaction pathways can be efficiently predicted in a guided manner using chemical activation imposed by geometrical con...
Article
General conditions for organic reactions are important but rare, and efforts to identify them usually consider only narrow regions of chemical space. Discovering more general reaction conditions requires considering vast regions of chemical space derived from a large matrix of substrates crossed with a high-dimensional matrix of reaction conditions...
Preprint
Full-text available
Nanomedicines have transformed promising therapeutic agents into clinically approved medicines with optimal safety and efficacy profiles. This is exemplified by the mRNA vaccines against COVID-19, which were made possible by lipid nanoparticle technology. Despite the success of nanomedicines to date, their design remains far from trivial in part du...
Preprint
Full-text available
Transitioning from fossil fuels to renewable energy sources is a critical global challenge; it demands advances at the levels of materials, devices, and systems for the efficient harvesting, storage, conversion, and management of renewable energy. Researchers globally have begun incorporating machine learning (ML) techniques with the aim of acceler...
Article
Transitioning from fossil fuels to renewable energy sources is a critical global challenge; it demands advances — at the materials, devices and systems levels — for the efficient harvesting, storage, conversion and management of renewable energy. Energy researchers have begun to incorporate machine learning (ML) techniques to accelerate these advan...
Article
Full-text available
Logic Artificial Intelligence (AI) is a subfield of AI where variables can take two defined arguments, True or False, and are arranged in clauses that follow the rules of formal logic. Several problems that span from physical systems to mathematical conjectures can be encoded into these clauses and solved by checking their satisfiability (SAT). In...
Article
An oracle that correctly predicts the outcome of every particle physics experiment, the products of every possible chemical reaction or the function of every protein would revolutionize science and technology. However, scientists would not be entirely satisfied because they would want to comprehend how the oracle made these predictions. This is sci...
Preprint
Full-text available
Modeling of solid-state material-molecule interfaces in heterogeneous catalysis requires the extensive evaluation of the energy of molecules on surfaces. Obtaining the binding energy of many configurations of large organic molecules requires a vast amount of computational time with density functional theory (DFT). Here, we use a graph neural networ...
Preprint
Full-text available
Long-acting injectables are considered one of the most promising therapeutic strategies for the treatment of chronic diseases as they can afford improved therapeutic efficacy, safety, and patient compliance. The use of polymer materials in such a drug formulation strategy can offer unparalleled diversity owing to the ability to synthesize materials...
Article
Full-text available
Artificial intelligence (AI) and machine learning (ML) are expanding in popularity for broad applications to challenging tasks in chemistry and materials science. Examples include the prediction of properties, the discovery of new reaction pathways, or the design of new molecules. The machine needs to read and write fluently in a chemical language...
Preprint
Full-text available
In this work, we introduce a quantum-control-inspired method for the characterization of variational quantum circuits using the rank of the dynamical Lie algebra associated with the hermitian generator(s) of the individual layers. Layer-based architectures in variational algorithms for the calculation of ground-state energies of physical systems ar...
Preprint
Full-text available
The efficient exploration of chemical space to design molecules with intended properties enables the accelerated discovery of drugs, materials, and catalysts, and is one of the most important outstanding challenges in chemistry. Encouraged by the recent surge in computer power and artificial intelligence development, many algorithms have been devel...
Article
Full-text available
Near-term quantum computation holds potential across multiple application domains. However, imperfect preparation and evolution of states due to algorithmic and experimental shortcomings, characteristic in the near-term implementation, would typically result in measurement outcomes deviating from the ideal setting. It is thus crucial for any near-t...
Preprint
Full-text available
We introduce a family of variational quantum algorithms called quantum iterative power algorithms (QIPA) that outperform existing hybrid near-term quantum algorithms of the same kind. We demonstrate the capabilities of QIPA as applied to three different global-optimization numerical experiments: the ground-state optimization of the $H_2$ molecular...
Article
ConspectusWe must accelerate the pace at which we make technological advancements to address climate change and disease risks worldwide. This swifter pace of discovery requires faster research and development cycles enabled by better integration between hypothesis generation, design, experimentation, and data analysis. Typical research cycles take...
Preprint
A new boron-containing heterocyclic scaffold – borindolizine – is reported as a new heterocyclic scaffold with broadly tunable fluorescence and a high Stokes shift. Two classes of emitters were synthesized through rational scaffold changes resulting in blue-emitting carboxyborindolizines (λmax,em = 431 – 459 nm) and green emitting aryl borindolizin...
Article
Designing viable molecular candidates is pivotal to devising low-cost and sustainable storage systems. A reinforcement learning framework has been developed that can identify stable candidates for redox flow batteries in the large search space of organic radicals.
Preprint
Full-text available
In this work, we introduce a new way to quantify information flow in quantum systems, especially for parameterized quantum circuits. We use a graph representation of the circuits and propose a new distance metric using the mutual information between gate nodes. We then present an optimization procedure for variational algorithms using paths based o...
Preprint
Full-text available
As we continue to find applications where the currently available noisy devices exhibit an advantage over their classical counterparts, the efficient use of quantum resources is highly desirable. The notion of quantum autoencoders was proposed as a way for the compression of quantum information to reduce resource requirements. Here, we present a st...
Article
Full-text available
Computer aided design of molecules has the potential to disrupt the field of drug and material discovery. Machine learning and deep learning in particular, made big strides in recent years and promises to greatly benefit computer aided methods. Reinforcement learning is a particularly promising approach since it enables de novo molecule design, tha...
Article
Over the past decades, continual advancement of computational power has led to the prevalence of automation across science, industry, and society, whereby digital solutions were developed algorithmically to exploit technical knowledge of a problem. However, we are currently entering a critical period of computational innovation, particularly in mat...
Article
Full-text available
Deep generative models of molecules have grown immensely in popularity, trained on relevant datasets, these models are used to search through chemical space. The downstream utility of generative models for the inverse design of novel functional compounds, depends on their ability to learn a training distribution of molecules. The most simple exampl...
Article
Full-text available
Quantum physics experiments produce interesting phenomena such as interference or entanglement, which are the core properties of numerous future quantum technologies. The complex relationship between the setup structure of a quantum experiment and its entanglement properties is essential to fundamental research in quantum optics but is difficult to...
Article
Full-text available
We provide an integration of the universal, perturbative explicitly correlated [2]R12-correction in the context of the Variational Quantum Eigensolver (VQE). This approach is able to increase the accuracy of the underlying reference method significantly while requiring no additional quantum resources. The proposed approach only requires knowledge o...
Preprint
Full-text available
To enable the accelerated discovery of materials with desirable properties, it is critical to develop accurate and efficient search algorithms. Quantum annealers and similar quantum-inspired optimizers have the potential to provide accelerated computation for certain combinatorial optimization challenges. However, they have not been exploited for m...
Article
In this study, we explore the use of molecules and molecular electronics for quantum computing. We construct one-qubit gates using one-electron scattering in molecules and two-qubit controlled-phase gates using electron-electron scattering along metallic leads. Furthermore, we propose a class of circuit implementations, and show initial application...
Preprint
Nonpairwise multi-qubit interactions present a useful resource for quantum information processors. Their implementation would facilitate more efficient quantum simulations of molecules and combinatorial optimization problems, and they could simplify error suppression and error correction schemes. Here we present a superconducting circuit architectu...
Preprint
Full-text available
Long-acting injectables are considered one of the most promising therapeutic strategies for the treatment of chronic diseases as they can afford improved therapeutic efficacy, safety, and patient compliance. The use of polymer materials in such a drug formulation strategy can offer unparalleled diversity owing to the ability to synthesize materials...
Article
Full-text available
Inverse molecular design involves algorithms that sample molecules with specific target properties from a multitude of candidates and can be posed as an optimization problem. High-dimensional optimization tasks in the natural sciences are commonly tackled via population-based metaheuristic optimization algorithms such as evolutionary algorithms. Ho...
Preprint
Full-text available
Long-acting injectables are considered one of the most promising therapeutic strategies for the treatment of chronic diseases as they can afford improved therapeutic efficacy, safety, and patient compliance. The use of polymer materials in such a drug formulation strategy can offer unparalleled diversity owing to the ability to synthesize materials...
Article
The core objective of machine-assisted scientific discovery is to learn physical laws from experimental data without prior knowledge of the systems in question. In the area of quantum physics, making progress towards these goals is significantly more challenging due to the curse of dimensionality as well as the counterintuitive nature of quantum me...
Preprint
Full-text available
Imagine an oracle that correctly predicts the outcome of every particle physics experiment, the products of every chemical reaction, or the function of every protein. Such an oracle would revolutionize science and technology as we know them. However, as scientists, we would not be satisfied with the oracle itself. We want more. We want to comprehen...
Preprint
Full-text available
Artificial intelligence (AI) and machine learning (ML) are expanding in popularity for broad applications to challenging tasks in chemistry and materials science. Examples include the prediction of properties, the discovery of new reaction pathways, or the design of new molecules. The machine needs to read and write fluently in a chemical language...
Article
We present a classically tractable model that leads to optimized low-depth quantum circuits leveraging separable-pair approximations. The obtained circuits are well suited as a baseline circuit for emerging quantum hardware and can, in the long term, provide significantly improved initial states for quantum algorithms. The associated wave functions...
Preprint
Full-text available
Optimization strategies driven by machine learning, such as Bayesian optimization, are being explored across experimental sciences as an efficient alternative to traditional design of experiment. When combined with automated laboratory hardware and high-performance computing, these strategies enable next-generation platforms for autonomous experime...
Article
Multipartite entanglement is one of the core concepts in quantum information science with broad applications that span from condensed matter physics to quantum physics foundation tests. Although its most studied and tested forms encompass two-dimensional systems, current quantum platforms technically allow the manipulation of additional quantum lev...
Article
Full-text available
Conventional materials discovery is a laborious and time-consuming process that can take decades from initial conception of the material to commercialization. Recent developments in materials acceleration platforms promise to accelerate materials discovery using automation of experiments coupled with machine learning. However, most of the automatio...
Article
We present a review of the Unitary Coupled Cluster (UCC) ansatz and related ansätze which are used to variationally solve the electronic structure problem on quantum computers. A brief history of coupled cluster (CC) methods is provided, followed by a broad discussion of the formulation of CC theory. This includes touching on the merits and difficu...
Article
A universal fault-tolerant quantum computer that can efficiently solve problems such as integer factorization and unstructured database search requires millions of qubits with low error rates and long coherence times. While the experimental advancement toward realizing such devices will potentially take decades of research, noisy intermediate-scale...
Preprint
In this study, we explore the use of molecules and molecular electronics for quantum computing. We construct one-qubit gates using one-electron scattering in molecules, and two-qubit controlled-phase gates using electron-electron scattering along metallic leads. Furthermore, we propose a class of circuit implementations, and show initial applicatio...
Preprint
Machine learning has the potential to automate molecular design and drastically accelerate the discovery of new functional compounds. Towards this goal, generative models and reinforcement learning (RL) using string and graph representations have been successfully used to search for novel molecules. However, these approaches are limited since their...
Article
Full-text available
Self-driving laboratories, in the form of automated experimentation platforms guided by machine learning algorithms, have emerged as a potential solution to the need for accelerated science. While new tools for automated analysis and characterization are being developed at a steady rate, automated synthesis remains the bottleneck in the chemical sp...
Article
Full-text available
We present TransProteus, a dataset, and methods for predicting the 3D structure and properties of materials inside transparent vessels from a single image. Manipulating materials in containers is essential in...