AI meets quantum: optimal gate compilation using reinforcement learning
The Science: Optimal short gate sequences for quantum state preparation and gate compilation of a noisy qubit using concepts from reinforcement learning and mapped it onto a Markov Decision Process.
The Impact: Compiling arbitrary gates into a fixed discrete gate set and preparing arbitrary initial states in the presence of noise are essential tasks in quantum programming. Tools from reinforcement learning generated solutions that compare favorably to established methods.
Summary: Reinforcement learning has been applied to a variety of tasks in quantum programming. The underlying assumption is that those tasks can be modeled as Markov Decision Processes (MDPs). The feasibility of this assumption is tested for two fundamental tasks in quantum programming: state preparation and gate compilation. By forming discrete MDPs and solving for the optimal policy, focusing exclusively on the single-qubit case with and without noise, one finds the shortest possible sequence of gates to either prepare a state, or compile a gate, up to a target accuracy. For example, the sequences of H and T gates with length as small as 11 produce ~ 99% fidelity for states of the form (HT) n||0> with values as large as n = 1010. It is shown that the optimal policy adapts to the effects of noisy gates to achieve a higher state fidelity. The main outcome of this work is that one can impose a discrete, stochastic and Markovian nature to a continuous, deterministic and non-Markovian quantum evolution, which provides theoretical insight and support into why reinforcement learning is expected to be successful in finding optimally short gate sequences in quantum programming.
Contact: Peter Orth – porth@iastate.edu
Focus Area: Algorithms
Institutions: Ames National Laboratory, NASA Ames, Rigetti Computing
Acknowledgment: Alam, M.S., Berthusen, N.F. & Orth, P.P. Quantum logic gate synthesis as a Markov decision process. npj Quantum Inf 9, 108 (2023). https://doi.org/10.1038/s41534-023-00766-w
This work was supported by the U.S. Department of Energy, Office of Science, National Quantum Information Science Research Centers, and Superconducting Quantum Materials and Systems Center (SQMS).