Quantum computers hold promise for simulating complex molecules, a task that is notoriously difficult for classical machines. However, a major hurdle has been designing the right quantum circuits, known as ansätze, which are crucial for accurate calculations. Typically, researchers must craft a new circuit for each specific molecular configuration, a time-consuming process that doesn't leverage similarities across related problems. Now, a team has developed a reinforcement learning (RL) framework that learns to generate these circuits adaptively for a range of molecular geometries, offering a more efficient and generalizable approach.
The researchers focused on computing potential energy curves (PECs), which describe a molecule's ground-state energy as it stretches or compresses along a bond distance. Instead of optimizing a circuit separately for each point on the curve, their RL agent learns a mapping that outputs a circuit for any bond distance within a continuous interval. This means the agent can produce circuits for unseen geometries without retraining, effectively solving many variational quantum eigensolver (VQE) problems at once. uses a soft actor-critic algorithm to select gates and their parameters from a hardware-efficient set, without relying on domain-specific knowledge, making it transferable to other problems.
In tests, the framework was applied to lithium hydride (LiH) molecules with four and six qubits, and an eight-qubit hydrogen chain (H4). For four-qubit LiH, the agent achieved chemical accuracy (within 0.001 Ha) in five out of twelve runs at a fixed bond distance of 2.2 Å, with a mean error of 0.0041 Ha. When generating the full PEC over a range of 1.0 to 4.0 Å, the average error was 0.0136 Ha, about 5.1 times more accurate than the Hartree-Fock approximation. The training cost per bond distance dropped dramatically to 1,380 episodes, a 22-fold improvement compared to single-distance training, which required 30,000 episodes.
Show that the learned circuits are not only accurate but also interpretable. For instance, in the four-qubit LiH PEC, the agent consistently used R_y and CNOT gates while avoiding R_x gates, aligning with chemical intuition for real-valued wavefunctions. The circuit depth ranged from 9 to 12 layers for six-qubit LiH, which is within an efficient window compared to optimal depths of 6-14. However, limitations include symmetry-breaking at larger bond distances, where the agent converged to energetically favorable but physically invalid states, and the current framework is restricted to classical simulation due to high measurement costs on quantum hardware.
This work highlights RL's potential for transferable quantum circuit design, offering non-greedy exploration that can escape local minima. Future directions include incorporating symmetry constraints, using chemically motivated gate sets for better performance, and scaling to larger systems through fragmentation s. The code and data are publicly available, paving the way for broader applications in quantum optimization and beyond.
Original Source
Read the complete research paper
About the Author
Guilherme A.
Former dentist (MD) from Brazil, 41 years old, husband, and AI enthusiast. In 2020, he transitioned from a decade-long career in dentistry to pursue his passion for technology, entrepreneurship, and helping others grow.
Connect on LinkedIn