Back to News
quantum-computing

Adaptive Shot Allocation for Recursive QAOA via Reinforcement Learning

arXiv Quantum Physics
Loading...
4 min read
0 likes
⚡ Quantum Brief
Researchers Euimin Lee and Shiho Kim introduced a reinforcement learning approach to optimize shot allocation in Recursive Quantum Approximate Optimization Algorithm (RQAOA), reducing measurement costs by up to 36% compared to uniform allocation. The study targets depth-1 RQAOA for weighted Max-Cut problems, framing shot allocation as a sequential decision problem to minimize noise exposure on near-term quantum devices. Two strategies were proposed: a heuristic method cutting shots by 23% using local difficulty indicators, and a Double Q-learning agent achieving 36% reduction under Lagrangian constraints. Experiments used a fixed-cap fairness protocol across diverse graph instances, isolating adaptive measurement’s impact while keeping elimination rules unchanged for consistency. The RL policy demonstrated generalization to unseen problem sizes, suggesting scalable efficiency gains for recursive quantum optimization under hardware noise constraints.
Adaptive Shot Allocation for Recursive QAOA via Reinforcement Learning

Summarize this article with:

Quantum Physics arXiv:2605.26544 (quant-ph) [Submitted on 26 May 2026] Title:Adaptive Shot Allocation for Recursive QAOA via Reinforcement Learning Authors:Euimin Lee, Shiho Kim View a PDF of the paper titled Adaptive Shot Allocation for Recursive QAOA via Reinforcement Learning, by Euimin Lee and Shiho Kim View PDF HTML (experimental) Abstract:Recursive QAOA (RQAOA) solves combinatorial optimization problems by using shallow quantum circuits to estimate pairwise correlations and recursively eliminate variables until a classical solver can handle the residual instance. Each elimination step requires measurement shots, and the total shot cost grows with the number of recursive stages. On near-term quantum devices, increasing shot counts can translate directly into greater exposure to hardware-level noise sources such as readout errors and decoherence, making shot-efficient execution not merely a cost-reduction measure but a factor with direct implications for solution reliability. While shot reduction has been studied broadly across NISQ algorithms, step-wise measurement control inside the recursive loop of RQAOA has received little attention. We formulate this step-wise allocation as a sequential decision problem and propose two strategies for depth-1 RQAOA on weighted Max-Cut instances. A hand-crafted heuristic assigns shots based on local indicators of step difficulty, and a tabular Double Q-learning agent learns a residual policy that adjusts this baseline under a Lagrangian-constrained objective. Both methods are evaluated under a fixed-cap fairness protocol that equalizes the per-step budget across all strategies, and the elimination rule itself is kept unchanged so that the contribution of adaptive measurement control can be isolated. On a diverse set of weighted graph instances spanning a range of sizes and structures, the heuristic reduces total shots by approximately 23% relative to uniform allocation, and the RL policy achieves a 36% reduction with a lower effective shots per success ratio than both baselines. The improvement persists on problem sizes not seen during training, suggesting that reinforcement learning can discover efficient, instance-adaptive measurement strategies in recursive quantum optimization. Subjects: Quantum Physics (quant-ph) Cite as: arXiv:2605.26544 [quant-ph] (or arXiv:2605.26544v1 [quant-ph] for this version) https://doi.org/10.48550/arXiv.2605.26544 Focus to learn more arXiv-issued DOI via DataCite (pending registration) Submission history From: Lee Euimin [view email] [v1] Tue, 26 May 2026 04:48:29 UTC (1,171 KB) Full-text links: Access Paper: View a PDF of the paper titled Adaptive Shot Allocation for Recursive QAOA via Reinforcement Learning, by Euimin Lee and Shiho KimView PDFHTML (experimental)TeX Source view license Current browse context: quant-ph new | recent | 2026-05 References & Citations INSPIRE HEP NASA ADSGoogle Scholar Semantic Scholar export BibTeX citation Loading... BibTeX formatted citation × loading... Data provided by: Bookmark Bibliographic Tools Bibliographic and Citation Tools Bibliographic Explorer Toggle Bibliographic Explorer (What is the Explorer?) Connected Papers Toggle Connected Papers (What is Connected Papers?) Litmaps Toggle Litmaps (What is Litmaps?) scite.ai Toggle scite Smart Citations (What are Smart Citations?) Code, Data, Media Code, Data and Media Associated with this Article alphaXiv Toggle alphaXiv (What is alphaXiv?) Links to Code Toggle CatalyzeX Code Finder for Papers (What is CatalyzeX?) DagsHub Toggle DagsHub (What is DagsHub?) GotitPub Toggle Gotit.pub (What is GotitPub?) Huggingface Toggle Hugging Face (What is Huggingface?) ScienceCast Toggle ScienceCast (What is ScienceCast?) Demos Demos Replicate Toggle Replicate (What is Replicate?) Spaces Toggle Hugging Face Spaces (What is Spaces?) Spaces Toggle TXYZ.AI (What is TXYZ.AI?) Related Papers Recommenders and Search Tools Link to Influence Flower Influence Flower (What are Influence Flowers?) Core recommender toggle CORE Recommender (What is CORE?) Author Venue Institution Topic About arXivLabs arXivLabs: experimental projects with community collaborators arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website. Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners that adhere to them. Have an idea for a project that will add value for arXiv's community? Learn more about arXivLabs. Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Read Original

Tags

quantum-optimization
quantum-algorithms

Source Information

Source: arXiv Quantum Physics