Rate-optimal Design for Anytime Best Arm Identification
By: Junpei Komiyama, Kyoungseok Jang, Junya Honda
Potential Business Impact:
Finds the best option faster with fewer tries.
We consider the best arm identification problem, where the goal is to identify the arm with the highest mean reward from a set of $K$ arms under a limited sampling budget. This problem models many practical scenarios such as A/B testing. We consider a class of algorithms for this problem, which is provably minimax optimal up to a constant factor. This idea is a generalization of existing works in fixed-budget best arm identification, which are limited to a particular choice of risk measures. Based on the framework, we propose Almost Tracking, a closed-form algorithm that has a provable guarantee on the popular risk measure $H_1$. Unlike existing algorithms, Almost Tracking does not require the total budget in advance nor does it need to discard a significant part of samples, which gives a practical advantage. Through experiments on synthetic and real-world datasets, we show that our algorithm outperforms existing anytime algorithms as well as fixed-budget algorithms.
Similar Papers
Constrained Best Arm Identification with Tests for Feasibility
Machine Learning (CS)
Finds the best option that also meets safety rules.
Asymptotically Optimal Linear Best Feasible Arm Identification with Fixed Budget
Machine Learning (CS)
Finds the best option quickly with limited money.
Optimal Best Arm Identification under Differential Privacy
Machine Learning (Stat)
Protects private data while finding the best option.