Score: 0

A burn-in(g) question: How long should an initial equal randomization stage be before Bayesian response-adaptive randomization?

Published: March 25, 2025 | arXiv ID: 2503.19795v1

By: Edwin Y. N. Tang , Stef Baas , Daniel Kaddaj and more

Potential Business Impact:

Helps doctors test new medicines fairly and safely.

Business Areas:
A/B Testing Data and Analytics

Response-adaptive (RA) trials offer the potential to enhance participant benefit but also complicate valid statistical analysis and potentially lead to a higher proportion of participants receiving an inferior treatment. A common approach to mitigate these disadvantages is to introduce a fixed non-adaptive randomization stage at the start of the RA design, known as the burn-in period. Currently, investigations and guidance on the effect of the burn-in length are scarce. To this end, this paper provides an exact evaluation approach to investigate how the burn-in length impacts the statistical properties of two-arm binary RA designs. We show that (1) for commonly used calibration and asymptotic tests an increase in the burn-in length reduces type I error rate inflation but does not lead to strict type I error rate control, necessitating exact tests; (2) the burn-in length substantially influences the power and participant benefit, and these measures are often not maximized at the maximum or minimum possible burn-in length; (3) the conditional exact test conditioning on total successes provides the highest average and minimum power for both small and moderate burn-in lengths compared to other tests. Using our exact analysis method, we re-design the ARREST trial to improve its statistical properties.

Page Count
25 pages

Category
Statistics:
Methodology