Vector Quantized-Elites: Unsupervised and Problem-Agnostic Quality-Diversity Optimization
By: Constantinos Tsakonas, Konstantinos Chatzilygeroudis
Potential Business Impact:
Finds many good solutions without knowing how.
Quality-Diversity algorithms have transformed optimization by prioritizing the discovery of diverse, high-performing solutions over a single optimal result. However, traditional Quality-Diversity methods, such as MAP-Elites, rely heavily on predefined behavior descriptors and complete prior knowledge of the task to define the behavior space grid, limiting their flexibility and applicability. In this work, we introduce Vector Quantized-Elites (VQ-Elites), a novel Quality-Diversity algorithm that autonomously constructs a structured behavior space grid using unsupervised learning, eliminating the need for prior task-specific knowledge. At the core of VQ-Elites is the integration of Vector Quantized Variational Autoencoders, which enables the dynamic learning of behavior descriptors and the generation of a structured, rather than unstructured, behavior space grid -- a significant advancement over existing unsupervised Quality-Diversity approaches. This design establishes VQ-Elites as a flexible, robust, and task-agnostic optimization framework. To further enhance the performance of unsupervised Quality-Diversity algorithms, we introduce behavior space bounding and cooperation mechanisms, which significantly improve convergence and performance, as well as the Effective Diversity Ratio and Coverage Diversity Score, two novel metrics that quantify the actual diversity in the unsupervised setting. We validate VQ-Elites on robotic arm pose-reaching, mobile robot space-covering, and MiniGrid exploration tasks. The results demonstrate its ability to efficiently generate diverse, high-quality solutions, emphasizing its adaptability, scalability, robustness to hyperparameters, and potential to extend Quality-Diversity optimization to complex, previously inaccessible domains.
Similar Papers
AutoQD: Automatic Discovery of Diverse Behaviors with Quality-Diversity Optimization
Machine Learning (CS)
Finds new ways for robots to learn tasks.
Multi-Objective Quality-Diversity in Unstructured and Unbounded Spaces
Machine Learning (CS)
Finds many good solutions in tricky, unknown problems.
Overcoming Deceptiveness in Fitness Optimization with Unsupervised Quality-Diversity
Neural and Evolutionary Computing
Finds best robot moves even in tricky situations.