How hard is learning to cut? Trade-offs and sample complexity
By: Sammy Khalife, Andrea Lodi
Potential Business Impact:
Makes computer problem-solving faster and smarter.
In the recent years, branch-and-cut algorithms have been the target of data-driven approaches designed to enhance the decision making in different phases of the algorithm such as branching, or the choice of cutting planes (cuts). In particular, for cutting plane selection two score functions have been proposed in the literature to evaluate the quality of a cut: branch-and-cut tree size and gap closed. In this paper, we present new sample complexity lower bounds, valid for both scores. We show that for a wide family of classes $\mathcal{F}$ that maps an instance to a cut, learning over an unknown distribution of the instances to minimize those scores requires at least (up to multiplicative constants) as many samples as learning from the same class function $\mathcal{F}$ any generic target function (using square loss). Our results also extend to the case of learning from a restricted set of cuts, namely those from the Simplex tableau. To the best of our knowledge, these constitute the first lower bounds for the learning-to-cut framework. We compare our bounds to known upper bounds in the case of neural networks and show they are nearly tight. We illustrate our results with a graph neural network selection evaluated on set covering and facility location integer programming models and we empirically show that the gap closed score is an effective proxy to minimize the branch-and-cut tree size. Although the gap closed score has been extensively used in the integer programming literature, this is the first principled analysis discussing both scores at the same time both theoretically and computationally.
Similar Papers
Generalization Guarantees for Learning Branch-and-Cut Policies in Integer Programming
Machine Learning (CS)
Helps computers solve hard problems faster.
Beyond Local Selection: Global Cut Selection for Enhanced Mixed-Integer Programming
Artificial Intelligence
Solves hard math problems much faster.
Probabilistic Graph Cuts
Machine Learning (CS)
Helps computers group similar things together better.