Score: 0

Multilinear and Linear Programs for Partially Identifiable Queries in Quasi-Markovian Structural Causal Models

Published: September 2, 2025 | arXiv ID: 2509.03548v1

By: João P. Arroyo , João G. Rodrigues , Daniel Lawand and more

Potential Business Impact:

Finds hidden causes even with missing information.

Business Areas:
A/B Testing Data and Analytics

We investigate partially identifiable queries in a class of causal models. We focus on acyclic Structural Causal Models that are quasi-Markovian (that is, each endogenous variable is connected with at most one exogenous confounder). We look into scenarios where endogenous variables are observed (and a distribution over them is known), while exogenous variables are not fully specified. This leads to a representation that is in essence a Bayesian network where the distribution of root variables is not uniquely determined. In such circumstances, it may not be possible to precisely compute a probability value of interest. We thus study the computation of tight probability bounds, a problem that has been solved by multilinear programming in general, and by linear programming when a single confounded component is intervened upon. We present a new algorithm to simplify the construction of such programs by exploiting input probabilities over endogenous variables. For scenarios with a single intervention, we apply column generation to compute a probability bound through a sequence of auxiliary linear integer programs, thus showing that a representation with polynomial cardinality for exogenous variables is possible. Experiments show column generation techniques to be superior to existing methods.

Page Count
14 pages

Category
Computer Science:
Artificial Intelligence