AIResearch AIResearch
Back to articles
AI

AI Can Estimate Causal Effects Without Seeing the Cause

Researchers have developed a method that allows artificial intelligence to estimate the causal effect of a treatment, such as a drug or a policy change, even when key confounding factors are hidden. T…

AI Research
November 14, 2025
3 min read
AI Can Estimate Causal Effects Without Seeing the Cause

Researchers have developed a method that allows artificial intelligence to estimate the causal effect of a treatment, such as a drug or a policy change, even when key confounding factors are hidden. This breakthrough addresses a fundamental challenge in fields from medicine to economics, where unobserved variables can distort conclusions drawn from observational data. The approach relies on proxy variables—measurable stand-ins for hidden confounders—and data from multiple environments, enabling accurate predictions in new, unseen settings without direct access to the treatment's outcome there. This could transform how scientists analyze real-world data where randomized controlled trials are impractical or unethical.

The key finding is that the causal effect of a treatment on an outcome can be identified using only proxy variables and data from source domains, even when the confounder is unobserved and varies across environments. For example, in a study of hotel search rankings, the effect of a hotel's position on user clicks was estimated without knowing all hidden factors influencing user choices, using price as a proxy. The researchers proved mathematically that under specific conditions, the interventional distribution—what would happen if the treatment were applied—is identifiable from the available data.

Methodologically, the team proposed two estimation techniques based on a structural causal model. The first uses maximum likelihood to parametrize the underlying mechanisms, while the second employs a reduced parametrization that directly computes the effect using empirical frequencies from the data. Both methods assume the proxy variables are sufficiently informative about the hidden confounder, as formalized by a rank condition on probability matrices. In simulations, these estimators consistently outperformed naive baselines that ignore confounding or adjust improperly for proxies.

Results from simulation studies and a real-world application to the Expedia hotel search dataset demonstrate the method's effectiveness. In simulations with sample sizes up to 100,000, the estimators showed low absolute errors and converged to the true causal effect as data increased. For the hotel ranking example, the method produced confidence intervals that overlapped with oracle estimates derived from randomized data, with an average absolute error of 0.044 for the reduced estimator, compared to 0.051 for a no-adjustment baseline. This indicates reliable performance in practical scenarios where ground truth is otherwise inaccessible.

In context, this work matters because it enables more trustworthy causal inferences in areas like public health, marketing, and social sciences, where hidden confounders are common. For instance, it could help estimate the effect of a new medication using historical patient data without full knowledge of all health factors, or assess policy impacts using proxy indicators. By leveraging multi-domain data, the method enhances the portability of causal findings, reducing reliance on costly or infeasible experiments.

Limitations include the assumption that proxy variables are discrete and sufficiently diverse across domains, as noted in the paper's discussion. If these conditions are not met, identifiability may be lost. Future work could explore extensions to continuous proxies and relaxations of the latent shift assumption, potentially broadening applicability to more complex real-world datasets.

About the Author

Guilherme A.

Guilherme A.

Former dentist (MD) from Brazil, 41 years old, husband, and AI enthusiast. In 2020, he transitioned from a decade-long career in dentistry to pursue his passion for technology, entrepreneurship, and helping others grow.

Connect on LinkedIn