AI Finds Cheaper Ways to Order Costly Tests

TL;DR

New research shows how algorithms can sequence tests to cut expenses in medicine, cybersecurity, and more, helping teams decide faster and spend less.

In many real-world scenarios, from medical diagnostics to network maintenance, determining the state of a system requires conducting a series of tests, each with its own cost and uncertainty. The Sequential Testing Problem (STP) addresses this by seeking the most cost-effective order to perform tests, balancing expenses against probabilistic outcomes. This problem has gained renewed attention over the past two decades, with researchers developing new theoretical insights, extensions, and practical applications that could streamline decision-making in diverse industries. A recent review paper consolidates these advancements, highlighting how algorithmic strategies can optimize testing sequences to reduce expected costs, offering potential savings and efficiency gains in areas like healthcare, engineering, and data management.

The core finding from the review is that for certain types of systems, optimal testing strategies can be determined efficiently using specific rules or algorithms. For example, in series systems (where all components must function for the system to work), testing variables in non-decreasing order of the ratio of cost to the probability of failure (ci/qi) is optimal. Similarly, for parallel systems (where any functioning component suffices), the optimal order is based on the ratio of cost to the probability of success (ci/pi). These extend to more complex functions like k-out-of-n systems, where the system works if at least k out of n components function, with polynomial-time algorithms available for finding optimal strategies. The review also notes that for broader classes, such as double-regular functions, optimal algorithms exist, leveraging permutations that align with strength pre-orders of variables.

Ologically, researchers have approached the STP by representing testing strategies as Binary Decision Trees (BDTs), where internal nodes correspond to variables tested and leaves indicate the function's value (0 or 1). This representation allows for computing expected costs by summing over all possible outcomes, weighted by their probabilities. Strategies can be adaptive, where the next test depends on previous , or nonadaptive, where tests follow a fixed sequence regardless of outcomes, with the adaptivity gap measuring the cost difference between these approaches. Various solution s have been employed, including dynamic programming for read-once functions (which model connectivity in series-parallel networks) and approximation algorithms using techniques like adaptive submodularity for harder cases, such as threshold functions or DNF/CNF representations.

Analysis from the review reveals a range of algorithmic performances and limitations. For instance, a 4-approximation algorithm exists for series functions with general probability distributions, meaning its cost is at most four times the optimal. For threshold functions, a 3-approximation algorithm has been developed, while for DNF and CNF representations, an O(log kd)-approximation is achievable, where k and d are the numbers of clauses and terms, respectively. In contrast, read-once functions pose greater s: an 8-approximation algorithm is available for nonadaptive strategies, but the adaptivity gap can be as high as Θ(log n) for unit costs and uniform distributions, indicating significant potential losses from using fixed sequences. The review also summarizes in tables, showing constant-factor approximations for problems like the Stochastic Score Classification Problem (SSCP) and the Stochastic Half-Space Evaluation Problem (SHEP), with factors depending on parameters like the number of half-spaces.

Contextually, these advancements matter because they apply to numerous practical domains. In network connectivity, algorithms help determine if two nodes are connected in uncertain graphs, relevant for telecommunications or post-disaster assessments. In medical diagnosis, STP models can optimize test sequences to identify conditions quickly and cheaply, though the review notes a lack of integration with real medical data. Other applications include database querying, where efficient evaluation of Boolean expressions reduces response times, and cybersecurity, where testing edges in networks detects attack paths. The batch testing extension, which allows multiple tests to be conducted simultaneously, has led to approximation algorithms with factors close to 1 for series systems, promising near-optimal solutions in settings like quality control or resource-constrained inspections.

Limitations highlighted in the review point to areas needing further research. For read-once functions, optimal strategies remain elusive, with only approximation algorithms available, and the adaptivity gap suggests adaptive strategies may be significantly better but harder to compute. The problem becomes NP-complete for general networks or with precedence constraints, limiting exact solutions to special cases. Additionally, most studies assume independent variables and perfect tests, whereas real-world scenarios often involve dependencies and imperfect tests, though some work extends to these complexities. The review calls for more application-focused research using real data, particularly in fields like machine learning and medical diagnosis, to bridge theoretical algorithms with practical implementation and explore robust optimization against uncertain costs and probabilities.

About the Author

Guilherme A.

Former dentist (MD) from Brazil, 41 years old, husband, and AI enthusiast. In 2020, he transitioned from a decade-long career in dentistry to pursue his passion for technology, entrepreneurship, and helping others grow.

Connect on LinkedIn