AIResearch AIResearch
Back to articles
Robotics

AI Keeps Planes Flying Safely Despite Hidden Faults

A new reinforcement learning method enables aircraft systems to adapt to gradual component degradation without needing prior fault detection, enhancing safety and reliability in real-time.

AI Research
November 14, 2025
3 min read
AI Keeps Planes Flying Safely Despite Hidden Faults

Imagine an aircraft flying with hidden wear and tear in its fuel system, yet it stays stable and safe without any alarms. This scenario is now possible thanks to a new AI-driven control method that adapts to gradual faults in real-time, eliminating the need for complex fault detection systems. For industries like aviation and transportation, this means improved reliability and reduced risks of catastrophic failures, making everyday travel safer for passengers and cargo.

The key finding from this research is that an adaptive reinforcement learning controller can maintain system performance even as components degrade over time. In tests on a simulated C-130 aircraft fuel system, the AI controller kept the plane's center of gravity stable and minimized fuel imbalance without requiring prior knowledge of faults or a separate diagnosis step. This contrasts with traditional methods that rely on detecting and isolating faults before adjusting control, which can delay responses and increase vulnerability.

To achieve this, the researchers used a mixed online and offline reinforcement learning approach based on the Proximal Policy Optimization (PPO) algorithm. In the online phase, the controller interacts with the real system, caching experiences like state observations and actions. Periodically, it switches to an offline phase where it relearns from these cached experiences using a data-driven model of the degrading system. This allows the AI to adapt to changing conditions efficiently without constant real-world interactions, balancing exploration and stability. The method was applied to a fuel transfer system with six tanks, where valves and pumps could degrade linearly over time, simulating real-world wear.

Results from experiments show that the AI controller consistently outperformed baseline methods. In one trial with a degrading leftmost engine, the controller maintained better fuel balance by selectively opening valves on the opposite side, as illustrated in Figure 5 of the paper. This kept the center of gravity close to the aircraft's longitudinal axis, earning higher rewards based on a function that prioritized stability, low variance in fuel distribution, and minimal valve usage. Aggregate performance over 20 trials, with random degradations in valves and pumps, demonstrated that rewards did not deteriorate as faults progressed, unlike in control schemes where all valves were kept closed or open. Figure 7 in the paper highlights this consistent performance, with the reinforcement learning approach showing robustness across multiple fault scenarios.

In practical terms, this innovation matters because it enhances fault tolerance in critical systems like aircraft, industrial machinery, and autonomous vehicles. By not relying on fault detection, it reduces hardware redundancies and computational demands, potentially lowering costs and increasing operational uptime. For regular readers, this means safer flights and more reliable transportation, as systems can gracefully handle aging components without sudden failures. The approach's ability to learn from experience without pre-defined models makes it adaptable to various applications where conditions change unpredictably.

However, the study has limitations. The method was tested in simulations with linear degradation models, and its performance in real-world abrupt fault scenarios remains unverified. Additionally, the choice of reward functions and neural network architectures may not generalize to all non-linear systems, and further research is needed to ensure convergence guarantees across different environments. The paper notes that future work could explore aperiodic re-learning schedules or applications to systems with memory constraints.

Original Source

Read the complete research paper

View on arXiv

About the Author

Guilherme A.

Guilherme A.

Former dentist (MD) from Brazil, 41 years old, husband, and AI enthusiast. In 2020, he transitioned from a decade-long career in dentistry to pursue his passion for technology, entrepreneurship, and helping others grow.

Connect on LinkedIn