In the high-stakes world of nuclear energy, where safety and efficiency hinge on precise data, a persistent has been the scarcity of experimental datasets. Critical heat flux (CHF), a key safety parameter that marks the limit of effective heat transfer in reactor cores, is particularly affected, with costly and limited measurements hindering advancements in design and predictive modeling. Now, a groundbreaking study from North Carolina State University leverages cutting-edge artificial intelligence to address this issue, employing diffusion models—a class of deep generative models—to generate high-fidelity synthetic CHF data. This research, detailed in a recent paper, demonstrates how these AI systems can learn from existing datasets to produce realistic, physics-consistent samples, potentially revolutionizing data augmentation in energy applications where traditional data collection falls short.
Ologically, the study developed both a vanilla diffusion model (DM) and a conditional diffusion model (CDM) using a public CHF dataset curated by the U.S. Nuclear Regulatory Commission, which includes 24,579 experimental samples covering thermal-hydraulic parameters like pressure, mass flux, and tube diameter. The DM operates by learning the underlying probability distribution through a process of adding and then reversing Gaussian noise over multiple steps, enabling it to generate arbitrary synthetic data that statistically resemble the training set. In contrast, the CDM enhances this by incorporating user-specified conditions, allowing for targeted data generation under specific thermal-hydraulic scenarios. Both models were trained with techniques like exponential moving averages to ensure stability, with the CDM specifically designed to output CHF values corresponding to given inputs, facilitating direct accuracy comparisons and uncertainty quantification through repeated sampling.
Reveal that the vanilla DM successfully captured the empirical distributions and pairwise correlations of the real CHF data, as evidenced by visual comparisons and quantitative metrics like Pearson and Spearman correlation coefficients, which showed strong alignment between generated and actual datasets. For instance, the Kolmogorov-Smirnov distance for joint empirical cumulative distribution functions was a low 0.1265, indicating high similarity. The CDM, however, excelled in targeted generation, achieving a mean absolute relative error of just 6.8% when compared to held-out test data, with most errors within ±25% and an R² value of 0.98 underscoring its predictive accuracy. Uncertainty analysis further demonstrated that the CDM produced stable outputs, with relative standard deviations averaging 4.40%, bolstering confidence in the reliability of the synthetic data for practical applications.
Of this research extend broadly across nuclear engineering and beyond, offering a scalable solution to data scarcity that could enhance the robustness of machine learning models used in safety assessments and reactor design. By enabling the generation of physics-consistent data, the CDM, in particular, supports more accurate predictions and uncertainty-aware analyses, which are critical for avoiding CHF-related failures in operational reactors. This approach not only augments existing datasets but also opens doors for applications in other energy sectors facing similar data limitations, such as renewable energy forecasting, where generative models could simulate rare or expensive-to-measure scenarios.
Despite these advancements, the study acknowledges limitations, such as the models' tendency to generate data primarily within the ranges of the training set, with limited extrapolation to entirely new domains. Future work aims to combine generative modeling with transfer learning to overcome this and further validate physical consistency across varied conditions. Nonetheless, this research marks a significant step forward in applying AI to nuclear energy, demonstrating that diffusion models can effectively bridge data gaps while maintaining scientific rigor and safety standards.
Reference: Alsafadi, F., Akins, A., & Wu, X. (2025). Towards Overcoming Data Scarcity in Nuclear Energy: A Study on Critical Heat Flux with Physics-consistent Conditional Diffusion Model. arXiv preprint arXiv:2511.16207.
Original Source
Read the complete research paper
About the Author
Guilherme A.
Former dentist (MD) from Brazil, 41 years old, husband, and AI enthusiast. In 2020, he transitioned from a decade-long career in dentistry to pursue his passion for technology, entrepreneurship, and helping others grow.
Connect on LinkedIn