JOURNAL ARTICLE

Falsification of Cyber-Physical Systems Using Deep Reinforcement Learning

Yoriyuki YamagataShuang LiuTakumi AkazakiYihai DuanJianye Hao

Year: 2020 Journal:   IEEE Transactions on Software Engineering Vol: 47 (12)Pages: 2823-2840   Publisher: IEEE Computer Society

Abstract

A Cyber-Physical System (CPS) is a system which consists of software components and physical components. Traditional system verification techniques such as model checking or theorem proving are difficult to apply to CPS because the physical components have infinite number of states. To solve this problem, robustness guided falsification of CPS is introduced. Robustness measures how robustly the given specification is satisfied. Robustness guided falsification tries to minimize the robustness by changing inputs and parameters of the system. The input with a minimal robustness (counterexample) is a good candidate to violate the specification. Existing methods use several optimization techniques to minimize robustness. However, those methods do not use temporal structures in a system input and often require a large number of simulation runs to minimize the robustness. In this paper, we explore state-of-the-art Deep Reinforcement Learning (DRL) techniques, i.e., Asynchronous Advantage Actor-Critic (A3C) and Double Deep Q Network (DDQN), to reduce the number of simulation runs required to find such counterexamples. We theoretically show how robustness guided falsification of a safety property is formatted as a reinforcement learning problem. Then, we experimentally compare the effectiveness of our methods with three baseline methods, i.e., random sampling, cross entropy and simulated annealing, on three well known CPS systems. We thoroughly analyse the experiment results and identify two factors of CPS which make DRL based methods better than existing methods. The most important factor is the availability of the system internal dynamics to the reinforcement learning algorithm. The other factor is the existence of learnable structure in the counterexample.

Keywords:
Robustness (evolution) Computer science Cyber-physical system Reinforcement learning Counterexample Asynchronous communication Physical system Model checking Artificial intelligence Distributed computing Machine learning Theoretical computer science Mathematics

Metrics

49
Cited By
5.00
FWCI (Field Weighted Citation Impact)
57
Refs
0.96
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Formal Methods in Verification
Physical Sciences →  Computer Science →  Computational Theory and Mathematics
Software Reliability and Analysis Research
Physical Sciences →  Computer Science →  Software
Safety Systems Engineering in Autonomy
Physical Sciences →  Engineering →  Safety, Risk, Reliability and Quality

Related Documents

JOURNAL ARTICLE

Adaptive workload adjustment for cyber-physical systems using deep reinforcement learning

Shikang XuIsrael KorenC.M. Krishna

Journal:   Sustainable Computing Informatics and Systems Year: 2021 Vol: 30 Pages: 100525-100525
JOURNAL ARTICLE

Falsification of Cyber-Physical Systems Using PDDL+ Planning

Diego AinetoEnrico ScalaEva OnaindíaIvan Serina

Journal:   Proceedings of the International Conference on Automated Planning and Scheduling Year: 2023 Vol: 33 (1)Pages: 2-6
JOURNAL ARTICLE

Falsification of Cyber-physical Systems Using Bayesian Optimization

Zahra RamezaniKenan ŠehićLuigi NardiKnut Åkesson

Journal:   ACM Transactions on Embedded Computing Systems Year: 2025 Vol: 24 (3)Pages: 1-23
© 2026 ScienceGate Book Chapters — All rights reserved.