基于实时操作系统的故障诊断高可靠测控系统设计
2023,31(2):8-14
摘要:针对地面测控系统的长期加电高可靠性设计需求,目前所采用的双机冗余方案缺少对系统软硬件状态考虑,在系统长期运行过程中存在冗余失效的可能性。针对该问题,提出来一种系统故障诊断与容错方法,对系统故障源如系统任务状态、CPU温度、CPU占用率、磁盘剩余空间、IO操作状态等异常进行了综合研究和分析。采用任务实时监测、最小二乘法以及哈希算法等关键技术和方法实现系统故障诊断与容错处理,经实际型号项目验证应用满足了系统对高可靠性需求的应用需求。
关键词:高可靠性;测控系统;故障诊断;容错处理;实时系统
Design of High Reliable Measurement and Control System for Fault Diagnosis Based on RTOS
Abstract:High reliability design requirements for long-term power on of test and control system. The current dual redundancy scheme lacks consideration of system software and hardware status, possibility of redundancy failure during long-term system operation. In this dissertation,A system fault diagnosis and fault tolerance method is proposed,the system fault sources such as system task status, CPU temperature, CPU utilization, disk space, IO operation status and other anomalies are comprehensively studied and analyzed. Key technologies and methods such as task real-time monitoring, least square method and hash algorithm are used to realize system fault diagnosis and fault tolerance processing, It is verified by the actual model project that the application meets the application requirements of the system for high reliability.
Key words:high reliability; test and control system; fault diagnosis; fault tolerant;RTOS
收稿日期:2022-10-20
基金项目:
