A Fault Injection Platform for Availability Evaluation of IA64 System

Article Preview

Abstract:

Availability is one of the key features for transaction processing systems. Fault injection is one of the important techniques to hasten availability tests. A fault injection platform is designed in this paper for IA64 system which is being widely used in transaction processing business. The fault injection platform is implemented using client/server mode and a series of fault injection tools are accomplished which covers CPU faults, memory faults, disk faults, IO faults, file system faults and system call boundary errors. A group of experiments on a typical IA64 server are described and performed, and the experiment results validate the effectiveness of the fault injection platform.

You might also be interested in these eBooks

Info:

Periodical:

Pages:

535-540

Citation:

Online since:

June 2011

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2011 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] S. Dawson, F. Jahanian, T. L. Tung. Testing of Fault-Tolerant and Real-Time Distributed Systems via Protocol Fault Injection. Proceedings of the 26th IEEE International Symposium on Fault Tolerant Computing (FTCS-26), Sendai, Japan, 1996: 404-414.

DOI: 10.1109/ftcs.1996.534626

Google Scholar

[2] J. Karlsson, J. Arlat, G. Leber. Application of Three Physical Fault Injection Techniques to the Experimental Assessment of the MARS Architecture. Proceedings of the Fifth Annual IEEE Working Conference of Dependable Computing for Critical Applications. IEEE CS Press, Los, California, 1995: 150-161.

DOI: 10.1007/978-3-642-79789-7_18

Google Scholar

[3] J. Arlat, Y. Crouzet, J. C. Laprie. Fault Injection for Dependability Validation of Fault-Tolerant Computing Systems. Proceedings of the 19th IEEE International Symposium on Fault Tolerant Computing (FTCS-19), Chicago, IL, 1989: 348-355.

DOI: 10.1109/ftcs.1989.105591

Google Scholar

[4] J. Arlat, M. Aguera, Y. Crouzet, et al. Experimental Evaluation of the Fault Tolerance of an Atomic Multicast System. IEEE Transactions on Reliability. 1990, 39(4): 455-467.

DOI: 10.1109/24.58723

Google Scholar

[5] H. Madeira, J. G. Silva. Experimental Evaluation of the Fail-Silent Behavior in Computers Without Error Masking. Proceedings of the 24th IEEE International Symposium on Fault Tolerant Computing (FTCS-24), Austin, Texas, 1994: 350-359.

DOI: 10.1109/ftcs.1994.315625

Google Scholar

[6] H. Madeira, M. Z. Rela, F. Moreira, et al. RIFLE: A General Purpose Pin-level Fault Injector. Proc. EDCC-1, Berlin Germany, 1994: 199-216.

DOI: 10.1007/3-540-58426-9_132

Google Scholar

[7] J. H. Barton. Fault Injection Experiments Using FIAT. IEEE Transactions on Computers. 1990, 39(4): 575-582.

DOI: 10.1109/12.54853

Google Scholar

[8] G. A. Kanawati. FERRARI: A Flexible Software-Based Fault and Error Injection System. IEEE Transactions on computers. 1995, 44(2): 248-260.

DOI: 10.1109/12.364536

Google Scholar

[9] T. K. Tsai, R. K. Iyer. Measuring Fault Tolerance with the FTAPE Fault Injection Tool. In Proc. of Performance Tools, Heidelberg, Germany, 1995: 26-40.

DOI: 10.2514/6.1995-1041

Google Scholar

[10] S. Han. Experiment Evaluation of Failure-Detection Schemes in Real-Time Communication Networks. In: Proceedings of the 27th fault tolerant computing, WA, Seattle, 1997: 122-131.

DOI: 10.1109/ftcs.1997.614085

Google Scholar

[11] B. Miller, D. Koski, C. Lee, et al. Fuzz Revisited: A Re-examination of the Reliability of UNIX Utilities and Services. Computer Science Technical Report 1268. University of Wisconsin Madison, 1998: 2-4.

Google Scholar

[12] J. Carreira, H. Madeira, J. Silva. Xception: A Technique for the Experimental Evaluation of Dependability in Modern Computers. IEEE Transactions on Software Engineering. 1998, 24(2): 125-136.

DOI: 10.1109/32.666826

Google Scholar

[13] G. A. Kanawati. FERRARI: A Flexible software-Based fault and error injection system. IEEE Transactions on computers. 1995, 44(2): 248-260.

DOI: 10.1109/12.364536

Google Scholar

[14] J. Arlat, Y. Crouzet, J. Karlsson, P. Folkesson, E. Fuchs, G. H. Leber. Comparison of Physical and Software-Implemented Fault Injection Techniques. IEEE Transactions on Computers. 2003, 52(8): 1115-1133.

DOI: 10.1109/tc.2003.1228509

Google Scholar

[15] A. Albinet, J. Arlat and J. C. Fabre. Characterization of the Impact of Faulty Drivers on the Robustness of the Linux Kernel. Proc. Int. Conf. on Dependable Systems and Networks (DSN 2004), Florence, Italy, IEEE CS Press, (2004).

DOI: 10.1109/dsn.2004.1311957

Google Scholar