Abstract
Jgroup/ARM is a middleware framework for operating dependable distributed applications based on Java. Jgroup integrates the distributed object models of Java RMI and Jini with the object group communication paradigm, enabling the construction of groups of replicated server objects that provide dependable services to clients. ARM provides automated mechanisms for distributing replicas to host processors and recovering from replica failures.
This paper describes an approach based on stratified sampling combined with fault injections for estimating the dependability attributes of a service deployed using the Jgroup/ARM middleware framework. A first experimental evaluation is performed focusing on a service provided by a triplicated server, and indicative predictions of various dependability attributes of the service are obtained. The evaluation shows that a very high availability and MTBF may be achieved for services based on Jgroup/ARM.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Amir, Y., Danilov, C., Stanton, J.: A Low Latency, Loss Tolerant Architecture and Protocol for Wide Area Group Communication. In: Proc. of the Int. Conf. on Dependable Systems and Networks, New York (June 2000)
Arlat, J., Aguera, M., Amat, L., Crouzet, Y., Fabre, J.-C., Laprie, J.-C., Martins, E., Powell, D.: Fault Injection for Dependability Validation: A Methodology and Some Applications. IEEE Transactions on Software Engineering 16(2), 166–182 (1990)
Arlat, J., Aguera, M., Crouzet, Y., Fabre, J.-C., Martins, E., Powell, D.: Experimental Evaluation of the Fault Tolerance of an Atomic Multicast System. IEEE Transactions on Reliability 39(4), 455–467 (1990)
Avresky, D., Arlat, J., Laprie, J.-C., Crouzet, Y.: Fault Injection for Formal Testing of Fault Tolerance. IEEE Transactions on Reliability 45(3), 443–455 (1996)
Babaoğlu, Ö., Davoli, R., Montresor, A.: Group Communication in Partitionable Systems: Specification and Algorithms. IEEE Transactions on Software Engineering 27(4), 308–336 (2001)
Ban, B.: JavaGroups – Group Communication Patterns in Java. Technical report, Dept. of Computer Science, Cornell University (July 1998)
Birman, K.: The Process Group Approach to Reliable Distributed Computing. Commun. ACM 36(12), 36–53 (1993)
Chandra, R., Lefever, R.M., Joshi, K.R., Cukier, M., Sanders, W.H.: A Global-State-Triggered Fault Injector for Distributed System Evaluation. IEEE Transactions on Parallel and Distributed Systems 15(7), 593–605 (2004)
Cukier, M., Powell, D., Arlat, J.: Coverage Estimation Methods for Stratified Fault-Injection. IEEE Transactions on Computers 48(7), 707–723 (1999)
Felber, P.: The CORBA Object Group Service: a Service Approach to Object Groups in CORBA. PhD thesis, Ecole Polytechnique Fédérale de Lausanne (Janaury 1998)
Gunneflo, U., Karlsson, J., Torin, J.: Evaluation of error detection schemes using fault injection by heavy-ion radiation. In: Proc. of the 19th Int. Symp. on Fault-Tolerant Computing, Chicago, IL, USA, pp. 340–347 (June 1989)
Joshi, K.R., Cukier, M., Sanders, W.H.: Experimental Evaluation of the Unavailability Induced by a Group Membership Protocol. In: Proc. of the 4th European Dependable Computing Conference, Toulouse, France, pp. 140–158 (October 2002)
Karamanolis, C., Magee, J.: Client-Access Protocols for Replicated Services. IEEE Transactions on Software Engineering 25(1) (January 1999)
Lewis, P.A.W., Orav, E.J.: Simulation Methodology for Statisticians, Operation Analyst and Engineers. Statistics/Probability Series, vol. 1. Wadsworth & Brooks/Cole (1989)
Meling, H., Helvik, B.E.: ARM: Autonomous Replication Management in Jgroup. In: Proc. of the 4th European Research Seminar on Advances in Distributed Systems, Bertinoro, Italy (May 2001)
Meling, H., Helvik, B.E.: Performance Consequences of Inconsistent Client-side Membership Information in the Open Group Model. In: Proc. of the 23rd Int. Performance, Computing, and Communications Conf., Phoenix, Arizona (April 2004)
Meling, H., Montresor, A., Babaoğlu, Ö., Helvik, B.E.: Jgroup/ARM: A Distributed Object Group Platform with Autonomous Replication Management for Dependable Computing. Technical Report UBLCS-2002-12, Dept. of Computer Science, University of Bologna (October 2002)
Mills, D.L.: Network Time Protocol (Version 3); Specification, Implementation and Analysis, RFC 1305 (March 1992)
Montresor, A.: System Support for Programming Object-Oriented Dependable Applications in Partitionable Systems. PhD thesis, Dept. of Computer Science, University of Bologna (February 2000)
Narasimhan, P.: Transparent Fault Tolerance for CORBA. PhD thesis, University of California, Santa Barbara (December 1999)
Powell, D.: Distributed Fault Tolerance: Lessons from Delta-4. IEEE Micro, 36–47 (February 1994)
Ren, Y.: AQuA: A Framework for Providing Adaptive Fault Tolerance to Distributed Applications. PhD thesis, University of Illinois at Urbana-Champaign (2001)
Ren, Y., Bakken, D.E., Courtney, T., Cukier, M., Karr, D.A., Rubel, P., Sabnis, C., Sanders, W.H., Schantz, R.E., Seri, M.: AQuA: An Adaptive Architecture that Provides Dependable Distributed Objects. IEEE Transactions on Computers 52(1), 31–50 (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Helvik, B.E., Meling, H., Montresor, A. (2005). An Approach to Experimentally Obtain Service Dependability Characteristics of the Jgroup/ARM System. In: Dal Cin, M., Kaâniche, M., Pataricza, A. (eds) Dependable Computing - EDCC 5. EDCC 2005. Lecture Notes in Computer Science, vol 3463. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11408901_13
Download citation
DOI: https://doi.org/10.1007/11408901_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-25723-3
Online ISBN: 978-3-540-32019-7
eBook Packages: Computer ScienceComputer Science (R0)Springer Nature Proceedings Computer Science
