hrcak mascot   Srce   HID

Izvorni znanstveni članak
https://doi.org/10.2498/cit.1001706

A Recovery Scheme for Cluster Federations Using Sender-based Message Logging

Bidyut Gupta ; Southern Illinois University, Carbondale, IL, USA
Ruslan Nikolaev ; Southern Illinois University, Carbondale, IL, USA
Raja Chirra ; Southern Illinois University, Carbondale, IL, USA

Puni tekst: engleski, pdf (257 KB) str. 127-139 preuzimanja: 295* citiraj
APA 6th Edition
Gupta, B., Nikolaev, R. i Chirra, R. (2011). A Recovery Scheme for Cluster Federations Using Sender-based Message Logging. Journal of computing and information technology, 19 (2), 127-139. https://doi.org/10.2498/cit.1001706
MLA 8th Edition
Gupta, Bidyut, et al. "A Recovery Scheme for Cluster Federations Using Sender-based Message Logging." Journal of computing and information technology, vol. 19, br. 2, 2011, str. 127-139. https://doi.org/10.2498/cit.1001706. Citirano 25.02.2020.
Chicago 17th Edition
Gupta, Bidyut, Ruslan Nikolaev i Raja Chirra. "A Recovery Scheme for Cluster Federations Using Sender-based Message Logging." Journal of computing and information technology 19, br. 2 (2011): 127-139. https://doi.org/10.2498/cit.1001706
Harvard
Gupta, B., Nikolaev, R., i Chirra, R. (2011). 'A Recovery Scheme for Cluster Federations Using Sender-based Message Logging', Journal of computing and information technology, 19(2), str. 127-139. https://doi.org/10.2498/cit.1001706
Vancouver
Gupta B, Nikolaev R, Chirra R. A Recovery Scheme for Cluster Federations Using Sender-based Message Logging. Journal of computing and information technology [Internet]. 2011 [pristupljeno 25.02.2020.];19(2):127-139. https://doi.org/10.2498/cit.1001706
IEEE
B. Gupta, R. Nikolaev i R. Chirra, "A Recovery Scheme for Cluster Federations Using Sender-based Message Logging", Journal of computing and information technology, vol.19, br. 2, str. 127-139, 2011. [Online]. https://doi.org/10.2498/cit.1001706

Sažetak
A cluster federation is a union of clusters and is heterogeneous. Each cluster contains a certain number of processes. An application running in such a computing environment is divided into communicating modules so that these modules can run on different clusters. To achieve fault-tolerance different clusters may employ different check pointing schemes. For example, some may use coordinated schemes, while some other may use communication-induced schemes. It may complicate the recovery process. In this paper, we have addressed the complex problem of recovery for cluster computing environment. The proposed approach handles both inter cluster orphan and lost messages unlike the existing works in this area. We first propose an algorithm to determine a recovery line so that there does not exist any inter cluster orphan message between any pair of the cluster level check points belonging to the recovery line. The main feature of the proposed algorithm is that it can be executed simultaneously by all clusters in the cluster federation. Next we apply the sender-based message logging idea to effectively handle all inter cluster lost messages to ensure correctness of computation.

Ključne riječi
cluster federation; cluster level; checkpoint; recovery

Hrčak ID: 71050

URI
https://hrcak.srce.hr/71050

Posjeta: 444 *