Loading…

Practical and low-overhead masking of failures of TCP-based servers

This article describes an architecture that allows a replicated service to survive crashes without breaking its TCP connections. Our approach does not require modifications to the TCP protocol, to the operating system on the server, or to any of the software running on the clients. Furthermore, it r...

Full description

Saved in:
Bibliographic Details
Published in:ACM transactions on computer systems 2009-05, Vol.27 (2), p.7
Main Authors: Zagorodnov, Dmitrii, Marzullo, Keith, Alvisi, Lorenzo, Bressoud, Thomas C
Format: Article
Language:English
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:This article describes an architecture that allows a replicated service to survive crashes without breaking its TCP connections. Our approach does not require modifications to the TCP protocol, to the operating system on the server, or to any of the software running on the clients. Furthermore, it runs on commodity hardware. We compare two implementations of this architecture (one based on primary/backup replication and another based on message logging) focusing on scalability, failover time, and application transparency. We evaluate three types of services: a file server, a Web server, and a multimedia streaming server. Our experiments suggest that the approach incurs low overhead on throughput, scales well as the number of clients increases, and allows recovery of the service in near-optimal time. [PUBLICATION ABSTRACT]
ISSN:0734-2071
1557-7333