loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
2nd International Conference on Dependability of Computer Systems (DepCoS-RELCOMEX '07)
Towards Reliability and Fault-Tolerance of Distributed Stream Processing System
Szklarska Poreba, Poland
June 14-June 16
ISBN: 0-7695-2850-3
Marcin Gorawski, Silesian University of Technology
Pawel Marks, Silesian University of Technology
Not so long ago data warehouses were used to process data sets loaded periodically. We could distinguish two kinds of ETL processes: full and incremental. Now we often have to process real-time data and analyse them almost on-the-fly, so the analysis are always up to date. There are many possible applications for real-time data warehouses. In most cases two features are important: delivering data to the warehouse as quick as possible, and not losing any tuple in case of failures. In this paper we propose an architecture for gathering and processing data from geographically distributed data sources. We present theoretical analysis, mathematical model of a data source, and some rules of system modules configuration. At the end of the paper our future plans are described briefly.
Citation:
Marcin Gorawski, Pawel Marks, "Towards Reliability and Fault-Tolerance of Distributed Stream Processing System," depcos-relcomex, pp.246-253, 2nd International Conference on Dependability of Computer Systems (DepCoS-RELCOMEX '07), 2007
Usage of this product signifies your acceptance of the Terms of Use.