XXIV International Conference of the Chilean Computer Science Society (SCCC'04)
Compressing Distributed Text in Parallel with (s, c)-Dense Codes
Arica, Chile
November 11-November 12
ISBN: 0-7695-2200-9
Carolina Bonacic, Universidad de Chile; Centro de Estudios del Cuaternario CEQUA, Chile
Mauricio Mar?, Centro de Estudios del Cuaternario CEQUA, Chile; Universidad de Magallanes, Chile
Systems able to cope with very large text collections are making intensive use of distributed memory parallel computing platforms such as Clusters of PCs. This is particularly evident in Web Search Engines which must resort to parallelism in order to deal efficiently with both high rates of queries per unit time and high space requirements in the form of large numbers of small documents stored in secondary memory. Those documents can be stored in compressed format to reduce memory space and communication time. This paper proposes a parallel algorithm for compressing text in such a distributed memory environment. We show efficient performance against the usual-practice alternative of compressing the whole text on a single machine.
Citation:
Carolina Bonacic, Antonio Fari?, Mauricio Mar?, Nieves R. Brisaboa, "Compressing Distributed Text in Parallel with (s, c)-Dense Codes," sccc, pp.93-98, XXIV International Conference of the Chilean Computer Science Society (SCCC'04), 2004