loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
IEEE Computer Society Bioinformatics Conference (CSB'02)
An Index Structure for Pattern Similarity Searching in DNA Microarray Data
Stanford, California
August 14-August 16
ISBN: 0-7695-1653-X
Haixun Wang, IBM T. J. Watson Research Center
Chang-Shing Perng, IBM T. J. Watson Research Center
Wei Fan, IBM T. J. Watson Research Center
Philip S. Yu, IBM T. J. Watson Research Center
The DNA microarray technology is about to bring an explosion of gene expression data that may dwarf even the human sequencing projects. Researchers are motivated to identify genes whose expression levels rise and fall coherently under a set of experimental perturbances, that is, they exhibit fluctuation of a similar shape when conditions change. In this paper, we show that queries based on pattern correlations against large-scale microarray databases can be supported by the weighted-sequence model, an index structure designed for sequence matching. A weighted-sequence is a two-dimensional structure where each element in the sequence is associated with a weight. We transform the DNA microarray data, as well as pattern-based queries, into weighted-sequences, and use subsequence matching algorithms to retrieve from the database all genes that match the query pattern. We demonstrate, using both synthetic and real-world data sets, that our method is effective and efficient.
Citation:
Haixun Wang, Chang-Shing Perng, Wei Fan, Philip S. Yu, "An Index Structure for Pattern Similarity Searching in DNA Microarray Data," csb, pp.256, IEEE Computer Society Bioinformatics Conference (CSB'02), 2002
Usage of this product signifies your acceptance of the Terms of Use.