loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
15th International Conference on Scientific and Statistical Database Management
Database Support for 3D-Protein Data Set Analysis
Cambridge, Massachusetts, USA
July 09-July 11
ISBN: 0-7695-1964-4
Alexander Hinneburg, University of Halle, Germany
Wolfgang Lehner, Dresden University of Technology, Germany
The progress in genome research demands for an adequate infrastructure to analyse the data sets. Database systems reflect a key technology to organize data and speed up the analysis process.
This paper discusses the role of a relational database system based on the problem of finding frequent substructures in multi-dimensional protein databases. The specific problem consists of producing a set of association rules regarding frequent substructures with different lengths and gaps between the amino acid residues of a protein. From a database point of view, the process of finding association rules building the base for a more in-depth analysis of the data material is split into two parts. The first part performs a discretization of the conformational angle space of a single amino acid residue by computing the nearest neighbour of a given set of representatives. The second part consists in adapting a well-known association rule algorithm to determine the frequent substructures. Both steps within this comprehensive analysis task requires substantial support of the underlying database in order to reduce the programming overhead at the application level.
Citation:
Alexander Hinneburg, Wolfgang Lehner, "Database Support for 3D-Protein Data Set Analysis," ssdbm, pp.161, 15th International Conference on Scientific and Statistical Database Management, 2003
Usage of this product signifies your acceptance of the Terms of Use.