loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
16th International Conference on Data Engineering (ICDE'00)
San Diego, California
February 28-March 03
ISBN: 0-7695-0506-6
Stefan Berchtold, University of Halle
Daniel A. Keim, University of Halle
During recent years, a variety of new database applications has been developed which substantially differ from conventional database applications. For example, new database applications such as data warehousing produce very large relations which require a multidimensional view on the data, and in areas such as multimedia and CAD a content-based search is essential which is often implemented using some kind of feature vectors. All the new applications have in common that the underlying database system has to support the processing of queries on large amounts of high-dimensional data. Now, we may ask what the difference is between processing low- and high-dimensional data. A result of recent research activities is that basically none of the querying and indexing techniques, which provide good results on low-dimensional data, also performs sufficiently well on higher-dimensional data. The problem of dealing with high-dimensional spaces has therefore been addressed in a variety of recent database research projects. The goal of the tutorial is to spread the knowledge about high-dimensional spaces and the proposed techniques to a large community of both, researchers and practitioners 3/4 researchers who are interested in querying and indexing techniques for high-dimensional data, and practitioners who are interested in the state-of-the art of database support for their applications. Also, the tutorial will be very interesting for non-database computer scientists because the problem of dealing with high-dimensional spaces has a large number of other applications such as robot motion planning, optimization problems, and visualization techniques. Therefore, a large part of the tutorial is dedicated to convey the understanding of the effects occurring in these spaces.The tutorial is structured as follows: In the first section, we describe two examples of new database applications, which demonstrate the need for efficient query processing techniques in high-dimensional spaces. In the second section, we discuss the effects occurring in high-dimensional spaces - first from a pure mathematical point of view and then from a database perspective. Next, we describe the different approaches for modeling the costs of processing queries on high-dimensional data. The description of the different approaches demonstrates nicely what happens if we ignore the special properties of high-dimensional spaces. In the fourth section, we then provide a structured overview of the proposed querying and indexing techniques, discussing their advantages and drawbacks. In this section, we also cover a number of additional techniques dealing with optimization and parallelization. In concluding the tutorial, we try to stir further research activities by presenting a number of interesting research problems.
Citation:
Stefan Berchtold, Daniel A. Keim, "Tutorial 5: Indexing High-Dimensional Spaces: Database Support for Next Decade's Applications," icde, pp.698, 16th International Conference on Data Engineering (ICDE'00), 2000
Usage of this product signifies your acceptance of the Terms of Use.