loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
ACS/IEEE 2005 International Conference on Computer Systems and Applications (AICCSA'05)
Scalability of OAT
Cairo, Egypt
January 03-January 06
ISBN: 0-7803-8735-X
J. Mizher, Dept. of Comput. Sci. & Eng., Southern Methodist Univ., Dallas, TX, USA
M.H. Dunham, Dept. of Comput. Sci. & Eng., Southern Methodist Univ., Dallas, TX, USA
Lin Lu, Dept. of Comput. Sci. & Eng., Southern Methodist Univ., Dallas, TX, USA
Yongqiao Xiao, Lab. d'Informatique pour la Mecanique, Sci. de lngenieur, France
Summary form only given. Mining user access patterns from clickstream data has attracted much attention from the research community. However, the scalability testing of corresponding mining algorithms has been virtually ignored. Memory requirements of these algorithms may be quite large due to the fact that in-memory data structures whose size depends on the number and length of patterns is often assumed. Due to the importance of the scalability of algorithms to the usefulness of the Web usage mining (WUM) techniques, we propose two new sampling techniques, continuous and random, which can be applied to static sized test datasets to examine WUM algorithm scalability. We illustrate the usefulness of these scalability approaches by performing scalability tests using the online adaptive traversal (OAT) pattern mining algorithm. These experiments show that indeed the OAT algorithm adjusts to the amount of memory and time requirements grow at a linear rate. This paper has several results: 1. The OAT algorithm is shown to be scalable in both space and time. The time grows at a linear rate, while the space adapts to available memory through compression. 2. Two sampling techniques are presented which facilitate the performance of scalability experiments against fixed size Web logs. 3. The impact of spiders crawling on the Web can have a disastrous impact on programs running to collect WUM statistics and patterns.
Citation:
J. Mizher, M.H. Dunham, Lin Lu, Yongqiao Xiao, "Scalability of OAT," aiccsa, pp.48-I, ACS/IEEE 2005 International Conference on Computer Systems and Applications (AICCSA'05), 2005
Usage of this product signifies your acceptance of the Terms of Use.