The Community for Technology Leaders
2002 IEEE International Conference on Data Mining, 2002. Proceedings. (2002)
Maebashi City, Japan
Dec. 9, 2002 to Dec. 12, 2002
ISBN: 0-7695-1754-4
TABLE OF CONTENTS

Program Committee (PDF)

pp. xviii

Non-PC Reviewers (PDF)

pp. xxi
Main-Track Regular Papers

Empirical Comparison of Various Reinforcement Learning Strategies for Sequential Targeted Marketing (Abstract)

Naoki Abe , I.B.M. T. J. Watson Research Center
Edwin Pednault , I.B.M. T. J. Watson Research Center
Haixun Wang , I.B.M. T. J. Watson Research Center
Bianca Zadrozny , I.B.M. T. J. Watson Research Center
Wei Fan , I.B.M. T. J. Watson Research Center
Chid Apte , I.B.M. T. J. Watson Research Center
pp. 3

Investigative Profiling with Computer Forensic Log Data and Association Rules (Abstract)

Tamas Abraham , Defence Science and Technology Organisation
Olivier de Vel , Defence Science and Technology Organisation
pp. 11

Text Document Categorization by Term Association (Abstract)

Maria-Luiza Antonie , University of Alberta
Osmar R. Zaïane , University of Alberta
pp. 19

Online Algorithms for Mining Semi-structured Data Stream (Abstract)

Tatsuya Asai , Kyushu University
Hiroki Arimura , Kyushu University and PRESTO
Kenji Abe , Kyushu University
Shinji Kawasoe , Kyushu University
pp. 27

A Lazy Approach to Pruning Classification Rules (Abstract)

Elena Baralis , Politecnico di Torino
Paolo Garza , Politecnico di Torino
pp. 35

High Performance Data Mining Using the Nearest Neighbor Join (Abstract)

Christian Böhm , University for Health Informatics and Technology
Florian Krebs , University of Munich
pp. 43

Mining General Temporal Association Rules for Items with Different Exhibition Periods (Abstract)

Cheng-Yue Chang , National Taiwan University
Ming-Syan Chen , National Taiwan University
Chang-Hung Lee , National Taiwan University
pp. 59

Learning with Progressive Transductive Support Vector Machine (Abstract)

Yisong Chen , Peking University
Guoping Wang , Peking University
Shihai Dong , Peking University
pp. 67

Evolutionary Time Series Segmentation for Stock Data Mining (Abstract)

Fu-lai Chung , Hong Kong Polytechnic University
Tak-chung Fu , Hong Kong Polytechnic University
Robert Luk , Hong Kong Polytechnic University
Vincent Ng , Hong Kong Polytechnic University
pp. 83

Using functional PCA for cardiac motion exploration (Abstract)

Denis Clot , Universit? Claude Bernard Lyon 1
pp. 91

Unsupervised Segmentation of Categorical Time Series into Episodes (Abstract)

Paul Cohen , University of Massachusetts
Brent Heeringa , University of Massachusetts
Niall Adams , Imperial College
pp. 99

Speed-up Iterative Frequent Itemset Mining with Constraint Changes (Abstract)

Gao Cong , National University of Singapore
Bing Liu , National University of Singapore
pp. 107

Feature Selection for Clustering - A Filter Solution (Abstract)

Manoranjan Dash , Northwestern University
Kiseok Choi , Northwestern University
Peter Scheuermann , Northwestern University
Huan Liu , Arizona State University
pp. 115

A Theory of Inductive Query Answering (Abstract)

Luc DE RAEDT , University of Freiburg
Manfred JAEGER , University of Freiburg, Univ. of Helsinki, and MPI Informatik
Sau Dan LEE , University of Freiburg
Heikki MANNILA , Univ. of Helsinki
pp. 123

Iterative Clustering of High Dimensional Text Data Augmented by Local Search (Abstract)

Inderjit S. Dhillon , University of Texas
Yuqiang Guan , University of Texas
J. Kogan , University of Maryland Baltimore County
pp. 131

Cluster merging and splitting in hierarchical clustering algorithms (Abstract)

Chris Ding , Lawrence Berkeley National Laboratory
Xiaofeng He , Lawrence Berkeley National Laboratory
pp. 139

Adaptive dimension reduction for clustering high dimensional data (Abstract)

Chris Ding , Lawrence Berkeley National Laboratory
Xiaofeng He , Lawrence Berkeley National Laboratory
Hongyuan Zha , Pennsylvania State University,
Horst D. Simon , Lawrence Berkeley National Laboratory
pp. 147

Modal-style operators in qualitative data analysis (Abstract)

Ivo Düntsch , Brock University
Günther Gediga , Institut f?r Evaluation und Marktanalysen
pp. 155

Progressive Modeling (Abstract)

Wei Fan , IBM T.J.Watson Research
Haixun Wang , IBM T.J.Watson Research
Philip S. Yu , IBM T.J.Watson Research
Shaw-hwa Lo , Columbia Univ.
Salvatore Stolfo , Columbia Univ.
pp. 163

Discriminative Category Matching: Efficient Text Classification for Huge Discriminative Category Matching: Efficient Text Classification for Huge (Abstract)

Gabriel Pui Cheong Fung , Chinese University of Hong Kong
Jeffrey Xu Yu , Chinese University of Hong Kong
Hongjun Lu , Hong Kong University of Science & Technology
pp. 187

Using Text Mining to Infer Semantic Attributes for Retail Data Mining (Abstract)

Rayid Ghani , Accenture Technology Labs
Andrew E. Fano , Accenture Technology Labs
pp. 195

Phrase-based Document Similarity Based on an Index Graph Model (Abstract)

Khaled M. Hammouda , University of Waterloo
Mohamed S. Kamel , University of Waterloo
pp. 203

Mining Top.K Frequent Closed Patterns without Minimum Support (Abstract)

Jiawei Han , University of Illinois at Urbana.Champaign
Jianyong Wang , University of Illinois at Urbana.Champaign
Ying Lu , University of Illinois at Urbana.Champaign
Petre Tzvetkov , University of Illinois at Urbana.Champaign
pp. 211

Mining Generalized Association Rules Using Pruning Techniques (Abstract)

Yin-Fu Huang , National Yunlin University of Science and Technology
Chiech-Ming Wu , National Yunlin University of Science and Technology
pp. 227

A Formal Model for User Preference (Abstract)

Sung Young Jung , LG Electronics Institute of Technology
Jeong-Hee Hong , LG Electronics Institute of Technology
Taek-Soo Kim , LG Electronics Institute of Technology
pp. 235

Convex Hull Ensemble Machine (Abstract)

Yongdai Kim , Ewha Womans University
pp. 243

Discovering Frequent Geometric Subgraphs (Abstract)

Michihiro Kuramochi , University of Minnesota
George Karypis , University of Minnesota
pp. 258

Adapting classification rule induction to subgroup discovery (Abstract)

Nada Lavrac , Jozef Stefan Institute
Peter Flach , University of Bristol
Branko Kavsek , Jozef Stefan Institute
Ljupco Todorovski , Jozef Stefan Institute
pp. 266

Linear Causal Model Discovery Using the MML criterion (Abstract)

Gang Li , Deakin University
Honghua Dai , Deakin University
Yiqing Tu , Deakin University
pp. 274

O-Cluster: Scalable Clustering of Large High Dimensional Data Sets (Abstract)

Boriana L. Milenova , Oracle Data Mining Technologies
Marcos M. Campos , Oracle Data Mining Technologies
pp. 290

Employing Discrete Bayes Error rate for discretization and feature selection (Abstract)

Ankush Mittal , National University of Singapore
Loong-Fah Cheong , National University of Singapore
pp. 298

Feature Selection Algorithms: A Survey and Experimental Evaluation (Abstract)

Luis Carlos Molina , Universitat Polit?cnica de Catalunya
Lluís Belanche , Universitat Polit?cnica de Catalunya
Àngela Nebot , Universitat Polit?cnica de Catalunya
pp. 306

Mining Association Rules from Stars (Abstract)

Eric Ka Ka Ng , Chinese University of Hong Kong
Ada Wai-Chee Fu , Chinese University of Hong Kong
Ke Wang , Simon Fraser University
pp. 322

PERUSE: An Unsupervised Algorithm for Finding Recurrig Patterns in Time Series (Abstract)

Tim Oates , University of Maryland Baltimore County
pp. 330

Adaptive and Resource-Aware Mining of Frequent Sets (Abstract)

P. Palmerini , Universit? Ca' Foscari and Consiglio Nazionale delle Ricerche
R. Perego , Consiglio Nazionale delle Ricerche
F. Silvestri , Consiglio Nazionale delle Ricerche and Universit? di Pisa
pp. 338

A new implementation technique for fast Spectral based document retrieval systems (Abstract)

Laurence A. F. Park , University of Melbourne
Marimuthu Palaniswami , University of Melbourne
Kotagiri Ramamohanarao , University of Melbourne
pp. 346

Efficient Discovery of Common Substructures in Macromolecules (Abstract)

Srinivasan Parthasarathy , Ohio State University
Matt Coatney , Ohio State University
pp. 362

Mining Motifs in Massive Time Series Databases (Abstract)

Pranav Patel , University of California - Riverside
Eamonn Keogh , University of California - Riverside
Jessica Lin , University of California - Riverside
Stefano Lonardi , University of California - Riverside
pp. 370

On Computing Condensed Frequent Pattern Bases (Abstract)

Jian Pei , State Univ. of New York at Buffalo
Guozhu Dong , Wright State Univ.
Wei Zou , Jiangxi Normal Univ.
Jiawei Han , Univ. of Illinois
pp. 378

Automatic Web Page Classification in a Dynamic and Hierarchical Way (Abstract)

XIAOGANG PENG , Louisiana Tech University
BEN CHOI , Louisiana Tech University
pp. 386

User-directed Exploration of Mining Space with Multiple Attributes (Abstract)

Chang-Shing Perng , IBM Thomas J. Watson Research Center
Haixun Wang , IBM Thomas J. Watson Research Center
Sheng Ma , IBM Thomas J. Watson Research Center
Joseph L. Hellerstein , IBM Thomas J. Watson Research Center
pp. 394

On a Capacity Control Using Boolean Kernels for the Learning of Boolean Functions (Abstract)

Ken Sadohara , National Institute of Advanced Industrial Science and Technology
pp. 410

Objective-Oriented Utility-Based Association Mining (Abstract)

Yi-Dong Shen , Chinese Academy of Sciences
Zhong Zhang , Simon Fraser University
Qiang Yang , Hong Kong University of Science and Technology
pp. 426

A Self-Organizing Map with Expanding Force for Data Clustering and Visualization (Abstract)

Wing-Ho Shum , Chinese University of Hong Kong
Hui-Dong Jin , Chinese University of Hong Kong
Kwong-Sak Leung , Chinese University of Hong Kong
Man-Leung Wong , Lingnan University
pp. 434

On the Mining of Substitution Rules for Statistically Dependent Items (Abstract)

Wei-Guang Teng , National Taiwan University
Ming-Jyh Hsieh , National Taiwan University
Ming-Syan Chen , National Taiwan University
pp. 442

TreeFinder: a First Step towards XML Data Mining (Abstract)

Alexandre Termier , LRI - CNRS UMR
Marie-Christine Rousset , LRI - CNRS UMR
Michèl Sebag , LRI - CNRS UMR
pp. 450

Computing Frequent Graph Patterns from Semistructured Data (Abstract)

N. Vanetik , Ben Gurion University
E. Gudes , Ben Gurion University
S. E. Shimony , Ben Gurion University
pp. 458

Predicting Rare Events In Temporal Domains (Abstract)

Ricardo Vilalta , University of Houston
Sheng Ma , IBM T.J. Watson Center
pp. 474

Mining Associations by Pattern Structure in Large Relational Tables (Abstract)

Haixun Wang , IBM T. J. Watson Research Center
Chang-Shing Perng , IBM T. J. Watson Research Center
Sheng Ma , IBM T. J. Watson Research Center
Philip S. Yu , IBM T. J. Watson Research Center
pp. 482

Adapting Information Extraction Knowledge For Unseen Web Sites (Abstract)

Tak-Lam Wong , Chinese University of Hong Kong
Wai Lam , Chinese University of Hong Kong
pp. 506

From Path Tree To Frequent Patterns: A Framework for Mining Frequent Patterns (Abstract)

Yabo Xu , Chinese University of Hong Kong
Jeffrey Xu Yu , Chinese University of Hong Kong
Guimei Liu , Hong Kong University of Science and Technology
Hongjun Lu , Hong Kong University of Science and Technology
pp. 514

Mining Case Bases for Action Recommendation (Abstract)

Qiang Yang , Hong Kong University of Science and Technology
Hong Cheng , Hong Kong University of Science and Technology
pp. 522

Heterogeneous Learner for Web Page Classification (Abstract)

Hwanjo Yu , University of Illinois
Kevin Chen-Chuan Chang , University of Illinois
Jiawei Han , University of Illinois
pp. 538

Using Category-Based Adherence to Cluster Market-Basket Data (Abstract)

Ching-Huang Yun , National Taiwan University
Kun-Ta Chuang , National Taiwan University
Ming-Syan Chen , National Taiwan University
pp. 546

A Comparison Study on Algorithms for Incremental Update of Frequent Sequences (Abstract)

Minghua Zhang , The University of Hong Kong
Ben Kao , The University of Hong Kong
Chi-Lap Yip , The University of Hong Kong
pp. 554

On Active Learning for Data Acquisition (Abstract)

Zhiqiang Zheng , University of Pennsylvania
Balaji Padmanabhan , University of Pennsylvania
pp. 562

SmartMiner: A Depth First Algorithm Guided by Tail Information for Mining Maximal Frequent Itemsets (Abstract)

Qinghua Zou , University of California-LA
Wesley W. Chu , University of California-LA
Baojing Lu , North Dakota State University
pp. 570
Main-Track Short Papers

Neighborgram Clustering Interactive Exploration of Cluster Neighborhoods (Abstract)

Michael R. Berthold , Data Analysis Research Lab, Tripos Inc.
Bernd Wiswedel , Data Analysis Research Lab, Tripos Inc.
David E. Patterson , Data Analysis Research Lab, Tripos Inc.
pp. 581

A New Algorithm for Learning Parameters of a Bayesian Network from Distributed Data (Abstract)

R. Chen , Washington State University
K. Sivakumar , Washington State University
pp. 585

Optimal Projections of High Dimensional Data (Abstract)

Emilio Corchado , Universidad de Burgos
Colin Fyfe , The University of Paisley
pp. 589

Generating an informative cover for association rules (Abstract)

Laurentiu Cristofor , University of Massachusetts at Boston
Dan Simovici , University of Massachusetts at Boston
pp. 597

Extraction Techniques for Mining Services from Web Sources (Abstract)

Hasan Davulcu , SUNY Stony Brook
Saikat Mukherjee , SUNY Stony Brook
I.V. Ramakrishnan , SUNY Stony Brook
pp. 601

An Algebraic Approach to Data Mining: Some Examples (Abstract)

Robert L. Grossman , University of Illinois at Chicago
Richard G. Larson , University of Illinois at Chicago
pp. 613

Wavelet Based UXO Detection (Abstract)

S. Hodgson , University of New England
N. Dunstan , University of New England
R. Murison , University of New England
pp. 617

Ensemble Modeling Through Multiplicative Adjustment of Class Probability (Abstract)

Se June Hong , IBM T.J. Watson Research Center
Jonathan Hosking , IBM T.J. Watson Research Center
Ramesh Natarajan , IBM T.J. Watson Research Center
pp. 621

Mining A Set of Coregulated RNA Sequences (Abstract)

Yuh-Jyh Hu , National Chiao Tung University
pp. 625

Association Analysis with One Scan of Databases (Abstract)

Hao Huang , Colorado School of Mines
Xindong Wu , University of Vermont
Richard Relue , Colorado School of Mines
pp. 629

Considering Both Intra-Pattern and Inter-Pattern Anomalies for Intrusion Detection (Abstract)

Ning Jiang , University of Central Florida
Kien A. Hua , University of Central Florida
Simon Sheu , National Tsing Hua University
pp. 637

On Evaluating Performance of Classifiers for Rare Classes (Abstract)

Mahesh V. Joshi , IBM T. J. Watson Research Center
pp. 641

Learning from Order Examples (Abstract)

Toshihiro Kamishima , National Institute of Advanced Industrial Science and Technology (AIST)
Shotaro Akaho , National Institute of Advanced Industrial Science and Technology (AIST)
pp. 645

A Personalized Music Filtering System Based on Melody Style Classification (Abstract)

Fang-Fei Kuo , National Cheng Chi University
Man-Kwan Shan , National Cheng Chi University
pp. 649

Improving Medical/Biological Data Classification Performance by Wavelet Preprocessing (Abstract)

Qi Li , University of Delaware
Tao Li , University of Rochester
Shenghuo Zhu , University of Rochester
Chandra Kambhamettu , University of Delaware
pp. 657

Progressive and Interactive Analysis of Event Data Using Event Miner (Abstract)

Sheng Ma , IBM T.J. Watson Research Center
Joseph L. Hellerstein , IBM T.J. Watson Research Center
Chang-shing Perng , IBM T.J. Watson Research Center
Genady Grabarnik , IBM T.J. Watson Research Center
pp. 661

Toward XML-Based Knowledge Discovery Systems (Abstract)

Rosa Meo , Universit? degli Studi di Torino
Giuseppe Psaila , Universit? degli Studi di Bergamo
pp. 665

Using Sequential and Non-Sequential Patterns in Predictive Web Usage Mining Tasks (Abstract)

Bamshad Mobasher , DePaul University
Honghua Dai , DePaul University
Tao Luo , DePaul University
Miki Nakagawa , DePaul University
pp. 669

Exploring the Parameter State Space of Stacking (Abstract)

Alexander K. Seewald , Austrian Research Institute for Artificial Intelligence
pp. 685

Mining Associated Implication Networks: Computational Intermarket Analysis (Abstract)

Phil Tse , Hong Kong Baptist University
Jiming Liu , Hong Kong Baptist University
pp. 689

Maintenance of Sequential Patterns for Record Modification Using Pre-large Sequences (Abstract)

Ching-Yao Wang , National Chiao-Tung University
Tzung-Pei Hong , National University of Kaohsiung
Shian-Shyong Tseng , National Chiao-Tung University
pp. 693

Concept Tree Based Clustering Visualization with Shaded Similarity Matrices (Abstract)

Jun Wang , University of Illinois at Urbana-Champaign
Bei Yu , University of Illinois at Urbana-Champaign
Les Gasser , University of Illinois at Urbana-Champaign
pp. 697

\Delta B <sup>+</sup> Tree: Indexing 3D Point Sets for Pattern Discovery (Abstract)

Xiong Wang , California State University, Fullerton
pp. 701

An Incremental Approach to Building a Cluster Hierarchy (Abstract)

Dwi H. Widyantoro , Texas A&M University
Thomas R. Ioerger , Texas A&M University
John Yen , The Pennsylvania State University
pp. 705

A Comparative Study of RNN for Outlier Detection in Data Mining (Abstract)

Graham Williams , CSIRO Enterprise Data Mining
Rohan Baxter , CSIRO Enterprise Data Mining
Hongxing He , CSIRO Enterprise Data Mining
Simon Hawkins , CSIRO Enterprise Data Mining
Lifang Gu , CSIRO Enterprise Data Mining
pp. 709

Mixtures of ARMA Models for Model-Based Time Series Clustering (Abstract)

Yimin Xiong , Hong Kong University of Science and Technology
Dit-Yan Yeung , Hong Kong University of Science and Technology
pp. 717

gSpan: Graph-Based Substructure Pattern Mining (Abstract)

Xifeng Yan , University of Illinois at Urbana-Champaign
Jiawei Han , University of Illinois at Urbana-Champaign
pp. 721

InfoMiner+: Mining Partial Periodic Patterns with Gap Penalties (Abstract)

Jiong Yang , UIUC
Wei Wang , UNC-Chapel Hill
Philip S. Yu , IBM T. J. Watson Research Center
pp. 725

FD_Mine: Discovering Functional Dependencies in a Database Using Equivalences (Abstract)

Hong Yao , University of Regina
Howard J. Hamilton , University of Regina
Cory J. Butz , University of Regina
pp. 729

Mining Genes in DNA Using GeneScout (Abstract)

Michael M. Yin , New Jersey Institute of Technology
Jason T. L. Wang , New Jersey Institute of Technology
pp. 733

Clustering Spatial Data when Facing Physical Constraints (Abstract)

Osmar R. ZaÏane , University of Alberta
Chi-Hoon Lee , University of Alberta
pp. 737

Mining Surveillance Video for Independent Motion Detection (Abstract)

Zhongfei (Mark) Zhang , State University of New York (SUNY) at Binghamton
pp. 741

Adaptive Parallel Sentences Mining from Web Bilingual News Collection (Abstract)

Bing Zhao , Carnegie Mellon University
Stephan Vogel , Carnegie Mellon University
pp. 745
Industry-Track Papers

Telecommunications Strategic Marketing - KDD and Economic Modeling (Abstract)

Stefano Cazzella , DIS - Univ. di Roma "La Sapienza"
Luigi Dragone , DIS - Univ. di Roma "La Sapienza"
Stefano M. Trisolini , TELECOM Italia
pp. 751

Mining Online Users? Access Records for Web Business Intelligence (Abstract)

Simon Fong , Universidade de Macau
Serena Chan , e-Business Development Team
pp. 759

Discovery of Interesting Association Rules from Livelink Web Log Data (Abstract)

Xiangji Huang , University of Waterloo
Aijun An , York University
Nick Cercone , Dalhousie University
Gary Promhouse , Open Text Corporation
pp. 763

Mining Optimal Actions for Profitable CRM (Abstract)

Charles X. Ling , The University of Western Ontario
Tielin Chen , The University of Western Ontario
Qiang Yang , Hong Kong University of Science and Technology
Jie Cheng , Canadian Imperial Bank of Commerce (CIBC)
pp. 767

Visually Mining Web User Clickpaths (Abstract)

Teresa Mah , Microsoft Corporation
Ying Li , Microsoft Corporation
pp. 771

Experimentation and Self Learning in Continuous Database Marketing (Abstract)

James E Pearce , MarketEaze Solutions
Geoffrey I Webb , Monash University
Robin N Shaw , Deakin University
Brian Garner , Deakin University
pp. 775

Demand Forecasting by the Neural Network with Discrete Fourier Transform (Abstract)

Mariko Yohda , The Japan Research Institute, Limited
Makiko Saito-Arita , The Japan Research Institute, Limited
Akira Okada , The Japan Research Institute, Limited
Ryota Suzuki , Hitotsubashi University
Yoshitsugu Kakemoto , The Japan Research Institute, Limited
pp. 779

Author's Index (PDF)

pp. 785
82 ms
(Ver 3.3 (11022016))