The Community for Technology Leaders
2013 IEEE 13th International Conference on Data Mining (2005)
Houston, Texas
Nov. 27, 2005 to Nov. 30, 2005
ISSN: 1550-4786
ISBN: 0-7695-2278-5
TABLE OF CONTENTS
Introduction

Welcome to ICDM 2005 (PDF)

pp. xvi,xvii

Conference organization (PDF)

pp. xviii-xix
Introduction

Program Committee (PDF)

pp. xxi-xxiv

Non-PC Reviewers (PDF)

pp. xxv-xxvi

Invited Talks (PDF)

pp. 837-838

Tutorials (PDF)

pp. 839

Workshops (PDF)

pp. 840

Panel Session (PDF)

pp. 841
Regular Papers

Handling Generalized Cost Functions in the Partitioning Optimization Problem through Sequential Binary Programming (Abstract)

Adrian Becker , University of Pennsylvania
Alan S. Abrahams , University of Pennsylvania
Daniel Fleder , University of Pennsylvania
Ian C. MacMillan , University of Pennsylvania
pp. 3-9

Online Hierarchical Clustering in a Data Warehouse Environment (Abstract)

Peer Kröger , University of Munich
Hans-Peter Kriegel , University of Munich
Elke Achtert , University of Munich
Christian Böhm , University of Munich
pp. 10-17

eMailSift: Email Classification Based on Structure and Content (Abstract)

Manu Aery , University of Texas at Arlington
Sharma Chakravarthy , University of Texas at Arlington
pp. 18-25

Classifier Fusion Using Shared Sampling Distribution for Boosting (Abstract)

Jing Peng , Tulane University
Raja Iqbal , Tulane University
Costin Barbu , Tulane University
pp. 34-41

Improving Automatic Query Classification via Semi-Supervised Learning (Abstract)

Aleksander Kołcz , America Online, Inc.
Abdur Chowdhury , America Online, Inc.
David D. Lewis , America Online, Inc.
Ophir Frieder , Information Retrieval Laboratory
Eric C. Jensen , Information Retrieval Laboratory
Steven M. Beitzel , Information Retrieval Laboratory
pp. 42-49

ViVo: Visual Vocabulary Construction for Mining Biomedical Images (Abstract)

Christos Faloutsos , Carnegie Mellon University
Ambuj K. Singh , University of California at Santa Barbara
Mark R. Verardo , University of California at Santa Barbara
Jia-Yu Pan , Carnegie Mellon University
Hyungjeong Yang , Chonnam National University
Arnab Bhattacharya , University of California at Santa Barbara
Vebjorn Ljosa , University of California at Santa Barbara
pp. 50-57

Adaptive Product Normalization: Using Online Learning for Record Linkage in Comparison Shopping (Abstract)

Sugato Basu , University of Texas at Austin
Mikhail Bilenko , University of Texas at Austin
Mehran Sahami , Google Inc.
pp. 58-65

Using Information-Theoretic Measures to Assess Association Rule Interestingness (Abstract)

Régis Gras , Polytechnic School of Nantes University
Fabrice Guillet , Polytechnic School of Nantes University
Henri Briand , Polytechnic School of Nantes University
Julien Blanchard , Polytechnic School of Nantes University
pp. 66-73

Shortest-Path Kernels on Graphs (Abstract)

Hans-Peter Kriegel , Ludwig-Maximilians-University Munich
Karsten M. Borgwardt , Ludwig-Maximilians-University Munich
pp. 74-81

Mining Frequent Spatio-Temporal Sequential Patterns (Abstract)

David W. Cheung , University of Hong Kong
Nikos Mamoulis , University of Hong Kong
Huiping Cao , University of Hong Kong
pp. 82-89

Modeling Multiple Time Series for Anomaly Detection (Abstract)

Matthew V. Mahoney , Florida Institute of Technology
Philip K. Chan , Florida Institute of Technology
pp. 90-97

Summarization — Compressing Data into an Informative Representation (Abstract)

Varun Chandola , University of Minnesota
Vipin Kumar , University of Minnesota
pp. 98-105

Labeling Unclustered Categorical Data into Clusters Based on the Important Attribute Values (Abstract)

Hung-Leng Chen , National Taiwan University
Ming-Syan Chen , National Taiwan University
Kun-Ta Chuang , National Taiwan University
pp. 106-113

Making Subsequence Time Series Clustering Meaningful (Abstract)

Jason R. Chen , Australian National University
pp. 114-121

Usage-Based PageRank for Web Personalization (Abstract)

Magdalini Eirinaki , Athens University of Economics and Business
Michalis Vazirgiannis , Athens University of Economics and Business
pp. 130-137

WARP: Time Warping for Periodicity Detection (Abstract)

Walid G. Aref , Purdue University
Mohamed G. Elfeky , Google Inc.
Ahmed K. Elmagarmid , Purdue University
pp. 138-145

Bifold Constraint-Based Mining by Simultaneous Monotone and Anti-Monotone Checking (Abstract)

Paul Nalos , University of Alberta Edmonton
Osmar R. Zaïane , University of Alberta Edmonton
Mohammad El-Hajj , University of Alberta Edmonton
pp. 146-153

Effective Estimation of Posterior Probabilities: Explaining the Accuracy of Randomized Decision Tree Approaches (Abstract)

Ed Greengrass , US Department of Defense
Wei Fan , IBM T.J.Watson Research
Joe McCloskey , US Department of Defense
Kevin Drummey , US Department of Defense
Philip S. Yu , IBM T.J.Watson Research
pp. 154-161

A Thorough Experimental Study of Datasets for Frequent Itemsets (Abstract)

Fabien De Marchi , Laboratoire LIRIS, UMR CNRS and Université Lyon I
Frédéric Flouvat , Laboratoire LIMOS, UMR CNRS and Université Clermont-Ferrand II
Jean-Marc Petit , Laboratoire LIRIS, UMR CNRS and INSA Lyon
pp. 162-169

AMIOT: Induced Ordered Tree Mining in Tree-Structured Databases (Abstract)

Hiroyuki Kawano , Nanzan University
Shohei Hido , Kyoto University
pp. 170-177

Hierarchy-Regularized Latent Semantic Indexing (Abstract)

Kai Yu , Siemens Corporate Technology
Yi Huang , University of Munich
Hans-Peter Kriegel , University of Munich
Shipeng Yu , University of Munich
Matthias Schubert , University of Munich
Volker Tresp , Siemens Corporate Technology
pp. 178-185

Extracting Frequent Subsequences from a Single Long Data Sequence: A Novel Anti-Monotonic Measure and a Simple On-Line Algorithm (Abstract)

Koji Iwanuma , University of Yamanashi
Ryuichi Ishihara , University of Yamanashi
Yo Takano , University of Yamanashi
Hidetomo Nabeshima , University of Yamanashi
pp. 186-193

Mining Minimal Distinguishing Subsequence Patterns with Gap Constraints (Abstract)

James Bailey , University of Melbourne
Guozhu Dong , Wright State University
Xiaonan Ji , University of Melbourne
pp. 194-201

Learning Instance Greedily Cloning Naive Bayes for Ranking (Abstract)

Harry Zhang , University of New Brunswick
Liangxiao Jiang , China University of Geosciences
pp. 202-209

An Algorithm for In-Core Frequent Itemset Mining on Streaming Data (Abstract)

Gagan Agrawal , Ohio State University
Ruoming Jin , Kent State University
pp. 210-217

Stability of Feature Selection Algorithms (Abstract)

Alexandros Kalousis , University of Geneva
Julien Prados , University of Geneva
Melanie Hilario , University of Geneva
pp. 218-225

HOT SAX: Efficiently Finding the Most Unusual Time Series Subsequence (Abstract)

Eamonn Keogh , University of California at Riverside
Jessica Lin , University of California at Riverside
Ada Fu , Chinese University of Hong Kong
pp. 226-233

Orthogonal Neighborhood Preserving Projections (Abstract)

E. Kokiopoulou , University of Minnesota
Y. Saad , University of Minnesota
pp. 234-241

Higher-Order Web Link Analysis Using Multilinear Algebra (Abstract)

Brett W. Bader , Sandia National Laboratories
Joseph P. Kenny , Sandia National Laboratories
Tamara G. Kolda , Sandia National Laboratories
pp. 242-249

A Generic Framework for Efficient Subspace Clustering of High-Dimensional Data (Abstract)

Peer Kröger , University of Munich
Hans-Peter Kriegel , University of Munich
Sebastian Wurst , University of Munich
Matthias Renz , University of Munich
pp. 250-257

Effective and Efficient Distributed Model-Based Clustering (Abstract)

Alexey Pryakhin , University of Munich
Peer Kröger , University of Munich
Matthias Schubert , University of Munich
Hans-Peter Kriegel , University of Munich
pp. 258-265

Finding Maximal Frequent Itemsets over Online Data Streams Adaptively (Abstract)

Wonsuk Lee , Yonsei University
Daesu Lee , Yonsei University
pp. 266-273

CanTree: A Tree Structure for Efficient Incremental Mining of Frequent Patterns (Abstract)

Tariqul Hoque , University of Manitoba
Carson Kai-Sang Leung , University of Manitoba
Quamrul I. Khan , University of Manitoba
pp. 274-281

Combining Multiple Clusterings by Soft Correspondence (Abstract)

Philip S. Yu , IBM T. J. Watson Research Center
Bo Long , State University of New York at Binghamton
Zhongfei (Mark) Zhang , State University of New York at Binghamton
pp. 282-289

Training Support Vector Machines Using Gilbert?s Algorithm (Abstract)

Shawn Martin , Sandia National Laboratories
pp. 306-313

A Heterogeneous Field Matching Method for Record Linkage (Abstract)

Craig A. Knoblock , University of Southern California
Martin Michalowski , University of Southern California
Claude Nanjo , Fetch Technologies
Steven N. Minton , Fetch Technologies
Matthew Michelson , University of Southern California
pp. 314-321

Leveraging Relational Autocorrelation with Latent Group Models (Abstract)

Jennifer Neville , University of Massachusetts at Amherst
David Jensen , University of Massachusetts at Amherst
pp. 322-329

Balancing Exploration and Exploitation: A New Algorithm for Active Machine Learning (Abstract)

Thomas Osugi , University of Nebraska
Deng Kun , University of Nebraska
Stephen Scott , University of Nebraska
pp. 330-337

Finding Representative Set from Massive Data (Abstract)

Wei Wang , University of North Carolina at Chapel Hill
Feng Pan , University of North Carolina at Chapel Hill
Anthony K. H. Tung , National University of Singapore
Jiong Yang , Case Western Reserve University
pp. 338-345

Parameter-Free Spatial Data Mining Using MDL (Abstract)

Panayiotis Tsaparas , University of Helsinki
Aristides Gionis , University of Helsinki
Heikki Mannila , University of Helsinki
Risto A. Väisänen , University of Helsinki
Spiros Papadimitriou , Carnegie Mellon University
Christos Faloutsos , Carnegie Mellon University
pp. 346-353

Discovering Frequent Arrangements of Temporal Intervals (Abstract)

Stan Sclaroff , Boston University
George Kollios , Boston University
Panagiotis Papapetrou , Boston University
Dimitrios Gunopulos , University of California at Riverside
pp. 354-361

Mining Patterns of Change in Remote Sensing Image Databases (Abstract)

Dalton M. Valeriano , National Institute for Space Research
Ricardo Cartaxo M. Souza , National Institute for Space Research
Maria Isabel S. Escada , National Institute for Space Research
Marcelino Pereira S. Silva , Rio Grande do Norte State University and National Institute for Space Research
Gilberto Câmara , National Institute for Space Research
pp. 362-369

Ranking-Based Evaluation of Regression Models (Abstract)

Claudia Perlich , IBM T. J. Watson Research Center
Saharon Rosset , IBM T. J. Watson Research Center
Bianca Zadrozny , IBM T. J. Watson Research Center
pp. 370-377

Multi-Stage Classification (Abstract)

Ted E. Senator , DARPA/IPTO
pp. 386-393

Learning Functional Dependency Networks Based on Genetic Programming (Abstract)

Wing-Ho Shum , Chinese University of Hong Kong
Kwong-Sak Leung , Chinese University of Hong Kong
Man-Leung Wong , Lingnan University
pp. 394-401

Generalizing the Notion of Confidence (Abstract)

Vipin Kumar , University of Minnesota
Michael Steinbach , University of Minnesota
pp. 402-409

Neighborhood Formation and Anomaly Detection in Bipartite Graphs (Abstract)

Jimeng Sun , Carnegie Mellon University
Huiming Qu , University of Pittsburgh
Christos Faloutsos , Carnegie Mellon University
Deepayan Chakrabarti , Yahoo! Research
pp. 418-425

A Border-Based Approach for Hiding Sensitive Frequent Itemsets (Abstract)

Xingzhi Sun , University of Queensland
Philip S. Yu , IBM T. J. Watson Research Center
pp. 426-433

X-mHMM: An Efficient Algorithm for Training Mixtures of HMMs When the Number of Mixtures Is Unknown (Abstract)

Zoltán Szamonek , Hungarian Academy of Sciences and Eötvös University Budapest
Csaba Szepesvári , Hungarian Academy of Sciences
pp. 434-441

A Random Walk through Human Associations (Abstract)

Raz Tamir , Hebrew University of Jerusalem
pp. 442-449

Supervised Tensor Learning (Abstract)

Xindong Wu , University of Vermont
Xuelong Li , University of London
Weiming Hu , Chinese Academy of Science
Stephen Maybank , University of London
Dacheng Tao , University of London
pp. 450-457

A Bernoulli Relational Model for Nonlinear Embedding (Abstract)

Gang Wang , Hong Kong University of Science and Technology
Zhihua Zhang , Hong Kong University of Science and Technology
Hui Zhang , Xi?an Jiaotong University
Frederick H. Lochovsky , Hong Kong University of Science and Technology
pp. 458-465

Template-Based Privacy Preservation in Classification Problems (Abstract)

Ke Wang , Simon Fraser University
Philip S. Yu , IBM T. J. Watson Research Center
Benjamin C. M. Fung , Simon Fraser University
pp. 466-473

On Reducing Classifier Granularity in Mining Concept-Drifting Data Streams (Abstract)

Haixun Wang , IBM T. J. Watson Research Center
Baile Shi , Fudan University
Xiaochen Wu , Fudan University
Wei Wang , Fudan University
Peng Wang , Fudan University
pp. 474-481

Approximate Inverse Frequent Itemset Mining: Privacy, Complexity, and Approximation (Abstract)

Xintao Wu , University of North Carolina at Charlotte
Yongge Wang , University of North Carolina at Charlotte
pp. 482-489

Atomic Wedgie: Efficient Query Filtering for Streaming Times Series (Abstract)

Eamonn Keogh , University of California at Riverside
Agenor Mafra-Neto , ISCA Technologies
Helga Van Herle , University of California at Los Angeles
Li Wei , University of California at Riverside
pp. 490-497

Discriminatively Trained Markov Model for Sequence Classification (Abstract)

Vasant Honavar , Iowa State University
Adrian Silvescu , Iowa State University
Oksana Yakhnenko , Iowa State University
pp. 498-505

Integrating Hidden Markov Models and Spectral Analysis for Sensory Time Series Clustering (Abstract)

Qiang Yang , Hong Kong University of Science and Technology
Jie Yin , Hong Kong University of Science and Technology
pp. 506-513

Discriminant Analysis: A Unified Approach (Abstract)

Peng Zhang , Tulane University
Norbert Riedel , Tulane University
pp. 514-521

Sharing Classifiers among Ensembles from Related Problem Domains (Abstract)

Samuel Burer , University of Iowa
W. Nick Street , University of Iowa
Yi Zhang , University of Iowa
pp. 522-529

A Visual Data Mining Framework for Convenient Identification of Useful Knowledge (Abstract)

Kaidi Zhao , University of Illinois at Chicago
Weimin Xiao , Motorola Labs
Bing Liu , University of Illinois at Chicago
Thomas M. Tirpak , Motorola Labs
pp. 530-537

Efficient Text Classification by Weighted Proximal SVM (Abstract)

Dong Zhuang , Beijing Institute of Technology
Jun Yan , Peking University
Zheng Chen , Microsoft Research Asia
Benyu Zhang , Microsoft Research Asia
Ying Chen , Beijing Institute of Technology
Qiang Yang , Hong Kong University of Science and Technology
pp. 538-545
Short Papers

A Rule Evaluation Support Method with Learning Models Based on Objective Rule Evaluation Indexes (Abstract)

Shusaku Tsumoto , Shimane University
Takahira Yamaguchi , Keio University
Miho Ohsaki , Doshisha University
Hidenao Abe , Shimane University
pp. 549-552

Mining Chains of Relations (Abstract)

Taneli Mielikäinen , University of Helsinki
Foto Afrati , National Technical University of Athens
Heikki Mannila , University of Helsinki
Panayiotis Tsaparas , University of Helsinki
Gautam Das , National Technical University of Athens
Aristides Gionis , University of Helsinki
pp. 553-556

Blocking Anonymity Threats Raised by Frequent Itemset Mining (Abstract)

Fosca Giannotti , ISTI - CNR
Dino Pedreschi , University of Pisa
Maurizio Atzori , ISTI - CNR and University of Pisa
Francesco Bonchi , ISTI - CNR
pp. 561-564

Adaptive Clustering: Obtaining Better Clusters Using Feedback and Past Experience (Abstract)

Christoph F. Eick , University of Houston
Ricardo Vilalta , University of Houston
Chun-Sheng Chen , University of Houston
Abraham Bagherjeiran , University of Houston
pp. 565-568

Semi-Supervised Mixture of Kernels via LPBoost Methods (Abstract)

Murat Dundar , Siemens Medical Solutions
Jinbo Bi , Siemens Medical Solutions
Glenn Fung , Siemens Medical Solutions
Bharat Rao , Siemens Medical Solutions
pp. 569-572

A Levelwise Search Algorithm for Interesting Subspace Clusters (Abstract)

Haiyun Bian , University of Cincinnati
Raj Bhatnagar , University of Cincinnati
pp. 573-576

Segment-Based Injection Attacks against Collaborative Filtering Recommender Systems (Abstract)

Bamshad Mobasher , DePaul University
Chad Williams , DePaul University
Runa Bhaumik , DePaul University
Robin Burke , DePaul University
pp. 577-580

On Feature Selection through Clustering (Abstract)

Dan A. Simovici , University of Massachusetts at Boston
Richard Butterworth , University of Massachusetts at Boston
pp. 581-584

Sequential Pattern Mining in Multiple Streams (Abstract)

Xingquan Zhu , University of Vermont
Xindong Wu , University of Vermont
Gong Chen , University of Vermont
pp. 585-588

Privacy Preserving Data Classification with Rotation Perturbation (Abstract)

Ling Liu , Georgia Institute of Technology
Keke Chen , Georgia Institute of Technology
pp. 589-592

A Computational Framework for Taxonomic Research: Diagnosing Body Shape within Fish Species Complexes (Abstract)

Henry L. Bart, Jr. , Tulane University
Huimin Chen , Tulane University
Yixin Chen , University of New Orleans
Shuqing Huang , Tulane University
pp. 593-596

Obtaining Best Parameter Values for Accurate Classification (Abstract)

Paul Leng , University of Liverpool
Frans Coenen , University of Liverpool
pp. 597-600

Process Diagnosis via Electrical-Wafer-Sorting Maps Classification (Abstract)

Guido Miraglia , ST Microelectronics
Giuseppe De Nicolao , University of Pavia
Oliver M. Donzelli , ST Microelectronics
Federico Di Palma , University of Pavia
pp. 601-604

An Improved Categorization of Classifier?s Sensitivity on Sample Selection Bias (Abstract)

Wei Fan , IBM T. J. Watson Research
Bianca Zadrozny , IBM T. J. Watson Research
Philip S. Yu , IBM T. J. Watson Research
Ian Davidson , State University of New York at Albany
pp. 605-608

Fast Frequent String Mining Using Suffix Arrays (Abstract)

Volker Heun , Universität München
Johannes Fischer , Universität München
Stefan Kramer , TU München
pp. 609-612

Privacy-Preserving Frequent Pattern Mining across Private Databases (Abstract)

Raymond Chi-Wing Wong , Chinese University of Hong Kong
Ke Wang , Simon Fraser University
Ada Wai-Chee Fu , Chinese University of Hong Kong
pp. 613-616

CoLe: A Cooperative Data Mining Approach and Its Application to Early Diabetes Detection (Abstract)

Robert C. James , Aechidna Health Informatics
Jie Gao , University of Calgary
Jörg Denzinger , University of Calgary
pp. 617-620

Feature Selection for Building Cost-Effective Data Stream Classifiers (Abstract)

X. Sean Wang , University of Vermont
Like Gao , University of Vermont
pp. 621-624

A Scalable Collaborative Filtering Framework Based on Co-Clustering (Abstract)

Srujana Merugu , University of Texas at Austin
Thomas George , Texas A & M University
pp. 625-628

Text Classification with Evolving Label-Sets (Abstract)

Ganesh Ramakrishnan , IBM India Research Lab
Shantanu Godbole , Indian Institute of Technology - Bombay
Sunita Sarawagi , Indian Institute of Technology - Bombay
pp. 629-632

A Framework for Semi-Supervised Learning Based on Subjective and Objective Clustering Criteria (Abstract)

D. Gunopulos , University of California at Riverside
M. Vazirgiannis , Athens University of Economics and Business
M. Halkidi , University of California at Riverside and Athens University of Economics and Business
N. Kumar , University of California at Riverside
C. Domeniconi , George Mason University
pp. 637-640

Focused Community Discovery (Abstract)

Philip S. Yu , IBM T.J. Watson Research Center
Kirsten Hildrum , IBM T.J. Watson Research Center
pp. 641-644

Suppressing Data Sets to Prevent Discovery of Association Rules (Abstract)

Yücel Saygin , Sabanci University
Mehmet Keskinöz , Sabanci University
Ali Inan , Sabanci University
Ayça Azgin Hintoğlu , Sabanci University
pp. 645-648

Triple Jump Acceleration for the EM Algorithm (Abstract)

Chun-Nan Hsu , Academia Sinica
Bou-Ho Yang , Academia Sinica and Chang Gung University
Han-Shen Huang , Academia Sinica
pp. 649-652

Partial Ensemble Classifiers Selection for Better Ranking (Abstract)

Charles X. Ling , University of Western Ontario
Jin Huang , University of Western Ontario
pp. 653-656

Pairwise Symmetry Decomposition Method for Generalized Covariance Analysis (Abstract)

Tsuyoshi Idé , IBM Research, Tokyo Research Laboratory
pp. 657-660

Mining Ontological Knowledge from Domain-Specific Text Documents (Abstract)

Xing Jiang , Nanyang Technological University
Ah-Hwee Tan , Nanyang Technological University
pp. 665-668

Mining Patterns That Respond to Actions (Abstract)

Ada Wai-Chee Fu , Chinese University of Hong Kong
Alexander Tuzhilin , New York University
Ke Wang , Simon Fraser University
Yuelong Jiang , Simon Fraser University
pp. 669-672

Supervised Ordering — An Empirical Survey (Abstract)

Hideto Kazawa , Nippon Telegraph and Telephone Corporation
Toshihiro Kamishima , National Institute of Advanced Industrial Science and Technology
Shotaro Akaho , National Institute of Advanced Industrial Science and Technology
pp. 673-676

Categorization and Keyword Identification of Unlabeled Documents (Abstract)

Ning Kang , George Mason University
Carlotta Domeniconi , George Mason University
Daniel Barbará , George Mason University
pp. 677-680

Gradual Model Generator for Single-Pass Clustering (Abstract)

Pasi Fränti , University of Joensuu
Ismo Kärkkäinen , University of Joensuu
pp. 681-684

Making Logistic Regression a Core Data Mining Tool with TR-IRLS (Abstract)

Andrew W. Moore , Carnegie Mellon University
Paul Komarek , Carnegie Mellon University
pp. 685-688

Hierarchical Density-Based Clustering of Uncertain Data (Abstract)

Hans-Peter Kriegel , University of Munich
Martin Pfeifle , University of Munich
pp. 689-692

Semi-Supervised Clustering with Metric Learning Using Relative Comparisons (Abstract)

Nimit Kumar , IBM India Research Lab
Deepa Paranjpe , IBM India Research Lab
Krishna Kummamuru , IBM India Research Lab
pp. 693-696

On Learning Asymmetric Dissimilarity Measures (Abstract)

Krishna Kummamuru , IBM India Research Lab
Rakesh Agrawal , IBM Almaden Research Center
Raghu Krishnapuram , IBM India Research Lab
pp. 697-700

Partial Elastic Matching of Time Series (Abstract)

Vasileios Megalooikonomou , Temple University
E. Keogh , University of California at Riverside
Qiang Wang , Temple University
Rolf Lakaemper , Temple University
C. A. Ratanamahatana , University of California at Riverside
Longin Jan Latecki , Temple University
pp. 701-704

CLUGO: A Clustering Algorithm for Automated Functional Annotations Based on Gene Ontology (Abstract)

Jan-Ming Ho , Academia Sinica
Ming-Syan Chen , National Taiwan University
In-Yee Lee , National Taiwan University and Academia Sinica
pp. 705-708

An Optimal Linear Time Algorithm for Quasi-Monotonic Segmentation (Abstract)

Yuhong Yan , National Research Council of Canada
Daniel Lemire , University of Quebec at Montreal
Martin Brooks , National Research Council of Canada
pp. 709-712

Average Number of Frequent (Closed) Patterns in Bernouilli and Markovian Databases (Abstract)

Loïck Lhote , Université de Caen Basse-Normandie
François Rioult , Université de Caen Basse-Normandie
Arnaud Soulet , Université de Caen Basse-Normandie
pp. 713-716

Predicting Software Escalations with Maximum ROI (Abstract)

Charles X. Ling , University of Western Ontario
Shengli Sheng , University of Western Ontario
Nazim H. Madhavji , University of Western Ontario
Tilmann Bruckhaus , Sun Microsystems, Inc.
pp. 717-720

Mining Approximate Frequent Itemsets from Noisy Data (Abstract)

Andrew Nobel , University of North Carolina at Chapel Hill
Jinze Liu , University of North Carolina at Chapel Hill
Wei Wang , University of North Carolina at Chapel Hill
Susan Paulsen , University of North Carolina at Chapel Hill
Jan Prins , University of North Carolina at Chapel Hill
pp. 721-724

Text Representation: From Vector to Tensor (Abstract)

Wenyin Liu , City University of Hong Kong
Benyu Zhang , Microsoft Research Asia
Jun Yan , Peking University
Zheng Chen , Microsoft Research Asia
Fengshan Bai , Tsinghua University
Ning Liu , Tsinghua University
Leefeng Chien , Academia Sinica
pp. 725-728

Parallel Algorithms for Distance-Based and Density-Based Outliers (Abstract)

Elio Lozano , University of Puerto Rico
Edgar Acuña , University of Puerto Rico
pp. 729-732

Bit Reduction Support Vector Machine (Abstract)

Andrew Remsen , University of South Florida
Lawrence O. Hall , University of South Florida
Tong Luo , University of South Florida
Dmitry B. Goldgof , University of South Florida
pp. 733-736

Spatial Clustering of Chimpanzee Locations for Neighborhood Identification (Abstract)

Shashi Shekhar , University of Minnesota
Jaideep Srivastava , University of Minnesota
Sandeep Mane , University of Minnesota
Anne Pusey , University of Minnesota
Carson Murray , University of Minnesota
pp. 737-740

A Graph-Ranking Algorithm for Geo-Referencing Documents (Abstract)

Mário J. Silva , Universidade de Lisboa
Bruno Martins , Universidade de Lisboa
pp. 741-744

An Expected Utility Approach to Active Feature-Value Acquisition (Abstract)

Maytal Saar-Tsechansky , University of Texas at Austin
Raymond Mooney , University of Texas at Austin
Prem Melville , University of Texas at Austin
Foster Provost , New York University
pp. 745-748

Automatically Mining Result Records from Search Engine Response Pages (Abstract)

Saygin Celebi , University of Louisiana at Lafayette
Jayasimha Reddy Katukuri , University of Louisiana at Lafayette
Dheerendranath Mundluru , University of Louisiana at Lafayette
pp. 749-752

Efficiently Mining Frequent Closed Partial Orders (Abstract)

Philip S. Yu , IBM T.J. Watson Research Center
Jian Pei , Simon Fraser University
Ke Wang , Simon Fraser University
Haixun Wang , IBM T.J. Watson Research Center
Jian Liu , State University of New York at Buffalo
Jianyong Wang , Tsinghua University
pp. 753-756

CLUMP: A Scalable and Robust Framework for Structure Discovery (Abstract)

Joydeep Ghosh , University of Texas at Austin
Kunal Punera , University of Texas at Austin
pp. 757-760

Face Recognition Using Landmark-Based Bidimensional Regression (Abstract)

David Marx , University of Nebraska - Lincoln
Ashok Samal , University of Nebraska - Lincoln
Jiazheng Shi , University of Nebraska - Lincoln
pp. 765-768

Instability of Classifiers on Categorical Data (Abstract)

Arno Siebes , Universiteit Utrecht
Muhammad Subianto , Universiteit Utrecht
Ad Feelders , Universiteit Utrecht
pp. 769-772

Pruning Social Networks Using Structural Properties and Descriptive Attributes (Abstract)

Lisa Singh , Georgetown University
Louis Licamele , University of Maryland at College Park
Lise Getoor , University of Maryland at College Park
pp. 773-776

Bias Analysis in Text Classification for Highly Skewed Data (Abstract)

Lei Tang , Arizona State University
Huan Liu , Arizona State University
pp. 781-784

Efficient Mining of High Branching Factor Attribute Trees (Abstract)

Hiroshi Motoda , Osaka University
Marie-Christine Rousset , CNRS, Université Paris-Sud and INRIA
Michèle Sebag , CNRS, Université Paris-Sud and INRIA
Alexandre Termier , Osaka University
Takashi Washio , Osaka University
Kouzou Ohara , Osaka University
pp. 785-788

Anomaly Intrusion Detection Using Multi-Objective Genetic Fuzzy System and Agent-Based Evolutionary Computation Framework (Abstract)

Chi-Ho Tsang , City University of Hong Kong
Hanli Wang , City University of Hong Kong
Sam Kwong , City University of Hong Kong
pp. 789-792

Hot Item Mining and Summarization from Multiple Auction Web Sites (Abstract)

Tak-Lam Wong , Chinese University of Hong Kong
Wai Lam , Chinese University of Hong Kong
pp. 797-800

Merging Interface Schemas on the Deep Web via Clustering Aggregation (Abstract)

Wensheng Wu , University of Illinois at Urbana-Champaign
AnHai Doan , University of Illinois at Urbana-Champaign
Clement Yu , University of Illinois at Chicago
pp. 801-804

On the Stationarity of Multivariate Time Series for Correlation-Based Data Analysis (Abstract)

Cyrus Shahabi , University of Southern California
Kiyoung Yang , University of Southern California
pp. 805-808

Speculative Markov Blanket Discovery for Optimal Feature Selection (Abstract)

Dimitris Margaritis , Iowa State University
Sandeep Yaramakala , Iowa State University
pp. 809-812

A Join-Less Approach for Co-Location Pattern Mining: A Summary of Results (Abstract)

Mete Celik , University of Minnesota
Jin Soung Yoo , University of Minnesota
Shashi Shekhar , University of Minnesota
pp. 813-816

Learning through Changes: An Empirical Study of Dynamic Behaviors of Probability Estimation Trees (Abstract)

Kun Zhang , Tulane University
Zujia Xu , Dillard University
Jing Peng , Tulane University
Bill Buckles , Tulane University
pp. 817-820

Visualizing Global Manifold Based on Distributed Local Data Abstractions (Abstract)

William K. Cheung , Hong Kong Baptist University
Xiaofeng Zhang , Hong Kong Baptist University
pp. 821-824

Bagging with Adaptive Costs (Abstract)

W. Nick Street , University of Iowa
Yi Zhang , University of Iowa
pp. 825-828

Example-Based Robust Outlier Detection in High Dimensional Datasets (Abstract)

Cui Zhu , University of Tsukuba
Hiroyuki Kitagawa , University of Tsukuba
Christos Faloutsos , Carnegie Mellon University
pp. 829-832

CTC — Correlating Tree Patterns for Classification (Abstract)

Albrecht Zimmermann , Albert-Ludwigs-University Freiburg
Björn Bringmann , Albert-Ludwigs-University Freiburg
pp. 833-836
Author Index

Author Index (PDF)

pp. 843-846
89 ms
(Ver 3.3 (11022016))