Invited speakers (PDF)
Welcome from Conference Chairs (PDF)
Preface (PDF)
Conference Organization (PDF)
Steering Committee (PDF)
Program Committee (PDF)
Non-PC Reviewers (PDF)
Corporate Sponsors (PDF)
Tutorials (PDF)
Neuroscience: New Insights for AI? (Abstract)
Data Mining Methods for Modeling Gene Expression Regulation and Their Applications (PDF)
An Information Theoretic Approach to Detection of Minority Subsets in Database (Abstract)
Learning to Use a Learned Model: A Two-Stage Approach to Classification (Abstract)
Hierarchical Classification by Expected Utility Maximization (Abstract)
Cluster Ranking with an Application to Mining Mailbox Networks (Abstract)
Large Scale Detection of Irregularities in Accounting Data (Abstract)
Adaptive Blocking: Learning to Scale Up Record Linkage (Abstract)
Adaptive Parallel Graph Mining for CMP Architectures (Abstract)
Meta Clustering (Abstract)
Mixed-Drove Spatio-Temporal Co-occurence Pattern Mining: A Summary of Results (Abstract)
Tolerance Closed Frequent Itemsets (Abstract)
Active Learning to Maximize Area Under the ROC Curve (Abstract)
Rapid Identification of Column Heterogeneity (Abstract)
Data Mining Approaches to Criminal Career Analysis (Abstract)
Biclustering Protein Complex Interactions with a Biclique Finding Algorithm (Abstract)
STAGGER: Periodicity Mining of Data Streams Using Expanding Sliding Windows (Abstract)
Turning Clusters into Patterns: Rectangle-Based Discriminative Data Description (Abstract)
Converting Output Scores from Outlier Detection Algorithms into Probability Estimates (Abstract)
Personalization in Context: Does Context Matter When Building Personalized Customer Models? (Abstract)
Bregman Bubble Clustering: A Robust, Scalable Framework for Locating Multiple, Dense Regions in Data (Abstract)
Optimal Segmentation Using Tree Models (Abstract)
Mining for Tree-Query Associations in a Graph (Abstract)
Keyphrase Extraction Using Semantic Networks Structure Analysis (Abstract)
Subjectivity Categorization of Weblog with Part-of-Speech Based Smoothing (Abstract)
Applying Data Mining to Pseudo-Relevance Feedback for High Performance Text Retrieval (Abstract)
Improving Personalization Solutions through Optimal Segmentation of Customer Bases (Abstract)
Secure Distributed k-Anonymous Pattern Mining (Abstract)
Dimension Reduction for Supervised Ordering (Abstract)
A Parameterized Probabilistic Model of Network Evolution for Supervised Link Prediction (Abstract)
Incremental Mining of Frequent Query Patterns from XML Queries for Caching (Abstract)
The Relationships Among Various Nonnegative Matrix Factorization Methods for Clustering (Abstract)
Integrating Features from Different Sources for Music Information Retrieval (Abstract)
How Bayesians Debug (Abstract)
On the Use of Structure and Sequence-Based Features for Protein Classification and Retrieval (Abstract)
Accelerating Newton Optimization for Log-Linear Models through Feature Redundancy (Abstract)
P3C: A Robust Projected Clustering Algorithm (Abstract)
Frequent Closed Itemset Mining Using Prefix Graphs with an Efficient Flow-Based Pruning Strategy (Abstract)
Efficient Clustering of Uncertain Data (Abstract)
A Data Mining Approach for Capacity Building of Stakeholders in Integrated Flood Management (Abstract)
Local Correlation Tracking in Time Series (Abstract)
Who Thinks Who Knows Who? Socio-cognitive Analysis of Email Networks (Abstract)
An Efficient Reference-Based Approach to Outlier Detection in Large Datasets (Abstract)
Using an Ensemble of One-Class SVM Classifiers to Harden Payload-based Anomaly Detection Systems (Abstract)
Relational Ensemble Classification (Abstract)
Discovering Partial Orders in Binary Data (Abstract)
Stability Region Based Expectation Maximization for Model-based Clustering (Abstract)
Co-clustering Documents and Words Using Bipartite Isoperimetric Graph Partitioning (Abstract)
Latent Dirichlet Co-Clustering (Abstract)
Latent Friend Mining from Blog Data (Abstract)
The PDD Framework for Detecting Categories of Peculiar Data (Abstract)
Entity Resolution with Markov Logic (Abstract)
Boosting Kernel Models for Regression (Abstract)
Boosting for Learning Multiple Classes with Imbalanced Class Distribution (Abstract)
What is the Dimension of Your Binary Data? (Abstract)
Fast Random Walk with Restart and Its Applications (Abstract)
Anytime Classification Using the Nearest Neighbor Algorithm with Applications to Stream Mining (Abstract)
Lazy Associative Classification (Abstract)
Geometrically Inspired Itemset Mining (Abstract)
Finding "Who Is Talking to Whom" in VoIP Networks via Progressive Stream Clustering (Abstract)
Comparison of Descriptor Spaces for Chemical Compound Retrieval and Classification (Abstract)
Regularized Least Absolute Deviations Regression and an Efficient Algorithm for Parameter Tuning (Abstract)
LOCI: Load Shedding through Class-Preserving Data Acquisition (Abstract)
SAXually Explicit Images: Finding Unusual Shapes (Abstract)
A Novel Scalable Algorithm for Supervised Subspace Learning (Abstract)
Forecasting Skewed Biased Stochastic Ozone Days: Analyses and Solutions (Abstract)
Identifying Follow-Correlation Itemset-Pairs (Abstract)
On the Lower Bound of Local Optimums in K-Means Algorithm (Abstract)
Fast On-line Kernel Learning for Trees (Abstract)
bitSPADE: A Lattice-based Sequential Pattern Mining Algorithm Using Bitmap Representation (Abstract)
Decision Trees for Functional Variables (Abstract)
Semantic Kernels for Text Classification Based on Topological Measures of Feature Similarity (Abstract)
Mining Maximal Generalized Frequent Geographic Patterns with Knowledge Constraints (Abstract)
Pattern Mining in Frequent Dynamic Subgraphs (Abstract)
Discovery of Collocation Episodes in Spatiotemporal Data (Abstract)
Getting the Most Out of Ensemble Selection (Abstract)
Diverse Topic Phrase Extraction through Latent Semantic Analysis (Abstract)
AC-Close: Efficiently Mining Approximate Closed Itemsets by Core Pattern Recovery (Abstract)
Belief Propagation in Large, Highly Connected Graphs for 3D Part-Based Object Recognition (Abstract)
A Framework for Regional Association Rule Mining in Spatial Datasets (Abstract)
Mining Generalized Graph Patterns Based on User Examples (Abstract)
An Experimental Investigation of Graph Kernels on a Collaborative Recommendation Task (Abstract)
A Balanced Ensemble Approach to Weighting Classifiers for Text Classification (Abstract)
Detection of Interdomain Routing Anomalies Based on Higher-Order Path Analysis (Abstract)
Star-Structured High-Order Heterogeneous Data Co-clustering Based on Consistent Information Theory (Abstract)
GraphRank: Statistical Modeling and Mining of Significant Subgraphs in the Feature Space (Abstract)
A Feature Selection and Evaluation Scheme for Computer Virus Detection (Abstract)
Constructing Ensembles for Better Ranking (Abstract)
TRIAS--An Algorithm for Mining Iceberg Tri-Lattices (Abstract)
Intelligent Icons: Integrating Lite-Weight Data Mining and Visualization into GUI Operating Systems (Abstract)
COSMIC: Conceptually Specified Multi-Instance Clusters (Abstract)
Direct Marketing When There Are Voluntary Buyers (Abstract)
DSTree: A Tree Structure for the Mining of Frequent Sets from Data Streams (Abstract)
Searching for Pattern Rules (Abstract)
Adding Semantics to Email Clustering (Abstract)
Gradual Cube: Customize Profile on Mobile OLAP (Abstract)
CoMiner: An Effective Algorithm for Mining Competitors from the Web (Abstract)
Multi-Tier Granule Mining for Representations of Multidimensional Association Rules (Abstract)
Social Capital in Friendship-Event Networks (Abstract)
Exploratory Under-Sampling for Class-Imbalance Learning (Abstract)
The Influence of Class Imbalance on Cost-Sensitive Learning: An Empirical Study (Abstract)
Similarity of Temporal Query Logs Based on ARIMA Model (Abstract)
Probabilistic Segmentation and Analysis of Horizontal Cells (Abstract)
Mining Correlation between Motifs and Gene Expression (Abstract)
High Quality, Efficient Hierarchical Document Clustering Using Closed Interesting Itemsets (Abstract)
On Trajectory Representation for Scientific Features (Abstract)
NewsCATS: A News Categorization and Trading System (Abstract)
Improving Grouped-Entity Resolution Using Quasi-Cliques (Abstract)
Fast Relevance Discovery in Time Series (Abstract)
Probabilistic Enhanced Mapping with the Generative Tabular Model (Abstract)
Object Identification with Constraints (Abstract)
High-Performance Unsupervised Relation Extraction from Large Corpora (Abstract)
Cluster Based Core Vector Machine (Abstract)
Enhancing Text Clustering Using Concept-based Mining Model (Abstract)
Detecting Link Spam Using Temporal Information (Abstract)
Minimum Enclosing Spheres Formulations for Support Vector Ordinal Regression (Abstract)
Mining Maximal Quasi-Bicliques to Co-Cluster Stocks and Financial Ratios for Value Investment (Abstract)
Boosting the Feature Space: Text Classification for Unstructured Data on the Web (Abstract)
Plagiarism Detection in arXiv (Abstract)
Window-based Tensor Analysis on High-dimensional and Multi-aspect Streams (Abstract)
Automatic Single-Organ Segmentation in Computed Tomography Images (Abstract)
Improving Nearest Neighbor Classifier Using Tabu Search and Ensemble Distance Metrics (Abstract)
Comparisons of K-Anonymization and Randomization Schemes under Linking Attacks (Abstract)
MARGIN: Maximal Frequent Subgraph Mining (Abstract)
Resource Management for Networked Classifiers in Distributed Stream Mining Systems (Abstract)
A Simple Yet Effective Data Clustering Algorithm (Abstract)
Entropy-based Concept Shift Detection (Abstract)
Recommendation on Item Graphs (Abstract)
Solution Path for Semi-Supervised Classification with Manifold Regularization (Abstract)
Semi-Supervised Kernel Regression (Abstract)
Mining Complex Time-Series Data by Learning Markovian Models (Abstract)
Temporal Data Mining in Dynamic Feature Spaces (Abstract)
Discover Bayesian Networks from Incomplete Data Using a Hybrid Evolutionary Algorithm (Abstract)
Distances and (Indefinite) Kernels for Sets of Objects (Abstract)
Deploying Approaches for Pattern Refinement in Text Mining (Abstract)
TOP-COP: Mining TOP-K Strongly Correlated Pairs in Large Databases (Abstract)
Manifold Clustering of Shapes (Abstract)
Linear and Non-Linear Dimensional Reduction via Class Representatives for Text Classification (Abstract)
Adaptive Kernel Principal Component Analysis with Unsupervised Learning of Kernels (Abstract)
Rule-Based Platform for Web User Profiling (Abstract)
Semantic Smoothing for Model-based Document Clustering (Abstract)
Corrective Classification: Classifier Ensembling with Corrective and Diverse Base Learners (Abstract)
Speedup Clustering with Hierarchical Ranking (Abstract)
Query-Sensitive Similarity Measure for Content-Based Image Retrieval (Abstract)
Author Index (PDF)