Using Self-Similarity to Cluster Large Data Sets
Extracting Share Frequent Itemsets with Infrequent Subsets
Building Decision Trees with Constraints
Sampling and Subsampling for Cluster Analysis in Data Mining: With Applications to Sky Survey Data
Free-Sets: A Condensed Representation of Boolean Data for the Approximation of Frequency Queries
XTRACT: Learning Document Type Descriptors from XML Document Collections
Cluster Detection in Databases: The Adaptive Matched Filter Algorithm and Implementation
A Taxonomy of Dirty Data
Data Squashing by Empirical Likelihood
Guest Editorial
DualMiner: A Dual-Pruning Algorithm for Itemsets with Constraints
Analysis of Pattern Discovery in Sequences Using a Bayes Error Framework
A Sequential Monte Carlo Method for Bayesian Analysis of Massive Datasets
Customer Lifetime Value Models for Decision Support
Guest Editorial
On the Need for Time Series Data Mining Benchmarks: A Survey and Empirical Demonstration
Bursty and Hierarchical Structure in Streams
Model-Based Clustering and Visualization of Navigation Patterns on a Web Site