EvoMiner: frequent subtree mining in phylogenetic databases
A fragment-based iterative consensus clustering algorithm with a robust similarity
Tackling representation, annotation and classification challenges for temporal knowledge base population
Explaining prediction models and individual predictions with feature contributions
Efficient mining of discriminative co-clusters from gene expression data
High utility K-anonymization for social network publishing
Surfacing code in the dark: an instant clone search approach
Bridging structured and unstructured data via hybrid semantic search and interactive ontology-enhanced query formulation
Parallel matrix factorization for recommender systems
Multi-document summarization via Archetypal Analysis of the content-graph joint model
A graph-theoretic approach to optimize keyword queries in relational databases
PAKDD’12 best paper: generating balanced classifier-independent training samples from unlabeled data
Special issue on big data research in China
Learning to annotate via social interaction analytics
Parallelizing skyline queries over uncertain data streams with sliding window partitioning and grid index
Dynamic and fast processing of queries on large-scale RDF data
A hybrid memory built by SSD and DRAM to support in-memory Big Data analytics
Data mining-based flatness pattern prediction for cold rolling process with varying operating condition
AsyIter: tolerating computational skew of synchronous iterative applications via computing decomposition
Data-based adaptive online prediction model for plant-wide production indices
Security-aware intermediate data placement strategy in scientific cloud workflows
Spatial–temporal compression and recovery in a wireless sensor network in an underground tunnel environment
A new semantic relatedness measurement using WordNet features
Finding peculiar compositions of two frequent strings with background texts
Tuple MapReduce and Pangool: an associated implementation
Email mining: tasks, common techniques, and tools
Improving class probability estimates for imbalanced data
Imprecise prior knowledge incorporating into one-class classification
Mining non-derivable hypercliques
Closed motifs for streaming time series classification
Clustering data streams using grid-based synopsis
A dissimilarity function for geospatial polygons
Automatic ranking of retrieval models using retrievability measure
Improving NCD accuracy by combining document segmentation and document distortion