1384-5810

Data Mining and Knowledge Discovery (DATAMINE) - January 2022, issue 1 论文列表

本期论文列表
Cost-sensitive ensemble learning: a unifying framework

End-to-end deep representation learning for time series clustering: a comparative study

Inferring range of information diffusion based on historical frequent items

Mint: MDL-based approach for Mining INTeresting Numerical Pattern Sets

Topic change point detection using a mixed Bayesian model

Matrix sketching for supervised classification with imbalanced classes

Generalized core maintenance of dynamic bipartite graphs

Temporal state change Bayesian networks for modeling of evolving multivariate state sequences: model, structure discovery and parameter estimation

Expected passes

Controlling hallucinations at word level in data-to-text generation

An efficient procedure for mining egocentric temporal motifs

Mining sequences with exceptional transition behaviour of varying order using quality measures based on information-theoretic scoring functions

Sequential stratified regeneration: MCMC for large state spaces with an application to subgraph count estimation

Strengthening ties towards a highly-connected world

A recurrent neural network architecture to model physical activity energy expenditure in older people

Individualized passenger travel pattern multi-clustering based on graph regularized tensor latent dirichlet allocation

Simplification of genetic programs: a literature survey

An adaptive meta-heuristic for music plagiarism detection based on text similarity and clustering

Interpreting deep learning models with marginal attribution by conditioning on quantiles

Exploiting second-order dissimilarity representations for hierarchical clustering and visualization

Grouped feature importance and combined features effect plot

Developing Biceps to completely compute in subquadratic time a new generic type of bicluster in dense and sparse matrices

Extended missing data imputation via GANs for ranking applications

Conclusive local interpretation rules for random forests

SPEck: mining statistically-significant sequential patterns efficiently with exact sampling

Dynamic cyber risk estimation with competitive quantile autoregression

Efficient binary embedding of categorical data using BinSketch

An eager splitting strategy for online decision trees in ensembles

INK: knowledge graph embeddings for node classification

Sequence graph transform (SGT): a feature embedding function for sequence data mining

Provable randomized rounding for minimum-similarity diversification

Synwalk: community detection via random walk modelling

Robust regression via error tolerance

Counterfactual inference with latent variable and its application in mental health care

PAC-Bayesian lifelong learning for multi-armed bandits

Introducing the contrast profile: a novel time series primitive that allows real world classification

XEM: An explainable-by-design ensemble method for multivariate time series classification

Weighted sparse simplex representation: a unified framework for subspace clustering, constrained clustering, and active learning

Who can receive the pass? A computational model for quantifying availability in soccer

PETSC: pattern-based embedding for time series classification

Novel features for time series analysis: a complex networks approach

Using p-values for the comparison of classifiers: pitfalls and alternatives

Interpretability, personalization and reliability of a machine learning based clinical decision support system

Sufficient dimension reduction for average causal effect estimation

Ranking with submodular functions on a budget

The area under the ROC curve as a measure of clustering quality

Dynamic self-paced sampling ensemble for highly imbalanced and class-overlapped data classification

MultiRocket: multiple pooling operators and transformations for fast and effective time series classification

VPint: value propagation-based spatial interpolation

The minimum description length principle for pattern mining: a survey

EmbAssi: embedding assignment costs for similarity search in large graph databases

Dynamic slate recommendation with gated recurrent units and Thompson sampling

Coupled block diagonal regularization for multi-view subspace clustering

TCMI: a non-parametric mutual-dependence estimator for multivariate continuous distributions

Human-in-the-loop handling of knowledge drift

Robust subgroup discovery

Neural content-aware collaborative filtering for cold-start music recommendation

SOKNL: A novel way of integrating K-nearest neighbours with adaptive random forest regression for data streams