SIGKDD(KDD) 2005论文列表 - Proceedings of the Eleventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Chicago, Illinois, USA, August 21-24, 2005.| 数据学习 (DataLearner)

SIGKDD(KDD) 2005 论文列表

Proceedings of the Eleventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Chicago, Illinois, USA, August 21-24, 2005.

Pattern-based similarity search for microarray data.

Haixun Wang Jian Pei Philip S. Yu

A multinomial clustering model for fast simulation of computer architecture designs.

Kaushal Sanghai Ting Su Jennifer G. Dy David R. Kaeli

Short term performance forecasting in enterprise systems.

Rob Powers Moisés Goldszmidt Ira Cohen

Mining rare and frequent events in multi-camera surveillance video using self-organizing maps.

Valery A. Petrushin

Disease progression modeling from historical clinical databases.

Ronald K. Pearson Robert J. Kingan Alan Hochberg

Automated detection of frontal systems from numerical model-generated data.

Xiang Li Rahul Ramachandran Sara J. Graves Sunil Movva Bilahari Akkiraju David Emmitt Steven Greco Robert Atlas Joseph Terry Juan-Carlos Jusem

An integrated framework on mining logs files for computing system management.

Tao Li Feng Liang Sheng Ma Wei Peng

Mining risk patterns in medical data.

Jiuyong Li Ada Wai-Chee Fu Hongxing He Jie Chen Huidong Jin Damien McAullay Graham J. Williams Ross Sparks Chris Kelman

Data mining in the chemical industry.

Alex N. Kalos Tim Rey

Generation of synthetic data sets for evaluating the accuracy of knowledge discovery systems.

Daniel R. Jeske Behrokh Samadi Pengyue J. Lin Lan Ye Sean Cox Rui Xiao Ted Younglove Minh Ly Douglas Holt Ryan Rich

Failure detection and localization in component based systems by online tracking.

Haifeng Chen Guofei Jiang Cristian Ungureanu Kenji Yoshihira

Fast window correlations over uncooperative time series.

Richard Cole Dennis E. Shasha Xiaojian Zhao

CLICKS: an effective algorithm for mining subspace clusters in categorical datasets.

Mohammed Javeed Zaki Markus Peters Ira Assent Thomas Seidl

Pattern lattice traversal by selective jumps.

Osmar R. Zaïane Mohammad El-Hajj

Building connected neighborhood graphs for isometric data embedding.

Li Yang

A generalized framework for mining spatio-temporal patterns in scientific data.

Hui Yang Srinivasan Parthasarathy Sameep Mehta

Combining proactive and reactive predictions for data streams.

Ying Yang Xindong Wu Xingquan Zhu

Formulating distance functions via the kernel trick.

Gang Wu Edward Y. Chang Navneet Panda

Regression error characteristic surfaces.

Luís Torgo

Mining comparable bilingual text corpora for cross-language information integration.

Tao Tao ChengXiang Zhai

A hybrid unsupervised approach for document clustering.

Mihai Surdeanu Jordi Turmo Alicia Ageno

Evaluating similarity measures: a large-scale study in the orkut social network.

Ellen Spertus Mehran Sahami Orkut Buyukkokten

Density-based clustering of uncertain data.

Hans-Peter Kriegel Martin Pfeifle

Key semantics extraction by dependency tree mining.

Satoshi Morinaga Hiroki Arimura Takahiro Ikeda Yosuke Sakao Susumu Akamine

Optimizing time series discretization for knowledge discovery.

Fabian Mörchen Alfred Ultsch

Efficient computations via scalable sparse kernel partial least squares and boosted latent features.

Michinari Momma

Estimating missed actual positives using independent classifiers.

Sandeep Mane Jaideep Srivastava San-Yih Hwang

Adversarial learning.

Daniel Lowd Christopher Meek

Co-clustering by block value decomposition.

Bo Long Zhongfei (Mark) Zhang Philip S. Yu

A fast kernel-based multilevel algorithm for graph clustering.

Inderjit S. Dhillon Yuqiang Guan Brian Kulis

Determining an author's native language by mining a text for errors.

Moshe Koppel Jonathan Schler Kfir Zigdon

Information retrieval based on collaborative filtering with latent interest semantic map.

Noriaki Kawamae Katsumi Takahashi

A maximum entropy web recommendation system: combining collaborative and content features.

Xin Jin Yanzan Zhou Bamshad Mobasher

Discovering frequent topological structures from graph datasets.

Ruoming Jin Chao Wang Dmitrii Polshakov Srinivasan Parthasarathy Gagan Agrawal

Simultaneous optimization of complex mining tasks with a knowledgeable cache.

Ruoming Jin Kaushik Sinha Gagan Agrawal

Privacy-preserving distributed k-means clustering over arbitrarily partitioned data.

Geetha Jagannathan Rebecca N. Wright

Application of kernels to link analysis.

Takahiko Ito Masashi Shimbo Taku Kudo Yuji Matsumoto

Maximal boasting.

Cinda Heeren Leonard Pitt

Unweaving a web of documents.

Ramanathan V. Guha Ravi Kumar D. Sivakumar Ravi Sundaram

Creating social networks to improve peer-to-peer networking.

Andrew S. Fast David D. Jensen Brian Neil Levine

Parallel mining of closed sequential patterns.

Shengnan Cong Jiawei Han David A. Padua

LIPED: HMM-based life profiles for adaptive event detection.

Chien Chin Chen Meng Chang Chen Ming-Syan Chen

Web mining from competitors' websites.

Xin Chen Yi-fang Brook Wu

Scalable discovery of hidden emails from large folders.

Giuseppe Carenini Raymond T. Ng Xiaodong Zhou

Integration of profile hidden Markov model output into association rule mining.

Christopher Besemann Anne Denton

Model-based overlapping clustering.

Arindam Banerjee Chase Krumpelman Joydeep Ghosh Sugato Basu Raymond J. Mooney

Towards exploratory test instance specific algorithms for high dimensional classification.

Charu C. Aggarwal

Learning to predict train wheel failures.

Chunsheng Yang Sylvain Létourneau

Enhancing the lift under budget constraints: an application in the mutual fund industry.

Lian Yan Michael Fassino Patrick Baldasare

Dynamic syslog mining for network failure monitoring.

Kenji Yamanishi Yuko Maruyama

Email data cleaning.

Jie Tang Hang Li Yunbo Cao ZhaoHui Tang

Modeling and predicting personal information dissemination behavior.

Xiaodan Song Ching-Yung Lin Belle L. Tseng Ming-Ting Sun

Predicting the product purchase patterns of corporate customers.

Bhavani Raskutti Alan Herschtal

A hit-miss model for duplicate detection in the WHO drug safety database.

G. Niklas Norén Roland Orre Andrew Bate

Using relational knowledge discovery to prevent securities fraud.

Jennifer Neville Özgür Simsek David D. Jensen John Komoroske Kelly Palmer Henry G. Goldberg

Using retrieval measures to assess similarity in mining dynamic web clickstreams.

Olfa Nasraoui Cesar Cardona Carlos Rojas

Making holistic schema matching robust: an ensemble approach.

Bin He Kevin Chen-Chuan Chang

Deriving marketing intelligence from online discussion.

Natalie S. Glance Matthew Hurst Kamal Nigam Matthew Siegler Robert Stockton Takashi Tomokiyo

Price prediction and insurance for online auctions.

Rayid Ghani

An approach to spacecraft anomaly detection problem using kernel feature space.

Ryohei Fujimaki Takehisa Yairi Kazuo Machida

Finding similar files in large document repositories.

George Forman Kave Eshghi Stephane Chiocchetti

Streaming feature selection using alpha-investing.

Jing Zhou Dean P. Foster Robert A. Stine Lyle H. Ungar

A new scheme on privacy-preserving data classification.

Nan Zhang Shengquan Wang Wei Zhao

Reasoning about sets using redescription mining.

Mohammed Javeed Zaki Naren Ramakrishnan

SVM selective sampling for ranking with application to data retrieval.

Hwanjo Yu

Cross-relational clustering with user's guidance.

Xiaoxin Yin Jiawei Han Philip S. Yu

Anonymity-preserving data collection.

Zhiqiang Yang Sheng Zhong Rebecca N. Wright

Mining closed relational graphs with connectivity constraints.

Xifeng Yan Xianghong Jasmine Zhou Jiawei Han

Summarizing itemset patterns: a profile-based approach.

Xifeng Yan Hong Cheng Jiawei Han Dong Xin

Improving discriminative sequential learning with rare--but--important associations.

Xuan Hieu Phan Minh Le Nguyen Tu Bao Ho Susumu Horiguchi

Web object indexing using domain knowledge.

Muyuan Wang Zhiwei Li Lie Lu Wei-Ying Ma Naiyao Zhang

Finding partial orders from unordered 0-1 data.

Antti Ukkonen Mikael Fortelius Heikki Mannila

Probabilistic workflow mining.

Ricardo Bezerra de Andrade e Silva Jiji Zhang James G. Shanahan

Sampling-based sequential subgroup mining.

Martin Scholz

On the use of linear programming for unsupervised text classification.

Mark Sandler

Robust boosting and its relation to bagging.

Saharon Rosset

Query chains: learning to rank from implicit feedback.

Filip Radlinski Thorsten Joachims

On mining cross-graph quasi-cliques.

Jian Pei Daxin Jiang Aidong Zhang

Detection of emerging space-time clusters.

Daniel B. Neill Andrew W. Moore Maheshkumar Sabhnani Kenny Daniel

A distributed learning framework for heterogeneous data sources.

Srujana Merugu Joydeep Ghosh

Discovering evolutionary theme patterns from text: an exploration of temporal text mining.

Qiaozhu Mei ChengXiang Zhai

A general model for clustering binary data.

Tao Li

Graphs over time: densification laws, shrinking diameters and possible explanations.

Jure Leskovec Jon M. Kleinberg Christos Faloutsos

Simple and effective visual models for gene expression cancer diagnostics.

Gregor Leban Minca Mramor Ivan Bratko Blaz Zupan

Feature bagging for outlier detection.

Aleksandar Lazarevic Vipin Kumar

Combining partitions by probabilistic label aggregation.

Tilman Lange Joachim M. Buhmann

A multiple tree algorithm for the efficient association of asteroid observations.

Jeremy Kubica Andrew W. Moore Andrew J. Connolly Robert Jedicke

Local sparsity control for naive Bayes with extreme misclassification costs.

Aleksander Kolcz

Fast discovery of unexpected patterns in data, relative to a Bayesian network.

Szymon Jaroszewicz Tobias Scheffer

Nomograms for visualizing support vector machines.

Aleks Jakulin Martin Mozina Janez Demsar Ivan Bratko Blaz Zupan

Combining email models for false positive reduction.

Shlomo Hershkop Salvatore J. Stolfo

Wavelet synopsis for data streams: minimizing non-euclidean error.

Sudipto Guha Boulos Harb

The predictive power of online chatter.

Daniel Gruhl Ramanathan V. Guha Ravi Kumar Jasmine Novak Andrew Tomkins

Non-redundant clustering with conditional ensembles.

David Gondek Thomas Hofmann

Mining tree queries in a graph.

Bart Goethals Eveline Hoekx Jan Van den Bussche

Dimension induced clustering.

Aristides Gionis Alexander Hinneburg Spiros Papadimitriou Panayiotis Tsaparas

Consistent bipartite graph co-partitioning for star-structured high-order heterogeneous data co-clustering.

Bin Gao Tie-Yan Liu Xin Zheng QianSheng Cheng Wei-Ying Ma

Rule extraction from linear support vector machines.

Glenn Fung Sathyakama Sandilya R. Bharat Rao

Mining images on semantics via statistical learning.

Jianping Fan Hangzai Luo Mohand-Said Hacid

Variable latent semantic indexing.

Anirban Dasgupta Ravi Kumar Prabhakar Raghavan Andrew Tomkins

A Bayesian network classifier with inverse tree structure for voxelwise magnetic resonance image analysis.

Rong Chen Edward Herskovits

The architecture of complexity: the structure and the dynamics of networks, from the web to the cell.

Albert-László Barabási

Mining the internet: the eighth wonder of the world.

Gian Fulgoni

Incentive networks.

Prabhakar Raghavan