cikm14

cikm 2002 论文列表

Proceedings of the 2002 ACM CIKM International Conference on Information and Knowledge Management, McLean, VA, USA, November 4-9, 2002.

Parallelizing the buckshot algorithm for efficient document clustering.
Mining soft-matching association rules.
Mining coverage statistics for websource selection in a mediator.
Web-DL: an experience in building digital libraries from the web.
Knowledge discovery in patent databases.
Knowledge discovery from texts: a concept frame graph approach.
Interactive methods for taxonomy editing and validation.
Index compression vs. retrieval time of inverted files for XML documents.
High-performing feature selection for text classification.
Ginga: a self-adaptive query processing system.
Discovering the representative of a search engine.
Data fusion with estimated weights.
Features of documents relevant to task- and fact-oriented questions.
An agent-based approach to knowledge management.
A system for knowledge management in bioinformatics.
A syntactic approach for searching similarities within sentences.
A new cache replacement algorithm for the integration of web caching and prefectching.
Using specification-driven concepts for distributed data management and dissemination.
A mapping mechanism to support bitmap index and other auxiliary structures on tables stored as primary B±trees.
Automatically classifying database workloads.
The verity federated infrastructure.
Comparison of interestingness functions for learning web usage patterns.
Rule-based data quality.
Semantic technology applications for homeland security.
Thematic mapping - from unstructured documents to taxonomies.
Alternatives to the k-means algorithm that find better clusterings.
FREM: fast and robust EM clustering for large data sets.
COOLCAT: an entropy-based algorithm for categorical clustering.
Entropy-based link analysis for mining web informative structures.
Using micro information units for internet search.
Personalized web search by mapping user queries to categories.
I/O-efficient techniques for computing pagerank.
Condorcet fusion for improved retrieval.
Knowledge-based extraction of named entities.
Strategies for minimising errors in hierarchical web categorisation.
Evaluation of hierarchical clustering algorithms for document datasets.
Inferring hierarchical descriptions.
Evaluating contents-link coupled web page clustering for web search results.
Mining temporal classes from time series data.
Evaluating continuous nearest neighbor queries for streaming time series via pre-fetching.
Efficient query monitoring using adaptive multiple key hashing.
RHist: adaptive summarization over continuous data streams.
Information retrieval on the semantic web.
Discovering approximate keys in XML data.
XKvalidator: a constraint validator for XML.
A singer identification technique for content-based classification of MP3 music objects.
Harmonic models for polyphonic music retrieval.
The effectiveness study of various music information retrieval approaches.
Trajectory queries and octagons in moving object databases.
"GeoPlot": spatial data mining on video libraries.
An efficient and effective algorithm for density biased sampling.
A language modeling framework for resource selection and results merging.
Capturing term dependencies using a language model based on sentence trees.
Passage retrieval based on language models.
Knowledge and information management: Is it possible to do interesting and important research, get funded, be useful and appreciated?
Categorizing information objects from user access patterns.
Using conjunction of attribute values for classification.
Boosting to correct inductive bias in text classification.
On arabic search: improving the retrieval effectiveness via a light stemming approach.
Pruning long documents for distributed information retrieval.
Query association for effective retrieval.
Partial rollback in object-oriented/object-relational database management systems.
Intelligent knowledge discovery in peer-to-peer file sharing.
A local search mechanism for peer-to-peer networks.
XClust: clustering XML schemas for effective integration.
NeT & CoT: translating relational schemas to XML schemas using semantic constraints.
Logical and physical support for heterogeneous data.
Inferring query models by computing information flow.
The role of variance in term weighting for probabilistic information retrieval.
Detecting similar documents using salient terms.
Similarity based retrieval from sequence databases using automata as queries.
On the efficient evaluation of relaxed queries in biological databases.
How to improve the pruning ability of dynamic metric access methods.
Topic-based document segmentation with probabilistic latent semantic analysis.
Structural extraction from visual layout of documents.
AuGEAS: authoritativeness grading, estimation, and sorting.
Cooperative caching by mobile clients in push-based information systems.
A self-managing data cache for edge-of-network web applications.
Efficient prediction of web accesses on a proxy server.
An object-oriented extension of XML for autonomous web applications.
Efficient synchronization for mobile XML data.
XMLTM: efficient transaction management for XML documents.
Multi-level operator combination in XML query processing.
Query processing of streamed XML data.
Efficient evaluation of multiple queries on streaming XML data.
Vulnerabilities in similarity search based systems.
A compact and efficient image retrieval approach based on border/interior pixel classification.
Symbolic photograph content-based retrieval.
Future directions in data mining: streams, networks, self-similarity and power laws.
Semantic-based delivery of OLAP summary tables in wireless environments.
A fast filtering scheme for large database cleansing.
Batch data warehouse maintenance in dynamic environments.
Analysis of pre-computed partition top method for range top-k queries in OLAP data cubes.
Removing redundancy and inconsistency in memory-based collaborative filtering.
Meta-recommendation systems: user-controlled integration of diverse recommendations.
Topic-oriented collaborative crawling.
Searching web databases by structuring keyword-based queries.
Mining sequential patterns with constraints in large databases.
An iterative strategy for pattern discovery in high-dimensional data sets.
F4: large-scale automated forecasting using fractals.
On scalable information retrieval systems.