Design and evaluation of a parallel algorithm for inferring topic hierarchies

作者:

Highlights:

• We propose a novel parallel Algorithm for inferring topic hierarchies using HLDA.

• We use loosely-coupled parallel tasks that do not require frequent synchronization.

• The parallel Algorithm is well-suited to be run on distributed computing systems.

• The proposed Algorithm achieves a predictive accuracy on par with that of HLDA.

• The parallel Algorithm exhibits a near-linear speed-up and scales well.

摘要

•We propose a novel parallel Algorithm for inferring topic hierarchies using HLDA.•We use loosely-coupled parallel tasks that do not require frequent synchronization.•The parallel Algorithm is well-suited to be run on distributed computing systems.•The proposed Algorithm achieves a predictive accuracy on par with that of HLDA.•The parallel Algorithm exhibits a near-linear speed-up and scales well.

论文关键词:Topic modeling,Hierarchical clustering,Information retrieval,Parallel algorithm,Cluster computing,Message passing interface

论文评审过程:Received 10 September 2014, Revised 3 June 2015, Accepted 8 June 2015, Available online 25 June 2015, Version of Record 25 June 2015.

论文官网地址:https://doi.org/10.1016/j.ipm.2015.06.006