Volume 29, Number 12, December 2018
A Survey of Desktop Grid Scheduling.

Evgeny Ivashko Ilya Chernov Natalia Nikitina

Tsumiki: A Meta-Platform for Building Your Own Testbed.

Justin Cappos Yanyan Zhuang Albert Rafetseder Ivan Beschastnikh

SMGuard: A Flexible and Fine-Grained Resource Management Framework for GPUs.

Chao Yu Yuebin Bai Hailong Yang Kun Cheng Yuhao Gu Zhongzhi Luan Depei Qian

PEPS++: Towards Extreme-Scale Simulations of Strongly Correlated Quantum Many-Particle Models on Sunway TaihuLight.

Lixin He Hong An Chao Yang Fei Wang Junshi Chen Chao Wang Weihao Liang Shao-Jun Dong Qiao Sun Wenting Han Wenyuan Liu Yongjian Han Wenjun Yao

Joint Scheduling and Source Selection for Background Traffic in Erasure-Coded Storage.

Shijing Li Tian Lan Moo-Ryong Ra Rajesh Krishna Panta

Improving Medium-Grain Partitioning for Scalable Sparse Tensor Decomposition.

Seher Acer Tugba Torun Cevdet Aykanat

Exploring Customizable Heterogeneous Power Distribution and Management for Datacenter.

Longjun Liu Hongbin Sun Chao Li Yang Hu Tao Li Nanning Zheng

Execution-Efficient Response Time Analysis on Global Multiprocessor Platforms.

Quan Zhou Guohui Li Jianjun Li Chenggang Deng

ExaGeoStat: A High Performance Unified Software for Geostatistics on Manycore Systems.

Sameh Abdulah Hatem Ltaief Ying Sun Marc G. Genton David E. Keyes

Encoding-Aware Data Placement for Efficient Degraded Reads in XOR-Coded Storage Systems: Algorithms and Evaluation.

Zhirong Shen Patrick P. C. Lee Jiwu Shu Wenzhong Guo

Developing User Perceived Value Based Pricing Models for Cloud Markets.

Peijin Cong Liying Li Junlong Zhou Kun Cao Tongquan Wei Mingsong Chen Shiyan Hu

Competitiveness of a Non-Linear Block-Space GPU Thread Map for Simplex Domains.

Cristobal A. Navarro Matthieu Vernier Benjamin Bustos Nancy Hitschfeld

Coalescing and Deduplicating Incremental Checkpoint Files for Restore-Express Multi-Level Checkpointing.

Purushottam Sigdel Nian-Feng Tzeng

Analysis and Design Techniques towards High-Performance and Energy-Efficient Dense Linear Solvers on GPUs.

Ahmad Abdelfattah Azzam Haidar Stanimire Tomov Jack J. Dongarra

An Efficient and Fair Multi-Resource Allocation Mechanism for Heterogeneous Servers.

Jalal Khamse-Ashari Ioannis Lambadaris George Kesidis Bhuvan Urgaonkar Yiqiang Q. Zhao

Adaptive Scheduling Parallel Jobs with Dynamic Batching in Spark Streaming.

Dazhao Cheng Xiaobo Zhou Yu Wang Changjun Jiang

A Self-Adaptive Network for HPC Clouds: Architecture, Framework, and Implementation.

Feroz Zahid Amir Taherkordi Ernst Gunnar Gran Tor Skeie Bjørn Dag Johnsen

A Flattened Metadata Service for Distributed File Systems.

Siyang Li Fenlin Liu Jiwu Shu Youyou Lu Tao Li Yang Hu


Volume 29, Number 11, November 2018
Towards Stable Flow Scheduling in Data Centers.

Tong Zhang Fengyuan Ren Ran Shu

SnapFiner: A Page-Aware Snapshot System for Virtual Machines.

Lei Cui Zhiyu Hao Lun Li Xiaochun Yun

Scalable Data Race Detection for Lock-Intensive Programs with Pending Period Representation.

Xiaofei Liao Minhao Lin Long Zheng Hai Jin Zhiyuan Shao

Parallel Computation of Component Trees on Distributed Memory Machines.

Markus Götz Gabriele Cavallaro Thierry Géraud Matthias Book Morris Riedel

Multi-Objective Optimization for Virtual Machine Allocation and Replica Placement in Virtualized Hadoop.

Carlos Guerrero Isaac Lera Belén Bermejo Carlos Juiz

mSNP: A Massively Parallel Algorithm for Large-Scale SNP Detection.

Yingbo Cui Shaoliang Peng Yutong Lu Xiaoqian Zhu Bingqiang Wang Chengkun Wu Xiangke Liao

MPCA SGD - A Method for Distributed Training of Deep Learning Models on Spark.

Matthias Langer Ashley Hall Zhen He Wenny Rahayu

M-Oscillating: Performance Maximization on Temperature-Constrained Multi-Core Processors.

Shi Sha Wujie Wen Shaolei Ren Gang Quan

Minimize the Make-span of Batched Requests for FPGA Pooling in Cloud Computing.

Yangming Zhao Chen Tian Zhuangdi Zhu Jie Cheng Chunming Qiao Alex X. Liu

Minimal Cost Server Configuration for Meeting Time-Varying Resource Demands in Cloud Centers.

Chubo Liu Kenli Li Keqin Li

LWPTool: A Lightweight Profiler to Guide Data Layout Optimization.

Chao Yu Probir Roy Yuebin Bai Hailong Yang Xu Liu

Extending the Cutting Stock Problem for Consolidating Services with Stochastic Workloads.

Marcus Hähnel John Martinovic Guntram Scheithauer Andreas Fischer Alexander Schill Waltenegus Dargie

Expressive Content-Based Routing in Software-Defined Networks.

Sukanya Bhowmik Muhammad Adnan Tariq Jonas Grunert Deepak Srinivasan Kurt Rothermel

Early Identification of Critical Blocks: Making Replicated Distributed Storage Systems Reliable Against Node Failures.

Juntao Fang Shenggang Wan Ping Huang Changsheng Xie Xubin He

Dynamic Resource Scheduling in Mobile Edge Cloud with Cloud Radio Access Network.

Xinhou Wang Kezhi Wang Song Wu Sheng Di Hai Jin Kun Yang Shumao Ou

Core Maintenance in Dynamic Graphs: A Parallel Approach Based on Matching.

Hai Jin Na Wang Dongxiao Yu Qiang-Sheng Hua Xuanhua Shi Xia Xie

BiGNoC: Accelerating Big Data Computing with Application-Specific Photonic Network-on-Chip Architectures.

Sai Vineel Reddy Chittamuru Dharanidhar Dang Sudeep Pasricha Rabi N. Mahapatra


Volume 29, Number 10, October 2018
A Survey on Recent OS-Level Energy Management Techniques for Mobile Processing Units.

Young Geun Kim Joonho Kong Sung Woo Chung

Workload Scheduling for Massive Storage Systems with Arbitrary Renewable Supply.

Daping Li Xiaoyang Qu Jiguang Wan Jun Wang Yang Xia Xiaozhao Zhuang Changsheng Xie

Triggered-Issuance and Triggered-Execution: A Control Paradigm to Minimize Pipeline Stalls in Distributed Controlled Coarse-Grained Reconfigurable Arrays.

Yanan Lu Leibo Liu Yangdong Deng Jian Weng Shouyi Yin Yiyu Shi Shaojun Wei

TerrierTail: Mitigating Tail Latency of Cloud Virtual Machines.

Esmail Asyabi SeyedAlireza SanaeeKohroudi Mohsen Sharifi Azer Bestavros

Penguin: Efficient Query-Based Framework for Replaying Large Scale Historical Data.

Rong Gu Yu-Fa Zhou Zhaokang Wang Chunfeng Yuan Yihua Huang

Optimizations of Unstructured Aerodynamics Computations for Many-core Architectures.

Mohammed A. Al Farhan David E. Keyes

Online Tuning of EASY-Backfilling using Queue Reordering Policies.

Éric Gaussier Jérôme Lelong Valentin Reis Denis Trystram

Multi-Level Domain-Decomposition Strategy for Solving the Eikonal Equation with the Fast-Sweeping Method.

Anup Shrestha Inanc Senocak

Massively Parallel Stencil Code Solver with Autonomous Adaptive Block Distribution.

Marco Berghoff Ivan Kondov Johannes Hötzer

Machine Learning-Based Quality-Aware Power and Thermal Management of Multistream HEVC Encoding on Multicore Servers.

Arman Iranfar Marina Zapater David Atienza

Improving Restore Performance in Deduplication-Based Backup Systems via a Fine-Grained Defragmentation Approach.

Yujuan Tan Baiping Wang Jian Wen Zhichao Yan Hong Jiang Witawas Srisa-an

Elastic Parity Logging for SSD RAID Arrays: Design, Analysis, and Implementation.

Helen H. W. Chan Yongkun Li Patrick P. C. Lee Yinlong Xu

Distributed Stream Rebalance for Stateful Operator Under Workload Variance.

Junhua Fang Rong Zhang Tom Z. J. Fu Zhenjie Zhang Aoying Zhou Xiaofang Zhou

CUDAMPF++: A Proactive Resource Exhaustion Scheme for Accelerating Homologous Sequence Search on CUDA-Enabled GPU.

Hanyu Jiang Narayan Ganesan Yu-Dong Yao

BenchBox: A User-Driven Benchmarking Framework for Fat-Client Storage Systems.

Raúl Gracia Tinedo Chenglong Zou Marc Sánchez Artigas Pedro García López

A Novel Data-Partitioning Algorithm for Performance Optimization of Data-Parallel Applications on Heterogeneous HPC Platforms.

Hamidreza Khaleghzadeh Ravi Reddy Manumachu Alexey L. Lastovetsky

A Dataflow Processor as the Basis of a Tiled Polymorphic Computing Architecture with Fine-Grain Instruction Migration.

David Hentrich Erdal Oruklu Jafar Saniie


Volume 29, Number 9, September 2018
Unleashing Fine-Grained Parallelism on Embedded Many-Core Accelerators with Lightweight OpenMP Tasking.

Giuseppe Tagliavini Daniele Cesarini Andrea Marongiu

Unifying Fixed Code Mapping, Communication, Synchronization and Scheduling Algorithms for Efficient and Scalable Loop Pipelining.

Aristeidis Mastoras Thomas R. Gross

TripleID-Q: RDF Query Processing Framework Using GPU.

Chantana Chantrapornchai Chidchanok Choksuchat

Stress-Aware Loops Mapping on CGRAs with Dynamic Multi-Map Reconfiguration.

Jiangyuan Gu Shouyi Yin Leibo Liu Shaojun Wei

RoB-Router : A Reorder Buffer Enabled Low Latency Network-on-Chip Router.

Cunlu Li Dezun Dong Zhonghai Lu Xiangke Liao

Online Auction for IaaS Clouds: Towards Elastic User Demands and Weighted Heterogeneous VMs.

Juan Li Yanmin Zhu Jiadi Yu Chengnian Long Guangtao Xue Shiyou Qian

On the Synchronization Bottleneck of OpenStack Swift-Like Cloud Storage Systems.

Mingkang Ruan Thierry Titcheu Ennan Zhai Zhenhua Li Yao Liu Jinlong E Yong Cui Hong Xu

MCL: A Cost-Efficient Nonblocking Multicast Interconnection Network.

Jun Duan Yuanyuan Yang

Implementing Snapshot Objects on Top of Crash-Prone Asynchronous Message-Passing Systems.

Carole Delporte-Gallet Hugues Fauconnier Sergio Rajsbaum Michel Raynal

Game-Based Thermal-Delay-Aware Adaptive Routing (GTDAR) for Temperature-Aware 3D Network-on-Chip Systems.

Kun-Chih Chen

Firework: Data Processing and Sharing for Hybrid Cloud-Edge Analytics.

Quan Zhang Qingyang Zhang Weisong Shi Hong Zhong

Error Resilient GPU Accelerated Image Processing for Space Applications.

R. L. Davidson Christopher P. Bridges

Dynamic Adaptable Asynchronous Progress Model for MPI RMA Multiphase Applications.

Min Si Antonio J. Peña Jeff R. Hammond Pavan Balaji Masamichi Takagi Yutaka Ishikawa

Asynchronous and Exact Forward Recovery for Detected Errors in Iterative Solvers.

Luc Jaulmes Miquel Moretó Eduard Ayguadé Jesús Labarta Mateo Valero Marc Casas

Analysis of Bounds on Hybrid Vector Clocks.

Sorrachai Yingchareonthawornchai Duong N. Nguyen Sandeep S. Kulkarni Murat Demirbas

An Improved Approximation for Scheduling Malleable Tasks with Precedence Constraints via Iterative Method.

Chi-Yeh Chen

A Code Generator for Energy-Efficient Wavefront Parallelization of Uniform Dependence Computations.

Yun Zou Sanjay V. Rajopadhye


Volume 29, Number 8, August 2018
Unraveling Network-Induced Memory Contention: Deeper Insights with Machine Learning.

Taylor L. Groves Ryan E. Grant Aaron Gonzales Dorian C. Arnold

TA-Update: An Adaptive Update Scheme with Tree-Structured Transmission in Erasure-Coded Storage Systems.

Yijie Wang Xiaoqiang Pei Xingkong Ma Fangliang Xu

Symmetric Indefinite Linear Solver Using OpenMP Task on Multicore Architectures.

Ichitaro Yamazaki Jakub Kurzak Panruo Wu Mawussi Zounon Jack J. Dongarra

Sparse Geometries Handling in Lattice Boltzmann Method Implementation for Graphic Processors.

Tadeusz Tomczak Roman G. Szafran

Secure Integrated Circuit Design via Hybrid Cloud.

Xingliang Yuan Jian Weng Cong Wang Kui Ren

Eunomia: Scaling Concurrent Index Structures Under Contention Using HTM.

Weihua Zhang Xin Wang Shiyu Ji Ziyun Wei Zhaoguo Wang Haibo Chen

Scalable GPU Virtualization with Dynamic Sharing of Graphics Memory Space.

Mochi Xue Jiacheng Ma Wentai Li Kun Tian Yaozu Dong Jinyu Wu Zhengwei Qi Bingsheng He Haibing Guan

Preserving Model Privacy for Machine Learning in Distributed Systems.

Qi Jia Linke Guo Zhanpeng Jin Yuguang Fang

Performance Model of MapReduce Iterative Applications for Hybrid Cloud Bursting.

Francisco J. Clemente-Castelló Bogdan Nicolae Rafael Mayo Juan Carlos Fernández

On Random Wiring in Practicable Folded Clos Networks for Modern Datacenters.

Cristobal Camarero Carmen Martínez Ramón Beivide

Non-Preemptive Scheduling for Mixed-Criticality Real-Time Multiprocessor Systems.

Hyeongboo Baek Namyong Jung Hoon Sung Chwa Insik Shin Jinkyu Lee

Managing Risk in a Derivative IaaS Cloud.

Prateek Sharma Stephen Lee Tian Guo David E. Irwin Prashant J. Shenoy

List-Scheduling versus Cluster-Scheduling.

Huijun Wang Oliver Sinnen

Enabling Generic, Verifiable, and Secure Data Search in Cloud Services.

Jie Zhu Qi Li Cong Wang Xingliang Yuan Qian Wang Kui Ren

Efficient Realization of Householder Transform Through Algorithm-Architecture Co-Design for Acceleration of QR Factorization.

Farhad Merchant Tarun Vatwani Anupam Chattopadhyay Soumyendu Raha S. K. Nandy Ranjani Narayan

Efficient Performance-Centric Bandwidth Allocation with Fairness Tradeoff.

Li Chen Yuan Feng Baochun Li Bo Li

A Fault-Tolerant Framework for Asynchronous Iterative Computations in Cloud Environments.

Zhigang Wang Lixin Gao Yu Gu Yubin Bao Ge Yu


Volume 29, Number 7, July 2018
Virtual Network Function Placement Considering Resource Optimization and SFC Requests in Cloud Datacenter.

Defang Li Peilin Hong Kaiping Xue Jianing Pei

Strategy-Proof Mechanism for Provisioning and Allocation Virtual Machines in Heterogeneous Clouds.

Xi Liu Weidong Li Xuejie Zhang

Scalable Minimum-Cost Balanced Partitioning of Large-Scale Social Networks: Online and Offline Solutions.

Romas James Hada Hongyi Wu Miao Jin

Replication-Based Fault-Tolerance for Large-Scale Graph Processing.

Rong Chen Youyang Yao Peng Wang Kaiyuan Zhang Zhaoguo Wang Haibing Guan Binyu Zang Haibo Chen

ReCA: An Efficient Reconfigurable Cache Architecture for Storage Systems with Online Workload Characterization.

Reza Salkhordeh Shahriar Ebrahimi Hossein Asadi

Race-Condition-Aware and Hardware-Oriented Task Partitioning and Scheduling Using Entropy Maximization.

Sizhao Li Yuanzhi Zhang Hongyin Luo Yan Chen Chao Lu Donghui Guo

Quantifying the Impact of Variability and Heterogeneity on the Energy Efficiency for a Next-Generation Ultra-Green Supercomputer.

Francesco Fraternali Andrea Bartolini Carlo Cavazzoni Luca Benini

PROSA: Protocol-Driven Network on Chip Architecture.

Miguel Gorgues Alonso Jose Flich

Non-Asymptotic Delay Bounds for Multi-Server Systems with Synchronization Constraints.

Markus Fidler Brenton D. Walker Yuming Jiang

MSGD: A Novel Matrix Factorization Approach for Large-Scale Collaborative Filtering Recommender Systems on GPUs.

Hao Li Kenli Li Ji-yao An Keqin Li

iDaaS: Inter-Datacenter Network as a Service.

Wenxin Li Deke Guo Keqiu Li Heng Qi Jianhui Zhang

Hybrid Transactional Replication: State-Machine and Deferred-Update Replication Combined.

Tadeusz Kobus Maciej Kokocinski Pawel T. Wojciechowski

G-CRS: GPU Accelerated Cauchy Reed-Solomon Coding.

Chengjian Liu Qiang Wang Xiaowen Chu Yiu-Wing Leung

Alleviating Memory Refresh Overhead via Data Compression for High Performance and Energy Efficiency.

Ke Zhou Wenjie Liu Kun Tang Ping Huang Xubin He

A Parallel Complex Coloring Algorithm for Scheduling of Input-Queued Switches.

Lingkang Wang Tong Ye Tony Tong Lee Weisheng Hu

A Model Predictive Controller for Managing QoS Enforcements and Microarchitecture-Level Interferences in a Lambda Platform.

M. Reza HoseinyFarahabady Albert Y. Zomaya Zahir Tari


Volume 29, Number 6, June 2018
Towards Memory-Efficient Allocation of CNNs on Processing-in-Memory Architecture.

Yi Wang Weixuan Chen Jing Yang Tao Li

SnapMig: Accelerating VM Live Storage Migration by Leveraging the Existing VM Snapshots in the Cloud.

Yaodong Yang Bo Mao Hong Jiang Yuekun Yang Hao Luo Suzhen Wu

Scheduling Stochastic Multi-Stage Jobs to Elastic Hybrid Cloud Resources.

Jie Zhu Xiaoping Li Rubén Ruiz Xiaolong Xu

Power-Aware and Performance-Guaranteed Virtual Machine Placement in the Cloud.

Hui Zhao Jing Wang Feng Liu Quan Wang Weizhan Zhang Qinghua Zheng

MIA: Metric Importance Analysis for Big Data Workload Characterization.

Zhibin Yu Wen Xiong Lieven Eeckhout Zhendong Bei Avi Mendelson Chengzhong Xu

Malleable Task-Graph Scheduling with a Practical Speed-Up Model.

Loris Marchal Bertrand Simon Oliver Sinnen Frédéric Vivien

Maelstream: Self-Organizing Media Streaming for Many-to-Many Interaction.

Lucas Provensi Abhishek Singh Frank Eliassen Roman Vitenberg

Learning-Based Memory Allocation Optimization for Delay-Sensitive Big Data Processing.

Linjiun Tsai Hubertus Franke Chung-Sheng Li Wanjiun Liao

Holistic Virtual Machine Scheduling in Cloud Datacenters towards Minimizing Total Energy.

Xiang Li Peter Garraghan Xiaohong Jiang Zhaohui Wu Jie Xu

High-Speed Transfer Optimization Based on Historical Analysis and Real-Time Tuning.

Engin Arslan Tevfik Kosar

GrapH: Traffic-Aware Graph Processing.

Christian Mayer Muhammad Adnan Tariq Ruben Mayer Kurt Rothermel

GFlink: An In-Memory Computing Architecture on Heterogeneous CPU-GPU Clusters for Big Data.

Cen Chen Kenli Li Aijia Ouyang Zeng Zeng Keqin Li

EDC: Improving the Performance and Space Efficiency of Flash-Based Storage Systems with Elastic Data Compression.

Bo Mao Suzhen Wu Hong Jiang Yaodong Yang Zaifa Xi

Distributed Randomized k-Clustering Based PCID Assignment for Ultra-Dense Femtocellular Networks.

Ajay Pratap Rishabh Singhal Rajiv Misra Sajal K. Das

Context-Aware Task Migration for HART-Centric Collaboration over FiWi Based Tactile Internet Infrastructures.

Mahfuzulhoq Chowdhury Eckehard G. Steinbach Wolfgang Kellerer Martin Maier

A New Algorithm for Parallel Connected-Component Labelling on GPUs.

Daniel P. Playne Kenneth A. Hawick

A Differentiated Caching Mechanism to Enable Primary Storage Deduplication in Clouds.

Huijun Wu Chen Wang Yinjin Fu Sherif Sakr Kai Lu Liming Zhu


Volume 29, Number 5, May 2018
TokenTLB+CUP: A Token-Based Page Classification with Cooperative Usage Prediction.

Albert Esteve Alberto Ros Antonio Robles María Engracia Gómez

Reducing Cache Coherence Traffic with a NUMA-Aware Runtime Approach.

Paul Caheny Lluc Alvarez Said Derradji Mateo Valero Miquel Moretó Marc Casas

Memory Hierarchy Characterization of NoSQL Applications through Full-System Simulation.

Adrian Colaso Pablo Prieto Jose Angel Herrero Pablo Abad Fidalgo Lucia G. Menezo Valentin Puente José-Ángel Gregorio

Long-Term Multi-Resource Fairness for Pay-as-you Use Computing Systems.

Shanjiang Tang Zhaojie Niu Bingsheng He Bu-Sung Lee Ce Yu

Light Weight Write Mechanism for Cloud Data.

Mosarrat Jahan Mohsen Rezvani Qianrui Zhao Partha Sarathi Roy Kouichi Sakurai Aruna Seneviratne Sanjay K. Jha

Lattice-Based Turn Model for Adaptive Routing.

Edoardo Fusella Alessandro Cilardo

Large-Scale and Extreme-Scale Computing with Stranded Green Power: Opportunities and Costs.

Fan Yang Andrew A. Chien

Intra-Node Memory Safe GPU Co-Scheduling.

Carlos Reaño Federico Silla Dimitrios S. Nikolopoulos Blesson Varghese

G-ML-Octree: An Update-Efficient Index Structure for Simulating 3D Moving Objects Across GPUs.

Ze Deng Lizhe Wang Wei Han Rajiv Ranjan Albert Y. Zomaya

Evacuate Before Too Late: Distributed Backup in Inter-DC Networks with Progressive Disasters.

Xiaokang Xie Qing Ling Ping Lu Wei Xu Zuqing Zhu

Efficient Timing Channel Protection for Hybrid (Packet/Circuit-Switched) Network-on-Chip.

Arnab Kumar Biswas

CoreVA-MPSoC: A Many-Core Architecture with Tightly Coupled Shared and Local Data Memories.

Johannes Ax Gregor Sievers Julian Daberkow Martin Flasskamp Marten Vohrmann Thorsten Jungeblut Wayne Kelly Mario Porrmann Ulrich Rückert

CoMan: Managing Bandwidth Across Computing Frameworks in Multiplexed Datacenters.

Wenxin Li Deke Guo Alex X. Liu Keqiu Li Heng Qi Song Guo Ali Munir Xiaoyi Tao

Auditing Big Data Storage in Cloud Computing Using Divide and Conquer Tables.

Mehdi Sookhak F. Richard Yu Albert Y. Zomaya

A Write-Friendly and Cache-Optimized Hashing Scheme for Non-Volatile Memory Systems.

Pengfei Zuo Yu Hua

A Guide for Achieving High Performance with Very Small Matrices on GPU: A Case Study of Batched LU and Cholesky Factorizations.

Azzam Haidar Ahmad Abdelfattah Mawussi Zounon Stanimire Tomov Jack J. Dongarra

A Framework for the Automatic Vectorization of Parallel Sort on x86-Based Processors.

Kaixi Hou Hao Wang Wu-Chun Feng


Volume 29, Number 4, April 2018
Storage, Communication, and Load Balancing Trade-off in Distributed Cache Networks.

Mahdi Jafari Siavoshani Ali Pourmiri Seyed Pooya Shariatpanahi

Sketch Acceleration on FPGA and its Applications in Network Anomaly Detection.

Da Tong Viktor K. Prasanna

Scheduling Parallel Real-Time Recurrent Tasks on Multicore Platforms.

Risat Pathan Petros Voudouris Per Stenström

MeLoDy: A Long-Term Dynamic Quality-Aware Incentive Mechanism for Crowdsourcing.

Hongwei Wang Song Guo Jiannong Cao Minyi Guo

LVRM: On the Design of Efficient Link Based Virtual Resource Management Algorithm for Cloud Platforms.

Prasan Kumar Sahoo Chinmaya Kumar Dehury Bharadwaj Veeravalli

Loop Tiling in Large-Scale Stencil Codes at Run-Time with OPS.

István Z. Reguly Gihan R. Mudalige Michael B. Giles

Joint DVFS and Parallelism for Energy Efficient and Low Latency Software Video Decoding.

Yahia Benmoussa Eric Senn Nicolas Derouineau Nicolas Tizon Jalil Boukhobza

FA-Stack: A Fast Array-Based Stack with Wait-Free Progress Guarantee.

Yaqiong Peng Zhiyu Hao

Efficient Disk-Based Directed Graph Processing: A Strongly Connected Component Approach.

Yu Zhang Xiaofei Liao Xiang Shi Hai Jin Bingsheng He

Distributed Convergence Detection Based on Global Residual Error Under Asynchronous Iterations.

Frédéric Magoulès Guillaume Gbikpi Benissan

Computing Hierarchical Summary from Two-Dimensional Big Data Streams.

Zubair Shah Abdun Naser Mahmood Michael Barlow Zahir Tari Xun Yi Albert Y. Zomaya

Blocking Analysis for Spin Locks in Real-Time Parallel Tasks.

Son Dinh Jing Li Kunal Agrawal Christopher D. Gill Chenyang Lu

AROMa: Aging-Aware Deadlock-Free Adaptive Routing Algorithm and Online Monitoring in 3D NoCs.

Zana Ghaderi Ayed Alqahtani Nader Bagherzadeh

An Efficient In-Memory Checkpoint Method and its Practice on Fault-Tolerant HPL.

Xiongchao Tang Jidong Zhai Bowen Yu Wenguang Chen Weimin Zheng Keqin Li

A Study of Systems with Multiple Operating Levels, Probabilistic Thresholds and Hysteresis.

Alexandre Brandwajn Thomas Begin Hind Castel-Taleb Tülin Atmaca

A Hierarchical RAID Architecture Towards Fast Recovery and High Reliability.

Yongkun Li Neng Wang Chengjin Tian Si Wu Yueming Zhang Yinlong Xu

A Double Auction Mechanism to Bridge Users' Task Requirements and Providers' Resources in Two-Sided Cloud Markets.

Li Lu Jiadi Yu Yanmin Zhu Minglu Li


Volume 29, Number 3, March 2018
Errata to "Evaluation of a Heterogeneous Multicore Architecture by Design and Test of an OFDM Receiver".

Sajjad Nouri Waqar Hussain Jari Nurmi

Time- and Cost- Efficient Task Scheduling across Geo-Distributed Data Centers.

Zhiming Hu Baochun Li Jun Luo

Queue Delegation Locking.

David Klaftenegger Konstantinos Sagonas Kjell Winblad

Quadboost: A Scalable Concurrent Quadtree.

Ke-ren Zhou Guangming Tan Wei Zhou

Parallel Algorithm for Incremental Betweenness Centrality on Large Graphs.

Fuad T. Jamour Spiros Skiadopoulos Panos Kalnis

P3S: A Methodology to Analyze and Predict Application Scalability.

Javier Panadero Alvaro Wong Dolores Rexachs Emilio Luque

OrthoNoC: A Broadcast-Oriented Dual-Plane Wireless Network-on-Chip Architecture.

Sergi Abadal Josep Torrellas Eduard Alarcón Albert Cabellos-Aparicio

Metascheduling of HPC Jobs in Day-Ahead Electricity Markets.

Prakash Murali Sathish Vadhiyar

kNN-DP: Handling Data Skewness in kNN Joins Using MapReduce.

Xujun Zhao Jifu Zhang Xiao Qin

IBOM: An Integrated and Balanced On-Chip Memory for High Performance GPGPUs.

Jianfei Wang Qin Wang Li Jiang Chao Li Xiaoyao Liang Naifeng Jing

Elastic Symbiotic Scaling of Operators and Resources in Stream Processing Systems.

Federico Lombardi Leonardo Aniello Silvia Bonomi Leonardo Querzoni

Cost-Efficient and Robust On-Demand Video Transcoding Using Heterogeneous Cloud Services.

Xiangbo Li Mohsen Amini Salehi Magdy A. Bayoumi Nian-Feng Tzeng Rajkumar Buyya

Cache-Oblivious MPI All-to-All Communications Based on Morton Order.

Shigang Li Yunquan Zhang Torsten Hoefler

Automatic Detection of Large Extended Data-Race-Free Regions with Conflict Isolation.

Alexandra Jimborean Per Ekemark Jonatan Waern Stefanos Kaxiras Alberto Ros

Argobots: A Lightweight Low-Level Threading and Tasking Framework.

Sangmin Seo Abdelhalim Amer Pavan Balaji Cyril Bordage George Bosilca Alex Brooks Philip H. Carns Adrián Castelló Damien Genet Thomas Hérault Shintaro Iwasaki Prateek Jindal Laxmikant V. Kalé Sriram Krishnamoorthy Jonathan Lifflander Huiwei Lu Esteban Meneses Marc Snir Yanhua Sun Kenjiro Taura Peter H. Beckman

A Relaxation-Based Network Decomposition Algorithm for Parallel Transient Stability Simulation with Improved Convergence.

Jian Shi Brian Sullivan Mike Mazzola Babak Saravi Uttam Adhikari Tomasz Haupt

A Hardware Architecture for Radial Basis Function Neural Network Classifier.

Mahnaz Mohammadi Akhil Krishna Nalesh Sivanandan S. K. Nandy


Volume 29, Number 2, February 2018
Using Hardware-Transactional-Memory Support to Implement Thread-Level Speculation.

Juan Salamanca José Nelson Amaral Guido Araujo

Towards Bandwidth Guarantee for Virtual Clusters Under Demand Uncertainty in Multi-Tenant Clouds.

Lei Yu Haiying Shen Zhipeng Cai Ling Liu Calton Pu

Toward High Mobile GPU Performance Through Collaborative Workload Offloading.

Chao Wu Bowen Yang Wenwu Zhu Yaoxue Zhang

Neurostream: Scalable and Energy Efficient Deep Learning with Smart Memory Cubes.

Erfan Azarkhish Davide Rossi Igor Loi Luca Benini

Machine Learning-Based Temperature Prediction for Runtime Thermal Management Across System Components.

Kaicheng Zhang Akhil Guliani Seda Ogrenci Memik Gokhan Memik Kazutomo Yoshii Rajesh Sankaran Peter H. Beckman

Efficient Distributed All-Pairs Algorithms: Management Using Optimal Cyclic Quorums.

Cory J. Kleinheksel Arun K. Somani

Easy PRAM-Based High-Performance Parallel Programming with ICE.

Fady Ghanim Uzi Vishkin Rajeev Barua

Distributed Privacy-Aware Fast Selection Algorithm for Large-Scale Data.

Hao Liu Jiming Chen

Confluence: Speeding Up Iterative Distributed Operations by Key-Dependency-Aware Partitioning.

Feng Liang Francis C. M. Lau Heming Cui Cho-Li Wang

Combining Static and Dynamic Storage Management for Data Intensive Scientific Workflows.

Nicholas L. Hazekamp Nathaniel Kremer-Herman Benjamín Tovar Haiyan Meng Olivia Choudhury Scott J. Emrich Douglas Thain

Capacity Optimization for Resource Pooling in Virtualized Data Centers with Composable Systems.

An-Dee Lin Chung-Sheng Li Wanjiun Liao Hubertus Franke

Asynchronous Task-Based Polar Decomposition on Single Node Manycore Architectures.

Dalal Sukkari Hatem Ltaief Mathieu Faverge David E. Keyes

An Energy Efficient VM Management Scheme with Power-Law Characteristic in Video Streaming Data Centers.

Hsueh-Wen Tseng Ting-Ting Yang Kai-Cheng Yang Pei-Shan Chen

An Auto-Tuner for OpenCL Work-Group Size on GPUs.

Thanh Tuan Dao Jaejin Lee

AIRA: A Framework for Flexible Compute Kernel Execution in Heterogeneous Platforms.

Robert Lyerly Alastair Murray Antonio Barbalace Binoy Ravindran

A Novel Network Structure with Power Efficiency and High Availability for Data Centers.

Zhenhua Li Yuanyuan Yang

A Job Sizing Strategy for High-Throughput Scientific Workflows.

Benjamín Tovar Rafael Ferreira da Silva Gideon Juve Ewa Deelman William E. Allcock Douglas Thain Miron Livny


Volume 29, Number 1, January 2018
VOLAP: A Scalable Distributed Real-Time OLAP System for High-Velocity Data.

Frank Dehne David E. Robillard Andrew Rau-Chaplin Neil Burke

Value the Recent Past: Approximate Causal Consistency for Partially Replicated Systems.

Ta Yuan Hsu Ajay D. Kshemkalyani

Towards Quality Aware Information Integration in Distributed Sensing Systems.

Wenjun Jiang Chenglin Miao Lu Su Qi Li Shaohan Hu Shiguang Wang Jing Gao Hengchang Liu Tarek F. Abdelzaher Jiawei Han Xue (Steve) Liu Yan Gao Lance M. Kaplan

Scalable Deadlock-Free Deterministic Minimal-Path Routing Engine for InfiniBand-Based Dragonfly Networks.

German Maglione Mathey Pedro Yébenes Jesús Escudero-Sahuquillo Pedro Javier García Francisco J. Quiles Eitan Zahavi

Resource Optimization Across the Cloud Stack.

Zoltán Ádám Mann

Rapid Calculation of Max-Min Fair Rates for Multi-Commodity Flows in Fat-Tree Networks.

Md Atiqul Mollah Xin Yuan Scott Pakin Michael Lang

Random Regular Graph and Generalized De Bruijn Graph with k-Shortest Path Routing.

Peyman Faizian Md Atiqul Mollah Xin Yuan Zaid Alzaid Scott Pakin Michael Lang

Optimization of Error-Bounded Lossy Compression for Hard-to-Compress HPC Data.

Sheng Di Franck Cappello

Near-Memory Acceleration for Radio Astronomy.

Leandro Fiorin Rik Jongerius Erik Vermij Jan van Lunteren Christoph Hagleitner

GraphD: Distributed Vertex-Centric Graph Processing Beyond the Memory Limit.

Da Yan Yuzhen Huang Miao Liu Hongzhi Chen James Cheng Huanhuan Wu Chengcui Zhang

Energy-Aware Virtual Machine Scheduling on Data Centers with Heterogeneous Bandwidths.

Daniel Guimaraes do Lago Edmundo R. M. Madeira Deep Medhi

Energy Efficiency Aware Task Assignment with DVFS in Heterogeneous Hadoop Clusters.

Dazhao Cheng Xiaobo Zhou Palden Lama Mike Ji Changjun Jiang

CoCloud: Enabling Efficient Cross-Cloud File Collaboration Based on Inefficient Web APIs.

Jinlong E Yong Cui Peng Wang Zhenhua Li Chaokun Zhang

Architectural Synthesis of Multi-SIMD Dataflow Accelerators for FPGA.

Yun Wu John McAllister

Adaptive Resource Allocation and Provisioning in Multi-Service Cloud Environments.

Ayoub Alsarhan Awni Itradat Ahmed Yassin Al-Dubai Albert Y. Zomaya Geyong Min

Accelerating Persistent Scatterer Pixel Selection for InSAR Processing.

Tahsin Reza Aaron Zimmer José Manuel Delgado Blasco Parwant Ghuman Tanuj kr Aasawat Matei Ripeanu

A Design Space Exploration Methodology for Parameter Optimization in Multicore Processors.

Prasanna Kansakar Arslan Munir

State of the Journal.

Manish Parashar