Volume 31, Number 12, December 2020
HeteroYARN: A Heterogeneous FPGA-Accelerated Architecture Based on YARN.

Ruixuan Li Qi Yang Yuhua Li Xiwu Gu Weijun Xiao Keqin Li

Pattern-Based Dynamic Compilation System for CGRAs With Online Configuration Transformation.

Leibo Liu Xingchen Man Jianfeng Zhu Shouyi Yin Shaojun Wei

Millimeter-Scale and Billion-Atom Reactive Force Field Simulation on Sunway Taihulight.

Ping Gao Xiaohui Duan Tingjian Zhang Meng Zhang Bertil Schmidt Xun Zhang Hongliang Sun Wusheng Zhang Lin Gan Wei Xue Haohuan Fu Weiguo Liu Guangwen Yang

Traffic-Aware Erasure-Coded Archival Schemes for In-Memory Stores.

Bin Xu Jianzhong Huang Xiao Qin Qiang Cao

Towards Unaligned Writes Optimization in Cloud Storage With High-Performance SSDs.

Jiwu Shu Fei Li Siyang Li Youyou Lu

Spatially Bursty I/O on Supercomputers: Causes, Impacts and Solutions.

Jie Yu Wenxiang Yang Fang Wang Dezun Dong Jinghua Feng Yuqi Li

GPU-Accelerated Real-Time Stereo Estimation With Binary Neural Network.

Gang Chen Haitao Meng Yucheng Liang Kai Huang

Congestion-Balanced and Welfare-Maximized Charging Strategies for Electric Vehicles.

Qiang Tang Kezhi Wang Kun Yang Yuansheng Luo

GPGPU Performance Estimation With Core and Memory Frequency Scaling.

Qiang Wang Xiaowen Chu

Dynamic Undervolting to Improve Energy Efficiency on Multicore X86 CPUs.

Panos K. Koutsovasilis Konstantinos Parasyris Christos D. Antonopoulos Nikolaos Bellas Spyros Lalis

Resource Management for Power-Constrained HEVC Transcoding Using Reinforcement Learning.

Luis Costero Arman Iranfar Marina Zapater Francisco D. Igual Katzalin Olcoz David Atienza

Scheduling Periodical Multi-Stage Jobs With Fuzziness to Elastic Cloud Resources.

Jie Zhu Xiaoping Li Rubén Ruiz Wei Li Haiping Huang Albert Y. Zomaya

Distributed Training of Deep Learning Models: A Taxonomic Perspective.

Matthias Langer Zhen He Wenny Rahayu Yanbo Xue

Scalable, Multi-Constraint, Complex-Objective Graph Partitioning.

George M. Slota Cameron Root Karen D. Devine Kamesh Madduri Sivasankaran Rajamanickam

Interval Job Scheduling With Machine Launch Cost.

Runtian Ren Yuqing Zhu Chuanyou Li Xueyan Tang

Cartesian Partitioning Models for 2D and 3D Parallel SpGEMM Algorithms.

Gunduz Vehbi Demirci Cevdet Aykanat

Preemptive and Low Latency Datacenter Scheduling via Lightweight Containers.

Wei Chen Xiaobo Zhou Jia Rao

Memory-Efficient and Skew-Tolerant MapReduce Over MPI for Supercomputing Systems.

Tao Gao Yanfei Guo Boyu Zhang Pietro Cicotti Yutong Lu Pavan Balaji Michela Taufer


Volume 31, Number 11, November 2020
System Error Prediction for Business Support Systems in Telecommunications Networks.

En-Hau Yeh Phone Lin Xin-Xue Lin Jeu-Yih Jeng Yuguang Fang

High-Quality Shared-Memory Graph Partitioning.

Yaroslav Akhremtsev Peter Sanders Christian Schulz

Countdown Slack: A Run-Time Library to Reduce Energy Footprint in Large-Scale MPI Applications.

Daniele Cesarini Andrea Bartolini Andrea Borghesi Carlo Cavazzoni Mathieu Luisier Luca Benini

Improving MPI Collective I/O for High Volume Non-Contiguous Requests With Intra-Node Aggregation.

Qiao Kang Sunwoo Lee Kaiyuan Hou Robert B. Ross Ankit Agrawal Alok N. Choudhary Wei-keng Liao

Fully Homomorphic based Privacy-Preserving Distributed Expectation Maximization on Cloud.

Abdulatif Alabdulatif Ibrahim Khalil Albert Y. Zomaya Zahir Tari Xun Yi

Cooperative Memory Expansion via OS Kernel Support for Networked Computing Systems.

Pisacha Srinuan Xu Yuan Nian-Feng Tzeng

QWEB: High-Performance Event-Driven Web Architecture With QAT Acceleration.

Jian Li Xiaokang Hu David Qian Changzheng Wei Gordon McFadden Brian Will Ping Yu Weigang Li Haibing Guan

Time-Optimal Leader Election in Population Protocols.

Yuichi Sudo Fukuhito Ooshita Taisuke Izumi Hirotsugu Kakugawa Toshimitsu Masuzawa

Comment on "A Tag Encoding Scheme Against Pollution Attack to Linear Network Coding".

Jinyong Chang Bilin Shao Yanyan Ji Genqing Bian

Towards Usable Cloud Storage Auditing.

Fei Chen Fengming Meng Tao Xiang Hua Dai Jianqiang Li Jing Qin

Generalized Cost-Based Job Scheduling in Very Large Heterogeneous Cluster Systems.

Wasiur R. KhudaBukhsh Sounak Kar Bastian Alt Amr Rizk Heinz Koeppl

Correlation of Performance Optimizations and Energy Consumption for Stencil-Based Application on Intel Xeon Scalable Processors.

Lukasz Szustak Roman Wyrzykowski Tomasz Olas Valeria Mele

MEMPHA: Model of Exascale Message-Passing Programs on Heterogeneous Architectures.

Sina Zangbari Koohi Nor Asilah Wati Abdul Hamid Mohamed Othman Gafurjan I. Ibragimov

Errata to "On-Edge Multi-Task Transfer Learning: Model and Practice With Data-Driven Task Allocation".

Qiong Chen Zimu Zheng Chuang Hu Dan Wang Fangming Liu

Phase-Aware Cache Partitioning to Target Both Turnaround Time and System Performance.

Lucia Pons Julio Sahuquillo Vicent Selfa Salvador Petit Julio Pons

Efficient Parallelism of Post-Quantum Signature Scheme SPHINCS.

Shuzhou Sun Rui Zhang Hui Ma

Towards Fair and Privacy-Preserving Federated Deep Models.

Lingjuan Lyu Jiangshan Yu Karthik Nandakumar Yitong Li Xingjun Ma Jiong Jin Han Yu Kee Siong Ng

High Performance Simulation of Spiking Neural Network on GPGPUs.

Peng Qu Youhui Zhang Xiang Fei Weimin Zheng

Efficient SSD Cache for Cloud Block Storage via Leveraging Block Reuse Distances.

Ke Zhou Yu Zhang Ping Huang Hua Wang Yongguang Ji Bin Cheng Ying Liu

Abstraction Layer For Standardizing APIs of Task-Based Engines.

Rabab Alomairy Hatem Ltaief Mustafa Abduljabbar David E. Keyes


Volume 31, Number 10, October 2020
Endpoint-Flexible Coflow Scheduling Across Geo-Distributed Datacenters.

Wenxin Li Xu Yuan Keqiu Li Heng Qi Xiaobo Zhou Renhai Xu

Reconciling Time Slice Conflicts of Virtual Machines With Dual Time Slice for Clouds.

Taeklim Kim Chang Hyun Park Jaehyuk Huh Jeongseob Ahn

Data-Driven Derivation of an Analytic Model for Parallel Servers With Job Replication.

Noor Bajunaid Daniel A. Menascé

A Dynamic Multi-Objective Approach for Dynamic Load Balancing in Heterogeneous Systems.

Alberto Cabrera Pérez Alejandro Acosta Francisco Almeida Vicente Blanco Pérez

An Optimal Locality-Aware Task Scheduling Algorithm Based on Bipartite Graph Modelling for Spark Applications.

Zhongming Fu Zhuo Tang Li Yang Chubo Liu

RMWPaxos: Fault-Tolerant In-Place Consensus Sequences.

Jan Skrzypczak Florian Schintke Thorsten Schütt

An Integrated Indexing and Search Service for Distributed File Systems.

Hyogi Sim Awais Khan Sudharshan S. Vazhkudai Seung-Hwan Lim Ali Raza Butt Youngjae Kim

Fast and Accurate Traffic Measurement With Hierarchical Filtering.

Haibo Wang Hongli Xu Liusheng Huang Yutong Zhai

A Ubiquitous Machine Learning Accelerator With Automatic Parallelization on FPGA.

Chao Wang Lei Gong Xi Li Xuehai Zhou

aeSpTV: An Adaptive and Efficient Framework for Sparse Tensor-Vector Product Kernel on a High-Performance Computing Platform.

Yuedan Chen Guoqing Xiao M. Tamer Özsu Chubo Liu Albert Y. Zomaya Tao Li

Cross-Rack-Aware Updates in Erasure-Coded Data Centers: Design and Evaluation.

Zhirong Shen Patrick P. C. Lee

Improving Restore Performance for In-Line Backup System Combining Deduplication and Delta Compression.

Yucheng Zhang Ye Yuan Dan Feng Chunzhi Wang Xinyun Wu Lingyu Yan Deng Pan Shuanghong Wang

Automated Fine-Grained CPU Cap Control in Serverless Computing Platform.

Young Ki Kim M. Reza HoseinyFarahabady Young Choon Lee Albert Y. Zomaya

Integrating Task Duplication in Optimal Task Scheduling With Communication Delays.

Michael Orr Oliver Sinnen

SF-Sketch: A Two-Stage Sketch for Data Streams.

Lingtong Liu Yulong Shen Yibo Yan Tong Yang Muhammad Shahzad Bin Cui Gaogang Xie

Deterministic Data Distribution for Efficient Recovery in Erasure-Coded Storage Systems.

Liangliang Xu Min Lyu Zhipeng Li Yongkun Li Yinlong Xu

Low-Cost Datacenter Load Balancing With Multipath Transport and Top-of-Rack Switches.

Enhuan Dong Xiaoming Fu Mingwei Xu Yuan Yang


Volume 31, Number 9, September 2020
Lock-Free Parallelization for Variance-Reduced Stochastic Gradient Descent on Streaming Data.

Yaqiong Peng Zhiyu Hao Xiaochun Yun

Towards Higher Performance and Robust Compilation for CGRA Modulo Scheduling.

Zhongyuan Zhao Weiguang Sheng Qin Wang Wenzhi Yin Pengfei Ye Jinchao Li Zhigang Mao

Boosting the Performance of SSDs via Fully Exploiting the Plane Level Parallelism.

Congming Gao Liang Shi Kai Liu Chun Jason Xue Jun Yang Youtao Zhang

The Workflow Trace Archive: Open-Access Data From Public and Private Computing Infrastructures.

Laurens Versluis Roland Mathá Sacheendra Talluri Tim Hegeman Radu Prodan Ewa Deelman Alexandru Iosup

Minority Disk Failure Prediction Based on Transfer Learning in Large Data Centers of Heterogeneous Disk Systems.

Ji Zhang Ke Zhou Ping Huang Xubin He Ming Xie Bin Cheng Yongguang Ji Yinhu Wang

Heterogeneous Edge Offloading With Incomplete Information: A Minority Game Approach.

Miao Hu Zixuan Xie Di Wu Yipeng Zhou Xu Chen Liang Xiao

CURE: A High-Performance, Low-Power, and Reliable Network-on-Chip Design Using Reinforcement Learning.

Ke Wang Ahmed Louri

SaberLDA: Sparsity-Aware Learning of Topic Models on GPUs.

Kaiwei Li Jianfei Chen Wenguang Chen Jun Zhu

Energy-Efficient Parallel Real-Time Scheduling on Clustered Multi-Core.

Ashikahmed Bhuiyan Di Liu Aamir Khan Abusayeed Saifullah Nan Guan Zhishan Guo

Customizable Scale-Out Key-Value Stores.

Ali Anwar Yue Cheng Hai Huang Jingoo Han Hyogi Sim Dongyoon Lee Fred Douglis Ali Raza Butt

Safety Enhancement for Real-Time Parallel Applications in Distributed Automotive Embedded Systems: A Stable Stopping Approach.

Guoqi Xie Gang Zeng Renfa Li

Efficient Algorithms for Delay-Aware NFV-Enabled Multicasting in Mobile Edge Clouds With Resource Sharing.

Haozhe Ren Zichuan Xu Weifa Liang Qiufen Xia Pan Zhou Omer F. Rana Alex Galis Guowei Wu

An Event-Driven Approach to Serverless Seismic Imaging in the Cloud.

Philipp A. Witte Mathias Louboutin Henryk Modzelewski Charles Jones James Selvage Felix J. Herrmann

The Design of Fast Content-Defined Chunking for Data Deduplication Based Storage Systems.

Wen Xia Xiangyu Zou Hong Jiang Yukun Zhou Chuanyi Liu Dan Feng Yu Hua Yuchong Hu Yucheng Zhang

ESetStore: An Erasure-Coded Storage System With Fast Data Recovery.

Chengjian Liu Qiang Wang Xiaowen Chu Yiu-Wing Leung Hai Liu

A Black-Box Fork-Join Latency Prediction Model for Data-Intensive Applications.

Minh Nguyen Sami Alesawi Ning Li Hao Che Hong Jiang


Volume 31, Number 8, August 2020
Bandwidth-Aware Dynamic Prefetch Configuration for IBM POWER8.

Carlos Navarro Josué Feliu Salvador Petit María Engracia Gómez Julio Sahuquillo

Location-Aware and Budget-Constrained Service Deployment for Composite Applications in Multi-Cloud Environment.

Tao Shi Hui Ma Gang Chen Sven Hartmann

Fireplug: Efficient and Robust Geo-Replication of Graph Databases.

Ray Neiheiser Luciana Rech Manuel Bravo Luís E. T. Rodrigues Miguel Correia

Automatic Generation of High-Performance FFT Kernels on Arm and X86 CPUs.

Zhihao Li Haipeng Jia Yunquan Zhang Tun Chen Liang Yuan Richard W. Vuduc

T-BASIR: Finding Shutdown Bugs for Cloud-Based Applications in Cloud Spot Markets.

Abdullah Alourani Ajay D. Kshemkalyani Mark Grechanik

Accelerating Stochastic Gradient Descent Based Matrix Factorization on FPGA.

Shijie Zhou Rajgopal Kannan Viktor K. Prasanna

Optimizing Streaming Parallelism on Heterogeneous Many-Core Architectures.

Peng Zhang Jianbin Fang Canqun Yang Chun Huang Tao Tang Zheng Wang

Analyzing the Performance Trade-Off in Implementing User-Level Threads.

Shintaro Iwasaki Abdelhalim Amer Kenjiro Taura Pavan Balaji

Evaluation of Stream Processing Frameworks.

Giselle van Dongen Dirk Van den Poel

FULL-KV: Flexible and Ultra-Low-Latency In-Memory Key-Value Store System Design on CPU-FPGA.

Yunhui Qiu Jinyu Xie Hankun Lv Wenbo Yin Wai-Shing Luk Lingli Wang Bowei Yu Hua Chen Xianjun Ge Zhijian Liao Xiaozhong Shi

Probabilistic Consistency Guarantee in Partial Quorum-Based Data Store.

Xin Yao Cho-Li Wang

Local-Density Subspace Distributed Clustering for High-Dimensional Data.

Yangli-ao Geng Qingyong Li Mingfei Liang Chong-Yung Chi Juan Tan Heng Huang

Power Guarantee for Electric Systems Using Real-Time Scheduling.

Eugene Kim Youngmoon Lee Liang He Kang G. Shin Jinkyu Lee

A Hybrid Update Strategy for I/O-Efficient Out-of-Core Graph Processing.

Xianghao Xu Fang Wang Hong Jiang Yongli Cheng Dan Feng Yongxuan Zhang

Accelerating Federated Learning via Momentum Gradient Descent.

Wei Liu Li Chen Yunfei Chen Wenyi Zhang

Crocus: Enabling Computing Resource Orchestration for Inline Cluster-Wide Deduplication on Scalable Storage Systems.

Prince Hamandawana Awais Khan Chang-Gyu Lee Sungyong Park Youngjae Kim


Volume 31, Number 7, July 2020
High Performance GPU Tensor Completion With Tubal-Sampling Pattern.

Tao Zhang Xiao-Yang Liu Xiaodong Wang

Cost-Aware Partitioning for Efficient Large Graph Processing in Geo-Distributed Datacenters.

Amelie Chi Zhou Bingkun Shen Yao Xiao Shadi Ibrahim Bingsheng He

Exact Distributed Load Centrality Computation: Algorithms, Convergence, and Applications to Distance Vector Routing.

Leonardo Maccari Lorenzo Ghiro Alessio Guerrieri Alberto Montresor Renato Lo Cigno

Replica Exchange MCMC Hardware With Automatic Temperature Selection and Parallel Trial.

Keivan Dabiri Mehrdad Malekmohammadi Ali Sheikholeslami Hirotaka Tamura

Performance Optimization for Relative-Error-Bounded Lossy Compression on Scientific Data.

Xiangyu Zou Tao Lu Wen Xia Xuan Wang Weizhe Zhang Haijun Zhang Sheng Di Dingwen Tao Franck Cappello

Improving Restore Performance of Packed Datasets in Deduplication Systems via Reducing Persistent Fragmented Chunks.

Yucheng Zhang Min Fu Xinyun Wu Fang Wang Qiang Wang Chunzhi Wang Xinhua Dong Hongmu Han

Accelerating Sparse Cholesky Factorization on Sunway Manycore Architecture.

Mingzhen Li Yi Liu Hailong Yang Zhongzhi Luan Lin Gan Guangwen Yang Depei Qian

Compression Ratio Modeling and Estimation across Error Bounds for Lossy Compression.

Jinzhen Wang Tong Liu Qing Liu Xubin He Huizhang Luo Weiming He

Combinatorial Auctions for Temperature-Constrained Resource Management in Manycores.

Heba Khdr Muhammad Shafique Santiago Pagani Andreas Herkersdorf Jörg Henkel

Distributed Graph Computation Meets Machine Learning.

Wencong Xiao Jilong Xue Youshan Miao Zhen Li Cheng Chen Ming Wu Wei Li Lidong Zhou

Scalable and Adaptive Data Replica Placement for Geo-Distributed Cloud Storages.

Kaiyang Liu Jun Peng Jingrong Wang Weirong Liu Zhiwu Huang Jianping Pan

Simplified Workflow Simulation on Clouds based on Computation and Communication Noisiness.

Roland Mathá Sasko Ristov Thomas Fahringer Radu Prodan

Reliability-Aware Network Service Provisioning in Mobile Edge-Cloud Networks.

Jing Li Weifa Liang Meitian Huang Xiaohua Jia

Partitioning Tree-Shaped Task Graphs for Distributed Platforms With Limited Memory.

Changjiang Gou Anne Benoit Loris Marchal

Modeling Analysis and Cost-Performance Ratio Optimization of Virtual Machine Scheduling in Cloud Computing.

Bo Wan Jiale Dang Zhetao Li Hongfang Gong Feng Zhang Sangyoon Oh

Performance-Aware Speculative Resource Oversubscription for Large-Scale Clusters.

Renyu Yang Chunming Hu Xiaoyang Sun Peter Garraghan Tianyu Wo Zhenyu Wen Hao Peng Jie Xu Chao Li

T-Caching: Enhancing Feasibility of In-Network Caching in ICN.

Sugi Lee Ikjun Yeom Dohyung Kim


Volume 31, Number 6, June 2020
Efficient Compute-Intensive Job Allocation in Data Centers via Deep Reinforcement Learning.

Deliang Yi Xin Zhou Yonggang Wen Rui Tan

Reduce Operations: Send Volume Balancing While Minimizing Latency.

M. Ozan Karsavuran Seher Acer Cevdet Aykanat

Errata to "Exploring Fault-Tolerant Erasure Codes for Scalable All-Flash Array Clusters".

Sungjoon Koh Jie Zhang Miryeong Kwon Jungyeon Yoon David Donofrio Nam Sung Kim Myoungsoo Jung

P-PFC: Reducing Tail Latency with Predictive PFC in Lossless Data Center Networks.

Chen Tian Bo Li Liulan Qin Jiaqi Zheng Jie Yang Wei Wang Guihai Chen Wanchun Dou

An Approximate Communication Framework for Network-on-Chips.

Yuechen Chen Ahmed Louri

A Value-Oriented Job Scheduling Approach for Power-Constrained and Oversubscribed HPC Systems.

Nirmal Kumbhare Aniruddha Marathe Ali Akoglu Howard Jay Siegel Ghaleb Abdulla Salim Hariri

Towards Distributed SDN: Mobility Management and Flow Scheduling in Software Defined Urban IoT.

Di Wu Xiang Nie Eskindir Asmare Dmitri I. Arkhipov Zhijing Qin Renfa Li Julie A. McCann Keqin Li

RIVA: Robust Integrity Verification Algorithm for High-Speed File Transfers.

Batyr Charyyev Engin Arslan

Turbo: Dynamic and Decentralized Global Analytics via Machine Learning.

Hao Wang Di Niu Baochun Li

On-Edge Multi-Task Transfer Learning: Model and Practice With Data-Driven Task Allocation.

Qiong Chen Zimu Zheng Chuang Hu Dan Wang Fangming Liu

Performance Analysis of Trial and Error Algorithms.

Jérôme Gaveau Christophe J. Le Martret Mohamad Assaad

ERA-LSTM: An Efficient ReRAM-Based Architecture for Long Short-Term Memory.

Jianhui Han He Liu Mingyu Wang Zhaolin Li Youhui Zhang

SLEEF: A Portable Vectorized Library of C Standard Mathematical Functions.

Naoki Shibata Francesco Petrogalli

Concurrent Irrevocability in Best-Effort Hardware Transactional Memory.

J. Rubén Titos Gil Ricardo Fernández Pascual Alberto Ros Manuel E. Acacio

Faster Parallel Core Maintenance Algorithms in Dynamic Graphs.

Qiang-Sheng Hua Yuliang Shi Dongxiao Yu Hai Jin Jiguo Yu Zhipeng Cai Xiuzhen Cheng Hanhua Chen

Online Deadline-Aware Task Dispatching and Scheduling in Edge Computing.

Jiaying Meng Haisheng Tan Xiang-Yang Li Zhenhua Han Bojie Li

NVGraph: Enforcing Crash Consistency of Evolving Network Analytics in NVMM Systems.

Soklong Lim Tyler Coy Zaixin Lu Bin Ren Xuechen Zhang

GRP-HEFT: A Budget-Constrained Resource Provisioning Scheme for Workflow Scheduling in IaaS Clouds.

Hamid Reza Faragardi Mohammad Reza Saleh Sedghpour Saber Fazliahmadi Thomas Fahringer Nayereh Rasouli


Volume 31, Number 5, May 2020
Approximate NoC and Memory Controller Architectures for GPGPU Accelerators.

Venkata Yaswanth Raparti Sudeep Pasricha

Exploring Token-Oriented In-Network Prioritization in Datacenter Networks.

Kexin Liu Bingchuan Tian Chen Tian Bo Li Qingyue Wang Jiaqi Zheng Jiajun Sun Yixiao Gao Wei Wang Guihai Chen Wanchun Dou Yanan Jiang Huaping Zhou Jingjie Jiang Fan Zhang Gong Zhang

gMig: Efficient vGPU Live Migration with Overlapped Software-Based Dirty Page Verification.

Qiumin Lu Xiao Zheng Jiacheng Ma Yaozu Dong Zhengwei Qi Jianguo Yao Bingsheng He Haibing Guan

Massively Scaling Seismic Processing on Sunway TaihuLight Supercomputer.

Yongmin Hu Hailong Yang Zhongzhi Luan Lin Gan Guangwen Yang Depei Qian

Decentralized Utility- and Locality-Aware Replication for Heterogeneous DHT-Based P2P Cloud Storage Systems.

Yahya Hassanzadeh-Nazarabadi Alptekin Küpçü Öznur Özkasap

Task Scheduling for Energy Consumption Constrained Parallel Applications on Heterogeneous Computing Systems.

Zhe Quan Zhi-Jie Wang Ting Ye Song Guo

Reducing the Impact of Intensive Dynamic Memory Allocations in Parallel Multi-Threaded Programs.

Daniel Langr Martin Kocicka

REACT: Scalable and High-Performance Regular Expression Pattern Matching Accelerator for In-Storage Processing.

Won Seob Jeong Changmin Lee Keunsoo Kim Myung Kuk Yoon Won Jeon Myoungsoo Jung Won Woo Ro

Thread-Level Locking for SIMT Architectures.

Lan Gao Yunlong Xu Rui Wang Zhongzhi Luan Zhibin Yu Depei Qian

Architectural Support for NVRAM Persistence in GPUs.

Sui Chen Lei Liu Weihua Zhang Lu Peng

Efficient Performance Estimation and Work-Group Size Pruning for OpenCL Kernels on GPUs.

Xiebing Wang Xuehai Qian Alois C. Knoll Kai Huang

Customer Perceived Value- and Risk-Aware Multiserver Configuration for Profit Maximization.

Tian Wang Junlong Zhou Gongxuan Zhang Tongquan Wei Shiyan Hu

WEED-MC: Wavelet Transform for Energy Efficient Data Gathering and Matrix Completion.

Vishal Krishna Singh Bhoomika Nathani Manish Kumar

Proofs of Physical Reliability for Cloud Storage Systems.

Li Li Loukas Lazos

A General Design for a Scalable MPI-GPU Multi-Resolution 2D Numerical Solver.

Massimiliano Turchetto Alessandro Dal Palù Renato Vacondio

Making Application-Level Crash Consistency Practical on Flash Storage.

Donghyun Kang Changwoo Min Sang Won Lee Young Ik Eom

Large-Scale Automatic K-Means Clustering for Heterogeneous Many-Core Supercomputer.

Teng Yu Wenlai Zhao Pan Liu Vladimir Janjic Xiaohan Yan Shicai Wang Haohuan Fu Guangwen Yang John Thomson


Volume 31, Number 4, April 2020
Towards Power Efficient High Performance Packet I/O.

Xuesong Li Wenxue Cheng Tong Zhang Fengyuan Ren Bailong Yang

COPA: Highly Cost-Effective Power Back-Up for Green Datacenters.

Yan Yin Junmin Wu Xu Zhou Lieven Eeckhout Amer Qouneh Tao Li Zhibin Yu

Online Placement and Scaling of Geo-Distributed Machine Learning Jobs via Volume-Discounting Brokerage.

Xiaotong Li Ruiting Zhou Lei Jiao Chuan Wu Yuhang Deng Zongpeng Li

Efficient Method for Parallel Computation of Geodesic Transformation on CPU.

Danijel Zlaus Domen Mongus

Towards Accurate Prediction for High-Dimensional and Highly-Variable Cloud Workloads with Deep Learning.

Zheyi Chen Jia Hu Geyong Min Albert Y. Zomaya Tarek A. El-Ghazawi

Energy-Aware Application Placement in Mobile Edge Computing: A Stochastic Optimization Approach.

Hossein Badri Tayebeh Bahreini Daniel Grosu Kai Yang

HRHS: A High-Performance Real-Time Hardware Scheduler.

Danesh Derafshi Amin Norollah Mohsen Khosroanjam Hakem Beitollahi

Combining Size-Based Load Balancing with Round-Robin for Scalable Low Latency.

Jonatha Anselmi

Reliability Aware Energy Optimized Scheduling of Non-Preemptive Periodic Real-Time Tasks on Heterogeneous Multiprocessor System.

Niraj Kumar Jaishree Mayank Arijit Mondal

A Novel Multi-Stage Forest-Based Key-Value Store for Holistic Performance Improvement.

Ziyi Lu Qiang Cao Fei Mei Hong Jiang Jingjun Li

gQoS: A QoS-Oriented GPU Virtualization with Adaptive Capacity Sharing.

Qiumin Lu Jianguo Yao Haibing Guan Ping Gao

A Holistic Heterogeneity-Aware Data Placement Scheme for Hybrid Parallel I/O Systems.

Shuibing He Zheng Li Jiang Zhou Yanlong Yin Xiaohua Xu Yong Chen Xian-He Sun

On Fault-Tolerant Bin Packing for Online Resource Allocation.

Chuanyou Li Xueyan Tang

Quantum Supremacy Circuit Simulation on Sunway TaihuLight.

Riling Li Bujiao Wu Mingsheng Ying Xiaoming Sun Guangwen Yang

Resource-Constrained Replication Strategies for Hierarchical and Heterogeneous Tasks.

Weng-Chon Ao Konstantinos Psounis

Hotspot-Aware Hybrid Memory Management for In-Memory Key-Value Stores.

Hai Jin Zhiwei Li Haikun Liu Xiaofei Liao Yu Zhang

cCUDA: Effective Co-Scheduling of Concurrent Kernels on GPUs.

S. Kazem Shekofteh Hamid Noori Mahmoud Naghibzadeh Holger Fröning Hadi Sadoghi Yazdi

Power-Aware Allocation of Graph Jobs in Geo-Distributed Cloud Networks.

Seyyedali Hosseinalipour Anuj Nayak Huaiyu Dai


Volume 31, Number 3, March 2020
An Attribute-Based Availability Model for Large Scale IaaS Clouds with CARMA.

Hongwu Lv Jane Hillston Paul Piho Huiqiang Wang

Online Scheduling of Task Graphs on Heterogeneous Platforms.

Louis-Claude Canon Loris Marchal Bertrand Simon Frédéric Vivien

A High Throughput B+tree for SIMD Architectures.

Weihua Zhang Zhaofeng Yan Yuzhe Lin Chuanlei Zhao Lu Peng

Exploring New Opportunities to Defeat Low-Rate DDoS Attack in Container-Based Cloud Environment.

Zhi Li Hai Jin Deqing Zou Bin Yuan

HPPT-NoC: A Dark-Silicon Inspired Hierarchical TDM NoC with Efficient Power-Performance Trading.

Salma Hesham Diana Goehringer Mohamed A. Abd El Ghany

Random Priority-Based Thrashing Control for Distributed Shared Memory.

Yi-Wei Ci Michael R. Lyu Zhan Zhang De-Cheng Zuo Xiao-Zong Yang

A Novel Low Cost Interconnection Architecture Based on the Generalized Hypercube.

Guijuan Wang Cheng-Kuan Lin Jianxi Fan Baolei Cheng Xiaohua Jia

Enabling Encrypted Boolean Queries in Geographically Distributed Databases.

Xu Yuan Xingliang Yuan Yihe Zhang Baochun Li Cong Wang

Simultaneous Management of Peak-Power and Reliability in Heterogeneous Multicore Embedded Systems.

Mohsen Ansari Javad Saber-Latibari Mostafa Pasandideh Alireza Ejlali

Achieving Flexible Global Reconfiguration in NoCs Using Reconfigurable Rings.

Liang Wang Leibo Liu Jie Han Xiaohang Wang Shouyi Yin Shaojun Wei

cuTensor-Tubal: Efficient Primitives for Tubal-Rank Tensor Learning Operations on GPUs.

Tao Zhang Xiao-Yang Liu Xiaodong Wang Anwar Walid

FeatherCNN: Fast Inference Computation with TensorGEMM on ARM Architectures.

Haidong Lan Jintao Meng Christian Hundt Bertil Schmidt Minwen Deng Xiaoning Wang Weiguo Liu Yu Qiao Shengzhong Feng

The Impact of Event Processing Flow on Asynchronous Server Efficiency.

Shungeng Zhang Qingyang Wang Yasuhiko Kanemasa Huasong Shan Liting Hu

Fault-Tolerant Routing Mechanism in 3D Optical Network-on-Chip Based on Node Reuse.

Pengxing Guo Weigang Hou Lei Guo Wei Sun Chuang Liu Hainan Bao Luan H. K. Duong Weichen Liu

A Comment on Privacy-Preserving Scalar Product Protocols as Proposed in "SPOC".

Thomas Schneider Amos Treiber

cuPC: CUDA-Based Parallel PC Algorithm for Causal Structure Learning on GPU.

Behrooz Zarebavani Foad Jafarinejad Matin Hashemi Saber Salehkaleybar

A Game-Theoretical Approach for User Allocation in Edge Computing Environment.

Qiang He Guangming Cui Xuyun Zhang Feifei Chen Shuiguang Deng Hai Jin Yanhui Li Yun Yang

Toward Designing Cost-Optimal Policies to Utilize IaaS Clouds with Online Learning.

Xiaohu Wu Patrick Loiseau Esa Hyytiä


Volume 31, Number 2, February 2020
The Network-Integrated Storage System.

Ibrahim Kettaneh Ahmed Alquraan Hatem Takruri Suli Yang Andrea C. Arpaci-Dusseau Remzi H. Arpaci-Dusseau Samer Al-Kiswany

PALE: Time Bounded Practical Agile Leader Election.

Bronislav Sidik Rami Puzis Polina Zilberman Yuval Elovici

Efficient and Portable Workgroup Size Tuning.

Chia-Lin Yu Shiao-Li Tsao

Achieving Load-Balanced, Redundancy-Free Cluster Caching with Selective Partition.

Yinghao Yu Wei Wang Renfei Huang Jun Zhang Khaled Ben Letaief

Optimized Block-Based Algorithms to Label Connected Components on GPUs.

Stefano Allegretti Federico Bolelli Costantino Grana

iCELIA: A Full-Stack Framework for STT-MRAM-Based Deep Learning Acceleration.

Hao Yan Hebin R. Cherian Ethan C. Ahn Xuehai Qian Lide Duan

Throughput Maximization of NFV-Enabled Multicasting in Mobile Edge Cloud Networks.

Yu Ma Weifa Liang Jie Wu Zichuan Xu

A Highly Reliable Metadata Service for Large-Scale Distributed File Systems.

Jiang Zhou Yong Chen Weiping Wang Shuibing He Dan Meng

Thread Isolation to Improve Symbiotic Scheduling on SMT Multicore Processors.

Josué Feliu Julio Sahuquillo Salvador Petit Lieven Eeckhout

Coded Load Balancing in Cache Networks.

Mahdi Jafari Siavoshani Farzad Parvaresh Ali Pourmiri Seyed Pooya Shariatpanahi

Improving Overall Performance of TLC SSD by Exploiting Dissimilarity of Flash Pages.

Wenhui Zhang Qiang Cao Hong Jiang Jie Yao

Performance Modeling of Parallel Loops on Multi-Socket Platforms Using Queueing Systems.

Younghyun Cho Surim Oh Bernhard Egger

Energy and Task-Aware Partitioning on Single-ISA Clustered Heterogeneous Processors.

Ashraf Suyyagh Zeljko Zilic

Wiera: Policy-Driven Multi-Tiered Geo-Distributed Cloud Storage System.

Kwangsung Oh Nan Qin Abhishek Chandra Jon B. Weissman

Optimizing Geo-Distributed Data Analytics with Coordinated Task Scheduling and Routing.

Laiping Zhao Yanan Yang Ali Munir Alex X. Liu Yue Li Wenyu Qu

APMigration: Improving Performance of Hybrid Memory Performance via An Adaptive Page Migration Method.

Yujuan Tan Baiping Wang Zhichao Yan Witawas Srisa-an Xianzhang Chen Duo Liu

Pache: A Packet Management Scheme of Cache in Data Center Networks.

Tao Chen Xiaofeng Gao Tao Liao Guihai Chen

Editor's Note.

Manish Parashar


Volume 31, Number 1, January 2020
Data-Parallel Hashing Techniques for GPU Architectures.

Brenton Lessley Hank Childs

A Survey of Phase Classification Techniques for Characterizing Variable Application Behavior.

Keeley Criswell Tosiron Adegbija

WPaxos: Wide Area Network Flexible Consensus.

Ailidani Ailijiang Aleksey Charapko Murat Demirbas Tevfik Kosar

The Existence of Completely Independent Spanning Trees for Some Compound Graphs.

Xiao-Wen Qin Rong-Xia Hao Jou-Ming Chang

Single Restart with Time Stamps for Parallel Task Processing with Known and Unknown Processors.

Jaya Prakash Champati Ben Liang

Scheduling Parallel Real-Time Tasks on the Minimum Number of Processors.

Hyeonjoong Cho Chulgoo Kim Joohyung Sun Arvind Easwaran Ju-Derk Park Byeong-Cheol Choi

Quantum Game Analysis on Extrinsic Incentive Mechanisms for P2P Services.

Shengling Wang Weiman Sun Liran Ma Weifeng Lv Xiuzhen Cheng

Minimizing Tardiness for Data-Intensive Applications in Heterogeneous Systems: A Matching Theory Perspective.

Ke Xu Liang Lv Tong Li Meng Shen Haiyang Wang Kun Yang

GA-Par: Dependable Microservice Orchestration Framework for Geo-Distributed Clouds.

Zhenyu Wen Tao Lin Renyu Yang Shouling Ji Rajiv Ranjan Alexander B. Romanovsky Chang-Ting Lin Jie Xu

Exploiting Parallelism and Vectorisation in Breadth-First Search for the Intel Xeon Phi.

Mireya Paredes Graham D. Riley Mikel Luján

Evaluating Modern GPU Interconnect: PCIe, NVLink, NV-SLI, NVSwitch and GPUDirect.

Ang Li Shuaiwen Leon Song Jieyang Chen Jiajia Li Xu Liu Nathan R. Tallent Kevin J. Barker

Enabling Runtime SpMV Format Selection through an Overhead Conscious Method.

Weijie Zhou Yue Zhao Xipeng Shen Wang Chen

EEPC: A Framework for Energy-Efficient Parallel Control of Connected Cars.

Minghua Shen Guojie Luo Nong Xiao

Designing Energy-Efficient MPSoC with Untrustworthy 3PIP Cores.

Yidan Sun Guiyuan Jiang Siew-Kei Lam Fangxin Ning

Deep Learning Research and Development Platform: Characterizing and Scheduling with QoS Guarantees on GPU Clusters.

Zhaoyun Chen Wei Quan Mei Wen Jianbin Fang Jie Yu Chunyuan Zhang Lei Luo

Adaptive Alert Management for Balancing Optimal Performance among Distributed CSOCs using Reinforcement Learning.

Ankit Shah Rajesh Ganesan Sushil Jajodia Pierangela Samarati Hasan Cam

A Truthful and Efficient Incentive Mechanism for Demand Response in Green Datacenters.

Zhi Zhou Fangming Liu Shutong Chen Zongpeng Li