NPC’18 Proceedings and Program

Thursday, November 29th

10:10-10:25 Coffee Break | Room: N404

10:25-10:50	Lianke Qin, Yifan Gong, Tianqi Tang, Yutian Wang and Jiangming Jin Training Deep Nets with Progressive Batch Normalization on multi-GPUs \|
10:50-11:15	Yong Yu, Tian Zhi, Xuda Zhou, Shaoli Liu, Yunji Chen and Shuyao Cheng BSHIFT: A Low Cost Deep Neural Networks Accelerator \|
11:15-11:40	Jiacheng Zhao, Yisong Chang, Denghui Li, Chunwei Xia, Huimin Cui, Ke Zhang and Xiaobing Feng On Retargeting the AI Programming Framework to New Hardwares \|
11:40-12:05	Dong Han, Shengyuan Zhou, Tian Zhi, Shaoli Liu and Baiyi Wang Float-Fix: An Efficient and Hardware-Friendly Data Type for Deep Neural Network \|
12:05-12:30	Junyu Li, Ligang He and Shenyuan Ren Data Fine-pruning: A Simple Way to Accelerate Neural Network Training \|

12:30-14:00 Lunch Break

14:00-14:25	Yanqi Wang, Qi Zhang, Yi Liu and Depei Qian HPC-SFI: System-level Fault Injection for High Performance Computing Systems \|
14:25-14:50	Wenke Li, Xuanhua Shi, Hong Huang, Peng Zhao, Hai Jin, Dong Dai and Yong Chen GRAM: A GPU-based Property Graph Traversal and Query for HPC Rich Metadata Management \|
14:50-15:15	Huihui Zou, Shanjiang Tang, Ce Yu, Hao Fu and Yusen Li ASW: Accelerating Smith-Waterman Algorithm on Coupled CPU-GPU Architecture \|
15:15-15:40	Leonard Poon GPU-Accelerated Clique Tree Propagation for Pouch Latent Tree Models \|

15:40-16:00 Coffee Break | Room: N404

16:00-16:10	Peng Jiang and Ligang He vGrouper: Optimizing the Performance of Parallel Jobs in Xen by Increasing Synchronous Execution of Virtual Machines \|
16:10-16:20	Shiqing Zhang, Yaohua Yang, Li Shen and Zhiying Wang GPU Memory Management Solution Supporting Incomplete Pages \|
16:20-16:30	Mingyi Zhu, Kejiang Ye and Cheng-Zhong Xu A Deep Learning Approach for Network Anomaly Detection based on AMF-LSTM \|
16:30-16:40	Xiao Zhang, Lanxin Kong, Shunyi Zhu, Xiaonan Zhao and Zhanhuai Li FSObserver: A Performance Measurement and Monitoring Tool for Distributed Storage Systems \|
16:40-16:50	Zhijie Yang, Lei Wang, Dong Ding, Xiangyu Zhang, Yu Deng, Shiming Li and Qiang Dou Systolic Array Based Accelerator and Algorithm Mapping for Deep Learning Algorithms \|

16:50-17:00	Yi Liu, Yunchun Li, Honggang Zhou, Hailong Yang and Wei Li A Fine-grained Performance Bottleneck Analysis Method for HDFS \|
17:00-17:10	Nan Hu, Zhiguang Chen, Yunfei Du and Yutong Lu Mimir+: An Optimized Framework of MapReduce on Heterogeneous High-Performance Computing System \|
17:10-17:20	Huiying Lan and Zidong Du DLIR: An Intermediate Representation for Deep Learning Processors \|
17:20-17:30	Xiao Zhang, Huiying Lan and Qi Guo Leveraging Subgraph Extraction for Performance Portable Programming Frameworks on DL Accelerators \|
17:30-17:40	Wenli Zhang Labeled Network Stack: A Co-Designed Stack for Low Tail-Latency and High Concurrency in Datacenter Services \|
17:40-17:50	Jinjing Zhao and Rong Jin Balancing the QOS and Security in Dijkstra Algorithm by SDN Technology \|

18:30 Reception

Friday, November 30th

10:00-10:15 Coffee Break | Room: N302

10:15-10:40	Cheng Pan, Lan Zhou, Yingwei Luo, Xiaolin Wang and Zhenlin Wang Lightweight and Accurate Memory Allocation in Key-value Cache \|
10:40-11:05	Haobo Wang, Yinliang Yue, Shuibing He and Weiping Wang KT-Store: A Key-Order and Write-Order Hybrid Key-Value Store with High Write and Range-query Performance \|
11:05-11:30	Yangyang Wang, Yunpeng Chai and Xin Wang ALOR: Adaptive Layout Optimization of Raft Groups for Heterogeneous Distributed Key-Value Stores \|
11:20-11:55	Bo Wang, Jie Tang, Rui Zhang and Wei Ding A Dependency-Aware Storage Schema Selection Mechanism for In-Memory Big Data Computing Frameworks \|
11:55-12:20	Heyang Xu, Yang Liu and Wei Wei Migration Cost and Energy-aware Virtual Machine Consolidation under Cloud Environments Considering Remaining Runtime \|

12:20-14:00 Lunch Break

14:00 Local Tour

18:30 Banquet

Saturday, December 1st

09:00-09:25	Jiang Xiao, Huichuwu Li, He Li and Hai Jin CNLoc: Channel State Information Assisted Indoor WLAN Localization Using Nomadic Access Points \|
09:25-09:50	Kang Jin, Cunlu Li, Dezun Dong and Binzhang Fu HARE: History-Aware Adaptive Routing Algorithm for Endpoint Congestion in Networks-on-Chip \|
09:50-10:15	Mingfan Li, Ke Wen and Hong An Improving the performance of distributed MXNet with RDMA \|
10:15-10:40	Chengchun Liu, Zhang Yang, Limin Xiao, Baicheng Yan, Zhihao Wang and Hongyun Tian An Efficient Method for Determing Full Point-to-point Latency Of Arbitrary Indirect HPC Networks \|

10:40-10:55 Coffee Break | Room: J205

10:55-11:20	Peng Zhao, Lei Liu, Amal Cao, Xiao Dong, Jiansong Li and Xiaobing Feng ElasticActor: An Actor System with Automatic Granularity Adjustment \|
11:20-11:45	Junhong Liu, Xin He, Weifeng Liu and Guangming Tan Register-Aware Optimizations for Parallel Sparse Matrix-Matrix Multiplication \|
11:45-12:10	Donglin Chen, Jianbin Fang, Shizhao Chen, Chuanfu Xu and Zheng Wang Optimizing Sparse Matrix-Vector Multiplications on An ARMv8-based Many-Core Architecture \|
12:10-12:35	Yanzhen Gao, Jing Xing, Zheng Wei, Jie Ma, Xiaozhen Bao and Peiheng Zhang STrieGD: A Sampling Trie Indexed Compression Algorithm for Large Scale Gene Data \|