Abstract |
Graphs are an important abstraction used in many scientific fields. With the magnitude of graph-structured data constantly increasing, effective data analytics requires efficient and scalable graph processing systems. Although HPC systems have long been used for scientific computing, people have only recently started to assess their potential for graph processing, a workload with inherent load imbalance, lack of locality, and access irregularity. We propose ShenTu, the first general- purpose graph processing framework that can efficiently utilize an entire Petascale system to process multi-trillion edge graphs in seconds. ShenTu embodies four key innovations: hardware spe- cialization, supernode routing, on-chip sorting, and degree-aware messaging, which together enable its unprecedented performance and scalability. It can traverse a record-size 70-trillion-edge graph in seconds. ShenTu enables the processing of internet scale web graphs and shows potential for brain simulation. |
Short Bio |
Wenguang Chen is a professor in Department of Computer Science and Technology, Tsinghua University. His research interest is in parallel and distributed systems and programming systems. He received the Bachelor’s and Ph.D. degrees in computer science from Tsinghua University in 1995 and 2000 respectively. Before joining Tsinghua in 2003, he was the CTO of Opportunity International Inc. He was appointed as the associate head of Department of Computer Science and Technology from 2007 to 2014.
He has published over 50 papers in international conferences and journals like OSDI, PLDI, SC, ICS, PPoPP, EuroSys, USENIX ATC, OOPSLA, VLDB and ICSE. He is a distinguished member and distinguished speaker of CCF( China Computer Foundation), an ACM member and a Co-Chair of ACM China Council, and an IEEE member.
His recent research focus on graph processing, and proposed a series of graph processing systems, such as GridGraph(single machine out-of-core), Gemini(distributed memory) and Shentu(supercomputer), with his collaborators to provide fast and efficient graph systems on various platforms.
|