Publications

SympleGraph: distributed graph processing with precise loop-carried dependency guarantee

Proceedings of the 41st ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI)
Prague: High-Performance Heterogeneity-Aware Asynchronous Decentralized Training

Proceedings of the Twenty-Fifth International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS)
AccPar: Tensor Partitioning for Heterogeneous Deep Learning Accelerators

2020 IEEE International Symposium on High Performance Computer Architecture (HPCA)
HyPar: Towards Hybrid Parallelism for Deep Learning Accelerator Array

2019 IEEE International Symposium on High Performance Computer Architecture (HPCA)
Hop: Heterogeneity-aware Decentralized Training

Proceedings of the Twenty-Fourth International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS)
GraphQ: Scalable PIM-Based Graph Processing

Proceedings of the 52nd Annual IEEE/ACM International Symposium on Microarchitecture (MICRO)
Wonderland: A Novel Abstraction-Based Out-Of-Core Graph Processing System

Proceedings of the Twenty-Third International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS)
GraphR: Accelerating Graph Processing Using ReRAM

2018 IEEE International Symposium on High Performance Computer Architecture (HPCA)
CSE: parallel finite state machines with convergence set enumeration

Proceedings of the 51st Annual IEEE/ACM International Symposium on Microarchitecture (MICRO)