Cheng Li
Cheng Li
PhD Candidate in Computer Science, University of Illinois Urbana-Champaign
Verified email at illinois.edu - Homepage
Title
Cited by
Cited by
Year
Stochastic circuits for real-time image-processing applications
A Alaghi, C Li, JP Hayes
Proceedings of the 50th Annual Design Automation Conference, 1-6, 2013
1932013
Sirius: An open end-to-end voice and vision personal assistant and its implications for future warehouse scale computers
J Hauswald, MA Laurenzano, Y Zhang, C Li, A Rovinski, A Khurana, ...
Proceedings of the Twentieth International Conference on Architectural …, 2015
1832015
DjiNN and Tonic: DNN as a service and its implications for future warehouse scale computers
J Hauswald, Y Kang, MA Laurenzano, Q Chen, C Li, T Mudge, ...
2015 ACM/IEEE 42nd Annual International Symposium on Computer Architecture …, 2015
1282015
KLAP: Kernel launch aggregation and promotion for optimizing dynamic parallelism
I El Hajj, J Gómez-Luna, C Li, LW Chang, D Milojicic, W Hwu
2016 49th Annual IEEE/ACM International Symposium on Microarchitecture …, 2016
222016
Designing future warehouse-scale computers for Sirius, an end-to-end voice and vision personal assistant
J Hauswald, MA Laurenzano, Y Zhang, H Yang, Y Kang, C Li, A Rovinski, ...
ACM Transactions on Computer Systems (TOCS) 34 (1), 1-32, 2016
102016
Accelerating reduction and scan using tensor core units
A Dakkak, C Li, J Xiong, I Gelado, W Hwu
Proceedings of the ACM International Conference on Supercomputing, 46-57, 2019
52019
Frustrated with replicating claims of a shared model? a solution
A Dakkak, C Li, J Xiong, WM Hwu
arXiv preprint arXiv:1811.09737, 2018
52018
Matrix factorization on GPUs with memory optimization and approximate computing
W Tan, S Chang, L Fong, C Li, Z Wang, LL Cao
Proceedings of the 47th International Conference on Parallel Processing, 1-10, 2018
52018
TrIMS: Transparent and Isolated Model Sharing for Low Latency Deep Learning Inference in Function-as-a-Service
A Dakkak, C Li, SG De Gonzalo, J Xiong, W Hwu
2019 IEEE 12th International Conference on Cloud Computing (CLOUD), 372-382, 2019
42019
MLModelScope: Evaluate and Measure ML Models within AI Pipelines
A Dakkak, C Li, A Srivastava, J Xiong, WM Hwu
arXiv preprint arXiv:1811.09737, 2018
42018
RAI: a scalable project submission system for parallel programming courses
A Dakkak, C Pearson, C Li, W Hwu
2017 IEEE International Parallel and Distributed Processing Symposium …, 2017
42017
Sirius implications for future warehouse-scale computers
J Hauswald, MA Laurenzano, Y Zhang, C Li, A Rovinski, A Khurana, ...
IEEE Micro 36 (3), 42-53, 2016
32016
Challenges and Pitfalls of Reproducing Machine Learning Artifacts
C Li, A Dakkak, J Xiong, W Hwu
arXiv preprint arXiv:1904.12437, 2019
22019
Evaluating Characteristics of CUDA Communication Primitives on High-Bandwidth Interconnects
C Pearson, A Dakkak, S Hashash, C Li, IH Chung, J Xiong, WM Hwu
Proceedings of the 2019 ACM/SPEC International Conference on Performance …, 2019
22019
The Design and Implementation of a Scalable DL Benchmarking Platform
C Li, A Dakkak, J Xiong, W Hwu
arXiv preprint arXiv:1911.08031, 2019
12019
Benanza: Automatic uBenchmark Generation to Compute" Lower-bound" Latency and Inform Optimizations of Deep Learning Models on GPUs
C Li, A Dakkak, J Xiong, W Hwu
arXiv preprint arXiv:1911.06922, 2019
12019
AI Matrix: A Deep Learning Benchmark for Alibaba Data Centers
W Zhang, W Wei, L Xu, L Jin, C Li
arXiv preprint arXiv:1909.10562, 2019
12019
Across-Stack Profiling and Characterization of Machine Learning Models on GPUs
C Li, A Dakkak, J Xiong, W Wei, L Xu, W Hwu
arXiv preprint arXiv:1908.06869, 2019
12019
DLBricks: Composable Benchmark Generation to Reduce Deep Learning Benchmarking Effort on CPUs
C Li, A Dakkak, J Xiong, W Hwu
Proceedings of the ACM/SPEC International Conference on Performance …, 2020
2020
DLSpec: A Deep Learning Task Exchange Specification
A Dakkak, C Li, J Xiong, WM Hwu
arXiv preprint arXiv:2002.11262, 2020
2020
The system can't perform the operation now. Try again later.
Articles 1–20