Follow
Junli Gu
Junli Gu
Tesla autopilot
Verified email at tesla.com
Title
Cited by
Cited by
Year
Heterogeneous system coherence for integrated CPU-GPU systems
J Power, A Basu, J Gu, S Puthoor, BM Beckmann, MD Hill, SK Reinhardt, ...
Proceedings of the 46th Annual IEEE/ACM International Symposium on …, 2013
2002013
PPEP: Online performance, power, and energy prediction framework and DVFS space exploration
B Su, J Gu, L Shen, W Huang, JL Greathouse, Z Wang
2014 47th Annual IEEE/ACM International Symposium on Microarchitecture, 445-457, 2014
1112014
WADE: Writeback-aware dynamic cache management for NVM-based main memory system
Z Wang, S Shan, T Cao, J Gu, Y Xu, S Mu, Y Xie, DA Jiménez
ACM Transactions on Architecture and Code Optimization (TACO) 10 (4), 1-21, 2013
582013
Implementing a leading loads performance predictor on commodity processors
B Su, JL Greathouse, J Gu, M Boyer, L Shen, Z Wang
2014 USENIX Annual Technical Conference (USENIX ATC 14), 2014
462014
Opencl caffe: Accelerating and enabling a cross platform machine learning framework
J Gu, Y Liu, Y Gao, M Zhu
Proceedings of the 4th International Workshop on OpenCL, 1-5, 2016
382016
Implementation and evaluation of deep neural networks (DNN) on mainstream heterogeneous systems
J Gu, M Zhu, Z Zhou, F Zhang, Z Lin, Q Zhang, M Breternitz
Proceedings of 5th Asia-Pacific Workshop on Systems, 1-7, 2014
342014
NAIS: Neural architecture and implementation search and its applications in autonomous driving
C Hao, Y Chen, X Liu, A Sarwari, D Sew, A Dhar, B Wu, D Fu, J Xiong, ...
2019 IEEE/ACM International Conference on Computer-Aided Design (ICCAD), 1-8, 2019
242019
A hybrid GPU+ FPGA system design for autonomous driving cars
C Hao, A Sarwari, Z Jin, H Abu-Haimed, D Sew, Y Li, X Liu, B Wu, D Fu, ...
2019 IEEE International Workshop on Signal Processing Systems (SiPS), 121-126, 2019
192019
Methods and apparatus related to data processors and caches incorporated in data processors
Z Wang, X Yuan, J Gu, Y Xu, SC Shan, S Mu, T Cao
US Patent 9,317,448, 2016
132016
Self-supervised learning of depth and ego-motion with differentiable bundle adjustment
Y Shi, J Zhu, Y Fang, K Lien, J Gu
arXiv preprint arXiv:1909.13163, 2019
112019
Moving data between caches in a heterogeneous processor system
J Gu, BM Beckmann, Y Xie
US Patent 9,652,390, 2017
102017
Optimizing a parallel video encoder with message passing and a shared memory architecture
J Gu, Y Sun
Tsinghua Science and Technology 16 (4), 393-398, 2011
92011
Structure-attentioned memory network for monocular depth estimation
J Zhu, Y Shi, M Ren, Y Fang, KC Lien, J Gu
arXiv preprint arXiv:1909.04594, 2019
52019
MOPED: Orchestrating interprocess message data on CMPs
J Gu, SS Lumetta, R Kumar, Y Sun
2011 IEEE 17th International Symposium on High Performance Computer …, 2011
42011
Enhancing lifetime of non-volatile cache by reducing intra-block write variation
Z Wang, Y Xie, Y Xu, J Gu, T Cao
US Patent 9,767,043, 2017
22017
iCHAT: inter-cache hardware-assistant data transfer for heterogeneous chip multiprocessors
J Gu, BM Beckmann, T Cao, Y Hu
2014 9th IEEE International Conference on Networking, Architecture, and …, 2014
22014
Accelerating data movement on future chip multi-processors
J Gu, R Kumar, SS Lumetta, Y Sun
Proceedings of the Second International Forum on Next-Generation Multicore …, 2010
22010
MOPED: Accelerating data communication on future cmps
J Gu, Y Sun, SS Lumetta, R Kumar
IEEE Micro 31 (4), 42-50, 2011
12011
Enhancing lifetime of non-volatile cache by injecting random replacement policy
Z Wang, Y Xie, Y Xu, J Gu, T Cao
US Patent 9,792,228, 2017
2017
Thermal-aware compiler for parallel instruction execution in processors
Y Xie, J Gu
US Patent 9,639,359, 2017
2017
The system can't perform the operation now. Try again later.
Articles 1–20