Segui
Jianbin Fang
Titolo
Citata da
Citata da
Anno
A comprehensive performance comparison of CUDA and OpenCL
J Fang, AL Varbanescu, H Sips
2011 International Conference on Parallel Processing, 216-225, 2011
4292011
Test-Driving Intel Xeon Phi
J Fang, H Sips, L Zhang, C Xu, C Yonggang, AL Varbanescu
The 5th ACM/SPEC International Conference on Performance Engineering, 2014
1212014
Collaborating CPU and GPU for large-scale high-order CFD simulations with complex grids on the TianHe-1A supercomputer
C Xu, X Deng, L Zhang, J Fang, G Wang, Y Jiang, W Cao, Y Che, Y Wang, ...
Journal of Computational Physics 278, 275-297, 2014
842014
Performance gaps between OpenMP and OpenCL for multi-core CPUs
J Shen, J Fang, H Sips, AL Varbanescu
2012 41st International Conference on Parallel Processing Workshops, 116-125, 2012
732012
An empirical study of intel xeon phi
J Fang, AL Varbanescu, H Sips, L Zhang, Y Che, C Xu
arXiv preprint arXiv:1310.5842, 2013
602013
Performance traps in OpenCL for CPUs
J Shen, J Fang, H Sips, AL Varbanescu
2013 21st Euromicro International Conference on Parallel, Distributed, and …, 2013
572013
An application-centric evaluation of OpenCL on multi-core CPUs
J Shen, J Fang, H Sips, AL Varbanescu
Parallel Computing 39 (12), 834-850, 2013
552013
Moving from exascale to zettascale computing: challenges and techniques
X Liao, K Lu, C Yang, J Li, Y Yuan, M Lai, L Huang, P Lu, J Fang, J Ren, ...
Frontiers of Information Technology & Electronic Engineering 19, 1236-1244, 2018
402018
Auto-tuning Streamed Applications on Intel Xeon Phi
P Zhang, J Fang, T Tang, C Yang, Z Wang
The 31st IEEE International Parallel & Distributed Processing Symposium, 2018
302018
Benchmarking intel xeon phi to guide kernel design
J Fang, AL Varbanescu, H Sips, L Zhang, Y Che, C Xu
292013
Adaptive Optimization of Sparse Matrix-Vector Multiplication on Emerging Many-Core Architectures
S Chen, J Fang, D Chen, C Xu, Z Wang
The 20th IEEE International Conference on High Performance Computing and …, 2018
282018
Grover: Looking for Performance Improvement by Disabling Local Memory Usage in OpenCL Kernels
J Fang, H Sips, P Jaaskelainen, AL Varbanescu
The 43rd International Conference on Parallel Processing (ICPP’14), 2014
242014
Parallel programming models for heterogeneous many-cores: a comprehensive survey
J Fang, C Huang, T Tang, Z Wang
CCF Transactions on High Performance Computing 2, 382-400, 2020
232020
Proteus: Network-aware web browsing on heterogeneous mobile systems
J Ren, X Wang, J Fang, Y Feng, D Zhu, Z Luo, J Zheng, Z Wang
Proceedings of the 14th International Conference on emerging Networking …, 2018
232018
Optimizing Sparse Matrix-Vector Multiplications on An ARMv8-based Many-Core Architecture
D Chen, J Fang, S Chen, C Xu, Z Wang
International Journal of Parallel Programming, 2018
222018
Deep learning research and development platform: Characterizing and scheduling with qos guarantees on gpu clusters
Z Chen, W Quan, M Wen, J Fang, J Yu, C Zhang, L Luo
IEEE Transactions on Parallel and Distributed Systems 31 (1), 34-50, 2019
212019
Aristotle: A performance impact indicator for the OpenCL kernels using local memory
J Fang, H Sips, AL Varbanescu
Scientific Programming, 239-257, 2014
212014
Parallel Computation of Non-Bonded Interactions in Drug Discovery: Nvidia GPUs vs. Intel Xeon Phi
HPS Jianbin Fang, Ana Lucia Varbanescu, Baldomero Imbernon, Jose M. Cecilia
The 2nd International Work-Conference on Bioinformatics and Biomedical …, 2014
20*2014
Deep program structure modeling through multi-relational graph-based learning
G Ye, Z Tang, H Wang, D Fang, J Fang, S Huang, Z Wang
Proceedings of the ACM International conference on parallel architectures …, 2020
192020
To Compress, or Not to Compress: Characterizing Deep Learning Model Compression for Embedded Inference
Q Qing, J Ren, J Yu, L Gao, H Wang, J Zheng, Y Feng, J Fang, Z Wang
The 16th IEEE International Symposium on Parallel and Distributed Processing …, 2018
192018
Il sistema al momento non può eseguire l'operazione. Riprova più tardi.
Articoli 1–20