Segui
Ahmad Abdelfattah
Ahmad Abdelfattah
Research Assistant Professor, Innovative Computing Laboratory, University of Tennessee
Email verificata su icl.utk.edu
Titolo
Citata da
Citata da
Anno
Performance, design, and autotuning of batched GEMM for GPUs
A Abdelfattah, A Haidar, S Tomov, J Dongarra
High Performance Computing: 31st International Conference, ISC High …, 2016
1282016
A survey of numerical linear algebra methods utilizing mixed-precision arithmetic
A Abdelfattah, H Anzt, EG Boman, E Carson, T Cojean, J Dongarra, A Fox, ...
The International Journal of High Performance Computing Applications 35 (4 …, 2021
1102021
High-performance tensor contractions for GPUs
A Abdelfattah, M Baboulin, V Dobrev, J Dongarra, C Earl, J Falcou, ...
Procedia Computer Science 80, 108-118, 2016
762016
High-performance matrix-matrix multiplications of very small matrices
I Masliah, A Abdelfattah, A Haidar, S Tomov, M Baboulin, J Falcou, ...
Euro-Par 2016: Parallel Processing: 22nd International Conference on …, 2016
692016
Parallel programming models for dense linear algebra on heterogeneous systems
J Dongarra, M Abalenkovs, A Abdelfattah, M Gates, A Haidar, J Kurzak, ...
Supercomputing frontiers and innovations 2 (4), 67-86, 2015
622015
Kblas: An optimized library for dense matrix-vector multiplication on gpu accelerators
A Abdelfattah, D Keyes, H Ltaief
ACM Transactions on Mathematical Software (TOMS) 42 (3), 1-31, 2016
552016
The design of fast and energy-efficient linear solvers: On the potential of half-precision arithmetic and iterative refinement techniques
A Haidar, A Abdelfattah, M Zounon, P Wu, S Pranesh, S Tomov, ...
International conference on computational science, 586-600, 2018
502018
A survey of numerical methods utilizing mixed precision arithmetic
A Abdelfattah, H Anzt, EG Boman, E Carson, T Cojean, J Dongarra, ...
arXiv preprint arXiv:2007.06674, 2020
462020
With extreme computing, the rules have changed
J Dongarra, S Tomov, P Luszczek, J Kurzak, M Gates, I Yamazaki, H Anzt, ...
Computing in Science & Engineering 19 (3), 52-62, 2017
452017
Efficient exascale discretizations: High-order finite element methods
T Kolev, P Fischer, M Min, J Dongarra, J Brown, V Dobrev, T Warburton, ...
The International Journal of High Performance Computing Applications 35 (6 …, 2021
442021
Fast batched matrix multiplication for small sizes using half-precision arithmetic on GPUs
A Abdelfattah, S Tomov, J Dongarra
2019 IEEE international parallel and distributed processing symposium (IPDPS …, 2019
422019
A novel fast and accurate pseudo-analytical simulation approach for MOAO
É Gendron, A Charara, A Abdelfattah, D Gratadour, D Keyes, H Ltaief, ...
Adaptive Optics Systems IV 9148, 2148-2160, 2014
382014
A guide for achieving high performance with very small matrices on GPU: a case study of batched LU and Cholesky factorizations
A Haidar, A Abdelfattah, M Zounon, S Tomov, J Dongarra
IEEE Transactions on Parallel and Distributed Systems 29 (5), 973-984, 2017
252017
Design, optimization, and benchmarking of dense linear algebra algorithms on AMD GPUs
C Brown, A Abdelfattah, S Tomov, J Dongarra
2020 IEEE High Performance Extreme Computing Conference (HPEC), 1-7, 2020
242020
Novel HPC techniques to batch execution of many variable size BLAS computations on GPUs
A Abdelfattah, A Haidar, S Tomov, J Dongarra
Proceedings of the International Conference on Supercomputing, 1-10, 2017
242017
Fast Cholesky factorization on GPUs for batch and native modes in MAGMA
A Abdelfattah, A Haidar, S Tomov, J Dongarra
Journal of Computational Science 20, 85-93, 2017
242017
A set of batched basic linear algebra subprograms and LAPACK routines
A Abdelfattah, T Costa, J Dongarra, M Gates, A Haidar, S Hammarling, ...
ACM Transactions on Mathematical Software (TOMS) 47 (3), 1-23, 2021
232021
Algorithms and optimization techniques for high-performance matrix-matrix multiplications of very small matrices
I Masliah, A Abdelfattah, A Haidar, S Tomov, M Baboulin, J Falcou, ...
Parallel Computing 81, 1-21, 2019
232019
GPU algorithms for efficient exascale discretizations
A Abdelfattah, V Barra, N Beams, R Bleile, J Brown, JS Camier, R Carson, ...
Parallel Computing 108, 102841, 2021
222021
Towards half-precision computation for complex matrices: A case study for mixed precision solvers on gpus
A Abdelfattah, S Tomov, J Dongarra
2019 IEEE/ACM 10th Workshop on Latest Advances in Scalable Algorithms for …, 2019
222019
Il sistema al momento non può eseguire l'operazione. Riprova più tardi.
Articoli 1–20