Hatem Ltaief
Hatem Ltaief
Principal Research Scientist, KAUST
Verified email at kaust.edu.sa - Homepage
Title
Cited by
Cited by
Year
Numerical linear algebra on emerging architectures: The PLASMA and MAGMA projects
E Agullo, J Demmel, J Dongarra, B Hadri, J Kurzak, J Langou, H Ltaief, ...
Journal of Physics: Conference Series 180 (1), 012037, 2009
4752009
Dense linear algebra solvers for multicore with GPU accelerators
S Tomov, R Nath, H Ltaief, J Dongarra
2010 IEEE International Symposium on Parallel & Distributed Processingá…, 2010
2702010
Flexible development of dense linear algebra algorithms on massively parallel architectures with DPLASMA
G Bosilca, A Bouteiller, A Danalis, M Faverge, A Haidar, T Herault, ...
2011 IEEE International Symposium on Parallel and Distributed Processingá…, 2011
165*2011
Scheduling dense linear algebra operations on multicore processors
J Kurzak, H Ltaief, J Dongarra, RM Badia
Concurrency and Computation: Practice and Experience 22 (1), 15-44, 2010
1282010
QR factorization on a multicore node enhanced with multiple GPU accelerators
E Agullo, C Augonnet, J Dongarra, M Faverge, H Ltaief, S Thibault, ...
2011 IEEE International Parallel & Distributed Processing Symposium, 932-943, 2011
1222011
A hybridization methodology for high-performance linear algebra software for GPUs
E Agullo, C Augonnet, J Dongarra, H Ltaief, R Namyst, S Thibault, ...
GPU Computing Gems Jade Edition, 473-484, 2012
1092012
Comparative study of one-sided factorizations with multiple software packages on multi-core hardware
E Agullo, B Hadri, H Ltaief, J Dongarrra
Proceedings of the Conference on High Performance Computing Networkingá…, 2009
952009
LU factorization for accelerator-based systems
E Agullo, C Augonnet, J Dongarra, M Faverge, J Langou, H Ltaief, ...
2011 9th IEEE/ACS International Conference on Computer Systems andá…, 2011
782011
Multicore-optimized wavefront diamond blocking for optimizing stencil updates
T Malas, G Hager, H Ltaief, H Stengel, G Wellein, D Keyes
SIAM Journal on Scientific Computing 37 (4), C439-C464, 2015
692015
Parallel reduction to condensed forms for symmetric eigenvalue problems using aggregated fine-grained and memory-aware kernels
A Haidar, H Ltaief, J Dongarra
Proceedings of 2011 International Conference for High Performance Computingá…, 2011
682011
A scalable high performant Cholesky factorization for multicore with GPU accelerators
H Ltaief, S Tomov, R Nath, P Du, J Dongarra
International Conference on High Performance Computing for Computationalá…, 2010
662010
Energy footprint of advanced dense numerical linear algebra using tile algorithms on multicore architectures
J Dongarra, H Ltaief, P Luszczek, VM Weaver
2012 Second International Conference on Cloud and Green Computing, 274-281, 2012
592012
Plasma users guide
E Agullo, J Dongarra, B Hadri, J Kurzak, J Langou, J Langou, H Ltaief, ...
Technical report, ICL, UTK, 2009
562009
Two-stage tridiagonal reduction for dense symmetric matrices using tile algorithms on multicore architectures
P Luszczek, H Ltaief, J Dongarra
2011 IEEE International Parallel & Distributed Processing Symposium, 944-955, 2011
482011
Data‐driven execution of fast multipole methods
H Ltaief, R Yokota
Concurrency and Computation: Practice and Experience 26 (11), 1935-1946, 2014
442014
Trends in data locality abstractions for HPC systems
D Unat, A Dubey, T Hoefler, J Shalf, M Abraham, M Bianco, ...
IEEE Transactions on Parallel and Distributed Systems 28 (10), 3007-3020, 2017
412017
Analysis of dynamically scheduled tile algorithms for dense linear algebra on multicore architectures
A Haidar, H Ltaief, A YarKhan, J Dongarra
Concurrency and Computation: Practice and Experience 24 (3), 305-321, 2012
412012
Achieving numerical accuracy and high performance using recursive tile LU factorization with partial pivoting
J Dongarra, M Faverge, H Ltaief, P Luszczek
Concurrency and Computation: Practice and Experience 26 (7), 1408-1431, 2014
402014
Tile QR factorization with parallel panel processing for multicore architectures
B Hadri, H Ltaief, E Agullo, J Dongarra
2010 IEEE International Symposium on Parallel & Distributed Processingá…, 2010
402010
Programming abstractions for data locality
A Tate, A Kamil, A Dubey, A Gr÷▀linger, B Chamberlain, B Goglin, ...
382014
The system can't perform the operation now. Try again later.
Articles 1–20