A class of parallel tiled linear algebra algorithms for multicore architectures A Buttari, J Langou, J Kurzak, J Dongarra Parallel Computing 35 (1), 38-53, 2009 | 555 | 2009 |

Numerical linear algebra on emerging architectures: The PLASMA and MAGMA projects E Agullo, J Demmel, J Dongarra, B Hadri, J Kurzak, J Langou, H Ltaief, ... Journal of Physics: Conference Series 180 (1), 012037, 2009 | 439 | 2009 |

Communication-optimal parallel and sequential QR and LU factorizations J Demmel, L Grigori, M Hoemmen, J Langou SIAM Journal on Scientific Computing 34 (1), A206-A239, 2012 | 344 | 2012 |

Parallel tiled QR factorization for multicore architectures A Buttari, J Langou, J Kurzak, J Dongarra Concurrency and Computation: Practice and Experience 20 (13), 1573-1590, 2008 | 231 | 2008 |

Algorithm-based fault tolerance applied to high performance computing G Bosilca, R Delmas, J Dongarra, J Langou Journal of Parallel and Distributed Computing 69 (4), 410-416, 2009 | 192 | 2009 |

Tiled QR factorization algorithms H Bouwmeester, M Jacquelin, J Langou, Y Robert Proceedings of 2011 International Conference for High Performance Computing …, 2011 | 171 | 2011 |

Algorithm 842: A set of GMRES routines for real and complex arithmetics on high performance computers V Frayssé, L Giraud, S Gratton, J Langou ACM Transactions on Mathematical Software (TOMS) 31 (2), 228-238, 2005 | 171 | 2005 |

Accelerating scientific computations with mixed precision algorithms M Baboulin, A Buttari, J Dongarra, J Kurzak, J Langou, J Langou, ... Computer Physics Communications 180 (12), 2526-2533, 2009 | 153 | 2009 |

The impact of multicore on math software A Buttari, J Dongarra, J Kurzak, J Langou, P Luszczek, S Tomov International Workshop on Applied Parallel Computing, 1-10, 2006 | 150 | 2006 |

Flexible development of dense linear algebra algorithms on massively parallel architectures with DPLASMA G Bosilca, A Bouteiller, A Danalis, M Faverge, A Haidar, T Herault, ... 2011 IEEE International Symposium on Parallel and Distributed Processing …, 2011 | 148* | 2011 |

Exploiting the performance of 32 bit floating point arithmetic in obtaining 64 bit accuracy (revisiting iterative refinement for linear systems) J Langou, J Langou, P Luszczek, J Kurzak, A Buttari, J Dongarra SC'06: Proceedings of the 2006 ACM/IEEE conference on Supercomputing, 50-50, 2006 | 146 | 2006 |

The loss of orthogonality in the Gram-Schmidt orthogonalization process L Giraud, J Langou, M Rozloznik Computers & Mathematics with Applications 50 (7), 1069-1075, 2005 | 140 | 2005 |

Fault tolerant high performance computing by a coding approach Z Chen, GE Fagg, E Gabriel, J Langou, T Angskun, G Bosilca, J Dongarra Proceedings of the tenth ACM SIGPLAN symposium on Principles and practice of …, 2005 | 120 | 2005 |

Mixed precision iterative refinement techniques for the solution of dense linear systems A Buttari, J Dongarra, J Langou, J Langou, P Luszczek, J Kurzak The International Journal of High Performance Computing Applications 21 (4 …, 2007 | 119 | 2007 |

Rounding error analysis of the classical Gram-Schmidt orthogonalization process L Giraud, J Langou, M Rozložník, J van den Eshof Numerische Mathematik 101 (1), 87-100, 2005 | 119 | 2005 |

Handbook of parallel computing: models, algorithms and applications S Rajasekaran, J Reif CRC Press, 2007 | 95* | 2007 |

LU factorization for accelerator-based systems E Agullo, C Augonnet, J Dongarra, M Faverge, J Langou, H Ltaief, ... 2011 9th IEEE/ACS International Conference on Computer Systems and …, 2011 | 76 | 2011 |

A set of GMRES routines for real and complex arithmetics V Frayssé, L Giraud, S Gratton, J Langou Tech. Rep. TR/PA/97/49, CERFACS, France, 1997 | 57 | 1997 |

Communication-avoiding parallel and sequential QR factorizations J Demmel, L Grigori, M Hoemmen, J Langou CoRR abs/0806.2159, 2008 | 53 | 2008 |

Hierarchical QR factorization algorithms for multi-core clusters J Dongarra, M Faverge, T Herault, M Jacquelin, J Langou, Y Robert Parallel Computing 39 (4-5), 212-232, 2013 | 52 | 2013 |