Aurelien Bouteiller
TitleCited byYear
MPICH-V: Toward a scalable fault tolerant MPI for volatile nodes
G Bosilca, A Bouteiller, F Cappello, S Djilali, G Fedak, C Germain, ...
SC'02: Proceedings of the 2002 ACM/IEEE Conference on Supercomputing, 29-29, 2002
4202002
DAGuE: A generic distributed DAG engine for high performance computing
G Bosilca, A Bouteiller, A Danalis, T Herault, P Lemarinier, J Dongarra
Parallel Computing 38 (1-2), 37-51, 2012
3462012
MPICH-V2: a fault tolerant MPI for volatile nodes based on pessimistic sender based message logging
A Bouteiller, F Cappello, T Herault, G Krawezik, P Lemarinier, F Magniette
Proceedings of the 2003 ACM/IEEE conference on Supercomputing, 25, 2003
2592003
MPICH-V project: A multiprotocol automatic fault-tolerant MPI
A Bouteiller, T Herault, G Krawezik, P Lemarinier, F Cappello
The International Journal of High Performance Computing Applications 20 (3 …, 2006
1832006
Parsec: Exploiting heterogeneity to enhance scalability
G Bosilca, A Bouteiller, A Danalis, M Faverge, T Hérault, JJ Dongarra
Computing in Science & Engineering 15 (6), 36-45, 2013
1632013
Flexible development of dense linear algebra algorithms on massively parallel architectures with DPLASMA
G Bosilca, A Bouteiller, A Danalis, M Faverge, A Haidar, T Herault, ...
2011 IEEE International Symposium on Parallel and Distributed Processing …, 2011
148*2011
Post-failure recovery of MPI communication capability: Design and rationale
W Bland, A Bouteiller, T Herault, G Bosilca, J Dongarra
The International Journal of High Performance Computing Applications 27 (3 …, 2013
1412013
Algorithm-based fault tolerance for dense matrix factorizations
P Du, A Bouteiller, G Bosilca, T Herault, J Dongarra
Acm sigplan notices 47 (8), 225-234, 2012
1392012
Coordinated checkpoint versus message log for fault tolerant MPI
A Bouteiller, P Lemarinier, G Krawezik, F Cappello
null, 242, 2003
1192003
An evaluation of user-level failure mitigation support in MPI
W Bland, A Bouteiller, T Herault, J Hursey, G Bosilca, JJ Dongarra
European MPI Users' Group Meeting, 193-203, 2012
982012
Improved message logging versus improved coordinated checkpointing for fault tolerant MPI
P Lemarinier, A Bouteiller, T Herault, G Krawezik, F Cappello
2004 IEEE International Conference on Cluster Computing (IEEE Cat. No …, 2004
952004
Redesigning the message logging model for high performance
A Bouteiller, G Bosilca, J Dongarra
Concurrency and Computation: Practice and Experience 22 (16), 2196-2211, 2010
832010
Unified model for assessing checkpointing protocols at extreme‐scale
G Bosilca, A Bouteiller, E Brunet, F Cappello, J Dongarra, A Guermouche, ...
Concurrency and Computation: Practice and Experience 26 (17), 2772-2791, 2014
662014
Hierarchical dag scheduling for hybrid distributed systems
W Wu, A Bouteiller, G Bosilca, M Faverge, J Dongarra
2015 IEEE International Parallel and Distributed Processing Symposium, 156-165, 2015
502015
Correlated set coordination in fault tolerant message logging protocols
A Bouteiller, T Herault, G Bosilca, JJ Dongarra
European Conference on Parallel Processing, 51-64, 2011
472011
Reasons for a pessimistic or optimistic message logging protocol in MPI uncoordinated failure, recovery
A Bouteiller, T Ropars, G Bosilca, C Morin, J Dongarra
2009 IEEE International Conference on Cluster Computing and Workshops, 1-9, 2009
432009
Kernel assisted collective intra-node mpi communication among multi-core and many-core cpus
T Ma, G Bosilca, A Bouteiller, B Goglin, JM Squyres, JJ Dongarra
2011 International Conference on Parallel Processing, 532-541, 2011
422011
An evaluation of user-level failure mitigation support in MPI
W Bland, A Bouteiller, T Herault, J Hursey, G Bosilca, JJ Dongarra
Computing 95 (12), 1171-1184, 2013
402013
A checkpoint-on-failure protocol for algorithm-based recovery in standard MPI
W Bland, P Du, A Bouteiller, T Herault, G Bosilca, J Dongarra
European Conference on Parallel Processing, 477-488, 2012
382012
Impact of event logger on causal message logging protocols for fault tolerant MPI
A Bouteiller, B Collin, T Herault, P Lemarinier, F Cappello
19th IEEE International Parallel and Distributed Processing Symposium, 10 pp., 2005
382005
The system can't perform the operation now. Try again later.
Articles 1–20