APEnet+: a 3D Torus network optimized for GPU-based HPC Systems R Ammendola, A Biagioni, O Frezza, FL Cicero, A Lonardo, PS Paolucci, ... Journal of Physics: Conference Series 396 (4), 042059, 2012 | 60 | 2012 |
Gpu peer-to-peer techniques applied to a cluster interconnect R Ammendola, M Bernaschi, A Biagioni, M Bisson, M Fatica, O Frezza, ... 2013 IEEE International Symposium on Parallel & Distributed Processing …, 2013 | 47 | 2013 |
Time-decoupled parallel SystemC simulation JH Weinstock, C Schumacher, R Leupers, G Ascheid, L Tosoratto 2014 Design, Automation & Test in Europe Conference & Exhibition (DATE), 1-4, 2014 | 45 | 2014 |
APEnet+: high bandwidth 3D torus direct network for petaflops scale commodity clusters R Ammendola, A Biagioni, O Frezza, FL Cicero, A Lonardo, PS Paolucci, ... Journal of Physics: Conference Series 331 (5), 052029, 2011 | 38 | 2011 |
NaNet: a flexible and configurable low-latency NIC for real-time trigger systems based on GPUs R Ammendola, A Biagioni, O Frezza, G Lamanna, A Lonardo, FL Cicero, ... Journal of Instrumentation 9 (02), C02023, 2014 | 29 | 2014 |
APEnet+ 34 Gbps data transmission system and custom transmission logic R Ammendola, A Biagioni, O Frezza, A Lonardo, FL Cicero, PS Paolucci, ... Journal of Instrumentation 8 (12), C12022, 2013 | 27 | 2013 |
QUonG: A GPU-based HPC system dedicated to LQCD computing R Ammendola, A Biagioni, O Frezza, FL Cicero, A Lonardo, PS Paolucci, ... 2011 Symposium on Application Accelerators in High-Performance Computing …, 2011 | 26 | 2011 |
Dynamic many-process applications on many-tile embedded systems and HPC clusters: The EURETILE programming environment and execution platforms PS Paolucci, A Biagioni, LG Murillo, F Rousseau, L Schor, L Tosoratto, ... Journal of Systems Architecture 69, 29-53, 2016 | 23 | 2016 |
Virtual-to-Physical address translation for an FPGA-based interconnect with host and GPU remote DMA capabilities R Ammendola, A Biagioni, O Frezza, FL Cicero, A Lonardo, PS Paolucci, ... 2013 International Conference on Field-Programmable Technology (FPT), 58-65, 2013 | 23 | 2013 |
legaSCi: Legacy SystemC model integration into parallel SystemC simulators C Schumacher, JH Weinstock, R Leupers, G Ascheid, L Tosoratto, ... IEEE 27th International Symposium on Parallel and Distributed Processing …, 2013 | 23 | 2013 |
NaNet-10: a 10GbE network interface card for the GPU-based low-level trigger of the NA62 RICH detector. R Ammendola, A Biagioni, M Fiorini, O Frezza, A Lonardo, G Lamanna, ... Journal of Instrumentation 11 (03), C03030, 2016 | 20 | 2016 |
Distributed simulation of polychronous and plastic spiking neural networks: strong and weak scaling of a representative mini-application benchmark executed on a small-scale … PS Paolucci, R Ammendola, A Biagioni, O Frezza, FL Cicero, A Lonardo, ... arXiv preprint arXiv:1310.8478, 2013 | 19 | 2013 |
NaNet: a configurable NIC bridging the gap between HPC and real-time HEP GPU computing A Lonardo, F Ameli, R Ammendola, A Biagioni, AC Ramusino, M Fiorini, ... Journal of Instrumentation 10 (04), C04011, 2015 | 18 | 2015 |
A hierarchical watchdog mechanism for systemic fault awareness on distributed systems R Ammendola, A Biagioni, O Frezza, FL Cicero, A Lonardo, PS Paolucci, ... Future Generation Computer Systems 53, 90-99, 2015 | 14 | 2015 |
EURETILE design flow: Dynamic and fault tolerant mapping of multiple applications onto many-tile systems L Schor, I Bacivarov, LG Murillo, PS Paolucci, F Rousseau, A El Antably, ... 2014 IEEE International Symposium on Parallel and Distributed Processing …, 2014 | 12 | 2014 |
NaNet: a low-latency NIC enabling GPU-based, real-time low level trigger systems R Ammendola, A Biagioni, R Fantechi, O Frezza, G Lamanna, FL Cicero, ... Journal of Physics: Conference Series 513 (1), 012018, 2014 | 11 | 2014 |
High-speed data transfer with FPGAs and QSFP+ modules R Ammendola, and others Nuclear Science Symposium Conference Record (NSS/MIC), 2010 IEEE, Nuclear …, 2012 | 11 | 2012 |
APEnet+: a 3D toroidal network enabling Petaflops scale Lattice QCD simulations on commodity clusters R Ammendola, A Biagioni, O Frezza, FL Cicero, A Lonardo, P Paolucci, ... Lattice 2010, 2010 | 11 | 2010 |
legaSCi: Legacy SystemC model integration into parallel simulators C Schumacher, JH Weinstock, R Leupers, G Ascheid, L Tosoratto, ... ACM Transactions on Embedded Computing Systems (TECS) 13 (5s), 1-24, 2014 | 9 | 2014 |
Design and implementation of a modular, low latency, fault-aware, FPGA-based network interface R Ammendola, A Biagioni, O Frezza, F Lo Cicero, A Lonardo, PS Paolucci, ... Reconfigurable Computing and FPGAs (ReConFig), 2013 International Conference …, 2013 | 8 | 2013 |