Segui
Nirmal Prajapati
Titolo
Citata da
Citata da
Anno
Unity: Accelerating {DNN} training through joint optimization of algebraic transformations and parallelization
C Unger, Z Jia, W Wu, S Lin, M Baines, CEQ Narvaez, V Ramakrishnaiah, ...
16th USENIX Symposium on Operating Systems Design and Implementation (OSDI …, 2022
562022
Simple, accurate, analytical time modeling and optimal tile size selection for GPGPU stencils
N Prajapati, W Ranasinghe, S Rajopadhye, R Andonov, H Djidjev, ...
Proceedings of the 22Nd ACM SIGPLAN Symposium on Principles and Practice of …, 2017
272017
Optimization approach to accelerator codesign
N Prajapati, S Rajopadhye, H Djidjev, N Santhi, T Grosser, R Andonov
IEEE Transactions on Computer-Aided Design of Integrated Circuits and …, 2019
32019
PCOT: Cache oblivious tiling of polyhedral programs
W Ranasinghe, N Prajapati, T Yuki, S Rajopadhye
arXiv preprint arXiv:1802.00166, 2018
32018
Hybrid static/dynamic schedules for tiled polyhedral programs
T Jin, N Prajapati, W Ranasinghe, G Iooss, Y Zou, S Rajopadhye, ...
arXiv preprint arXiv:1610.07236, 2016
32016
Energy modeling and optimization for tiled nested-loop codes
N Prajapati, W Ranasinghe, V Tandrapati, R Andonov, H Djidjev, ...
2015 IEEE International Parallel and Distributed Processing Symposium …, 2015
32015
Revisiting sparse dynamic programming for the 0/1 Knapsack Problem
TI Sifat, N Prajapati, S Rajopadhye
Proceedings of the 49th International Conference on Parallel Processing, 1-10, 2020
22020
Scheduling and tiling reductions on realistic machines
N Prajapati
arXiv preprint arXiv:1801.05909, 2018
22018
Hybrid Static/Dynamic Schedules for Tiled Polyhedral Programs. CoRR abs/1610.07236 (2016)
T Jin, N Prajapati, W Ranasinghe, G Iooss, Y Zou, S Rajopadhye, ...
arXiv preprint arXiv:1610.07236, 2016
22016
Transformations for Energy Efficient Accelerated Chain Matrix Multiplication (TEE-ACM 2)
M Moraru, M Warnet, J Loiseau, V Ramakrishnaiah, N Prajapati, H Lim, ...
Supercomputing, 2022
12022
Accelerator Codesign as Non-Linear Optimization
N Prajapati, S Rajopadhye, H Djidjev, N Santhi, T Grosser, R Andonov
arXiv preprint arXiv:1712.04892, 2017
12017
Energy modeling and optimization for GPU stencil computations
N Prajapati, W Ranasinghe, V Tandrapati, R Andonov, H Djidjev, ...
Manuscript submitted for publication, 2014
12014
BB-ML: Basic Block Performance Prediction using Machine Learning Techniques
H Abdelkhalik, S Aktar, Y Arafa, A Barai, G Chennupati, N Santhi, ...
2023 IEEE 29th International Conference on Parallel and Distributed Systems …, 2023
2023
Modeling and Characterizing Shared and Local Memories of the Ampere GPUs
H Abdelkhalik, Y Arafa, N Santhi, N Prajapati, AHA Badawy
Proceedings of the International Symposium on Memory Systems, 1-3, 2023
2023
Modeling and Characterizing Shared and Local Memories of the Ampere GPUs
A Badawy
2023
ASCR Reverse Site Visit [Slides]
I Qualters, AA Hagberg, GM Shipman, L Chacon, PS McCormick, ...
Los Alamos National Lab.(LANL), Los Alamos, NM (United States), 2022
2022
BB-ML: Basic Block Performance Prediction using Machine Learning Techniques
S Aktar, H Abdelkhalik, NH Turja, Y Arafa, A Barai, N Panda, ...
arXiv preprint arXiv:2202.07798, 2022
2022
Position Papers for the ASCR Workshop on Reimagining Codesign
JA Ang, AA Chien, S Hammond, A Hoisie, I Karlin, S Pakin, J Shalf, ...
US Department of Energy (USDOE), Washington DC (United States). Office of …, 2021
2021
The Ristra Project: FY20/21 Milestone Report
DJ Daniel, AL Hungerford, BK Bergen, DB Bowen, TP Burke, ...
Los Alamos National Lab.(LANL), Los Alamos, NM (United States), 2020
2020
Analytical Cost Metrics: Days of Future Past
N Prajapati
Colorado State University, 2019
2019
Il sistema al momento non può eseguire l'operazione. Riprova più tardi.
Articoli 1–20