An evaluation of vectorizing compilers S Maleki, Y Gao, MJ Garzar, T Wong, DA Padua 2011 International Conference on Parallel Architectures and Compilation …, 2011 | 318 | 2011 |
CHET: an optimizing compiler for fully-homomorphic neural-network inferencing R Dathathri, O Saarikivi, H Chen, K Laine, K Lauter, S Maleki, ... Proceedings of the 40th ACM SIGPLAN conference on programming language …, 2019 | 266 | 2019 |
Performance portability with the chapel language A Sidelnik, S Maleki, BL Chamberlain, MJ Garzar'n, D Padua 2012 IEEE 26th international parallel and distributed processing symposium …, 2012 | 66 | 2012 |
Is Moore's Party Over? MY Vardi Commun. ACM 54 (11), 5, 2011 | 66* | 2011 |
DSMR: A parallel algorithm for single-source shortest path problem S Maleki, D Nguyen, A Lenharth, M Garzarán, D Padua, K Pingali Proceedings of the 2016 International Conference on Supercomputing, 1-14, 2016 | 54 | 2016 |
Splitwise: Efficient generative llm inference using phase splitting P Patel, E Choukse, C Zhang, A Shah, Í Goiri, S Maleki, R Bianchini 2024 ACM/IEEE 51st Annual International Symposium on Computer Architecture …, 2024 | 50 | 2024 |
Parallelizing dynamic programming through rank convergence S Maleki, M Musuvathi, T Mytkowicz ACM SIGPLAN Notices 49 (8), 219-232, 2014 | 49 | 2014 |
Synthesizing optimal collective algorithms Z Cai, Z Liu, S Maleki, M Musuvathi, T Mytkowicz, J Nelson, O Saarikivi Proceedings of the 26th ACM SIGPLAN Symposium on Principles and Practice of …, 2021 | 48 | 2021 |
An empirical study of the effect of source-level loop transformations on compiler stability Z Gong, Z Chen, J Szaday, D Wong, Z Sura, N Watkinson, S Maleki, ... Proceedings of the ACM on Programming Languages 2 (OOPSLA), 1-29, 2018 | 42 | 2018 |
Breaking the computation and communication abstraction barrier in distributed machine learning workloads A Jangda, J Huang, G Liu, AHN Sabet, S Maleki, Y Miao, M Musuvathi, ... Proceedings of the 27th ACM International Conference on Architectural …, 2022 | 39 | 2022 |
{TACCL}: Guiding Collective Algorithm Synthesis using Communication Sketches A Shah, V Chidambaram, M Cowan, S Maleki, M Musuvathi, T Mytkowicz, ... 20th USENIX Symposium on Networked Systems Design and Implementation (NSDI …, 2023 | 35 | 2023 |
Inter-disciplinary research challenges in computer systems for the 2020s A Cohen, X Shen, J Torrellas, J Tuck, Y Zhou, S Adve, I Akturk, S Bagchi, ... National Science Foundation, 2018 | 30 | 2018 |
Implementing network security measures in response to a detected cyber attack MS Musuvathi, TD Mytkowicz, S Maleki, Y Ding US Patent 10,805,317, 2020 | 29 | 2020 |
Parallel dynamic programming through rank convergence TD Mytkowicz, M Musuvathi, S Maleki US Patent 9,195,436, 2015 | 28 | 2015 |
Homomorphic evaluation of tensor programs MS Musuvathi, K Laine, KE Lauter, H Chen, OI Saarikivi, S Maleki, ... US Patent 11,177,935, 2021 | 21 | 2021 |
Determining a likelihood of a user interaction with a content element MS Musuvathi, TD Mytkowicz, S Maleki, Y Ding US Patent 11,062,226, 2021 | 20 | 2021 |
Lore: A loop repository for the evaluation of compilers Z Chen, Z Gong, JJ Szaday, DC Wong, D Padua, A Nicolau, ... 2017 IEEE International Symposium on Workload Characterization (IISWC), 219-228, 2017 | 19 | 2017 |
Parallelizing wfst speech decoders C Mendis, J Droppo, S Maleki, M Musuvathi, T Mytkowicz, G Zweig 2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016 | 19 | 2016 |
Efficient parallelization using rank convergence in dynamic programming algorithms S Maleki, M Musuvathi, T Mytkowicz Communications of the ACM 59 (10), 85-92, 2016 | 17 | 2016 |
Mscclang: Microsoft collective communication language M Cowan, S Maleki, M Musuvathi, O Saarikivi, Y Xiong Proceedings of the 28th ACM International Conference on Architectural …, 2023 | 16 | 2023 |