Segui
Jared Casper
Jared Casper
Research Scientist, NVIDIA
Email verificata su nvidia.com
Titolo
Citata da
Citata da
Anno
Deep speech 2: End-to-end speech recognition in english and mandarin
D Amodei, S Ananthanarayanan, R Anubhai, J Bai, E Battenberg, C Case, ...
International conference on machine learning, 173-182, 2016
31912016
Deep speech: Scaling up end-to-end speech recognition
A Hannun, C Case, J Casper, B Catanzaro, G Diamos, E Elsen, ...
arXiv preprint arXiv:1412.5567, 2014
22312014
Megatron-lm: Training multi-billion parameter language models using model parallelism
M Shoeybi, M Patwary, R Puri, P LeGresley, J Casper, B Catanzaro
arXiv preprint arXiv:1909.08053, 2019
7622019
An effective hybrid transactional memory system with strong isolation guarantees
CC Minh, M Trautmann, JW Chung, A McDonald, N Bronson, J Casper, ...
Proceedings of the 34th annual international symposium on Computer …, 2007
4492007
A practical concurrent binary search tree
NG Bronson, J Casper, H Chafi, K Olukotun
ACM Sigplan Notices 45 (5), 257-268, 2010
2812010
The vector-thread architecture
R Krashinsky, C Batten, M Hampton, S Gerding, B Pharris, J Casper, ...
ACM SIGARCH Computer Architecture News 32 (2), 52, 2004
2552004
Using deepspeed and megatron to train megatron-turing nlg 530b, a large-scale generative language model
S Smith, M Patwary, B Norick, P LeGresley, S Rajbhandari, J Casper, ...
arXiv preprint arXiv:2201.11990, 2022
2382022
Hardware acceleration of database operations
J Casper, K Olukotun
Proceedings of the 2014 ACM/SIGDA international symposium on Field …, 2014
2152014
Bloom: A 176b-parameter open-access multilingual language model
TL Scao, A Fan, C Akiki, E Pavlick, S Ilić, D Hesslow, R Castagné, ...
arXiv preprint arXiv:2211.05100, 2022
1952022
A scalable, non-blocking approach to transactional memory
H Chafi, J Casper, BD Carlstrom, A McDonald, CC Minh, W Baek, ...
2007 IEEE 13th International Symposium on High Performance Computer …, 2007
1882007
Efficient large-scale language model training on gpu clusters using megatron-lm
D Narayanan, M Shoeybi, J Casper, P LeGresley, M Patwary, ...
Proceedings of the International Conference for High Performance Computing …, 2021
1772021
Eigenbench: A simple exploration tool for orthogonal TM characteristics
S Hong, T Oguntebi, J Casper, N Bronson, C Kozyrakis, K Olukotun
IEEE International Symposium on Workload Characterization (IISWC'10), 1-11, 2010
962010
A practical FPGA-based framework for novel CMP research
S Wee, J Casper, N Njoroge, Y Tesylar, D Ge, C Kozyrakis, K Olukotun
Proceedings of the 2007 ACM/SIGDA 15th international symposium on Field …, 2007
932007
Atlas: A chip-multiprocessor with transactional memory support
N Njoroge, J Casper, S Wee, Y Teslyar, D Ge, C Kozyrakis, K Olukotun
2007 Design, Automation & Test in Europe Conference & Exhibition, 1-6, 2007
842007
Transactional predication: high-performance concurrent sets and maps for stm
NG Bronson, J Casper, H Chafi, K Olukotun
Proceedings of the 29th ACM SIGACT-SIGOPS symposium on Principles of …, 2010
682010
Systems and methods for speech transcription
A Hannun, C Case, J Casper, B Catanzaro, G Diamos, E Elsen, ...
US Patent 10,540,957, 2020
592020
Rewon Child, Reza Yazdani Aminabadi, Julie Bernauer, Xia Song, Mohammad Shoeybi, Yuxiong He, Michael Houston, Saurabh Tiwary, and Bryan Catanzaro
S Smith, M Patwary, B Norick, P LeGresley, S Rajbhandari, J Casper, ...
Using deepspeed and megatron to train megatron-turing nlg 530b, a large …, 2022
512022
Hardware acceleration of transactional memory on commodity systems
J Casper, T Oguntebi, S Hong, NG Bronson, C Kozyrakis, K Olukotun
ACM SIGPLAN Notices 46 (3), 27-38, 2011
402011
FARM: A prototyping environment for tightly-coupled, heterogeneous architectures
T Oguntebi, S Hong, J Casper, N Bronson, C Kozyrakis, K Olukotun
2010 18th IEEE Annual International Symposium on Field-Programmable Custom …, 2010
252010
Reducing activation recomputation in large transformer models
V Korthikanti, J Casper, S Lym, L McAfee, M Andersch, M Shoeybi, ...
arXiv preprint arXiv:2205.05198, 2022
152022
Il sistema al momento non può eseguire l'operazione. Riprova più tardi.
Articoli 1–20