Follow
Ben Athiwaratkun
Ben Athiwaratkun
Senior AI Scientist
Verified email at amazon.com - Homepage
Title
Cited by
Cited by
Year
Adversarial deep averaging networks for cross-lingual sentiment classification
X Chen, Y Sun, B Athiwaratkun, C Cardie, K Weinberger
Transactions of the Association for Computational Linguistics 6, 557-570, 2018
3462018
There are many consistent explanations of unlabeled data: Why you should average
B Athiwaratkun, M Finzi, P Izmailov, AG Wilson
arXiv preprint arXiv:1806.05594, 2018
284*2018
Malware classification with LSTM and GRU language models and a character-level CNN
B Athiwaratkun, JW Stokes
2017 IEEE international conference on acoustics, speech and signal …, 2017
2832017
Structured prediction as translation between augmented natural languages
G Paolini, B Athiwaratkun, J Krone, J Ma, A Achille, R Anubhai, ...
arXiv preprint arXiv:2101.05779, 2021
2302021
Probabilistic fasttext for multi-sense word embeddings
B Athiwaratkun, AG Wilson, A Anandkumar
arXiv preprint arXiv:1806.02901, 2018
1772018
Multimodal word distributions
B Athiwaratkun, AG Wilson
arXiv preprint arXiv:1704.08424, 2017
1212017
Hierarchical density order embeddings
B Athiwaratkun, AG Wilson
arXiv preprint arXiv:1804.09843, 2018
612018
Multi-lingual evaluation of code generation models
B Athiwaratkun, SK Gouda, Z Wang, X Li, Y Tian, M Tan, WU Ahmad, ...
arXiv preprint arXiv:2210.14868, 2022
572022
Augmented natural language for generative sequence labeling
B Athiwaratkun, CN Santos, J Krone, B Xiang
arXiv preprint arXiv:2009.13272, 2020
522020
Improving stability in deep reinforcement learning with weight averaging
E Nikishin, P Izmailov, B Athiwaratkun, D Podoprikhin, T Garipov, ...
Uncertainty in artificial intelligence workshop on uncertainty in Deep learning, 2018
452018
Baishakhi Ray, Parminder Bhatia, Sudipta Sengupta, Dan Roth, and Bing Xiang
B Athiwaratkun, SK Gouda, Z Wang, X Li, Y Tian, M Tan, WU Ahmad, ...
Multi-lingual evaluation of code generation models, 2022
182022
Generative context pair selection for multi-hop question answering
D Dua, CN Santos, P Ng, B Athiwaratkun, B Xiang, M Gardner, S Singh
arXiv preprint arXiv:2104.08744, 2021
42021
Infinite symmetric ergodic index and related examples in infinite measure
I Loh, C Silva, B Athiwaratkun
arXiv preprint arXiv:1702.01455, 2017
32017
Towards greener yet powerful code generation via quantization: An empirical study
X Wei, SK Gonugondla, S Wang, W Ahmad, B Ray, H Qian, X Li, V Kumar, ...
Proceedings of the 31st ACM Joint European Software Engineering Conference …, 2023
1*2023
On io-efficient attention mechanisms: Context-aware bifurcated attention and the generalized multi-group attention
B Athiwaratkun, SK Gonugondla, SK Gouda, H Qian, H Ding, Q Sun, ...
Workshop on Efficient Systems for Foundation Models@ ICML2023, 2023
12023
Bifurcated Attention for Single-Context Large-Batch Sampling
B Athiwaratkun, SK Gonugondla, SK Gouda, H Qian, H Ding, Q Sun, ...
arXiv preprint arXiv:2403.08845, 2024
2024
Token Alignment via Character Matching for Subword Completion
B Athiwaratkun, S Wang, M Shang, Y Tian, Z Wang, SK Gonugondla, ...
arXiv preprint arXiv:2403.08688, 2024
2024
Random token segmentation for training next token prediction models
Z Wang, T Yuchen, M Shang, P Athiwaratkun, M Tan, P Bhatia, AO Arnold, ...
US Patent App. 17/847,118, 2023
2023
Programmatically generating evaluation data sets for code generation models
P Athiwaratkun, Z Lin, R Keerthi, Z Wang, T Yuchen, H Ding, SRA Bontala, ...
US Patent App. 17/847,113, 2023
2023
Constrained prefix matching for generating next token predictions
P Athiwaratkun, T Yuchen, M Shang, Z Wang, RM Nallapati, P Bhatia, ...
US Patent App. 17/847,115, 2023
2023
The system can't perform the operation now. Try again later.
Articles 1–20