Bharat Kaul
Bharat Kaul
Intel Labs
Verified email at intel.com - Homepage
Title
Cited by
Cited by
Year
Method and apparatus to manage network addresses
B Kaul, N Tulpule, M Zhu, P Krishnaswamy
US Patent App. 10/651,929, 2005
1952005
Scaledeep: A scalable compute architecture for learning and evaluating deep networks
S Venkataramani, A Ranjan, S Banerjee, D Das, S Avancha, ...
Proceedings of the 44th Annual International Symposium on Computer …, 2017
1562017
Distributed deep learning using synchronous stochastic gradient descent
D Das, S Avancha, D Mudigere, K Vaidynathan, S Sridharan, D Kalamkar, ...
arXiv preprint arXiv:1602.06709, 2016
1512016
Mixed precision training of convolutional neural networks using integer operations
D Das, N Mellempudi, D Mudigere, D Kalamkar, S Avancha, K Banerjee, ...
arXiv preprint arXiv:1802.00930, 2018
962018
Out-of-distribution detection using an ensemble of self supervised leave-out classifiers
A Vyas, N Jammalamadaka, X Zhu, D Das, B Kaul, TL Willke
Proceedings of the European Conference on Computer Vision (ECCV), 550-564, 2018
892018
Ternary neural networks with fine-grained quantization
N Mellempudi, A Kundu, D Mudigere, D Das, B Kaul, P Dubey
arXiv preprint arXiv:1705.01462, 2017
802017
Sigma: A sparse and irregular gemm accelerator with flexible interconnects for dnn training
E Qin, A Samajdar, H Kwon, V Nadella, S Srinivasan, D Das, B Kaul, ...
2020 IEEE International Symposium on High Performance Computer Architecture …, 2020
672020
A study of BFLOAT16 for deep learning training
D Kalamkar, D Mudigere, N Mellempudi, D Das, K Banerjee, S Avancha, ...
arXiv preprint arXiv:1905.12322, 2019
662019
Data structure and movement for lattice-based simulations
AG Shet, SH Sorathiya, S Krithivasan, AM Deshpande, B Kaul, ...
Physical Review E 88 (1), 013314, 2013
422013
Mixed precision training with 8-bit floating point
N Mellempudi, S Srinivasan, D Das, B Kaul
arXiv preprint arXiv:1905.12334, 2019
272019
Mixed low-precision deep learning inference using dynamic fixed point
N Mellempudi, A Kundu, D Das, D Mudigere, B Kaul
arXiv preprint arXiv:1701.08978, 2017
232017
On scale-out deep learning training for cloud and HPC
S Sridharan, K Vaidyanathan, D Kalamkar, D Das, ME Smorkalov, ...
arXiv preprint arXiv:1801.08030, 2018
222018
Exploring shared-memory optimizations for an unstructured mesh CFD application on modern parallel systems
D Mudigere, S Sridharan, A Deshpande, J Park, A Heinecke, ...
2015 IEEE International Parallel and Distributed Processing Symposium, 723-732, 2015
182015
Protocol interworking framework
B Kaul, N Tulpule, D Sawhney
US Patent App. 10/651,531, 2005
182005
Rail: Risk-averse imitation learning
A Santara, A Naik, B Ravindran, D Das, D Mudigere, S Avancha, B Kaul
arXiv preprint arXiv:1707.06658, 2017
162017
On vectorization for lattice based simulations
AG Shet, K Siddharth, SH Sorathiya, AM Deshpande, SD Sherlekar, ...
International Journal of Modern Physics C 24 (12), 1340011, 2013
162013
Improving communication performance and scalability of native applications on intel xeon phi coprocessor clusters
K Vaidyanathan, K Pamnany, DD Kalamkar, A Heinecke, M Smelyanskiy, ...
2014 IEEE 28th International Parallel and Distributed Processing Symposium …, 2014
152014
Ternary residual networks
A Kundu, K Banerjee, N Mellempudi, D Mudigere, D Das, B Kaul, ...
arXiv preprint arXiv:1707.04679, 2017
112017
X-mann: A crossbar based architecture for memory augmented neural networks
A Ranjan, S Jain, JR Stevens, D Das, B Kaul, A Raghunathan
Proceedings of the 56th Annual Design Automation Conference 2019, 1-6, 2019
102019
High performance non-uniform FFT on modern x86-based multi-core systems
DD Kalamkar, JD Trzaskoz, S Sridharan, M Smelyanskiy, D Kim, ...
2012 IEEE 26th International Parallel and Distributed Processing Symposium …, 2012
102012
The system can't perform the operation now. Try again later.
Articles 1–20