Dipankar Das

Cited by

	All	Since 2019
Citations	3380	2906
h-index	24	23
i10-index	38	35

680

340

170

510

2014201520162017201820192020202120222023202410 13 64 93 214 306 478 630 668 625 187

Public access

View all

7 articles

3 articles

available

not available

Based on funding mandates

Co-authors

Naveen MellempudiFellow, Advanced Micro DevicesVerified email at amd.com
Dheevatsa MudigereDistinguished Engineer, NVIDIAVerified email at nvidia.com
Srinivas Sridharan, PhdDistinguished Engineer, NVIDIAVerified email at nvidia.com
Pradeep DubeyIntel CorporationVerified email at intel.com
Abhisek KunduResearch Scientist, Intel Parallel Computing Labs, IndiaVerified email at intel.com
Anand RaghunathanProfessor of Electrical and Computer Engineering, Purdue UniversityVerified email at purdue.edu
Alexander HeineckeSenior Principal Engineer at Intel LabsVerified email at intel.com
Ashish RanjanResearch Staff Member, IBM T.J. Watson Research CenterVerified email at ibm.com
Swagath VenkataramaniResearch Staff Member, IBM T.J. Watson Research Center / Purdue Univ.Verified email at ibm.com
Satya Gautam VadlamudiElystar Investment ManagementVerified email at elystarinvest.com
Mikhail SmelyanskiyFacebookVerified email at intel.com
Jongsoo ParkResearch Scientist, FacebookVerified email at fb.com
Nataraj JammalamadakaPhd ScholarVerified email at research.iiit.ac.in
Evangelos GeorganasIntel Labs, Parallel Computing LabVerified email at intel.com
Narayanan SundaramFacebookVerified email at fb.com
Mostofa PatwaryApplied Deep Learning Research, NVIDIAVerified email at nvidia.com
DulloorGeorgia Institute of Technology, Kumo.AIVerified email at kumo.ai
Jacob R. StevensIntegrated Systems Lab, Purdue UniversityVerified email at purdue.edu
Apoorv VyasFAIR Labs MetaVerified email at meta.com
Theodore WillkeIntel LabsVerified email at intel.com

Dipankar Das

Intel Parallel Computing Labs, Intel Labs

Verified email at intel.com

AI Deep Learning System Design Computer Architecture Algorithms


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Graphmat: High performance graph analytics made productive N Sundaram, NR Satish, MMA Patwary, SR Dulloor, SG Vadlamudi, ... arXiv preprint arXiv:1503.07241, 2015	386	2015
Sigma: A sparse and irregular gemm accelerator with flexible interconnects for dnn training E Qin, A Samajdar, H Kwon, V Nadella, S Srinivasan, D Das, B Kaul, ... 2020 IEEE International Symposium on High Performance Computer Architecture …, 2020	377	2020
A study of BFLOAT16 for deep learning training D Kalamkar, D Mudigere, N Mellempudi, D Das, K Banerjee, S Avancha, ... arXiv preprint arXiv:1905.12322, 2019	296	2019
Scaledeep: A scalable compute architecture for learning and evaluating deep networks S Venkataramani, A Ranjan, S Banerjee, D Das, S Avancha, ... Proceedings of the 44th Annual International Symposium on Computer …, 2017	265	2017
Out-of-distribution detection using an ensemble of self supervised leave-out classifiers A Vyas, N Jammalamadaka, X Zhu, D Das, B Kaul, TL Willke Proceedings of the European conference on computer vision (ECCV), 550-564, 2018	259	2018
Reconfigurable interface-based electrical architecture D Das, VK Agrawal, S Rajappan US Patent 8,930,036, 2015	211	2015
Distributed deep learning using synchronous stochastic gradient descent D Das, S Avancha, D Mudigere, K Vaidynathan, S Sridharan, D Kalamkar, ... arXiv preprint arXiv:1602.06709, 2016	207	2016
Mixed precision training of convolutional neural networks using integer operations D Das, N Mellempudi, D Mudigere, D Kalamkar, S Avancha, K Banerjee, ... arXiv preprint arXiv:1802.00930, 2018	187	2018
Ternary neural networks with fine-grained quantization N Mellempudi, A Kundu, D Mudigere, D Das, B Kaul, P Dubey arXiv preprint arXiv:1705.01462, 2017	128	2017
Parallel efficient sparse matrix-matrix multiplication on multicore platforms MMA Patwary, NR Satish, N Sundaram, J Park, MJ Anderson, ... International Conference on High Performance Computing, 48-57, 2015	77	2015
Mixed precision training with 8-bit floating point N Mellempudi, S Srinivasan, D Das, B Kaul arXiv preprint arXiv:1905.12334, 2019	69	2019
Abstraction layers for scalable distributed machine learning DD Kalamkar, K Vaidyanathan, S Sridharan, D Das US Patent 11,094,029, 2021	66	2021
Improving concurrency and asynchrony in multithreaded MPI applications using software offloading K Vaidyanathan, DD Kalamkar, K Pamnany, JR Hammond, P Balaji, ... Proceedings of the International Conference for High Performance Computing …, 2015	56	2015
Communication optimizations for distributed machine learning S Sridharan, K Vaidyanathan, D Das, C Sakthivel, ME Smorkalov US Patent 11,270,201, 2022	55	2022
Apparatuses, methods, and systems for neural networks S Venkataramani, D Das, A Ranjan, S Banerjee, S Avancha, ... US Patent App. 16/317,497, 2019	53	2019
Dynamic precision management for integer deep learning primitives N Mellempudi, D Mudigere, D Das, S Sridharan US Patent 10,643,297, 2020	47	2020
Hardware implemented point to point communication primitives for machine learning S Sridharan, K Vaidyanathan, D Das US Patent 11,488,008, 2022	46	2022
Optimized compute hardware for machine learning operations D Das, R Gramunt, M Smelyanskiy, J Corbal, D Mudigere, NK Mellempudi, ... US Patent 10,776,699, 2020	44	2020
X-mann: A crossbar based architecture for memory augmented neural networks A Ranjan, S Jain, JR Stevens, D Das, B Kaul, A Raghunathan Proceedings of the 56th Annual Design Automation Conference 2019, 1-6, 2019	41	2019
On scale-out deep learning training for cloud and hpc S Sridharan, K Vaidyanathan, D Kalamkar, D Das, ME Smorkalov, ... arXiv preprint arXiv:1801.08030, 2018	34	2018

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors