Łukasz Kaiser

Cited by

	All	Since 2019
Citations	178984	167793
h-index	52	49
i10-index	85	71

54000

27000

13500

40500

201620172018201920202021202220232024824 2958 6302 11270 18023 27436 38020 53371 19637

Public access

View all

6 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Jakob UszkoreitInceptiveVerified email at uszkoreit.net
Noam ShazeerCharacter.aiVerified email at character.ai
Ashish VaswaniStartupVerified email at fastmail.com
Aidan GomezCohereVerified email at cohere.ai
Illia PolosukhinNEAR.AIVerified email at near.ai
Afroz MohiuddinGoogle IncVerified email at google.com
Oriol VinyalsResearch Scientist at Google DeepMindVerified email at google.com
Samy BengioSenior Director, AI and Machine Learning Research, AppleVerified email at apple.com
Henryk MichalewskiGoogleVerified email at google.com
Ilya SutskeverCo-Founder and Chief Scientist of OpenAIVerified email at openai.com
Anselm LevskayaResearch Scientist, GoogleVerified email at google.com
Stephan GouwsSenior Research Scientist, Google DeepMindVerified email at google.com
George TuckerGoogle BrainVerified email at google.com
Quoc V. LeResearch Scientist, GoogleVerified email at stanford.edu
François CholletGoogleVerified email at google.com
Ben D GoodrichGoogleVerified email at google.com
Piotr KozakowskiUniversity of WarsawVerified email at mimuw.edu.pl
Geoffrey HintonEmeritus Prof. Computer Science, University of TorontoVerified email at cs.toronto.edu
Mohammad SalehGoogle BrainVerified email at google.com
Étienne PotGoogleVerified email at epfl.ch

Łukasz Kaiser

OpenAI & CNRS

Verified email at openai.com - Homepage

Machine Learning & Logic in Computer Science


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Attention is all you need A Vaswani, N Shazeer, N Parmar, J Uszkoreit, L Jones, AN Gomez, ... Advances in neural information processing systems 30, 2017	117989	2017
TensorFlow: Large-scale machine learning on heterogeneous systems M Abadi, A Agarwal, P Barham, E Brevdo, Z Chen, C Citro, GS Corrado, ...	24249*	2015
Google's neural machine translation system: Bridging the gap between human and machine translation Y Wu, M Schuster, Z Chen, QV Le, M Norouzi, W Macherey, M Krikun, ... arXiv preprint arXiv:1609.08144, 2016	8396	2016
Attention is all you need A Waswani, N Shazeer, N Parmar, J Uszkoreit, L Jones, A Gomez, ... NIPS, 2017	3577*	2017
Reformer: The efficient transformer N Kitaev, Ł Kaiser, A Levskaya arXiv preprint arXiv:2001.04451, 2020	2253	2020
Evaluating large language models trained on code M Chen, J Tworek, H Jun, Q Yuan, HPO Pinto, J Kaplan, H Edwards, ... arXiv preprint arXiv:2107.03374, 2021	1982	2021
Image transformer N Parmar, A Vaswani, J Uszkoreit, L Kaiser, N Shazeer, A Ku, D Tran International conference on machine learning, 4055-4064, 2018	1810	2018
Rethinking attention with performers K Choromanski, V Likhosherstov, D Dohan, X Song, A Gane, T Sarlos, ... arXiv preprint arXiv:2009.14794, 2020	1277	2020
Regularizing neural networks by penalizing confident output distributions G Pereyra, G Tucker, J Chorowski, Ł Kaiser, G Hinton arXiv preprint arXiv:1701.06548, 2017	1193	2017
Grammar as a foreign language O Vinyals, Ł Kaiser, T Koo, S Petrov, I Sutskever, G Hinton Advances in neural information processing systems 28, 2015	1111	2015
Training verifiers to solve math word problems K Cobbe, V Kosaraju, M Bavarian, M Chen, H Jun, L Kaiser, M Plappert, ... arXiv preprint arXiv:2110.14168, 2021	1078	2021
Attention is all you need. arXiv 2017 A Vaswani, N Shazeer, N Parmar, J Uszkoreit, L Jones, AN Gomez, ... arXiv preprint arXiv:1706.03762 3762, 2023	1034	2023
Multi-task sequence to sequence learning MT Luong, QV Le, I Sutskever, O Vinyals, L Kaiser arXiv preprint arXiv:1511.06114, 2015	927	2015
Generating wikipedia by summarizing long sequences PJ Liu, M Saleh, E Pot, B Goodrich, R Sepassi, L Kaiser, N Shazeer arXiv preprint arXiv:1801.10198, 2018	910	2018
Universal transformers M Dehghani, S Gouws, O Vinyals, J Uszkoreit, Ł Kaiser arXiv preprint arXiv:1807.03819, 2018	888	2018
Model-based reinforcement learning for atari L Kaiser, M Babaeizadeh, P Milos, B Osinski, RH Campbell, ... arXiv preprint arXiv:1903.00374, 2019	875	2019
TensorFlow: Large-scale machine learning on heterogeneous systems, software available from tensorflow. org (2015) M Abadi, A Agarwal, P Barham, E Brevdo, Z Chen, C Citro, GS Corrado, ... URL https://www. tensorflow. org, 2015	834	2015
Gpt-4 technical report J Achiam, S Adler, S Agarwal, L Ahmad, I Akkaya, FL Aleman, D Almeida, ... arXiv preprint arXiv:2303.08774, 2023	804	2023
Tensor2tensor for neural machine translation A Vaswani, S Bengio, E Brevdo, F Chollet, AN Gomez, S Gouws, L Jones, ... arXiv preprint arXiv:1803.07416, 2018	611	2018
Adding gradient noise improves learning for very deep networks A Neelakantan, L Vilnis, QV Le, I Sutskever, L Kaiser, K Kurach, J Martens arXiv preprint arXiv:1511.06807, 2015	574	2015

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors