Alborz Geramifard

Cited by

	All	Since 2019
Citations	1923	1084
h-index	22	16
i10-index	36	25

260

130

195

20072008200920102011201220132014201520162017201820192020202120222023202412 12 18 14 46 89 85 100 108 92 104 138 138 157 229 241 253 64

Co-authors

Jonathan P. HowRichard C. Maclaurin Professor of Aerospace Engineering, Massachusetts Institute of TechnologyVerified email at mit.edu
Nicholas RoyMITVerified email at csail.mit.edu
Satwik KotturResearch Scientist, Facebook AIVerified email at fb.com
Seungwhan MoonFacebook, Carnegie Mellon UniversityVerified email at fb.com
Ahmad BeiramiGoogle ResearchVerified email at google.com
Michael BowlingUniversity of AlbertaVerified email at ualberta.ca
Paul A CrookResearch Scientist, Meta Platforms, Inc.Verified email at fb.com
Nazim Kemal UreIstanbul Technical UniversityVerified email at itu.edu.tr
Richard S. SuttonKeen, Amii, and University of AlbertaVerified email at richsutton.com
Rajen SubbaGoogleVerified email at google.com
Girish ChowdharyAssociate ProfessorVerified email at illinois.edu
Chinnadhurai SankarResearch Lead, SliceX AI | ex-Meta AIVerified email at fb.com
Ankita DeFacebookVerified email at fb.com
Thomas J. WalshSony AIVerified email at sony.com
Csaba SzepesvariDeepMind & University of AlbertaVerified email at cs.ualberta.ca
Babak DamavandiMeta Reality LabsVerified email at fb.com
David WhitneyMetaVerified email at meta.com
Christoph DannResearch Scientist, GoogleVerified email at google.com
Stefanie TellexBrown UniversityVerified email at cs.brown.edu
Will DabneyDeepMindVerified email at google.com

Alborz Geramifard

Research Scientist Director at Meta

Verified email at meta.com - Homepage

Reinforcement Learning Conversational AI Planning Brain and Cognitive Sciences


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Dyna-style planning with linear function approximation and prioritized sweeping RS Sutton, C Szepesvári, A Geramifard, MP Bowling arXiv preprint arXiv:1206.3285, 2012	227	2012
A tutorial on linear function approximators for dynamic programming and reinforcement learning A Geramifard, TJ Walsh, S Tellex, G Chowdhary, N Roy, JP How Foundations and Trends® in Machine Learning 6 (4), 375-451, 2013	161	2013
Decentralized control of partially observable Markov decision processes C Amato, G Chowdhary, A Geramifard, NK Üre, MJ Kochenderfer 52nd IEEE Conference on Decision and Control, 2398-2405, 2013	147	2013
Cooperative mission planning for multi-UAV teams SS Ponda, LB Johnson, A Geramifard, JP How Handbook of unmanned aerial vehicles 2, 1447-1490, 2015	100	2015
Incremental least-squares temporal difference learning A Geramifard, M Bowling, RS Sutton Proceedings of the 21st national conference on Artificial intelligence …, 2006	91	2006
RLPy: a value-function-based reinforcement learning framework for education and research. A Geramifard, C Dann, RH Klein, W Dabney, JP How J. Mach. Learn. Res. 16 (1), 1573-1578, 2015	86	2015
Online Discovery of Feature Dependencies. A Geramifard, F Doshi, J Redding, N Roy, JP How ICML, 881-888, 2011	81	2011
SIMMC 2.0: A task-oriented dialog dataset for immersive multimodal conversations S Kottur, S Moon, A Geramifard, B Damavandi arXiv preprint arXiv:2104.08667, 2021	75	2021
Situated and interactive multimodal conversations S Moon, S Kottur, PA Crook, A De, S Poddar, T Levin, D Whitney, ... arXiv preprint arXiv:2006.01460, 2020	75	2020
Overview of the ninth dialog system technology challenge: Dstc9 C Gunasekara, S Kim, LF D'Haro, A Rastogi, YN Chen, M Eric, ... arXiv preprint arXiv:2011.06486, 2020	69	2020
iLSTD: Eligibility traces and convergence analysis A Geramifard, M Bowling, M Zinkevich, RS Sutton Advances in Neural Information Processing Systems 19, 2006	62	2006
On the design and use of a micro air vehicle to track and avoid adversaries R He, A Bachrach, M Achtelik, A Geramifard, D Gurdan, S Prentice, ... The International Journal of Robotics Research 29 (5), 529-546, 2010	54	2010
Intelligent cooperative control architecture: a framework for performance improvement using safe learning A Geramifard, J Redding, JP How Journal of Intelligent & Robotic Systems 72, 83-103, 2013	52	2013
Customized movie trailers A Geramifard US Patent App. 14/105,428, 2015	49	2015
Reinforcement learning with misspecified model classes J Joseph, A Geramifard, JW Roberts, JP How, N Roy 2013 IEEE International Conference on Robotics and Automation, 939-946, 2013	46	2013
UAV cooperative control with stochastic risk models A Geramifard, J Redding, N Roy, JP How Proceedings of the 2011 american control conference, 3393-3398, 2011	46	2011
Biased cost pathfinding A Geramifard, P Chubak, V Bulitko Proceedings of the AAAI Conference on Artificial Intelligence and …, 2006	41	2006
An intelligent cooperative control architecture J Redding, A Geramifard, A Undurti, HL Choi, JP How Proceedings of the 2010 American control conference, 57-62, 2010	37	2010
Adaptive planning for Markov decision processes with uncertain transition models via incremental feature dependency discovery NK Ure, A Geramifard, G Chowdhary, JP How Machine Learning and Knowledge Discovery in Databases: European Conference …, 2012	32	2012
Annotation inconsistency and entity bias in MultiWOZ K Qian, A Beirami, Z Lin, A De, A Geramifard, Z Yu, C Sankar arXiv preprint arXiv:2105.14150, 2021	29	2021

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors