Alborz Geramifard
Alborz Geramifard
Research Manager at Facebook
Verified email at fb.com - Homepage
TitleCited byYear
Dyna-style planning with linear function approximation and prioritized sweeping
RS Sutton, C Szepesvári, A Geramifard, MP Bowling
arXiv preprint arXiv:1206.3285, 2012
1232012
A tutorial on linear function approximators for dynamic programming and reinforcement learning
A Geramifard, TJ Walsh, S Tellex, G Chowdhary, N Roy, JP How
Foundations and Trends® in Machine Learning 6 (4), 375-451, 2013
862013
Decentralized control of partially observable Markov decision processes
C Amato, G Chowdhary, A Geramifard, NK Üre, MJ Kochenderfer
52nd IEEE Conference on Decision and Control, 2398-2405, 2013
772013
Incremental least-squares temporal difference learning
A Geramifard, M Bowling, RS Sutton
Proceedings of the National Conference on Artificial Intelligence 21 (1), 356, 2006
742006
Online Discovery of Feature Dependencies.
A Geramifard, F Doshi, J Redding, N Roy, JP How
ICML, 881-888, 2011
702011
Cooperative mission planning for multi-UAV teams
SS Ponda, LB Johnson, A Geramifard, JP How
Handbook of unmanned aerial vehicles 2, 1447-1490, 2015
532015
iLSTD: Eligibility traces and convergence analysis
A Geramifard, M Bowling, M Zinkevich, RS Sutton
Advances in Neural Information Processing Systems, 441-448, 2007
512007
On the design and use of a micro air vehicle to track and avoid adversaries
R He, A Bachrach, M Achtelik, A Geramifard, D Gurdan, S Prentice, ...
The International Journal of Robotics Research 29 (5), 529-546, 2010
502010
UAV cooperative control with stochastic risk models
A Geramifard, J Redding, N Roy, JP How
Proceedings of the 2011 American Control Conference, 3393-3398, 2011
362011
An intelligent cooperative control architecture
J Redding, A Geramifard, A Undurti, HL Choi, JP How
Proceedings of the 2010 American control conference, 57-62, 2010
322010
Rlpy: a value-function-based reinforcement learning framework for education and research
A Geramifard, C Dann, RH Klein, W Dabney, JP How
MIT Press, 2015
292015
Adaptive planning for Markov decision processes with uncertain transition models via incremental feature dependency discovery
NK Ure, A Geramifard, G Chowdhary, JP How
Joint European conference on machine learning and knowledge discovery in …, 2012
292012
Biased Cost Pathfinding.
A Geramifard, P Chubak, V Bulitko
AIIDE, 112-114, 2006
292006
Intelligent cooperative control architecture: a framework for performance improvement using safe learning
A Geramifard, J Redding, JP How
Journal of Intelligent & Robotic Systems 72 (1), 83-103, 2013
212013
Reinforcement learning with misspecified model classes
J Joseph, A Geramifard, JW Roberts, JP How, N Roy
2013 IEEE International Conference on Robotics and Automation, 939-946, 2013
212013
Handbook of Unmanned Aerial Vehicles, chapter Linear Flight Contol Techniques for Unmanned Aerial Vehicles
JP How, E Frazzoli, G Chowdhary
Springer, 2012
212012
RLPy: The Reinforcement Learning Library for Education and Research
A Geramifard, RH Klein, P Jonathan
192013
Practical reinforcement learning using representation learning and safe exploration for large scale Markov decision processes
A Geramifard
Massachusetts Institute of Technology, 2012
162012
Batch iFDD: A scalable matching pursuit algorithm for solving MDPs
A Geramifard, TJ Walsh, N Roy, J How
Proceedings of the 29th Annual Conference on Uncertainty in Artificial …, 2013
132013
Co-ordinated tracking and planning using air and ground vehicles
A Bachrach, A Garamifard, D Gurdan, R He, S Prentice, J Stumpf, N Roy
Experimental Robotics, 137-146, 2009
112009
The system can't perform the operation now. Try again later.
Articles 1–20