Michael Littman
Titolo
Citata da
Citata da
Anno
Reinforcement learning: A survey
LP Kaelbling, ML Littman, AW Moore
Journal of artificial intelligence research 4, 237-285, 1996
79461996
Planning and acting in partially observable stochastic domains
LP Kaelbling, ML Littman, AR Cassandra
Artificial intelligence 101 (1-2), 99-134, 1998
40231998
Markov games as a framework for multi-agent reinforcement learning
ML Littman
Machine learning proceedings 1994, 157-163, 1994
22621994
Measuring praise and criticism: Inference of semantic orientation from association
PD Turney, ML Littman
ACM Transactions on Information Systems (TOIS) 21 (4), 315-346, 2003
20302003
Activity recognition from accelerometer data
N Ravi, N Dandekar, P Mysore, ML Littman
Aaai 5 (2005), 1541-1546, 2005
18682005
Packet routing in dynamically changing networks: A reinforcement learning approach
JA Boyan, ML Littman
Advances in neural information processing systems, 671-678, 1994
8651994
Acting optimally in partially observable stochastic domains
AR Cassandra, LP Kaelbling, ML Littman
Aaai 94, 1023-1028, 1994
8121994
Learning policies for partially observable environments: Scaling up
ML Littman, AR Cassandra, LP Kaelbling
Machine Learning Proceedings 1995, 362-370, 1995
7941995
Convergence results for single-step on-policy reinforcement-learning algorithms
S Singh, T Jaakkola, ML Littman, C Szepesvri
Machine learning 38 (3), 287-308, 2000
7182000
Graphical models for game theory
M Kearns, ML Littman, S Singh
arXiv preprint arXiv:1301.2281, 2013
7012013
Interactions between learning and evolution
D Ackley, M Littman
Artificial life II 10, 487-509, 1991
6741991
On the complexity of solving Markov decision problems
ML Littman, TL Dean, LP Kaelbling
arXiv preprint arXiv:1302.4971, 2013
5982013
Incremental pruning: A simple, fast, exact method for partially observable Markov decision processes
AR Cassandra, ML Littman, NL Zhang
arXiv preprint arXiv:1302.1525, 2013
5782013
Friend-or-foe Q-learning in general-sum games
ML Littman
ICML 1, 322-328, 2001
5472001
Predictive representations of state
ML Littman, RS Sutton
Advances in neural information processing systems, 1555-1561, 2002
5312002
Computerized cross-language document retrieval using latent semantic indexing
TK Landauer, ML Littman
US Patent 5,301,109, 1994
4971994
Algorithms for sequential decision making
ML Littman
Brown University, 1996
4931996
Unsupervised learning of semantic orientation from a hundred-billion-word corpus
PD Turney, ML Littman
arXiv preprint cs/0212012, 2002
4062002
Value-function reinforcement learning in Markov games
ML Littman
Cognitive systems research 2 (1), 55-66, 2001
3982001
PAC model-free reinforcement learning
AL Strehl, L Li, E Wiewiora, J Langford, ML Littman
Proceedings of the 23rd international conference on Machine learning, 881-888, 2006
3972006
Il sistema al momento non pu eseguire l'operazione. Riprova pi tardi.
Articoli 1–20