Safe exploration in continuous action spaces G Dalal, K Dvijotham, M Vecerik, T Hester, C Paduraru, Y Tassa arXiv preprint arXiv:1801.08757, 2018 | 73 | 2018 |

Finite Sample Analyses for TD (0) with Function Approximation G Dalal, B Szörényi, G Thoppe, S Mannor Association for the Advancement of Artificial Intelligence (AAAI) 2018, 2018 | 64 | 2018 |

Finite sample analysis of two-timescale stochastic approximation with applications to reinforcement learning G Dalal, B Szorenyi, G Thoppe, S Mannor 31st Annual Conference on Learning Theory (COLT) 75, 1-35, 2018 | 45 | 2018 |

Beyond the one step greedy approach in reinforcement learning Y Efroni, G Dalal, B Scherrer, S Mannor Proceedings of The 35th International Conference on Machine Learning (ICML 2018), 2018 | 19 | 2018 |

Hierarchical Decision Making In Electricity Grid Management G Dalal, E Gilboa, S Mannor Proceedings of The 33rd International Conference on Machine Learning (ICML …, 2016 | 17 | 2016 |

Anomaly Detection in Large Databases Using Behavioral Patterning H Mazzawi, G Dalal, D Rozenblat, L Ein-Dor, M Ninio, O Lavi 2017 IEEE 33rd International Conference on Data Engineering (ICDE 2017), 2017 | 14 | 2017 |

Multiple-step greedy policies in approximate and online reinforcement learning Y Efroni, G Dalal, B Scherrer, S Mannor Advances in Neural Information Processing Systems (NIPS 2018), 5238-5247, 2018 | 13 | 2018 |

Unit commitment using nearest neighbor as a short-term proxy G Dalal, E Gilboa, S Mannor, L Wehenkel 20th Power Systems Computation Conference (PSCC'18), 2018 | 11 | 2018 |

Supervised Learning for Optimal Power Flow as a Real-Time Proxy R Canyasse, G Dalal, S Mannor IEEE PES Innovative Smart Grid Technologies (ISGT 2017) 8, 2017 | 11 | 2017 |

Chance-constrained outage scheduling using a machine learning proxy G Dalal, E Gilboa, S Mannor, L Wehenkel IEEE Transactions on Power Systems 34 (4), 2019 | 10 | 2019 |

How to combine tree-search methods in reinforcement learning Y Efroni, G Dalal, B Scherrer, S Mannor Proceedings of the AAAI Conference on Artificial Intelligence (AAAI 2019) 33 …, 2019 | 9 | 2019 |

A tale of two-timescale reinforcement learning with the tightest finite-time bound G Dalal, B Szorenyi, G Thoppe Proceedings of the AAAI Conference on Artificial Intelligence 34 (04), 3701-3708, 2020 | 8 | 2020 |

Concentration Bounds for Two Timescale Stochastic Approximation with Applications to Reinforcement Learning G Dalal, B Szorenyi, G Thoppe, S Mannor arXiv preprint arXiv:1703.05376, 2017 | 8 | 2017 |

Finite sample analysis for TD (0) with linear function approximation G Dalal, B Szörényi, G Thoppe, S Mannor Proceedings of the AAAI Conference on Artificial Intelligence (AAAI 2018), 2018 | 7 | 2018 |

Distributed Scenario-Based Optimization for Asset Management in a Hierarchical Decision Making Environment G Dalal, E Gilboa, S Mannor 19th Power Systems Computation Conference (PSCC'16), 2016 | 7 | 2016 |

Reinforcement learning for the unit commitment problem G Dalal, S Mannor 2015 IEEE Eindhoven PowerTech, 1-6, 2015 | 5 | 2015 |

Multiple-step greedy policies in online and approximate reinforcement learning Y Efroni, G Dalal, B Scherrer, S Mannor arXiv preprint arXiv:1805.07956, 2018 | 2 | 2018 |

Convergence of online and approximate multiple-step lookahead policy iteration Y Efroni, G Dalal, B Scherrer, S Mannor The 14th European Workshop on Reinforcement Learning (EWRL 2018), 2018 | | 2018 |