Safe exploration in continuous action spaces G Dalal, K Dvijotham, M Vecerik, T Hester, C Paduraru, Y Tassa arXiv preprint arXiv:1801.08757, 2018 | 309 | 2018 |

Finite Sample Analyses for TD (0) with Function Approximation G Dalal, B Szörényi, G Thoppe, S Mannor Association for the Advancement of Artificial Intelligence (AAAI) 2018, 2018 | 150 | 2018 |

Finite sample analysis of two-timescale stochastic approximation with applications to reinforcement learning G Dalal, B Szorenyi, G Thoppe, S Mannor 31st Annual Conference on Learning Theory (COLT) 75, 1-35, 2018 | 95 | 2018 |

A tale of two-timescale reinforcement learning with the tightest finite-time bound G Dalal, B Szorenyi, G Thoppe Proceedings of the AAAI Conference on Artificial Intelligence 34 (04), 3701-3708, 2020 | 39 | 2020 |

Beyond the one step greedy approach in reinforcement learning Y Efroni, G Dalal, B Scherrer, S Mannor Proceedings of The 35th International Conference on Machine Learning (ICML 2018), 2018 | 36 | 2018 |

Anomaly Detection in Large Databases Using Behavioral Patterning H Mazzawi, G Dalal, D Rozenblat, L Ein-Dor, M Ninio, O Lavi 2017 IEEE 33rd International Conference on Data Engineering (ICDE 2017), 2017 | 34 | 2017 |

Chance-constrained outage scheduling using a machine learning proxy G Dalal, E Gilboa, S Mannor, L Wehenkel IEEE Transactions on Power Systems 34 (4), 2019 | 33 | 2019 |

Hierarchical Decision Making In Electricity Grid Management G Dalal, E Gilboa, S Mannor Proceedings of The 33rd International Conference on Machine Learning (ICML …, 2016 | 33 | 2016 |

Multiple-step greedy policies in approximate and online reinforcement learning Y Efroni, G Dalal, B Scherrer, S Mannor Advances in Neural Information Processing Systems (NIPS 2018), 5238-5247, 2018 | 29 | 2018 |

How to combine tree-search methods in reinforcement learning Y Efroni, G Dalal, B Scherrer, S Mannor Proceedings of the AAAI Conference on Artificial Intelligence (AAAI 2019) 33 …, 2019 | 28 | 2019 |

Supervised Learning for Optimal Power Flow as a Real-Time Proxy R Canyasse, G Dalal, S Mannor IEEE PES Innovative Smart Grid Technologies (ISGT 2017) 8, 2017 | 28 | 2017 |

Unit commitment using nearest neighbor as a short-term proxy G Dalal, E Gilboa, S Mannor, L Wehenkel 20th Power Systems Computation Conference (PSCC'18), 2018 | 19 | 2018 |

Reinforcement learning for the unit commitment problem G Dalal, S Mannor 2015 IEEE Eindhoven PowerTech, 1-6, 2015 | 19 | 2015 |

Reinforcement learning for datacenter congestion control C Tessler, Y Shpigelman, G Dalal, A Mandelbaum, D Haritan Kazakov, ... ACM SIGMETRICS Performance Evaluation Review 49 (2), 43-46, 2022 | 15 | 2022 |

Concentration Bounds for Two Timescale Stochastic Approximation with Applications to Reinforcement Learning G Dalal, B Szorenyi, G Thoppe, S Mannor arXiv preprint arXiv:1703.05376, 2017 | 11 | 2017 |

Acting in Delayed Environments with Non-Stationary Markov Policies E Derman, G Dalal, S Mannor International Conference on Learning Representations (ICLR), 2021 | 10 | 2021 |

The Architectural Implications of Distributed Reinforcement Learning on CPU-GPU Systems A Inci, E Bolotin, Y Fu, G Dalal, S Mannor, D Nellans, D Marculescu EMC2 (The Sixth Workshop on Energy Efficient Machine Learning and Cognitive …, 2020 | 10 | 2020 |

On covariate shift of latent confounders in imitation and reinforcement learning G Tennenholtz, A Hallak, G Dalal, S Mannor, G Chechik, U Shalit arXiv preprint arXiv:2110.06539, 2021 | 8 | 2021 |

Distributed Scenario-Based Optimization for Asset Management in a Hierarchical Decision Making Environment G Dalal, E Gilboa, S Mannor 19th Power Systems Computation Conference (PSCC'16), 2016 | 7 | 2016 |

Finite sample analysis for TD (0) with linear function approximation G Dalal, B Szörényi, G Thoppe, S Mannor Proceedings of the AAAI Conference on Artificial Intelligence (AAAI 2018), 2018 | 6 | 2018 |