Follow
Philipp Moritz
Title
Cited by
Cited by
Year
Trust region policy optimization
J Schulman, S Levine, P Abbeel, M Jordan, P Moritz
International conference on machine learning, 1889-1897, 2015
51672015
High-dimensional continuous control using generalized advantage estimation
J Schulman, P Moritz, S Levine, M Jordan, P Abbeel
arXiv preprint arXiv:1506.02438, 2015
20642015
Ray: A distributed framework for emerging {AI} applications
P Moritz, R Nishihara, S Wang, A Tumanov, R Liaw, E Liang, M Elibol, ...
13th USENIX Symposium on Operating Systems Design and Implementation (OSDI …, 2018
6472018
RLlib: Abstractions for Distributed Reinforcement Learning
E Liang, R Liaw, P Moritz, R Nishihara, R Fox, K Goldberg, J Gonzalez, ...
International Conference on Machine Learning, 3059-3068, 2018
4242018
Tune: A research platform for distributed model selection and training
R Liaw, E Liang, R Nishihara, P Moritz, JE Gonzalez, I Stoica
arXiv preprint arXiv:1807.05118, 2018
3702018
A linearly-convergent stochastic L-BFGS algorithm
P Moritz, R Nishihara, M Jordan
Artificial Intelligence and Statistics, 249-258, 2016
2212016
Sparknet: Training deep networks in spark
P Moritz, R Nishihara, I Stoica, MI Jordan
arXiv preprint arXiv:1511.06051, 2015
1962015
Ray rllib: A composable and scalable reinforcement learning library
E Liang, R Liaw, R Nishihara, P Moritz, R Fox, J Gonzalez, K Goldberg, ...
arXiv preprint arXiv:1712.09381, 85, 2017
1082017
Real-time machine learning: The missing pieces
R Nishihara, P Moritz, S Wang, A Tumanov, W Paul, J Schleier-Smith, ...
Proceedings of the 16th Workshop on Hot Topics in Operating Systems, 106-110, 2017
582017
Policy gradient search: Online planning and expert iteration without search trees
T Anthony, R Nishihara, P Moritz, T Salimans, J Schulman
arXiv preprint arXiv:1904.03646, 2019
272019
Lineage stash: fault tolerance off the critical path
S Wang, J Liagouris, R Nishihara, P Moritz, U Misra, A Tumanov, I Stoica
Proceedings of the 27th ACM Symposium on Operating Systems Principles, 338-352, 2019
262019
Hoplite: efficient and fault-tolerant collective communication for task-based distributed systems
S Zhuang, Z Li, D Zhuo, S Wang, E Liang, R Nishihara, P Moritz, I Stoica
Proceedings of the 2021 ACM SIGCOMM 2021 Conference, 641-656, 2021
62021
Trust Region Policy Optimization (TRPO)
J Schulman, S Levine, P Moritz, MI Jordan, P Abbeel
32020
Ray: A Distributed Execution Engine for the Machine Learning Ecosystem
PC Moritz
UC Berkeley, 2019
12019
Flexible Primitives for Distributed Deep Learning in Ray
Y Bulatov, R Nishihara, P Moritz, M Elibol, I Stoica, MI Jordan
SysML Conference, 2018
12018
Hoplite: Efficient Collective Communication for Task-Based Distributed Systems
S Zhuang, Z Li, D Zhuo, S Wang, E Liang, R Nishihara, P Moritz, I Stoica
2020
The system can't perform the operation now. Try again later.
Articles 1–16