Gpt-4 technical report J Achiam, S Adler, S Agarwal, L Ahmad, I Akkaya, FL Aleman, D Almeida, ... arXiv preprint arXiv:2303.08774, 2023 | 5980 | 2023 |
Neural combinatorial optimization with reinforcement learning I Bello, H Pham, QV Le, M Norouzi, S Bengio arXiv preprint arXiv:1611.09940, 2016 | 1983 | 2016 |
Stand-alone self-attention in vision models P Ramachandran, N Parmar, A Vaswani, I Bello, A Levskaya, J Shlens Advances in neural information processing systems 32, 2019 | 1511* | 2019 |
Attention augmented convolutional networks I Bello, B Zoph, A Vaswani, J Shlens, QV Le Proceedings of the IEEE/CVF international conference on computer vision …, 2019 | 1387 | 2019 |
Neural optimizer search with reinforcement learning I Bello, B Zoph, V Vasudevan, QV Le International Conference on Machine Learning, 459-468, 2017 | 468 | 2017 |
Revisiting resnets: Improved training and scaling strategies I Bello, W Fedus, X Du, ED Cubuk, A Srinivas, TY Lin, J Shlens, B Zoph Advances in Neural Information Processing Systems 34, 22614-22627, 2021 | 358 | 2021 |
Lambdanetworks: Modeling long-range interactions without attention I Bello arXiv preprint arXiv:2102.08602, 2021 | 210 | 2021 |
St-moe: Designing stable and transferable sparse expert models B Zoph, I Bello, S Kumar, N Du, Y Huang, J Dean, N Shazeer, W Fedus arXiv preprint arXiv:2202.08906, 2022 | 138 | 2022 |
Designing effective sparse expert models B Zoph, I Bello, S Kumar, N Du, Y Huang, J Dean, N Shazeer, W Fedus arXiv preprint arXiv:2202.08906 2 (3), 17, 2022 | 93 | 2022 |
Seq2Slate: Re-ranking and slate optimization with RNNs I Bello, S Kulkarni, S Jain, C Boutilier, E Chi, E Eban, X Luo, A Mackey, ... arXiv preprint arXiv:1810.02019, 2018 | 89 | 2018 |
Global self-attention networks for image recognition Z Shen, I Bello, R Vemulapalli, X Jia, CH Chen arXiv preprint arXiv:2010.03019, 2020 | 40 | 2020 |
Revisiting 3d resnets for video recognition X Du, Y Li, Y Cui, R Qian, J Li, I Bello arXiv preprint arXiv:2109.01696, 2021 | 23 | 2021 |
Backprop evolution M Alber, I Bello, B Zoph, PJ Kindermans, P Ramachandran, Q Le arXiv preprint arXiv:1808.02822, 2018 | 19 | 2018 |
Neural network optimizer search I Bello, B Zoph, V Vasudevan, QV Le US Patent App. 17/145,524, 2021 | 6 | 2021 |
Systems and Methods for Slate Optimization with Recurrent Neural Networks OP Meshi, I Bello, S Kulkarni, S Jain US Patent App. 16/415,854, 2019 | 3 | 2019 |
Fully attentional computer vision J Shlens, AT Vaswani, NJ Parmar, P Ramachandran, AC Levskaya, ... US Patent App. 17/606,976, 2022 | 2 | 2022 |
Neural network optimizer search I Bello, B Zoph, V Vasudevan, QV Le US Patent 10,922,611, 2021 | 1 | 2021 |
Learning Control Policies from High-Dimensional Visual Inputs I Bello, Y Tkachenko Stanford CS231N, 2015 | 1 | 2015 |
Modeling Dependencies with Global Self-Attention Neural Networks Z Shen, R Vemulapalli, I Bello, X Jia, CH Chen US Patent App. 18/044,842, 2023 | | 2023 |
Modeling of Long-Range Interactions with Reduced Feature Materialization via Lambda Functions I Bello US Patent App. 18/011,636, 2023 | | 2023 |