
OpenAlex is a bibliographic catalogue of scientific papers, authors and institutions accessible in open access mode, named after the Library of Alexandria. It's citation coverage is excellent and I hope you will find utility in this listing of citing articles!
If you click the article title, you'll navigate to the article, as listed in CrossRef. If you click the Open Access links, you'll navigate to the "best Open Access location". Clicking the citation count will open this listing for that article. Lastly at the bottom of the page, you'll find basic pagination options.
Requested Article:
Phasic Policy Gradient
Karl Cobbe, Jacob Hilton, Oleg Klimov, et al.
arXiv (Cornell University) (2020)
Open Access | Times Cited: 49
Karl Cobbe, Jacob Hilton, Oleg Klimov, et al.
arXiv (Cornell University) (2020)
Open Access | Times Cited: 49
Showing 1-25 of 49 citing articles:
A Survey of Zero-shot Generalisation in Deep Reinforcement Learning
Robert Kirk, Amy Zhang, Edward Grefenstette, et al.
Journal of Artificial Intelligence Research (2023) Vol. 76, pp. 201-264
Open Access | Times Cited: 73
Robert Kirk, Amy Zhang, Edward Grefenstette, et al.
Journal of Artificial Intelligence Research (2023) Vol. 76, pp. 201-264
Open Access | Times Cited: 73
Autonomous Unmanned Aerial Vehicle navigation using Reinforcement Learning: A systematic review
Fadi AlMahamid, Katarina Grolinger
Engineering Applications of Artificial Intelligence (2022) Vol. 115, pp. 105321-105321
Open Access | Times Cited: 71
Fadi AlMahamid, Katarina Grolinger
Engineering Applications of Artificial Intelligence (2022) Vol. 115, pp. 105321-105321
Open Access | Times Cited: 71
Learning to drive from a world on rails
Dian Chen, Vladlen Koltun, Philipp Krähenbühl
2021 IEEE/CVF International Conference on Computer Vision (ICCV) (2021)
Open Access | Times Cited: 61
Dian Chen, Vladlen Koltun, Philipp Krähenbühl
2021 IEEE/CVF International Conference on Computer Vision (ICCV) (2021)
Open Access | Times Cited: 61
A Survey of Generalisation in Deep Reinforcement Learning
Robert Kirk, Amy Zhang, Edward Grefenstette, et al.
arXiv (Cornell University) (2021)
Closed Access | Times Cited: 58
Robert Kirk, Amy Zhang, Edward Grefenstette, et al.
arXiv (Cornell University) (2021)
Closed Access | Times Cited: 58
Machine learning meets advanced robotic manipulation
Saeid Nahavandi, Roohallah Alizadehsani, Darius Nahavandi, et al.
Information Fusion (2024) Vol. 105, pp. 102221-102221
Open Access | Times Cited: 10
Saeid Nahavandi, Roohallah Alizadehsani, Darius Nahavandi, et al.
Information Fusion (2024) Vol. 105, pp. 102221-102221
Open Access | Times Cited: 10
Fusion of Microgrid Control With Model-Free Reinforcement Learning: Review and Vision
Buxin She, Fangxing Li, Hantao Cui, et al.
IEEE Transactions on Smart Grid (2022) Vol. 14, Iss. 4, pp. 3232-3245
Open Access | Times Cited: 29
Buxin She, Fangxing Li, Hantao Cui, et al.
IEEE Transactions on Smart Grid (2022) Vol. 14, Iss. 4, pp. 3232-3245
Open Access | Times Cited: 29
PIRLNav: Pretraining with Imitation and RL Finetuning for OBJECTNAV
Ram Ramrakhya, Dhruv Batra, Erik Wijmans, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023)
Open Access | Times Cited: 19
Ram Ramrakhya, Dhruv Batra, Erik Wijmans, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023)
Open Access | Times Cited: 19
Reinforcement Learning Algorithms: An Overview and Classification
Fadi AlMahamid, Katarina Grolinger
(2021), pp. 1-7
Open Access | Times Cited: 40
Fadi AlMahamid, Katarina Grolinger
(2021), pp. 1-7
Open Access | Times Cited: 40
A New Language-Independent Deep CNN for Scene Text Detection and Style Transfer in Social Media Images
Palaiahnakote Shivakumara, Ayan Banerjee, Umapada Pal, et al.
IEEE Transactions on Image Processing (2023) Vol. 32, pp. 3552-3566
Closed Access | Times Cited: 13
Palaiahnakote Shivakumara, Ayan Banerjee, Umapada Pal, et al.
IEEE Transactions on Image Processing (2023) Vol. 32, pp. 3552-3566
Closed Access | Times Cited: 13
Rethinking the Implementation Tricks and Monotonicity Constraint in Cooperative Multi-Agent Reinforcement Learning
Jian Hu, Siyang Jiang, Seth Austin Harding, et al.
arXiv (Cornell University) (2021)
Open Access | Times Cited: 28
Jian Hu, Siyang Jiang, Seth Austin Harding, et al.
arXiv (Cornell University) (2021)
Open Access | Times Cited: 28
Buffer Awareness Neural Adaptive Video Streaming for Avoiding Extra Buffer Consumption
Tianchi Huang, Chao Zhou, Rui-Xiao Zhang, et al.
IEEE INFOCOM 2022 - IEEE Conference on Computer Communications (2023), pp. 1-10
Closed Access | Times Cited: 7
Tianchi Huang, Chao Zhou, Rui-Xiao Zhang, et al.
IEEE INFOCOM 2022 - IEEE Conference on Computer Communications (2023), pp. 1-10
Closed Access | Times Cited: 7
Policy ensemble gradient for continuous control problems in deep reinforcement learning
Guoqiang Liu, Gang Chen, Victoria Huang
Neurocomputing (2023) Vol. 548, pp. 126381-126381
Open Access | Times Cited: 5
Guoqiang Liu, Gang Chen, Victoria Huang
Neurocomputing (2023) Vol. 548, pp. 126381-126381
Open Access | Times Cited: 5
RIIT: Rethinking the Importance of Implementation Tricks in Multi-Agent Reinforcement Learning.
Jian Hu, Haibin Wu, Seth Austin Harding, et al.
(2021)
Closed Access | Times Cited: 10
Jian Hu, Haibin Wu, Seth Austin Harding, et al.
(2021)
Closed Access | Times Cited: 10
Soft Contrastive Learning With Q-Irrelevance Abstraction for Reinforcement Learning
Minsong Liu, Luntong Li, Shuai Hao, et al.
IEEE Transactions on Cognitive and Developmental Systems (2022) Vol. 15, Iss. 3, pp. 1463-1473
Closed Access | Times Cited: 7
Minsong Liu, Luntong Li, Shuai Hao, et al.
IEEE Transactions on Cognitive and Developmental Systems (2022) Vol. 15, Iss. 3, pp. 1463-1473
Closed Access | Times Cited: 7
Scaling Scaling Laws with Board Games
Andrew Jones
arXiv (Cornell University) (2021)
Open Access | Times Cited: 8
Andrew Jones
arXiv (Cornell University) (2021)
Open Access | Times Cited: 8
Improving scalability of multi-agent reinforcement learning with parameters sharing
Ning Yang, Bo Ding, Peichang Shi, et al.
(2022), pp. 37-42
Closed Access | Times Cited: 6
Ning Yang, Bo Ding, Peichang Shi, et al.
(2022), pp. 37-42
Closed Access | Times Cited: 6
What about Inputting Policy in Value Function: Policy Representation and Policy-Extended Value Function Approximator
Hongyao Tang, Zhaopeng Meng, Jianye Hao, et al.
Proceedings of the AAAI Conference on Artificial Intelligence (2022) Vol. 36, Iss. 8, pp. 8441-8449
Open Access | Times Cited: 5
Hongyao Tang, Zhaopeng Meng, Jianye Hao, et al.
Proceedings of the AAAI Conference on Artificial Intelligence (2022) Vol. 36, Iss. 8, pp. 8441-8449
Open Access | Times Cited: 5
Cross-Trajectory Representation Learning for Zero-Shot Generalization in RL.
Bogdan Mazoure, Ahmed M. Ahmed, Patrick MacAlpine, et al.
arXiv (Cornell University) (2021)
Closed Access | Times Cited: 6
Bogdan Mazoure, Ahmed M. Ahmed, Patrick MacAlpine, et al.
arXiv (Cornell University) (2021)
Closed Access | Times Cited: 6
DEIR: Efficient and Robust Exploration through Discriminative-Model-Based Episodic Intrinsic Rewards
Shanchuan Wan, Yujin Tang, Yingtao Tian, et al.
(2023), pp. 4289-4298
Open Access | Times Cited: 2
Shanchuan Wan, Yujin Tang, Yingtao Tian, et al.
(2023), pp. 4289-4298
Open Access | Times Cited: 2
Muesli: Combining Improvements in Policy Optimization.
Matteo Hessel, Ivo Danihelka, Fabio Viola, et al.
arXiv (Cornell University) (2021)
Closed Access | Times Cited: 5
Matteo Hessel, Ivo Danihelka, Fabio Viola, et al.
arXiv (Cornell University) (2021)
Closed Access | Times Cited: 5
Learning to Design and Construct Bridge without Blueprint
Yunfei Li, Tao Kong, Lei Li, et al.
2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (2021), pp. 2398-2405
Open Access | Times Cited: 5
Yunfei Li, Tao Kong, Lei Li, et al.
2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (2021), pp. 2398-2405
Open Access | Times Cited: 5
Efficient Transformers in Reinforcement Learning using Actor-Learner Distillation
Emilio Parisotto, Ruslan Salakhutdinov
arXiv (Cornell University) (2021)
Closed Access | Times Cited: 4
Emilio Parisotto, Ruslan Salakhutdinov
arXiv (Cornell University) (2021)
Closed Access | Times Cited: 4
Learning Design and Construction with Varying-Sized Materials via Prioritized Memory Resets
Yunfei Li, Tao Kong, Lei Li, et al.
2022 International Conference on Robotics and Automation (ICRA) (2022), pp. 7469-7476
Open Access | Times Cited: 3
Yunfei Li, Tao Kong, Lei Li, et al.
2022 International Conference on Robotics and Automation (ICRA) (2022), pp. 7469-7476
Open Access | Times Cited: 3
Analyzing and Overcoming Degradation in Warm-Start Reinforcement Learning
Benjamin Wexler, Elad Sarafian, Sarit Kraus
2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (2022), pp. 4048-4055
Closed Access | Times Cited: 3
Benjamin Wexler, Elad Sarafian, Sarit Kraus
2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (2022), pp. 4048-4055
Closed Access | Times Cited: 3
A priority experience replay actor-critic algorithm using self-attention mechanism for strategy optimization of discrete problems
Yuezhongyi Sun, Boyu Yang
PeerJ Computer Science (2024) Vol. 10, pp. e2161-e2161
Open Access
Yuezhongyi Sun, Boyu Yang
PeerJ Computer Science (2024) Vol. 10, pp. e2161-e2161
Open Access