
OpenAlex is a bibliographic catalogue of scientific papers, authors and institutions accessible in open access mode, named after the Library of Alexandria. It's citation coverage is excellent and I hope you will find utility in this listing of citing articles!
If you click the article title, you'll navigate to the article, as listed in CrossRef. If you click the Open Access links, you'll navigate to the "best Open Access location". Clicking the citation count will open this listing for that article. Lastly at the bottom of the page, you'll find basic pagination options.
Requested Article:
Advantage-Weighted Regression: Simple and Scalable Off-Policy Reinforcement Learning
Xue Bin Peng, Aviral Kumar, Grace Zhang, et al.
arXiv (Cornell University) (2019)
Open Access | Times Cited: 154
Xue Bin Peng, Aviral Kumar, Grace Zhang, et al.
arXiv (Cornell University) (2019)
Open Access | Times Cited: 154
Showing 26-50 of 154 citing articles:
ELAPSE: Expand Latent Action Projection Space for policy optimization in Offline Reinforcement Learning
Xinchen Han, Hossam Afifi, Michel Marot
Neurocomputing (2025), pp. 129665-129665
Closed Access
Xinchen Han, Hossam Afifi, Michel Marot
Neurocomputing (2025), pp. 129665-129665
Closed Access
Coordinating ride-pooling with public transit using Reward-Guided Conservative Q-Learning: An offline training and online fine-tuning reinforcement learning framework
Yulong Hu, Tingting Dong, Sen Li
Transportation Research Part C Emerging Technologies (2025) Vol. 174, pp. 105051-105051
Closed Access
Yulong Hu, Tingting Dong, Sen Li
Transportation Research Part C Emerging Technologies (2025) Vol. 174, pp. 105051-105051
Closed Access
Balancing Engagement and Polarization: Multi-Objective Alignment of News Content Using LLMs
Mengjie Cheng, Elie Ofek, Hema Yoganarasimhan
(2025)
Closed Access
Mengjie Cheng, Elie Ofek, Hema Yoganarasimhan
(2025)
Closed Access
Rewriting History with Inverse RL: Hindsight Inference for Policy Improvement
Benjamin Eysenbach, Xinyang Geng, Sergey Levine, et al.
arXiv (Cornell University) (2020)
Open Access | Times Cited: 31
Benjamin Eysenbach, Xinyang Geng, Sergey Levine, et al.
arXiv (Cornell University) (2020)
Open Access | Times Cited: 31
Structured World Models from Human Videos
Russell Mendonca, Shikhar Bahl, Deepak Pathak
(2023)
Open Access | Times Cited: 10
Russell Mendonca, Shikhar Bahl, Deepak Pathak
(2023)
Open Access | Times Cited: 10
Learning Reward Functions for Robotic Manipulation by Observing Humans
Minttu Alakuijala, Gabriel Dulac-Arnold, Julien Mairal, et al.
(2023), pp. 5006-5012
Open Access | Times Cited: 9
Minttu Alakuijala, Gabriel Dulac-Arnold, Julien Mairal, et al.
(2023), pp. 5006-5012
Open Access | Times Cited: 9
Weighted Policy Constraints for Offline Reinforcement Learning
Zhiyong Peng, Changlin Han, Yadong Liu, et al.
Proceedings of the AAAI Conference on Artificial Intelligence (2023) Vol. 37, Iss. 8, pp. 9435-9443
Open Access | Times Cited: 9
Zhiyong Peng, Changlin Han, Yadong Liu, et al.
Proceedings of the AAAI Conference on Artificial Intelligence (2023) Vol. 37, Iss. 8, pp. 9435-9443
Open Access | Times Cited: 9
Efficient Offline Reinforcement Learning With Relaxed Conservatism
Longyang Huang, Botao Dong, Weidong Zhang
IEEE Transactions on Pattern Analysis and Machine Intelligence (2024) Vol. 46, Iss. 8, pp. 5260-5272
Closed Access | Times Cited: 3
Longyang Huang, Botao Dong, Weidong Zhang
IEEE Transactions on Pattern Analysis and Machine Intelligence (2024) Vol. 46, Iss. 8, pp. 5260-5272
Closed Access | Times Cited: 3
Direct learning of improved control policies from historical plant data
Khalid Alhazmi, S. Mani Sarathy
Computers & Chemical Engineering (2024) Vol. 185, pp. 108662-108662
Closed Access | Times Cited: 3
Khalid Alhazmi, S. Mani Sarathy
Computers & Chemical Engineering (2024) Vol. 185, pp. 108662-108662
Closed Access | Times Cited: 3
The RL/LLM Taxonomy Tree: Reviewing Synergies Between Reinforcement Learning and Large Language Models
Moschoula Pternea, Prerna Singh, Abir Chakraborty, et al.
Journal of Artificial Intelligence Research (2024) Vol. 80, pp. 1525-1573
Open Access | Times Cited: 3
Moschoula Pternea, Prerna Singh, Abir Chakraborty, et al.
Journal of Artificial Intelligence Research (2024) Vol. 80, pp. 1525-1573
Open Access | Times Cited: 3
Learning Agile Robotic Locomotion Skills by Imitating Animals
Xue Bin Peng, Erwin Coumans, Tingnan Zhang, et al.
arXiv (Cornell University) (2020)
Closed Access | Times Cited: 25
Xue Bin Peng, Erwin Coumans, Tingnan Zhang, et al.
arXiv (Cornell University) (2020)
Closed Access | Times Cited: 25
COMBO: Conservative Offline Model-Based Policy Optimization
Tianhe Yu, Aviral Kumar, Rafael Rafailov, et al.
arXiv (Cornell University) (2021)
Closed Access | Times Cited: 21
Tianhe Yu, Aviral Kumar, Rafael Rafailov, et al.
arXiv (Cornell University) (2021)
Closed Access | Times Cited: 21
Learning robotic navigation from experience: principles, methods and recent results
Sergey Levine, Dhruv Shah
Philosophical Transactions of the Royal Society B Biological Sciences (2022) Vol. 378, Iss. 1869
Open Access | Times Cited: 14
Sergey Levine, Dhruv Shah
Philosophical Transactions of the Royal Society B Biological Sciences (2022) Vol. 378, Iss. 1869
Open Access | Times Cited: 14
Offline Decentralized Multi-Agent Reinforcement Learning
Jiechuan Jiang, Zongqing Lu
Frontiers in artificial intelligence and applications (2023)
Open Access | Times Cited: 8
Jiechuan Jiang, Zongqing Lu
Frontiers in artificial intelligence and applications (2023)
Open Access | Times Cited: 8
PPO-CMA: Proximal Policy Optimization with Covariance Matrix Adaptation
Perttu Hämäläinen, Amin Babadi, Xiaoxiao Ma, et al.
arXiv (Cornell University) (2018)
Closed Access | Times Cited: 26
Perttu Hämäläinen, Amin Babadi, Xiaoxiao Ma, et al.
arXiv (Cornell University) (2018)
Closed Access | Times Cited: 26
BAIL: Best-Action Imitation Learning for Batch Deep Reinforcement Learning
Xinyue Chen, Zijian Zhou, Zheng Wang, et al.
arXiv (Cornell University) (2019)
Closed Access | Times Cited: 25
Xinyue Chen, Zijian Zhou, Zheng Wang, et al.
arXiv (Cornell University) (2019)
Closed Access | Times Cited: 25
Learning to Reach Goals via Iterated Supervised Learning.
Dibya Ghosh, Abhishek Gupta, Ashwin Reddy, et al.
arXiv (Cornell University) (2019)
Closed Access | Times Cited: 23
Dibya Ghosh, Abhishek Gupta, Ashwin Reddy, et al.
arXiv (Cornell University) (2019)
Closed Access | Times Cited: 23
Model-Based Offline Planning
Arthur Argenson, Gabriel Dulac-Arnold
arXiv (Cornell University) (2020)
Closed Access | Times Cited: 21
Arthur Argenson, Gabriel Dulac-Arnold
arXiv (Cornell University) (2020)
Closed Access | Times Cited: 21
Offline Reinforcement Learning with Reverse Model-based Imagination
Jianhao Wang, Wenzhe Li, Haozhe Jiang, et al.
arXiv (Cornell University) (2021)
Open Access | Times Cited: 18
Jianhao Wang, Wenzhe Li, Haozhe Jiang, et al.
arXiv (Cornell University) (2021)
Open Access | Times Cited: 18
A Survey of Demonstration Learning
André Correia, Luı́s A. Alexandre
(2023)
Open Access | Times Cited: 7
André Correia, Luı́s A. Alexandre
(2023)
Open Access | Times Cited: 7
PPO-CMA: Proximal Policy Optimization with Covariance Matrix Adaptation
Perttu Hämäläinen, Amin Babadi, Xiaoxiao Ma, et al.
(2020), pp. 1-6
Open Access | Times Cited: 20
Perttu Hämäläinen, Amin Babadi, Xiaoxiao Ma, et al.
(2020), pp. 1-6
Open Access | Times Cited: 20
Learning to Reach Goals via Iterated Supervised Learning
Dibya Ghosh, Abhishek Gupta, Ashwin Reddy, et al.
International Conference on Learning Representations (2021)
Closed Access | Times Cited: 16
Dibya Ghosh, Abhishek Gupta, Ashwin Reddy, et al.
International Conference on Learning Representations (2021)
Closed Access | Times Cited: 16
Offline Reinforcement Learning as Anti-exploration
Shideh Rezaeifar, Robert Dadashi, Nino Vieillard, et al.
Proceedings of the AAAI Conference on Artificial Intelligence (2022) Vol. 36, Iss. 7, pp. 8106-8114
Open Access | Times Cited: 11
Shideh Rezaeifar, Robert Dadashi, Nino Vieillard, et al.
Proceedings of the AAAI Conference on Artificial Intelligence (2022) Vol. 36, Iss. 7, pp. 8106-8114
Open Access | Times Cited: 11
Stabilizing Diffusion Model for Robotic Control With Dynamic Programming and Transition Feasibility
Haoran Li, Yaocheng Zhang, Haowei Wen, et al.
IEEE Transactions on Artificial Intelligence (2024) Vol. 5, Iss. 9, pp. 4585-4594
Closed Access | Times Cited: 2
Haoran Li, Yaocheng Zhang, Haowei Wen, et al.
IEEE Transactions on Artificial Intelligence (2024) Vol. 5, Iss. 9, pp. 4585-4594
Closed Access | Times Cited: 2
A survey of demonstration learning
André Correia, Luı́s A. Alexandre
Robotics and Autonomous Systems (2024), pp. 104812-104812
Open Access | Times Cited: 2
André Correia, Luı́s A. Alexandre
Robotics and Autonomous Systems (2024), pp. 104812-104812
Open Access | Times Cited: 2