OpenAlex Citation Counts

OpenAlex is a bibliographic catalogue of scientific papers, authors and institutions accessible in open access mode, named after the Library of Alexandria. It's citation coverage is excellent and I hope you will find utility in this listing of citing articles!

If you click the article title, you'll navigate to the article, as listed in CrossRef. If you click the Open Access links, you'll navigate to the "best Open Access location". Clicking the citation count will open this listing for that article. Lastly at the bottom of the page, you'll find basic pagination options.

Requested Article:

Advantage-Weighted Regression: Simple and Scalable Off-Policy Reinforcement Learning
Xue Bin Peng, Aviral Kumar, Grace Zhang, et al.
arXiv (Cornell University) (2019)
Open Access | Times Cited: 154

Showing 26-50 of 154 citing articles:

ELAPSE: Expand Latent Action Projection Space for policy optimization in Offline Reinforcement Learning
Xinchen Han, Hossam Afifi, Michel Marot
Neurocomputing (2025), pp. 129665-129665
Closed Access

Coordinating ride-pooling with public transit using Reward-Guided Conservative Q-Learning: An offline training and online fine-tuning reinforcement learning framework
Yulong Hu, Tingting Dong, Sen Li
Transportation Research Part C Emerging Technologies (2025) Vol. 174, pp. 105051-105051
Closed Access

Balancing Engagement and Polarization: Multi-Objective Alignment of News Content Using LLMs
Mengjie Cheng, Elie Ofek, Hema Yoganarasimhan
(2025)
Closed Access

Rewriting History with Inverse RL: Hindsight Inference for Policy Improvement
Benjamin Eysenbach, Xinyang Geng, Sergey Levine, et al.
arXiv (Cornell University) (2020)
Open Access | Times Cited: 31

Structured World Models from Human Videos
Russell Mendonca, Shikhar Bahl, Deepak Pathak
(2023)
Open Access | Times Cited: 10

Learning Reward Functions for Robotic Manipulation by Observing Humans
Minttu Alakuijala, Gabriel Dulac-Arnold, Julien Mairal, et al.
(2023), pp. 5006-5012
Open Access | Times Cited: 9

Weighted Policy Constraints for Offline Reinforcement Learning
Zhiyong Peng, Changlin Han, Yadong Liu, et al.
Proceedings of the AAAI Conference on Artificial Intelligence (2023) Vol. 37, Iss. 8, pp. 9435-9443
Open Access | Times Cited: 9

Efficient Offline Reinforcement Learning With Relaxed Conservatism
Longyang Huang, Botao Dong, Weidong Zhang
IEEE Transactions on Pattern Analysis and Machine Intelligence (2024) Vol. 46, Iss. 8, pp. 5260-5272
Closed Access | Times Cited: 3

Direct learning of improved control policies from historical plant data
Khalid Alhazmi, S. Mani Sarathy
Computers & Chemical Engineering (2024) Vol. 185, pp. 108662-108662
Closed Access | Times Cited: 3

The RL/LLM Taxonomy Tree: Reviewing Synergies Between Reinforcement Learning and Large Language Models
Moschoula Pternea, Prerna Singh, Abir Chakraborty, et al.
Journal of Artificial Intelligence Research (2024) Vol. 80, pp. 1525-1573
Open Access | Times Cited: 3

Learning Agile Robotic Locomotion Skills by Imitating Animals
Xue Bin Peng, Erwin Coumans, Tingnan Zhang, et al.
arXiv (Cornell University) (2020)
Closed Access | Times Cited: 25

COMBO: Conservative Offline Model-Based Policy Optimization
Tianhe Yu, Aviral Kumar, Rafael Rafailov, et al.
arXiv (Cornell University) (2021)
Closed Access | Times Cited: 21

Learning robotic navigation from experience: principles, methods and recent results
Sergey Levine, Dhruv Shah
Philosophical Transactions of the Royal Society B Biological Sciences (2022) Vol. 378, Iss. 1869
Open Access | Times Cited: 14

Offline Decentralized Multi-Agent Reinforcement Learning
Jiechuan Jiang, Zongqing Lu
Frontiers in artificial intelligence and applications (2023)
Open Access | Times Cited: 8

PPO-CMA: Proximal Policy Optimization with Covariance Matrix Adaptation
Perttu Hämäläinen, Amin Babadi, Xiaoxiao Ma, et al.
arXiv (Cornell University) (2018)
Closed Access | Times Cited: 26

BAIL: Best-Action Imitation Learning for Batch Deep Reinforcement Learning
Xinyue Chen, Zijian Zhou, Zheng Wang, et al.
arXiv (Cornell University) (2019)
Closed Access | Times Cited: 25

Learning to Reach Goals via Iterated Supervised Learning.
Dibya Ghosh, Abhishek Gupta, Ashwin Reddy, et al.
arXiv (Cornell University) (2019)
Closed Access | Times Cited: 23

Model-Based Offline Planning
Arthur Argenson, Gabriel Dulac-Arnold
arXiv (Cornell University) (2020)
Closed Access | Times Cited: 21

Offline Reinforcement Learning with Reverse Model-based Imagination
Jianhao Wang, Wenzhe Li, Haozhe Jiang, et al.
arXiv (Cornell University) (2021)
Open Access | Times Cited: 18

A Survey of Demonstration Learning
André Correia, Luı́s A. Alexandre
(2023)
Open Access | Times Cited: 7

PPO-CMA: Proximal Policy Optimization with Covariance Matrix Adaptation
Perttu Hämäläinen, Amin Babadi, Xiaoxiao Ma, et al.
(2020), pp. 1-6
Open Access | Times Cited: 20

Learning to Reach Goals via Iterated Supervised Learning
Dibya Ghosh, Abhishek Gupta, Ashwin Reddy, et al.
International Conference on Learning Representations (2021)
Closed Access | Times Cited: 16

Offline Reinforcement Learning as Anti-exploration
Shideh Rezaeifar, Robert Dadashi, Nino Vieillard, et al.
Proceedings of the AAAI Conference on Artificial Intelligence (2022) Vol. 36, Iss. 7, pp. 8106-8114
Open Access | Times Cited: 11

Stabilizing Diffusion Model for Robotic Control With Dynamic Programming and Transition Feasibility
Haoran Li, Yaocheng Zhang, Haowei Wen, et al.
IEEE Transactions on Artificial Intelligence (2024) Vol. 5, Iss. 9, pp. 4585-4594
Closed Access | Times Cited: 2

A survey of demonstration learning
André Correia, Luı́s A. Alexandre
Robotics and Autonomous Systems (2024), pp. 104812-104812
Open Access | Times Cited: 2

Previous Page - Page 2 - Next Page

Cookie	Duration	Description
cookielawinfo-checkbox-advertisement	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Advertisement" category .
cookielawinfo-checkbox-analytics	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Analytics" category .
cookielawinfo-checkbox-functional	1 year	The cookie is set by the GDPR Cookie Consent plugin to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Necessary" category .
cookielawinfo-checkbox-others	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to store the user consent for cookies in the category "Others".
cookielawinfo-checkbox-performance	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to store the user consent for cookies in the category "Performance".
CookieLawInfoConsent	1 year	Records the default button state of the corresponding category & the status of CCPA. It works only in coordination with the primary cookie.
PHPSESSID	session	This cookie is native to PHP applications. The cookie is used to store and identify a users' unique session ID for the purpose of managing user session on the website. The cookie is a session cookies and is deleted when all the browser windows are closed.

Requested Article:

Showing 26-50 of 154 citing articles:

Your Privacy