OpenAlex Citation Counts

OpenAlex is a bibliographic catalogue of scientific papers, authors and institutions accessible in open access mode, named after the Library of Alexandria. It's citation coverage is excellent and I hope you will find utility in this listing of citing articles!

If you click the article title, you'll navigate to the article, as listed in CrossRef. If you click the Open Access links, you'll navigate to the "best Open Access location". Clicking the citation count will open this listing for that article. Lastly at the bottom of the page, you'll find basic pagination options.

Requested Article:

A Minimalist Approach to Offline Reinforcement Learning
Scott Fujimoto, Shixiang Gu
arXiv (Cornell University) (2021)
Open Access | Times Cited: 133

Showing 1-25 of 133 citing articles:

Offline Reinforcement Learning with Implicit Q-Learning
Ilya Kostrikov, Ashvin Nair, Sergey Levine
arXiv (Cornell University) (2021)
Open Access | Times Cited: 107

Deep reinforcement learning based energy management strategies for electrified vehicles: Recent advances and perspectives
Hongwen He, Xiangfei Meng, Yong Wang, et al.
Renewable and Sustainable Energy Reviews (2023) Vol. 192, pp. 114248-114248
Closed Access | Times Cited: 42

Parameterized Decision-Making with Multi-Modality Perception for Autonomous Driving
Yuyang Xia, Shuncheng Liu, Quanlin Yu, et al.
2022 IEEE 38th International Conference on Data Engineering (ICDE) (2024), pp. 4463-4476
Closed Access | Times Cited: 17

Offline Meta-Reinforcement Learning for Industrial Insertion
Tony Z. Zhao, Jianlan Luo, Oleg Sushkov, et al.
2022 International Conference on Robotics and Automation (ICRA) (2022), pp. 6386-6393
Open Access | Times Cited: 40

Offline Meta-Reinforcement Learning for Active Pantograph Control in High-Speed Railways
Hui Wang, Zhigang Liu, Guiyang Hu, et al.
IEEE Transactions on Industrial Informatics (2024) Vol. 20, Iss. 8, pp. 10669-10679
Closed Access | Times Cited: 9

Imitation Is Not Enough: Robustifying Imitation with Reinforcement Learning for Challenging Driving Scenarios
Yiren Lu, Justin Fu, George Tucker, et al.
2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (2023), pp. 7553-7560
Open Access | Times Cited: 19

d3rlpy: An Offline Deep Reinforcement Learning Library
Takuma Seno, Michita Imai
arXiv (Cornell University) (2021)
Open Access | Times Cited: 33

Exploration and Regularization of the Latent Action Space in Recommendation
Shuchang Liu, Qingpeng Cai, Bowen Sun, et al.
Proceedings of the ACM Web Conference 2022 (2023)
Open Access | Times Cited: 14

A Review of Uncertainty for Deep Reinforcement Learning
Owen Lockwood, Mei Si
Proceedings of the AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment (2022) Vol. 18, Iss. 1, pp. 155-162
Open Access | Times Cited: 21

Improving Sample Efficiency of Multiagent Reinforcement Learning With Nonexpert Policy for Flocking Control
Yunbo Qiu, Yue Jin, Lebin Yu, et al.
IEEE Internet of Things Journal (2023) Vol. 10, Iss. 16, pp. 14014-14027
Closed Access | Times Cited: 11

Multi-agent reinforcement learning-driven adaptive controller tuning system for autonomous control of wastewater treatment plants: An offline learning approach
KiJeon Nam, SungKu Heo, ChangKyoo Yoo
Journal of Water Process Engineering (2025) Vol. 70, pp. 107059-107059
Closed Access

Mild evaluation policy via dataset constraint for offline reinforcement learning
Xue Li, Xinghong Ling
Expert Systems with Applications (2025), pp. 126842-126842
Closed Access

ELAPSE: Expand Latent Action Projection Space for policy optimization in Offline Reinforcement Learning
Xinchen Han, Hossam Afifi, Michel Marot
Neurocomputing (2025), pp. 129665-129665
Closed Access

Deep imitative reinforcement learning with gradient conflict-free for decision-making in autonomous vehicles
Zitong Shan, Jian Zhao, Wenhui Huang, et al.
Transportation Research Part C Emerging Technologies (2025) Vol. 173, pp. 105047-105047
Closed Access

Conservative reward enhancement through the nearest neighbor integration in model-based Offline Policy Optimization
Xue Li, Bangjun Wang, Xinghong Ling
Expert Systems with Applications (2025), pp. 126888-126888
Closed Access

Coordinating ride-pooling with public transit using Reward-Guided Conservative Q-Learning: An offline training and online fine-tuning reinforcement learning framework
Yulong Hu, Tingting Dong, Sen Li
Transportation Research Part C Emerging Technologies (2025) Vol. 174, pp. 105051-105051
Closed Access

Pessimistic policy iteration with bounded uncertainty
Zhiyong Peng, Changlin Han, Yadong Liu, et al.
Expert Systems with Applications (2025), pp. 127651-127651
Closed Access

Weighted Policy Constraints for Offline Reinforcement Learning
Zhiyong Peng, Changlin Han, Yadong Liu, et al.
Proceedings of the AAAI Conference on Artificial Intelligence (2023) Vol. 37, Iss. 8, pp. 9435-9443
Open Access | Times Cited: 9

A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning
Yinmin Zhang, Jie Liu, Chuming Li, et al.
Proceedings of the AAAI Conference on Artificial Intelligence (2024) Vol. 38, Iss. 15, pp. 16908-16916
Open Access | Times Cited: 3

The RL/LLM Taxonomy Tree: Reviewing Synergies Between Reinforcement Learning and Large Language Models
Moschoula Pternea, Prerna Singh, Abir Chakraborty, et al.
Journal of Artificial Intelligence Research (2024) Vol. 80, pp. 1525-1573
Open Access | Times Cited: 3

ManiSkill: Generalizable Manipulation Skill Benchmark with Large-Scale Demonstrations
Tongzhou Mu, Zhan Ling, Fanbo Xiang, et al.
arXiv (Cornell University) (2021)
Open Access | Times Cited: 21

Value Penalized Q-Learning for Recommender Systems
Chengqian Gao, Ke Xu, Kuangqi Zhou, et al.
Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval (2022), pp. 2008-2012
Open Access | Times Cited: 14

Offline Deep Reinforcement Learning and Off-Policy Evaluation for Personalized Basal Insulin Control in Type 1 Diabetes
Taiyu Zhu, Kezhi Li, Pantelis Georgiou
IEEE Journal of Biomedical and Health Informatics (2023) Vol. 27, Iss. 10, pp. 5087-5098
Open Access | Times Cited: 8

Offline Decentralized Multi-Agent Reinforcement Learning
Jiechuan Jiang, Zongqing Lu
Frontiers in artificial intelligence and applications (2023)
Open Access | Times Cited: 8

A Survey of Demonstration Learning
André Correia, Luı́s A. Alexandre
(2023)
Open Access | Times Cited: 7

Page 1 - Next Page

Cookie	Duration	Description
cookielawinfo-checkbox-advertisement	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Advertisement" category .
cookielawinfo-checkbox-analytics	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Analytics" category .
cookielawinfo-checkbox-functional	1 year	The cookie is set by the GDPR Cookie Consent plugin to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Necessary" category .
cookielawinfo-checkbox-others	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to store the user consent for cookies in the category "Others".
cookielawinfo-checkbox-performance	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to store the user consent for cookies in the category "Performance".
CookieLawInfoConsent	1 year	Records the default button state of the corresponding category & the status of CCPA. It works only in coordination with the primary cookie.
PHPSESSID	session	This cookie is native to PHP applications. The cookie is used to store and identify a users' unique session ID for the purpose of managing user session on the website. The cookie is a session cookies and is deleted when all the browser windows are closed.

Requested Article:

Showing 1-25 of 133 citing articles:

Your Privacy