
OpenAlex is a bibliographic catalogue of scientific papers, authors and institutions accessible in open access mode, named after the Library of Alexandria. It's citation coverage is excellent and I hope you will find utility in this listing of citing articles!
If you click the article title, you'll navigate to the article, as listed in CrossRef. If you click the Open Access links, you'll navigate to the "best Open Access location". Clicking the citation count will open this listing for that article. Lastly at the bottom of the page, you'll find basic pagination options.
Requested Article:
A Minimalist Approach to Offline Reinforcement Learning
Scott Fujimoto, Shixiang Gu
arXiv (Cornell University) (2021)
Open Access | Times Cited: 133
Scott Fujimoto, Shixiang Gu
arXiv (Cornell University) (2021)
Open Access | Times Cited: 133
Showing 1-25 of 133 citing articles:
Offline Reinforcement Learning with Implicit Q-Learning
Ilya Kostrikov, Ashvin Nair, Sergey Levine
arXiv (Cornell University) (2021)
Open Access | Times Cited: 107
Ilya Kostrikov, Ashvin Nair, Sergey Levine
arXiv (Cornell University) (2021)
Open Access | Times Cited: 107
Deep reinforcement learning based energy management strategies for electrified vehicles: Recent advances and perspectives
Hongwen He, Xiangfei Meng, Yong Wang, et al.
Renewable and Sustainable Energy Reviews (2023) Vol. 192, pp. 114248-114248
Closed Access | Times Cited: 42
Hongwen He, Xiangfei Meng, Yong Wang, et al.
Renewable and Sustainable Energy Reviews (2023) Vol. 192, pp. 114248-114248
Closed Access | Times Cited: 42
Parameterized Decision-Making with Multi-Modality Perception for Autonomous Driving
Yuyang Xia, Shuncheng Liu, Quanlin Yu, et al.
2022 IEEE 38th International Conference on Data Engineering (ICDE) (2024), pp. 4463-4476
Closed Access | Times Cited: 17
Yuyang Xia, Shuncheng Liu, Quanlin Yu, et al.
2022 IEEE 38th International Conference on Data Engineering (ICDE) (2024), pp. 4463-4476
Closed Access | Times Cited: 17
Offline Meta-Reinforcement Learning for Industrial Insertion
Tony Z. Zhao, Jianlan Luo, Oleg Sushkov, et al.
2022 International Conference on Robotics and Automation (ICRA) (2022), pp. 6386-6393
Open Access | Times Cited: 40
Tony Z. Zhao, Jianlan Luo, Oleg Sushkov, et al.
2022 International Conference on Robotics and Automation (ICRA) (2022), pp. 6386-6393
Open Access | Times Cited: 40
Offline Meta-Reinforcement Learning for Active Pantograph Control in High-Speed Railways
Hui Wang, Zhigang Liu, Guiyang Hu, et al.
IEEE Transactions on Industrial Informatics (2024) Vol. 20, Iss. 8, pp. 10669-10679
Closed Access | Times Cited: 9
Hui Wang, Zhigang Liu, Guiyang Hu, et al.
IEEE Transactions on Industrial Informatics (2024) Vol. 20, Iss. 8, pp. 10669-10679
Closed Access | Times Cited: 9
Imitation Is Not Enough: Robustifying Imitation with Reinforcement Learning for Challenging Driving Scenarios
Yiren Lu, Justin Fu, George Tucker, et al.
2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (2023), pp. 7553-7560
Open Access | Times Cited: 19
Yiren Lu, Justin Fu, George Tucker, et al.
2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (2023), pp. 7553-7560
Open Access | Times Cited: 19
d3rlpy: An Offline Deep Reinforcement Learning Library
Takuma Seno, Michita Imai
arXiv (Cornell University) (2021)
Open Access | Times Cited: 33
Takuma Seno, Michita Imai
arXiv (Cornell University) (2021)
Open Access | Times Cited: 33
Exploration and Regularization of the Latent Action Space in Recommendation
Shuchang Liu, Qingpeng Cai, Bowen Sun, et al.
Proceedings of the ACM Web Conference 2022 (2023)
Open Access | Times Cited: 14
Shuchang Liu, Qingpeng Cai, Bowen Sun, et al.
Proceedings of the ACM Web Conference 2022 (2023)
Open Access | Times Cited: 14
A Review of Uncertainty for Deep Reinforcement Learning
Owen Lockwood, Mei Si
Proceedings of the AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment (2022) Vol. 18, Iss. 1, pp. 155-162
Open Access | Times Cited: 21
Owen Lockwood, Mei Si
Proceedings of the AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment (2022) Vol. 18, Iss. 1, pp. 155-162
Open Access | Times Cited: 21
Improving Sample Efficiency of Multiagent Reinforcement Learning With Nonexpert Policy for Flocking Control
Yunbo Qiu, Yue Jin, Lebin Yu, et al.
IEEE Internet of Things Journal (2023) Vol. 10, Iss. 16, pp. 14014-14027
Closed Access | Times Cited: 11
Yunbo Qiu, Yue Jin, Lebin Yu, et al.
IEEE Internet of Things Journal (2023) Vol. 10, Iss. 16, pp. 14014-14027
Closed Access | Times Cited: 11
Multi-agent reinforcement learning-driven adaptive controller tuning system for autonomous control of wastewater treatment plants: An offline learning approach
KiJeon Nam, SungKu Heo, ChangKyoo Yoo
Journal of Water Process Engineering (2025) Vol. 70, pp. 107059-107059
Closed Access
KiJeon Nam, SungKu Heo, ChangKyoo Yoo
Journal of Water Process Engineering (2025) Vol. 70, pp. 107059-107059
Closed Access
Mild evaluation policy via dataset constraint for offline reinforcement learning
Xue Li, Xinghong Ling
Expert Systems with Applications (2025), pp. 126842-126842
Closed Access
Xue Li, Xinghong Ling
Expert Systems with Applications (2025), pp. 126842-126842
Closed Access
ELAPSE: Expand Latent Action Projection Space for policy optimization in Offline Reinforcement Learning
Xinchen Han, Hossam Afifi, Michel Marot
Neurocomputing (2025), pp. 129665-129665
Closed Access
Xinchen Han, Hossam Afifi, Michel Marot
Neurocomputing (2025), pp. 129665-129665
Closed Access
Deep imitative reinforcement learning with gradient conflict-free for decision-making in autonomous vehicles
Zitong Shan, Jian Zhao, Wenhui Huang, et al.
Transportation Research Part C Emerging Technologies (2025) Vol. 173, pp. 105047-105047
Closed Access
Zitong Shan, Jian Zhao, Wenhui Huang, et al.
Transportation Research Part C Emerging Technologies (2025) Vol. 173, pp. 105047-105047
Closed Access
Conservative reward enhancement through the nearest neighbor integration in model-based Offline Policy Optimization
Xue Li, Bangjun Wang, Xinghong Ling
Expert Systems with Applications (2025), pp. 126888-126888
Closed Access
Xue Li, Bangjun Wang, Xinghong Ling
Expert Systems with Applications (2025), pp. 126888-126888
Closed Access
Coordinating ride-pooling with public transit using Reward-Guided Conservative Q-Learning: An offline training and online fine-tuning reinforcement learning framework
Yulong Hu, Tingting Dong, Sen Li
Transportation Research Part C Emerging Technologies (2025) Vol. 174, pp. 105051-105051
Closed Access
Yulong Hu, Tingting Dong, Sen Li
Transportation Research Part C Emerging Technologies (2025) Vol. 174, pp. 105051-105051
Closed Access
Pessimistic policy iteration with bounded uncertainty
Zhiyong Peng, Changlin Han, Yadong Liu, et al.
Expert Systems with Applications (2025), pp. 127651-127651
Closed Access
Zhiyong Peng, Changlin Han, Yadong Liu, et al.
Expert Systems with Applications (2025), pp. 127651-127651
Closed Access
Weighted Policy Constraints for Offline Reinforcement Learning
Zhiyong Peng, Changlin Han, Yadong Liu, et al.
Proceedings of the AAAI Conference on Artificial Intelligence (2023) Vol. 37, Iss. 8, pp. 9435-9443
Open Access | Times Cited: 9
Zhiyong Peng, Changlin Han, Yadong Liu, et al.
Proceedings of the AAAI Conference on Artificial Intelligence (2023) Vol. 37, Iss. 8, pp. 9435-9443
Open Access | Times Cited: 9
A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning
Yinmin Zhang, Jie Liu, Chuming Li, et al.
Proceedings of the AAAI Conference on Artificial Intelligence (2024) Vol. 38, Iss. 15, pp. 16908-16916
Open Access | Times Cited: 3
Yinmin Zhang, Jie Liu, Chuming Li, et al.
Proceedings of the AAAI Conference on Artificial Intelligence (2024) Vol. 38, Iss. 15, pp. 16908-16916
Open Access | Times Cited: 3
The RL/LLM Taxonomy Tree: Reviewing Synergies Between Reinforcement Learning and Large Language Models
Moschoula Pternea, Prerna Singh, Abir Chakraborty, et al.
Journal of Artificial Intelligence Research (2024) Vol. 80, pp. 1525-1573
Open Access | Times Cited: 3
Moschoula Pternea, Prerna Singh, Abir Chakraborty, et al.
Journal of Artificial Intelligence Research (2024) Vol. 80, pp. 1525-1573
Open Access | Times Cited: 3
ManiSkill: Generalizable Manipulation Skill Benchmark with Large-Scale Demonstrations
Tongzhou Mu, Zhan Ling, Fanbo Xiang, et al.
arXiv (Cornell University) (2021)
Open Access | Times Cited: 21
Tongzhou Mu, Zhan Ling, Fanbo Xiang, et al.
arXiv (Cornell University) (2021)
Open Access | Times Cited: 21
Value Penalized Q-Learning for Recommender Systems
Chengqian Gao, Ke Xu, Kuangqi Zhou, et al.
Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval (2022), pp. 2008-2012
Open Access | Times Cited: 14
Chengqian Gao, Ke Xu, Kuangqi Zhou, et al.
Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval (2022), pp. 2008-2012
Open Access | Times Cited: 14
Offline Deep Reinforcement Learning and Off-Policy Evaluation for Personalized Basal Insulin Control in Type 1 Diabetes
Taiyu Zhu, Kezhi Li, Pantelis Georgiou
IEEE Journal of Biomedical and Health Informatics (2023) Vol. 27, Iss. 10, pp. 5087-5098
Open Access | Times Cited: 8
Taiyu Zhu, Kezhi Li, Pantelis Georgiou
IEEE Journal of Biomedical and Health Informatics (2023) Vol. 27, Iss. 10, pp. 5087-5098
Open Access | Times Cited: 8
Offline Decentralized Multi-Agent Reinforcement Learning
Jiechuan Jiang, Zongqing Lu
Frontiers in artificial intelligence and applications (2023)
Open Access | Times Cited: 8
Jiechuan Jiang, Zongqing Lu
Frontiers in artificial intelligence and applications (2023)
Open Access | Times Cited: 8
A Survey of Demonstration Learning
André Correia, Luı́s A. Alexandre
(2023)
Open Access | Times Cited: 7
André Correia, Luı́s A. Alexandre
(2023)
Open Access | Times Cited: 7