OpenAlex Citation Counts

OpenAlex Citations Logo

OpenAlex is a bibliographic catalogue of scientific papers, authors and institutions accessible in open access mode, named after the Library of Alexandria. It's citation coverage is excellent and I hope you will find utility in this listing of citing articles!

If you click the article title, you'll navigate to the article, as listed in CrossRef. If you click the Open Access links, you'll navigate to the "best Open Access location". Clicking the citation count will open this listing for that article. Lastly at the bottom of the page, you'll find basic pagination options.

Requested Article:

Conservative Q-Learning for Offline Reinforcement Learning
Aviral Kumar, Aurick Zhou, George Tucker, et al.
arXiv (Cornell University) (2020)
Open Access | Times Cited: 457

Showing 26-50 of 457 citing articles:

Recent advances in reinforcement learning-based autonomous driving behavior planning: A survey
Jingda Wu, Chao Huang, Hailong Huang, et al.
Transportation Research Part C Emerging Technologies (2024) Vol. 164, pp. 104654-104654
Closed Access | Times Cited: 9

Integrating reinforcement learning and large language models for crop production process management optimization and control through a new knowledge-based deep learning paradigm
Dong Chen, Yanbo Huang
Computers and Electronics in Agriculture (2025) Vol. 232, pp. 110028-110028
Closed Access | Times Cited: 1

What Matters in Learning from Offline Human Demonstrations for Robot Manipulation
Ajay Mandlekar, Danfei Xu, Josiah Wong, et al.
arXiv (Cornell University) (2021)
Open Access | Times Cited: 52

Multi-Task Fusion via Reinforcement Learning for Long-Term User Satisfaction in Recommender Systems
Qihua Zhang, Junning Liu, Yuzhuo Dai, et al.
Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (2022), pp. 4510-4520
Open Access | Times Cited: 37

DeepThermal: Combustion Optimization for Thermal Power Generating Units Using Offline Reinforcement Learning
Xianyuan Zhan, Haoran Xu, Yue Zhang, et al.
Proceedings of the AAAI Conference on Artificial Intelligence (2022) Vol. 36, Iss. 4, pp. 4680-4688
Open Access | Times Cited: 31

Imitation Is Not Enough: Robustifying Imitation with Reinforcement Learning for Challenging Driving Scenarios
Yiren Lu, Justin Fu, George Tucker, et al.
2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (2023), pp. 7553-7560
Open Access | Times Cited: 19

SecBoost: Secrecy-Aware Deep Reinforcement Learning Based Energy-Efficient Scheme for 5G HetNets
Himanshu Sharma, Neeraj Kumar, Raj Kumar Tekchandani
IEEE Transactions on Mobile Computing (2023), pp. 1-15
Closed Access | Times Cited: 18

Causal Decision Transformer for Recommender Systems via Offline Reinforcement Learning
Siyu Wang, Xiaocong Chen, Dietmar Jannach, et al.
Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval (2023), pp. 1599-1608
Open Access | Times Cited: 17

Reinforcement learning and bandits for speech and language processing: Tutorial, review and outlook
Baihan Lin
Expert Systems with Applications (2023) Vol. 238, pp. 122254-122254
Open Access | Times Cited: 17

Offline DRL for Price-Based Demand Response: Learning From Suboptimal Data and Beyond
Tao Qian, Zeyu Liang, Chengcheng Shao, et al.
IEEE Transactions on Smart Grid (2024) Vol. 15, Iss. 5, pp. 4618-4635
Closed Access | Times Cited: 7

An optimal solutions-guided deep reinforcement learning approach for online energy storage control
Gaoyuan Xu, Jian Shi, Jiaman Wu, et al.
Applied Energy (2024) Vol. 361, pp. 122915-122915
Closed Access | Times Cited: 6

Safe chance constrained reinforcement learning for batch process control
Max Mowbray, Panagiotis Petsagkourakis, Ehecatl Antonio del Rio‐Chanona, et al.
Computers & Chemical Engineering (2021) Vol. 157, pp. 107630-107630
Open Access | Times Cited: 33

d3rlpy: An Offline Deep Reinforcement Learning Library
Takuma Seno, Michita Imai
arXiv (Cornell University) (2021)
Open Access | Times Cited: 33

Proximal policy optimization algorithm for dynamic pricing with online reviews
Chao Wu, Wenjie Bi, Haiying Liu
Expert Systems with Applications (2022) Vol. 213, pp. 119191-119191
Closed Access | Times Cited: 25

A statistical learning framework for spatial-temporal feature selection and application to air quality index forecasting
Zixi Zhao, Jinran Wu, Fengjing Cai, et al.
Ecological Indicators (2022) Vol. 144, pp. 109416-109416
Open Access | Times Cited: 23

Off-Policy Actor-critic for Recommender Systems
Minmin Chen, Can Xu, Vince Gatto, et al.
(2022), pp. 338-349
Open Access | Times Cited: 23

Robust path following on rivers using bootstrapped reinforcement learning
Niklas Paulig, Ostap Okhrin
Ocean Engineering (2024) Vol. 298, pp. 117207-117207
Open Access | Times Cited: 5

Real-world robot applications of foundation models: a review
Kento Kawaharazuka, Tatsuya Matsushima, Andrew Gambardella, et al.
Advanced Robotics (2024) Vol. 38, Iss. 18, pp. 1232-1254
Open Access | Times Cited: 5

COG: Connecting New Skills to Past Experience with Offline Reinforcement Learning
Avi Singh, Albert S. Yu, T. Jonathan Yang, et al.
arXiv (Cornell University) (2020)
Open Access | Times Cited: 33

Pessimistic Reward Models for Off-Policy Learning in Recommendation
Olivier Jeunen, Bart Goethals
(2021), pp. 63-74
Closed Access | Times Cited: 28

A Review of Uncertainty for Deep Reinforcement Learning
Owen Lockwood, Mei Si
Proceedings of the AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment (2022) Vol. 18, Iss. 1, pp. 155-162
Open Access | Times Cited: 21

Constraints Penalized Q-learning for Safe Offline Reinforcement Learning
Haoran Xu, Xianyuan Zhan, Xiangyu Zhu
Proceedings of the AAAI Conference on Artificial Intelligence (2022) Vol. 36, Iss. 8, pp. 8753-8760
Open Access | Times Cited: 20

Bias in Reinforcement Learning: A Review in Healthcare Applications
Benjamin Smith, Anahita Khojandi, Rama K. Vasudevan
ACM Computing Surveys (2023) Vol. 56, Iss. 2, pp. 1-17
Closed Access | Times Cited: 12

Possibilities of reinforcement learning for nuclear power plants: Evidence on current applications and beyond
Aicheng Gong, Yangkun Chen, J.P. Zhang, et al.
Nuclear Engineering and Technology (2024) Vol. 56, Iss. 6, pp. 1959-1974
Open Access | Times Cited: 4

Evaluating differential pricing in e-commerce from the perspective of utility
Gaoyong Han, Zhiyong Feng, Shizhan Chen, et al.
Electronic Commerce Research and Applications (2024) Vol. 64, pp. 101373-101373
Closed Access | Times Cited: 4

Scroll to top