
OpenAlex is a bibliographic catalogue of scientific papers, authors and institutions accessible in open access mode, named after the Library of Alexandria. It's citation coverage is excellent and I hope you will find utility in this listing of citing articles!
If you click the article title, you'll navigate to the article, as listed in CrossRef. If you click the Open Access links, you'll navigate to the "best Open Access location". Clicking the citation count will open this listing for that article. Lastly at the bottom of the page, you'll find basic pagination options.
Requested Article:
Critic Regularized Regression
Ziyu Wang, Alexander Novikov, Konrad Żołna, et al.
arXiv (Cornell University) (2020)
Open Access | Times Cited: 88
Ziyu Wang, Alexander Novikov, Konrad Żołna, et al.
arXiv (Cornell University) (2020)
Open Access | Times Cited: 88
Showing 1-25 of 88 citing articles:
Challenges of real-world reinforcement learning: definitions, benchmarks and analysis
Gabriel Dulac-Arnold, Nir Levine, Daniel J. Mankowitz, et al.
Machine Learning (2021) Vol. 110, Iss. 9, pp. 2419-2468
Open Access | Times Cited: 320
Gabriel Dulac-Arnold, Nir Levine, Daniel J. Mankowitz, et al.
Machine Learning (2021) Vol. 110, Iss. 9, pp. 2419-2468
Open Access | Times Cited: 320
A Minimalist Approach to Offline Reinforcement Learning
Scott Fujimoto, Shixiang Gu
arXiv (Cornell University) (2021)
Open Access | Times Cited: 133
Scott Fujimoto, Shixiang Gu
arXiv (Cornell University) (2021)
Open Access | Times Cited: 133
Offline Reinforcement Learning with Implicit Q-Learning
Ilya Kostrikov, Ashvin Nair, Sergey Levine
arXiv (Cornell University) (2021)
Open Access | Times Cited: 107
Ilya Kostrikov, Ashvin Nair, Sergey Levine
arXiv (Cornell University) (2021)
Open Access | Times Cited: 107
AWAC: Accelerating Online Reinforcement Learning with Offline Datasets
Ashvin Nair, Murtaza Dalal, Abhishek Gupta, et al.
arXiv (Cornell University) (2021)
Closed Access | Times Cited: 62
Ashvin Nair, Murtaza Dalal, Abhishek Gupta, et al.
arXiv (Cornell University) (2021)
Closed Access | Times Cited: 62
Bridging Offline Reinforcement Learning and Imitation Learning: A Tale of Pessimism
Paria Rashidinejad, Banghua Zhu, Cong Ma, et al.
IEEE Transactions on Information Theory (2022) Vol. 68, Iss. 12, pp. 8156-8196
Open Access | Times Cited: 40
Paria Rashidinejad, Banghua Zhu, Cong Ma, et al.
IEEE Transactions on Information Theory (2022) Vol. 68, Iss. 12, pp. 8156-8196
Open Access | Times Cited: 40
Alleviating Matthew Effect of Offline Reinforcement Learning in Interactive Recommendation
Chongming Gao, Kexin Huang, Jiawei Chen, et al.
Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval (2023), pp. 238-248
Open Access | Times Cited: 25
Chongming Gao, Kexin Huang, Jiawei Chen, et al.
Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval (2023), pp. 238-248
Open Access | Times Cited: 25
What Matters in Learning from Offline Human Demonstrations for Robot Manipulation
Ajay Mandlekar, Danfei Xu, Josiah Wong, et al.
arXiv (Cornell University) (2021)
Open Access | Times Cited: 52
Ajay Mandlekar, Danfei Xu, Josiah Wong, et al.
arXiv (Cornell University) (2021)
Open Access | Times Cited: 52
Mouse visual cortex as a limited resource system that self-learns an ecologically-general representation
Aran Nayebi, Nathan C. L. Kong, Chengxu Zhuang, et al.
PLoS Computational Biology (2023) Vol. 19, Iss. 10, pp. e1011506-e1011506
Open Access | Times Cited: 21
Aran Nayebi, Nathan C. L. Kong, Chengxu Zhuang, et al.
PLoS Computational Biology (2023) Vol. 19, Iss. 10, pp. e1011506-e1011506
Open Access | Times Cited: 21
An empirical investigation of the challenges of real-world reinforcement learning.
Gabriel Dulac-Arnold, Nir Levine, Daniel J. Mankowitz, et al.
arXiv (Cornell University) (2020)
Closed Access | Times Cited: 48
Gabriel Dulac-Arnold, Nir Levine, Daniel J. Mankowitz, et al.
arXiv (Cornell University) (2020)
Closed Access | Times Cited: 48
Hyperparameter Selection for Offline Reinforcement Learning.
Tom Le Paine, Cosmin Păduraru, Andrea Michi, et al.
arXiv (Cornell University) (2020)
Closed Access | Times Cited: 40
Tom Le Paine, Cosmin Păduraru, Andrea Michi, et al.
arXiv (Cornell University) (2020)
Closed Access | Times Cited: 40
d3rlpy: An Offline Deep Reinforcement Learning Library
Takuma Seno, Michita Imai
arXiv (Cornell University) (2021)
Open Access | Times Cited: 33
Takuma Seno, Michita Imai
arXiv (Cornell University) (2021)
Open Access | Times Cited: 33
ACM MMSys 2024 Bandwidth Estimation in Real Time Communications Challenge
Sami Khairy, Gabriel Mittag, Vishak Gopal, et al.
(2024), pp. 339-345
Open Access | Times Cited: 4
Sami Khairy, Gabriel Mittag, Vishak Gopal, et al.
(2024), pp. 339-345
Open Access | Times Cited: 4
Learning Reward Functions for Robotic Manipulation by Observing Humans
Minttu Alakuijala, Gabriel Dulac-Arnold, Julien Mairal, et al.
(2023), pp. 5006-5012
Open Access | Times Cited: 9
Minttu Alakuijala, Gabriel Dulac-Arnold, Julien Mairal, et al.
(2023), pp. 5006-5012
Open Access | Times Cited: 9
Weighted Policy Constraints for Offline Reinforcement Learning
Zhiyong Peng, Changlin Han, Yadong Liu, et al.
Proceedings of the AAAI Conference on Artificial Intelligence (2023) Vol. 37, Iss. 8, pp. 9435-9443
Open Access | Times Cited: 9
Zhiyong Peng, Changlin Han, Yadong Liu, et al.
Proceedings of the AAAI Conference on Artificial Intelligence (2023) Vol. 37, Iss. 8, pp. 9435-9443
Open Access | Times Cited: 9
Benchmarks for Deep Off-Policy Evaluation
Justin Fu, Mohammad Norouzi, Ofir Nachum, et al.
arXiv (Cornell University) (2021)
Open Access | Times Cited: 22
Justin Fu, Mohammad Norouzi, Ofir Nachum, et al.
arXiv (Cornell University) (2021)
Open Access | Times Cited: 22
Learning robotic navigation from experience: principles, methods and recent results
Sergey Levine, Dhruv Shah
Philosophical Transactions of the Royal Society B Biological Sciences (2022) Vol. 378, Iss. 1869
Open Access | Times Cited: 14
Sergey Levine, Dhruv Shah
Philosophical Transactions of the Royal Society B Biological Sciences (2022) Vol. 378, Iss. 1869
Open Access | Times Cited: 14
Is Pessimism Provably Efficient for Offline RL
Ying Jin, Zhuoran Yang, Zhaoran Wang
arXiv (Cornell University) (2020)
Closed Access | Times Cited: 23
Ying Jin, Zhuoran Yang, Zhaoran Wang
arXiv (Cornell University) (2020)
Closed Access | Times Cited: 23
Model-Based Offline Planning
Arthur Argenson, Gabriel Dulac-Arnold
arXiv (Cornell University) (2020)
Closed Access | Times Cited: 21
Arthur Argenson, Gabriel Dulac-Arnold
arXiv (Cornell University) (2020)
Closed Access | Times Cited: 21
Online and Offline Reinforcement Learning by Planning with a Learned Model
Julian Schrittwieser, Thomas Hubert, Amol Mandhane, et al.
arXiv (Cornell University) (2021)
Open Access | Times Cited: 20
Julian Schrittwieser, Thomas Hubert, Amol Mandhane, et al.
arXiv (Cornell University) (2021)
Open Access | Times Cited: 20
Robot Learning on the Job: Human-in-the-Loop Autonomy and Learning During Deployment
Huihan Liu, Soroush Nasiriany, Lance Zhang, et al.
(2023)
Open Access | Times Cited: 7
Huihan Liu, Soroush Nasiriany, Lance Zhang, et al.
(2023)
Open Access | Times Cited: 7
The Importance of Pessimism in Fixed-Dataset Policy Optimization
Jacob Buckman, Carles Gelada, Marc G. Bellemare
(2020)
Closed Access | Times Cited: 19
Jacob Buckman, Carles Gelada, Marc G. Bellemare
(2020)
Closed Access | Times Cited: 19
Offline Reinforcement Learning as Anti-exploration
Shideh Rezaeifar, Robert Dadashi, Nino Vieillard, et al.
Proceedings of the AAAI Conference on Artificial Intelligence (2022) Vol. 36, Iss. 7, pp. 8106-8114
Open Access | Times Cited: 11
Shideh Rezaeifar, Robert Dadashi, Nino Vieillard, et al.
Proceedings of the AAAI Conference on Artificial Intelligence (2022) Vol. 36, Iss. 7, pp. 8106-8114
Open Access | Times Cited: 11
Personalization for web-based services using offline reinforcement learning
Pavlos Athanasios Apostolopoulos, Zehui Wang, Hanson Wang, et al.
Machine Learning (2024) Vol. 113, Iss. 5, pp. 3049-3071
Closed Access | Times Cited: 2
Pavlos Athanasios Apostolopoulos, Zehui Wang, Hanson Wang, et al.
Machine Learning (2024) Vol. 113, Iss. 5, pp. 3049-3071
Closed Access | Times Cited: 2
Accelerate online reinforcement learning for building HVAC control with heterogeneous expert guidances
Shichao Xu, Yangyang Fu, Yixuan Wang, et al.
(2022), pp. 89-98
Closed Access | Times Cited: 8
Shichao Xu, Yangyang Fu, Yixuan Wang, et al.
(2022), pp. 89-98
Closed Access | Times Cited: 8
A model of mood as integrated advantage
Daniel Bennett, Guy Davidson, Yael Niv
(2020)
Open Access | Times Cited: 12
Daniel Bennett, Guy Davidson, Yael Niv
(2020)
Open Access | Times Cited: 12