OpenAlex Citation Counts

OpenAlex Citations Logo

OpenAlex is a bibliographic catalogue of scientific papers, authors and institutions accessible in open access mode, named after the Library of Alexandria. It's citation coverage is excellent and I hope you will find utility in this listing of citing articles!

If you click the article title, you'll navigate to the article, as listed in CrossRef. If you click the Open Access links, you'll navigate to the "best Open Access location". Clicking the citation count will open this listing for that article. Lastly at the bottom of the page, you'll find basic pagination options.

Requested Article:

Pessimistic Reward Models for Off-Policy Learning in Recommendation
Olivier Jeunen, Bart Goethals
(2021), pp. 63-74
Closed Access | Times Cited: 28

Showing 1-25 of 28 citing articles:

CIRS: Bursting Filter Bubbles by Counterfactual Interactive Recommender System
Chongming Gao, Shiqi Wang, Shijun Li, et al.
ACM transactions on office information systems (2023) Vol. 42, Iss. 1, pp. 1-27
Open Access | Times Cited: 46

Alleviating Matthew Effect of Offline Reinforcement Learning in Interactive Recommendation
Chongming Gao, Kexin Huang, Jiawei Chen, et al.
Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval (2023), pp. 238-248
Open Access | Times Cited: 25

On (Normalised) Discounted Cumulative Gain as an Off-Policy Evaluation Metric for Top- n Recommendation
Olivier Jeunen, Ivan Potapov, Aleksei Ustimenko
Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (2024), pp. 1222-1233
Closed Access | Times Cited: 7

Off-Policy Actor-critic for Recommender Systems
Minmin Chen, Can Xu, Vince Gatto, et al.
(2022), pp. 338-349
Open Access | Times Cited: 23

Multi-dimensional requirements for reinforcement recommendation reasoning
Yinggang Li, Xiangrong Tong, Zhongming Lv
Applied Intelligence (2025) Vol. 55, Iss. 6
Closed Access

Off-Policy Evaluation and Learning for the Future under Non-Stationarity
Tatsuhiro Shimizu, Kazuki Kawamura, Tatsumori MUROI, et al.
(2025), pp. 1256-1264
Closed Access

Ad-load Balancing via Off-policy Learning in a Content Marketplace
Hitesh Sagtani, Madan Gopal Jhawar, Rishabh Mehrotra, et al.
(2024), pp. 586-595
Open Access | Times Cited: 3

Pessimistic Decision-Making for Recommender Systems
Olivier Jeunen, Bart Goethals
ACM Transactions on Recommender Systems (2022) Vol. 1, Iss. 1, pp. 1-27
Open Access | Times Cited: 15

On the Opportunities and Challenges of Offline Reinforcement Learning for Recommender Systems
Xiaocong Chen, Siyu Wang, Julian McAuley, et al.
ACM transactions on office information systems (2024) Vol. 42, Iss. 6, pp. 1-26
Open Access | Times Cited: 2

Mitigating Exploitation Bias in Learning to Rank with an Uncertainty-aware Empirical Bayes Approach
Tao Yang, Cuize Han, Chen Luo, et al.
Proceedings of the ACM Web Conference 2022 (2024), pp. 1486-1496
Open Access | Times Cited: 2

Reinforcing Long-Term Performance in Recommender Systems with User-Oriented Exploration Policy
Changshuo Zhang, Sirui Chen, Xiao Zhang, et al.
Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval (2024), pp. 1850-1860
Closed Access | Times Cited: 2

CONSEQUENCES — Causality, Counterfactuals and Sequential Decision-Making for Recommender Systems
Olivier Jeunen, Thorsten Joachims, Harrie Oosterhuis, et al.
(2022), pp. 654-657
Open Access | Times Cited: 8

CONSEQUENCES --- The 3rd Workshop on Causality, Counterfactuals and Sequential Decision-Making for Recommender Systems
Olivier Jeunen, Harrie Oosterhuis, Yuta Saito, et al.
(2024), pp. 1206-1209
Open Access | Times Cited: 1

Off-policy Learning over Heterogeneous Information for Recommendation
Xiangmeng Wang, Qian Li, Dianer Yu, et al.
Proceedings of the ACM Web Conference 2022 (2022), pp. 2348-2359
Closed Access | Times Cited: 7

MGPolicy
Xiangmeng Wang, Qian Li, Dianer Yu, et al.
Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval (2022), pp. 1369-1378
Closed Access | Times Cited: 6

Embarrassingly shallow auto-encoders for dynamic collaborative filtering
Olivier Jeunen, Jan Van Balen, Bart Goethals
User Modeling and User-Adapted Interaction (2022) Vol. 32, Iss. 4, pp. 509-541
Closed Access | Times Cited: 5

CIPPO: Contrastive Imitation Proximal Policy Optimization for Recommendation Based on Reinforcement Learning
Weilong Chen, Shao‐Liang Zhang, Ruobing Xie, et al.
IEEE Transactions on Knowledge and Data Engineering (2024) Vol. 36, Iss. 11, pp. 5753-5767
Closed Access

Multi-Objective Recommendation via Multivariate Policy Learning
Olivier Jeunen, Jatin Mandav, Ivan Potapov, et al.
(2024), pp. 712-721
Open Access

Optimal Baseline Corrections for Off-Policy Contextual Bandits
Shashank Gupta, Olivier Jeunen, Harrie Oosterhuis, et al.
(2024), pp. 722-732
Open Access

Δ-OPE: Off-Policy Estimation with Pairs of Policies
Olivier Jeunen, Aleksei Ustimenko
(2024), pp. 878-883
Open Access

Pessimistic Off-Policy Optimization for Learning to Rank
Matej Cief, Branislav Kveton, Michal Kompan
Frontiers in artificial intelligence and applications (2024)
Open Access

ROLeR: Effective Reward Shaping in Offline Reinforcement Learning for Recommender Systems
Yi Zhang, Ruihong Qiu, Jiajun Liu, et al.
(2024), pp. 3269-3278
Closed Access

CONSEQUENCES — The 2nd Workshop on Causality, Counterfactuals and Sequential Decision-Making for Recommender Systems
Olivier Jeunen, Thorsten Joachims, Harrie Oosterhuis, et al.
(2023), pp. 1223-1226
Open Access | Times Cited: 1

Reward innovation for long-term member satisfaction
Gary Tang, Jiangwei Pan, H. Wang, et al.
(2023), pp. 396-399
Closed Access | Times Cited: 1

Page 1 - Next Page

Scroll to top