
OpenAlex is a bibliographic catalogue of scientific papers, authors and institutions accessible in open access mode, named after the Library of Alexandria. It's citation coverage is excellent and I hope you will find utility in this listing of citing articles!
If you click the article title, you'll navigate to the article, as listed in CrossRef. If you click the Open Access links, you'll navigate to the "best Open Access location". Clicking the citation count will open this listing for that article. Lastly at the bottom of the page, you'll find basic pagination options.
Requested Article:
Action Space Shaping in Deep Reinforcement Learning
Anssi Kanervisto, Christian Scheller, Ville Hautamäki
2021 IEEE Conference on Games (CoG) (2020), pp. 479-486
Open Access | Times Cited: 71
Anssi Kanervisto, Christian Scheller, Ville Hautamäki
2021 IEEE Conference on Games (CoG) (2020), pp. 479-486
Open Access | Times Cited: 71
Showing 1-25 of 71 citing articles:
Reinforced Neighborhood Selection Guided Multi-Relational Graph Neural Networks
Hao Peng, Ruitong Zhang, Yingtong Dou, et al.
ACM transactions on office information systems (2021) Vol. 40, Iss. 4, pp. 1-46
Open Access | Times Cited: 110
Hao Peng, Ruitong Zhang, Yingtong Dou, et al.
ACM transactions on office information systems (2021) Vol. 40, Iss. 4, pp. 1-46
Open Access | Times Cited: 110
World and Human Action Models towards gameplay ideation
Anssi Kanervisto, Dave Bignell, Linda Yilin Wen, et al.
Nature (2025) Vol. 638, Iss. 8051, pp. 656-663
Open Access | Times Cited: 2
Anssi Kanervisto, Dave Bignell, Linda Yilin Wen, et al.
Nature (2025) Vol. 638, Iss. 8051, pp. 656-663
Open Access | Times Cited: 2
Reinforcement learning based task offloading of IoT applications in fog computing: algorithms and optimization techniques
Takwa Allaoui, Kaouther Gasmi, Tahar Ezzedine
Cluster Computing (2024) Vol. 27, Iss. 8, pp. 10299-10324
Closed Access | Times Cited: 9
Takwa Allaoui, Kaouther Gasmi, Tahar Ezzedine
Cluster Computing (2024) Vol. 27, Iss. 8, pp. 10299-10324
Closed Access | Times Cited: 9
DRSIR: A Deep Reinforcement Learning Approach for Routing in Software-Defined Networking
Daniela M. Casas-Velasco, Oscar Maurício Caicedo Rendón, Nelson L. S. da Fonseca
IEEE Transactions on Network and Service Management (2021) Vol. 19, Iss. 4, pp. 4807-4820
Open Access | Times Cited: 44
Daniela M. Casas-Velasco, Oscar Maurício Caicedo Rendón, Nelson L. S. da Fonseca
IEEE Transactions on Network and Service Management (2021) Vol. 19, Iss. 4, pp. 4807-4820
Open Access | Times Cited: 44
Review of Deep Reinforcement Learning Approaches for Conflict Resolution in Air Traffic Control
Zhuang Wang, Weijun Pan, Hui Li, et al.
Aerospace (2022) Vol. 9, Iss. 6, pp. 294-294
Open Access | Times Cited: 32
Zhuang Wang, Weijun Pan, Hui Li, et al.
Aerospace (2022) Vol. 9, Iss. 6, pp. 294-294
Open Access | Times Cited: 32
A deep reinforcement learning based hyper-heuristic for modular production control
Marcel Panzer, Benedict Bender, Norbert Gronau
International Journal of Production Research (2023) Vol. 62, Iss. 8, pp. 2747-2768
Open Access | Times Cited: 15
Marcel Panzer, Benedict Bender, Norbert Gronau
International Journal of Production Research (2023) Vol. 62, Iss. 8, pp. 2747-2768
Open Access | Times Cited: 15
Automating DBSCAN via Deep Reinforcement Learning
Ruitong Zhang, Hao Peng, Yingtong Dou, et al.
Proceedings of the 31st ACM International Conference on Information & Knowledge Management (2022), pp. 2620-2630
Open Access | Times Cited: 18
Ruitong Zhang, Hao Peng, Yingtong Dou, et al.
Proceedings of the 31st ACM International Conference on Information & Knowledge Management (2022), pp. 2620-2630
Open Access | Times Cited: 18
Sicgan-Driven Sim-to-Real Transfer: Zero-Shot Deployment on Robotic Manipulators Through Visual Deception
Lucía Güitta-López, Lionel Güitta-López, Jaime Boal Martín-Larrauri, et al.
(2025)
Closed Access
Lucía Güitta-López, Lionel Güitta-López, Jaime Boal Martín-Larrauri, et al.
(2025)
Closed Access
Deep reinforcement learning approach for real-time airport gate assignment
Haonan Li, Xu Wu, Marta Ribeiro, et al.
Operations Research Perspectives (2025), pp. 100338-100338
Open Access
Haonan Li, Xu Wu, Marta Ribeiro, et al.
Operations Research Perspectives (2025), pp. 100338-100338
Open Access
Wide-Range Variable Cycle Engine Control Based on Deep Reinforcement Learning
Yaoyao Ding, Fengming Wang, Yunfei Mu, et al.
Aerospace (2025) Vol. 12, Iss. 5, pp. 424-424
Open Access
Yaoyao Ding, Fengming Wang, Yunfei Mu, et al.
Aerospace (2025) Vol. 12, Iss. 5, pp. 424-424
Open Access
Toward Learning Human-Like, Safe and Comfortable Car-Following Policies With a Novel Deep Reinforcement Learning Approach
M. Ugur Yavas, Tufan Kumbasar, Nazım Kemal Üre
IEEE Access (2023) Vol. 11, pp. 16843-16854
Open Access | Times Cited: 9
M. Ugur Yavas, Tufan Kumbasar, Nazım Kemal Üre
IEEE Access (2023) Vol. 11, pp. 16843-16854
Open Access | Times Cited: 9
Reinforcement-Learning-Based Routing and Resource Management for Internet of Things Environments: Theoretical Perspective and Challenges
Arslan Musaddiq, Tobias Olsson, Fredrik Ahlgren
Sensors (2023) Vol. 23, Iss. 19, pp. 8263-8263
Open Access | Times Cited: 9
Arslan Musaddiq, Tobias Olsson, Fredrik Ahlgren
Sensors (2023) Vol. 23, Iss. 19, pp. 8263-8263
Open Access | Times Cited: 9
Designing an adaptive and deep learning based control framework for modular production systems
Marcel Panzer, Norbert Gronau
Journal of Intelligent Manufacturing (2023) Vol. 35, Iss. 8, pp. 4113-4136
Open Access | Times Cited: 9
Marcel Panzer, Norbert Gronau
Journal of Intelligent Manufacturing (2023) Vol. 35, Iss. 8, pp. 4113-4136
Open Access | Times Cited: 9
Learning State-Specific Action Masks for Reinforcement Learning
Ziyi Wang, Xinran Li, Luoyang Sun, et al.
Algorithms (2024) Vol. 17, Iss. 2, pp. 60-60
Open Access | Times Cited: 3
Ziyi Wang, Xinran Li, Luoyang Sun, et al.
Algorithms (2024) Vol. 17, Iss. 2, pp. 60-60
Open Access | Times Cited: 3
Joint Band Assignment and Beam Management Using Hierarchical Reinforcement Learning for Multi-Band Communication
Do-Hyun Kim, Miguel R. Castellanos, Robert W. Heath
IEEE Transactions on Vehicular Technology (2024) Vol. 73, Iss. 9, pp. 13451-13465
Open Access | Times Cited: 3
Do-Hyun Kim, Miguel R. Castellanos, Robert W. Heath
IEEE Transactions on Vehicular Technology (2024) Vol. 73, Iss. 9, pp. 13451-13465
Open Access | Times Cited: 3
Electric vehicle charging design: The factored action based reinforcement learning approach
Van Binh Truong, Long Bao Le
Applied Energy (2024) Vol. 359, pp. 122737-122737
Closed Access | Times Cited: 2
Van Binh Truong, Long Bao Le
Applied Energy (2024) Vol. 359, pp. 122737-122737
Closed Access | Times Cited: 2
CryoRL: Reinforcement Learning Enables Efficient Cryo-EM Data Collection
Quanfu Fan, Yilai Li, Yuguang Yao, et al.
2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) (2024), pp. 7877-7887
Open Access | Times Cited: 2
Quanfu Fan, Yilai Li, Yuguang Yao, et al.
2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) (2024), pp. 7877-7887
Open Access | Times Cited: 2
Mean-Field-Aided Multiagent Reinforcement Learning for Resource Allocation in Vehicular Networks
Hengxi Zhang, Chengyue Lu, Huaze Tang, et al.
IEEE Internet of Things Journal (2022) Vol. 10, Iss. 3, pp. 2667-2679
Closed Access | Times Cited: 11
Hengxi Zhang, Chengyue Lu, Huaze Tang, et al.
IEEE Internet of Things Journal (2022) Vol. 10, Iss. 3, pp. 2667-2679
Closed Access | Times Cited: 11
A sequential multi-agent reinforcement learning framework for different action spaces
Shucong Tian, Meng Yang, Rongling Xiong, et al.
Expert Systems with Applications (2024) Vol. 258, pp. 125138-125138
Closed Access | Times Cited: 2
Shucong Tian, Meng Yang, Rongling Xiong, et al.
Expert Systems with Applications (2024) Vol. 258, pp. 125138-125138
Closed Access | Times Cited: 2
Reinforcement Learning vs. Computational Intelligence: Comparing Service Management Approaches for the Cloud Continuum
Filippo Poltronieri, Cesare Stefanelli, Mauro Tortonesi, et al.
Future Internet (2023) Vol. 15, Iss. 11, pp. 359-359
Open Access | Times Cited: 5
Filippo Poltronieri, Cesare Stefanelli, Mauro Tortonesi, et al.
Future Internet (2023) Vol. 15, Iss. 11, pp. 359-359
Open Access | Times Cited: 5
Safe and Psychologically Pleasant Traffic Signal Control with Reinforcement Learning using Action Masking
Arthur Müller, Matthia Sabatelli
2022 IEEE 25th International Conference on Intelligent Transportation Systems (ITSC) (2022), pp. 951-958
Open Access | Times Cited: 8
Arthur Müller, Matthia Sabatelli
2022 IEEE 25th International Conference on Intelligent Transportation Systems (ITSC) (2022), pp. 951-958
Open Access | Times Cited: 8
Navigates Like Me: Understanding How People Evaluate Human-Like AI in Video Games
Stephanie Milani, Arthur Juliani, Ida Momennejad, et al.
(2023), pp. 1-18
Open Access | Times Cited: 4
Stephanie Milani, Arthur Juliani, Ida Momennejad, et al.
(2023), pp. 1-18
Open Access | Times Cited: 4
Exploring the Use of Invalid Action Masking in Reinforcement Learning: A Comparative Study of On-Policy and Off-Policy Algorithms in Real-Time Strategy Games
Yueqi Hou, Xiaolong Liang, Jiaqiang Zhang, et al.
Applied Sciences (2023) Vol. 13, Iss. 14, pp. 8283-8283
Open Access | Times Cited: 4
Yueqi Hou, Xiaolong Liang, Jiaqiang Zhang, et al.
Applied Sciences (2023) Vol. 13, Iss. 14, pp. 8283-8283
Open Access | Times Cited: 4
Context-aware composition of agent policies by Markov decision process entity embeddings and agent ensembles
Nicole Merkle, Ralf Mikut
Semantic Web (2024) Vol. 15, Iss. 4, pp. 1443-1471
Open Access | Times Cited: 1
Nicole Merkle, Ralf Mikut
Semantic Web (2024) Vol. 15, Iss. 4, pp. 1443-1471
Open Access | Times Cited: 1
Reinforcement Learning for Two-Stage Permutation Flow Shop Scheduling—A Real-World Application in Household Appliance Production
Arthur Müller, Felix Grumbach, Fiona Kattenstroth
IEEE Access (2024) Vol. 12, pp. 11388-11399
Open Access | Times Cited: 1
Arthur Müller, Felix Grumbach, Fiona Kattenstroth
IEEE Access (2024) Vol. 12, pp. 11388-11399
Open Access | Times Cited: 1