
OpenAlex is a bibliographic catalogue of scientific papers, authors and institutions accessible in open access mode, named after the Library of Alexandria. It's citation coverage is excellent and I hope you will find utility in this listing of citing articles!
If you click the article title, you'll navigate to the article, as listed in CrossRef. If you click the Open Access links, you'll navigate to the "best Open Access location". Clicking the citation count will open this listing for that article. Lastly at the bottom of the page, you'll find basic pagination options.
Requested Article:
Deep learning for video classification and captioning
Zuxuan Wu, Ting Yao, Yanwei Fu, et al.
(2017), pp. 3-29
Open Access | Times Cited: 112
Zuxuan Wu, Ting Yao, Yanwei Fu, et al.
(2017), pp. 3-29
Open Access | Times Cited: 112
Showing 1-25 of 112 citing articles:
Cap4Video: What Can Auxiliary Captions Do for Text-Video Retrieval?
Wenhao Wu, Haipeng Luo, Bo Fang, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023), pp. 10704-10713
Open Access | Times Cited: 48
Wenhao Wu, Haipeng Luo, Bo Fang, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023), pp. 10704-10713
Open Access | Times Cited: 48
Query - Dependent Video Representation for Moment Retrieval and Highlight Detection
WonJun Moon, Sangeek Hyun, SangUk Park, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023), pp. 23023-23033
Open Access | Times Cited: 48
WonJun Moon, Sangeek Hyun, SangUk Park, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023), pp. 23023-23033
Open Access | Times Cited: 48
A Comprehensive Study of Deep Video Action Recognition
Yi Zhu, Xinyu Li, Chunhui Liu, et al.
arXiv (Cornell University) (2020)
Open Access | Times Cited: 106
Yi Zhu, Xinyu Li, Chunhui Liu, et al.
arXiv (Cornell University) (2020)
Open Access | Times Cited: 106
A Survey on 3D Skeleton-Based Action Recognition Using Learning Method
Bin Ren, Mengyuan Liu, Runwei Ding, et al.
arXiv (Cornell University) (2020)
Open Access | Times Cited: 77
Bin Ren, Mengyuan Liu, Runwei Ding, et al.
arXiv (Cornell University) (2020)
Open Access | Times Cited: 77
Video description: A comprehensive survey of deep learning approaches
Ghazala Rafiq, Muhammad Rafiq, Gyu Sang Choi
Artificial Intelligence Review (2023) Vol. 56, Iss. 11, pp. 13293-13372
Open Access | Times Cited: 26
Ghazala Rafiq, Muhammad Rafiq, Gyu Sang Choi
Artificial Intelligence Review (2023) Vol. 56, Iss. 11, pp. 13293-13372
Open Access | Times Cited: 26
A Multimodal Misinformation Detector for COVID-19 Short Videos on TikTok
Lanyu Shang, Ziyi Kou, Yang Zhang, et al.
2021 IEEE International Conference on Big Data (Big Data) (2021), pp. 899-908
Closed Access | Times Cited: 46
Lanyu Shang, Ziyi Kou, Yang Zhang, et al.
2021 IEEE International Conference on Big Data (Big Data) (2021), pp. 899-908
Closed Access | Times Cited: 46
Word2VisualVec: Image and Video to Sentence Matching by Visual Feature Prediction
Jianfeng Dong, Xirong Li, Cees G. M. Snoek
arXiv (Cornell University) (2016)
Open Access | Times Cited: 50
Jianfeng Dong, Xirong Li, Cees G. M. Snoek
arXiv (Cornell University) (2016)
Open Access | Times Cited: 50
Beyond the Memory Wall: A Case for Memory-Centric HPC System for Deep Learning
Youngeun Kwon, Minsoo Rhu
(2018)
Open Access | Times Cited: 50
Youngeun Kwon, Minsoo Rhu
(2018)
Open Access | Times Cited: 50
A Survey of the Usages of Deep Learning in Natural Language Processing
Daniel W. Otter, Julian Richard Medina, Jugal Kalita
arXiv (Cornell University) (2018)
Closed Access | Times Cited: 49
Daniel W. Otter, Julian Richard Medina, Jugal Kalita
arXiv (Cornell University) (2018)
Closed Access | Times Cited: 49
YoTube: Searching Action Proposal Via Recurrent and Static Regression Networks
Hongyuan Zhu, Romain Vial, Shijian Lu, et al.
IEEE Transactions on Image Processing (2018) Vol. 27, Iss. 6, pp. 2609-2622
Open Access | Times Cited: 48
Hongyuan Zhu, Romain Vial, Shijian Lu, et al.
IEEE Transactions on Image Processing (2018) Vol. 27, Iss. 6, pp. 2609-2622
Open Access | Times Cited: 48
A Hybrid Deep Model Using Deep Learning and Dense Optical Flow Approaches for Human Activity Recognition
Senem Tanberk, Zeynep Hilal Kilimci, Dilek Bilgin Tükel, et al.
IEEE Access (2020) Vol. 8, pp. 19799-19809
Open Access | Times Cited: 44
Senem Tanberk, Zeynep Hilal Kilimci, Dilek Bilgin Tükel, et al.
IEEE Access (2020) Vol. 8, pp. 19799-19809
Open Access | Times Cited: 44
Sentiment Analysis Based Direction Prediction in Bitcoin using Deep Learning Algorithms and Word Embedding Models
Zeynep Hilal Kilimci
International Journal of Intelligent Systems and Applications in Engineering (2020) Vol. 8, Iss. 2, pp. 60-65
Open Access | Times Cited: 42
Zeynep Hilal Kilimci
International Journal of Intelligent Systems and Applications in Engineering (2020) Vol. 8, Iss. 2, pp. 60-65
Open Access | Times Cited: 42
A comparative review of graph convolutional networks for human skeleton-based action recognition
Liqi Feng, Yaqin Zhao, Wenxuan Zhao, et al.
Artificial Intelligence Review (2021) Vol. 55, Iss. 5, pp. 4275-4305
Closed Access | Times Cited: 40
Liqi Feng, Yaqin Zhao, Wenxuan Zhao, et al.
Artificial Intelligence Review (2021) Vol. 55, Iss. 5, pp. 4275-4305
Closed Access | Times Cited: 40
Knowledge representation and learning of operator clinical workflow from full-length routine fetal ultrasound scan videos
Harshita Sharma, Lior Drukker, Pierre Chatelain, et al.
Medical Image Analysis (2021) Vol. 69, pp. 101973-101973
Open Access | Times Cited: 38
Harshita Sharma, Lior Drukker, Pierre Chatelain, et al.
Medical Image Analysis (2021) Vol. 69, pp. 101973-101973
Open Access | Times Cited: 38
Toward the unification of generative and discriminative visual foundation model: a survey
Xu Liu, Tong Zhou, Chong Wang, et al.
The Visual Computer (2024)
Closed Access | Times Cited: 5
Xu Liu, Tong Zhou, Chong Wang, et al.
The Visual Computer (2024)
Closed Access | Times Cited: 5
Temporal Difference Networks for Video Action Recognition
Joe Yue-Hei Ng, Larry S. Davis
(2018), pp. 1587-1596
Closed Access | Times Cited: 39
Joe Yue-Hei Ng, Larry S. Davis
(2018), pp. 1587-1596
Closed Access | Times Cited: 39
Intelligent Monitoring of Stress Induced by Water Deficiency in Plants Using Deep Learning
Shiva Azimi, Rohan Wadhawan, Tapan Kumar Gandhi
IEEE Transactions on Instrumentation and Measurement (2021) Vol. 70, pp. 1-13
Open Access | Times Cited: 32
Shiva Azimi, Rohan Wadhawan, Tapan Kumar Gandhi
IEEE Transactions on Instrumentation and Measurement (2021) Vol. 70, pp. 1-13
Open Access | Times Cited: 32
A review on Video Classification with Methods, Findings, Performance, Challenges, Limitations and Future Work
Md Shofiqul Islam, Mst Sunjida Sultana, Uttam Kumar Roy, et al.
Jurnal Ilmiah Teknik Elektro Komputer dan Informatika (2021) Vol. 6, Iss. 2, pp. 47-47
Open Access | Times Cited: 30
Md Shofiqul Islam, Mst Sunjida Sultana, Uttam Kumar Roy, et al.
Jurnal Ilmiah Teknik Elektro Komputer dan Informatika (2021) Vol. 6, Iss. 2, pp. 47-47
Open Access | Times Cited: 30
Global semantic enhancement network for video captioning
Xuemei Luo, Xiaotong Luo, Di Wang, et al.
Pattern Recognition (2023) Vol. 145, pp. 109906-109906
Closed Access | Times Cited: 11
Xuemei Luo, Xiaotong Luo, Di Wang, et al.
Pattern Recognition (2023) Vol. 145, pp. 109906-109906
Closed Access | Times Cited: 11
An attention based dual learning approach for video captioning
Wanting Ji, Ruili Wang, Yan Tian, et al.
Applied Soft Computing (2021) Vol. 117, pp. 108332-108332
Closed Access | Times Cited: 27
Wanting Ji, Ruili Wang, Yan Tian, et al.
Applied Soft Computing (2021) Vol. 117, pp. 108332-108332
Closed Access | Times Cited: 27
A Computer Vision Approach for Estimating Lifting Load Contributors to Injury Risk
Guoyang Zhou, Vaneet Aggarwal, Ming Yin, et al.
IEEE Transactions on Human-Machine Systems (2022) Vol. 52, Iss. 2, pp. 207-219
Closed Access | Times Cited: 18
Guoyang Zhou, Vaneet Aggarwal, Ming Yin, et al.
IEEE Transactions on Human-Machine Systems (2022) Vol. 52, Iss. 2, pp. 207-219
Closed Access | Times Cited: 18
A Machine Learning Method for Automated Description and Workflow Analysis of First Trimester Ultrasound Scans
Robail Yasrab, Zeyu Fu, He Zhao, et al.
IEEE Transactions on Medical Imaging (2022) Vol. 42, Iss. 5, pp. 1301-1313
Open Access | Times Cited: 18
Robail Yasrab, Zeyu Fu, He Zhao, et al.
IEEE Transactions on Medical Imaging (2022) Vol. 42, Iss. 5, pp. 1301-1313
Open Access | Times Cited: 18
Efficient text-to-video retrieval via multi-modal multi-tagger derived pre-screening
Yingjia Xu, Mengxia Wu, Zixin Guo, et al.
Visual Intelligence (2025) Vol. 3, Iss. 1
Open Access
Yingjia Xu, Mengxia Wu, Zixin Guo, et al.
Visual Intelligence (2025) Vol. 3, Iss. 1
Open Access
Incremental knowledge acquisition and self-learning for autonomous video surveillance
Rashmika Nawaratne, Tharindu Bandaragoda, Achini Adikari, et al.
IECON 2017 - 43rd Annual Conference of the IEEE Industrial Electronics Society (2017), pp. 4790-4795
Closed Access | Times Cited: 32
Rashmika Nawaratne, Tharindu Bandaragoda, Achini Adikari, et al.
IECON 2017 - 43rd Annual Conference of the IEEE Industrial Electronics Society (2017), pp. 4790-4795
Closed Access | Times Cited: 32
A study on deep learning spatiotemporal models and feature extraction techniques for video understanding
M Suresha, S. Kuppa, D. S. Raghukumar
International Journal of Multimedia Information Retrieval (2020) Vol. 9, Iss. 2, pp. 81-101
Closed Access | Times Cited: 25
M Suresha, S. Kuppa, D. S. Raghukumar
International Journal of Multimedia Information Retrieval (2020) Vol. 9, Iss. 2, pp. 81-101
Closed Access | Times Cited: 25