
OpenAlex is a bibliographic catalogue of scientific papers, authors and institutions accessible in open access mode, named after the Library of Alexandria. It's citation coverage is excellent and I hope you will find utility in this listing of citing articles!
If you click the article title, you'll navigate to the article, as listed in CrossRef. If you click the Open Access links, you'll navigate to the "best Open Access location". Clicking the citation count will open this listing for that article. Lastly at the bottom of the page, you'll find basic pagination options.
Requested Article:
HierVL: Learning Hierarchical Video-Language Embeddings
Kumar Ashutosh, Rohit Girdhar, Lorenzo Torresani, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023) Vol. 114, pp. 23066-23078
Open Access | Times Cited: 20
Kumar Ashutosh, Rohit Girdhar, Lorenzo Torresani, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023) Vol. 114, pp. 23066-23078
Open Access | Times Cited: 20
Showing 20 citing articles:
EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone
Shraman Pramanick, Yale Song, Sayan Nag, et al.
2021 IEEE/CVF International Conference on Computer Vision (ICCV) (2023), pp. 5262-5274
Open Access | Times Cited: 25
Shraman Pramanick, Yale Song, Sayan Nag, et al.
2021 IEEE/CVF International Conference on Computer Vision (ICCV) (2023), pp. 5262-5274
Open Access | Times Cited: 25
Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives
Kristen Grauman, Andrew Westbury, Lorenzo Torresani, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 33, pp. 19383-19400
Closed Access | Times Cited: 10
Kristen Grauman, Andrew Westbury, Lorenzo Torresani, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 33, pp. 19383-19400
Closed Access | Times Cited: 10
NaQ: Leveraging Narrations as Queries to Supervise Episodic Memory
Santhosh Kumar Ramakrishnan, Ziad Al-Halah, Kristen Grauman
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023), pp. 6694-6703
Open Access | Times Cited: 12
Santhosh Kumar Ramakrishnan, Ziad Al-Halah, Kristen Grauman
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023), pp. 6694-6703
Open Access | Times Cited: 12
VideoLLM-online: Online Video Large Language Model for Streaming Video
Joya Chen, Zhaoyang Lv, Shiwei Wu, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024), pp. 18407-18418
Closed Access | Times Cited: 4
Joya Chen, Zhaoyang Lv, Shiwei Wu, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024), pp. 18407-18418
Closed Access | Times Cited: 4
Programming-by-Demonstration for Long-Horizon Robot Tasks
Noah Patton, Kia Rahmani, Meghana Missula, et al.
Proceedings of the ACM on Programming Languages (2024) Vol. 8, Iss. POPL, pp. 512-545
Open Access | Times Cited: 3
Noah Patton, Kia Rahmani, Meghana Missula, et al.
Proceedings of the ACM on Programming Languages (2024) Vol. 8, Iss. POPL, pp. 512-545
Open Access | Times Cited: 3
Step Differences in Instructional Video
Tushar Nagarajan, Lorenzo Torresani
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 35, pp. 18740-18750
Closed Access | Times Cited: 3
Tushar Nagarajan, Lorenzo Torresani
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 35, pp. 18740-18750
Closed Access | Times Cited: 3
Detours for Navigating Instructional Videos
Kumar Ashutosh, Zihui Xue, Tushar Nagarajan, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 33, pp. 18804-18815
Closed Access | Times Cited: 3
Kumar Ashutosh, Zihui Xue, Tushar Nagarajan, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 33, pp. 18804-18815
Closed Access | Times Cited: 3
Foundation Models for Video Understanding: A Survey
Neelu Madan, Andreas Møgelmose, Rajat Modi, et al.
(2024)
Open Access | Times Cited: 2
Neelu Madan, Andreas Møgelmose, Rajat Modi, et al.
(2024)
Open Access | Times Cited: 2
Foundation Models for Video Understanding: A Survey
Neelu Madan, Andreas Møgelmose, Rajat Modi, et al.
(2024)
Open Access | Times Cited: 2
Neelu Madan, Andreas Møgelmose, Rajat Modi, et al.
(2024)
Open Access | Times Cited: 2
Mirasol3B: A Multimodal Autoregressive Model for Time-Aligned and Contextual Modalities
AJ Piergiovanni, Isaac Noble, Dahun Kim, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024), pp. 26794-26804
Closed Access | Times Cited: 2
AJ Piergiovanni, Isaac Noble, Dahun Kim, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024), pp. 26794-26804
Closed Access | Times Cited: 2
Improving semantic video retrieval models by training with a relevance-aware online mining strategy
Alex Falcon, Giuseppe Serra, Oswald Lanz
Computer Vision and Image Understanding (2024) Vol. 245, pp. 104035-104035
Open Access | Times Cited: 1
Alex Falcon, Giuseppe Serra, Oswald Lanz
Computer Vision and Image Understanding (2024) Vol. 245, pp. 104035-104035
Open Access | Times Cited: 1
Can't make an Omelette without Breaking some Eggs: Plausible Action Anticipation using Large Video-Language Models
Himangi Mittal, Nakul Agarwal, Shao-Yuan Lo, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 35, pp. 18580-18590
Closed Access | Times Cited: 1
Himangi Mittal, Nakul Agarwal, Shao-Yuan Lo, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 35, pp. 18580-18590
Closed Access | Times Cited: 1
Learning Object State Changes in Videos: An Open-World Perspective
Zihui Xue, Ashutosh Kumar, Kristen Grauman
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 35, pp. 18493-18503
Closed Access | Times Cited: 1
Zihui Xue, Ashutosh Kumar, Kristen Grauman
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 35, pp. 18493-18503
Closed Access | Times Cited: 1
A Sound Approach: Using Large Language Models to Generate Audio Descriptions for Egocentric Text-Audio Retrieval
Andreea-Maria Oncescu, João F. Henriques, Andrew Zisserman, et al.
ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2024), pp. 7300-7304
Open Access
Andreea-Maria Oncescu, João F. Henriques, Andrew Zisserman, et al.
ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2024), pp. 7300-7304
Open Access
A Backpack Full of Skills: Egocentric Video Understanding with Diverse Task Perspectives
Simone Peirone, Francesca Pistilli, Antonio Alliegro, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024), pp. 18275-18285
Closed Access
Simone Peirone, Francesca Pistilli, Antonio Alliegro, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024), pp. 18275-18285
Closed Access
PALM: Predicting Actions through Language Models
Sanghwan Kim, Daoji Huang, Yongqin Xian, et al.
Lecture notes in computer science (2024), pp. 140-158
Closed Access
Sanghwan Kim, Daoji Huang, Yongqin Xian, et al.
Lecture notes in computer science (2024), pp. 140-158
Closed Access
LEGO: Learning EGOcentric Action Frame Generation via Visual Instruction Tuning
Bolin Lai, Xiaoliang Dai, Lawrence R. Chen, et al.
Lecture notes in computer science (2024), pp. 135-155
Closed Access
Bolin Lai, Xiaoliang Dai, Lawrence R. Chen, et al.
Lecture notes in computer science (2024), pp. 135-155
Closed Access
Discovering Novel Actions from Open World Egocentric Videos with Object-Grounded Visual Commonsense Reasoning
Sanjoy Kundu, Shubham Trehan, Sathyanarayanan N. Aakur
Lecture notes in computer science (2024), pp. 39-56
Closed Access
Sanjoy Kundu, Shubham Trehan, Sathyanarayanan N. Aakur
Lecture notes in computer science (2024), pp. 39-56
Closed Access
COM Kitchens: An Unedited Overhead-View Video Dataset as a Vision-Language Benchmark
Koki Maeda, Tosho Hirasawa, Atsushi Hashimoto, et al.
Lecture notes in computer science (2024), pp. 123-140
Closed Access
Koki Maeda, Tosho Hirasawa, Atsushi Hashimoto, et al.
Lecture notes in computer science (2024), pp. 123-140
Closed Access
Gated Temporal Diffusion for Stochastic Long-Term Dense Anticipation
Olga Zatsarynna, Emad Bahrami, Yazan Abu Farha, et al.
Lecture notes in computer science (2024), pp. 454-472
Closed Access
Olga Zatsarynna, Emad Bahrami, Yazan Abu Farha, et al.
Lecture notes in computer science (2024), pp. 454-472
Closed Access