
OpenAlex is a bibliographic catalogue of scientific papers, authors and institutions accessible in open access mode, named after the Library of Alexandria. It's citation coverage is excellent and I hope you will find utility in this listing of citing articles!
If you click the article title, you'll navigate to the article, as listed in CrossRef. If you click the Open Access links, you'll navigate to the "best Open Access location". Clicking the citation count will open this listing for that article. Lastly at the bottom of the page, you'll find basic pagination options.
Requested Article:
XLS-R: Self-supervised Cross-lingual Speech Representation Learning at Scale
Arun Babu, Changhan Wang, Andros Tjandra, et al.
Interspeech 2022 (2022)
Open Access | Times Cited: 283
Arun Babu, Changhan Wang, Andros Tjandra, et al.
Interspeech 2022 (2022)
Open Access | Times Cited: 283
Showing 1-25 of 283 citing articles:
Self-Supervised Speech Representation Learning: A Review
Abdelrahman Mohamed, Hung-yi Lee, Lasse Borgholt, et al.
IEEE Journal of Selected Topics in Signal Processing (2022) Vol. 16, Iss. 6, pp. 1179-1210
Open Access | Times Cited: 207
Abdelrahman Mohamed, Hung-yi Lee, Lasse Borgholt, et al.
IEEE Journal of Selected Topics in Signal Processing (2022) Vol. 16, Iss. 6, pp. 1179-1210
Open Access | Times Cited: 207
Dawn of the Transformer Era in Speech Emotion Recognition: Closing the Valence Gap
Johannes Wagner, Andreas Triantafyllopoulos, Hagen Wierstorf, et al.
IEEE Transactions on Pattern Analysis and Machine Intelligence (2023) Vol. 45, Iss. 9, pp. 10745-10759
Open Access | Times Cited: 183
Johannes Wagner, Andreas Triantafyllopoulos, Hagen Wierstorf, et al.
IEEE Transactions on Pattern Analysis and Machine Intelligence (2023) Vol. 45, Iss. 9, pp. 10745-10759
Open Access | Times Cited: 183
A review of deep learning techniques for speech processing
Ambuj Mehrish, Navonil Majumder, Rishabh Bharadwaj, et al.
Information Fusion (2023) Vol. 99, pp. 101869-101869
Open Access | Times Cited: 151
Ambuj Mehrish, Navonil Majumder, Rishabh Bharadwaj, et al.
Information Fusion (2023) Vol. 99, pp. 101869-101869
Open Access | Times Cited: 151
FLEURS: FEW-Shot Learning Evaluation of Universal Representations of Speech
Alexis Conneau, Min Ma, Simran Khanuja, et al.
2022 IEEE Spoken Language Technology Workshop (SLT) (2023), pp. 798-805
Open Access | Times Cited: 81
Alexis Conneau, Min Ma, Simran Khanuja, et al.
2022 IEEE Spoken Language Technology Workshop (SLT) (2023), pp. 798-805
Open Access | Times Cited: 81
Prompting Large Language Models with Speech Recognition Abilities
Yassir Fathullah, Chunyang Wu, Egor Lakomkin, et al.
ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2024), pp. 13351-13355
Open Access | Times Cited: 27
Yassir Fathullah, Chunyang Wu, Egor Lakomkin, et al.
ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2024), pp. 13351-13355
Open Access | Times Cited: 27
Using transformers for multimodal emotion recognition: Taxonomies and state of the art review
Samira Hazmoune, Fateh Bougamouza
Engineering Applications of Artificial Intelligence (2024) Vol. 133, pp. 108339-108339
Closed Access | Times Cited: 21
Samira Hazmoune, Fateh Bougamouza
Engineering Applications of Artificial Intelligence (2024) Vol. 133, pp. 108339-108339
Closed Access | Times Cited: 21
Joint speech and text machine translation for up to 100 languages
Loïc Barrault, Yu-An Chung, Mariano Coria Meglioli, et al.
Nature (2025) Vol. 637, Iss. 8046, pp. 587-593
Open Access | Times Cited: 3
Loïc Barrault, Yu-An Chung, Mariano Coria Meglioli, et al.
Nature (2025) Vol. 637, Iss. 8046, pp. 587-593
Open Access | Times Cited: 3
MAESTRO: Matched Speech Text Representations through Modality Matching
Zhehuai Chen, Zhang Yu, Andrew E. Rosenberg, et al.
Interspeech 2022 (2022)
Open Access | Times Cited: 56
Zhehuai Chen, Zhang Yu, Andrew E. Rosenberg, et al.
Interspeech 2022 (2022)
Open Access | Times Cited: 56
The Vicomtech Audio Deepfake Detection System Based on Wav2vec2 for the 2022 ADD Challenge
Juan M. Martín-Doñas, Aitor Álvarez
ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2022), pp. 9241-9245
Open Access | Times Cited: 48
Juan M. Martín-Doñas, Aitor Álvarez
ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2022), pp. 9241-9245
Open Access | Times Cited: 48
Improving Automatic Speech Recognition Performance for Low-Resource Languages With Self-Supervised Models
Jing Zhao, Wei-Qiang Zhang
IEEE Journal of Selected Topics in Signal Processing (2022) Vol. 16, Iss. 6, pp. 1227-1241
Closed Access | Times Cited: 41
Jing Zhao, Wei-Qiang Zhang
IEEE Journal of Selected Topics in Signal Processing (2022) Vol. 16, Iss. 6, pp. 1227-1241
Closed Access | Times Cited: 41
Findings of the VarDial Evaluation Campaign 2023
Noëmi Aepli, Çağrı Çöltekin, Rob van der Goot, et al.
(2023), pp. 251-261
Open Access | Times Cited: 34
Noëmi Aepli, Çağrı Çöltekin, Rob van der Goot, et al.
(2023), pp. 251-261
Open Access | Times Cited: 34
Ethical Challenges in the Development of Virtual Assistants Powered by Large Language Models
Andrés Piñeiro-Martín, Carmén García Mateo, Laura Docío-Fernández, et al.
Electronics (2023) Vol. 12, Iss. 14, pp. 3170-3170
Open Access | Times Cited: 31
Andrés Piñeiro-Martín, Carmén García Mateo, Laura Docío-Fernández, et al.
Electronics (2023) Vol. 12, Iss. 14, pp. 3170-3170
Open Access | Times Cited: 31
Audio Deepfake Detection With Self-Supervised Wavlm And Multi-Fusion Attentive Classifier
Yinlin Guo, Haofan Huang, Xi Chen, et al.
ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2024), pp. 12702-12706
Open Access | Times Cited: 9
Yinlin Guo, Haofan Huang, Xi Chen, et al.
ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2024), pp. 12702-12706
Open Access | Times Cited: 9
Fake Audio Detection Based On Unsupervised Pretraining Models
Zhiqiang Lv, Shanshan Zhang, Kai Tang, et al.
ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2022)
Closed Access | Times Cited: 35
Zhiqiang Lv, Shanshan Zhang, Kai Tang, et al.
ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2022)
Closed Access | Times Cited: 35
SHAS: Approaching optimal Segmentation for End-to-End Speech Translation
Ioannis Tsiamas, Gerard I. Gállego, José A. R. Fonollosa, et al.
Interspeech 2022 (2022), pp. 106-110
Open Access | Times Cited: 32
Ioannis Tsiamas, Gerard I. Gállego, José A. R. Fonollosa, et al.
Interspeech 2022 (2022), pp. 106-110
Open Access | Times Cited: 32
Novel Speech Recognition Systems Applied to Forensics within Child Exploitation: Wav2vec2.0 vs. Whisper
Juan Camilo Vásquez-Correa, Aitor Álvarez
Sensors (2023) Vol. 23, Iss. 4, pp. 1843-1843
Open Access | Times Cited: 20
Juan Camilo Vásquez-Correa, Aitor Álvarez
Sensors (2023) Vol. 23, Iss. 4, pp. 1843-1843
Open Access | Times Cited: 20
Wav2Seq: Pre-Training Speech-to-Text Encoder-Decoder Models Using Pseudo Languages
Felix Wu, Kwangyoun Kim, Shinji Watanabe, et al.
ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2023)
Open Access | Times Cited: 20
Felix Wu, Kwangyoun Kim, Shinji Watanabe, et al.
ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2023)
Open Access | Times Cited: 20
Design, Implementation, and Practical Evaluation of a Voice Recognition Based IoT Home Automation System for Low-Resource Languages and Resource-Constrained Edge IoT Devices: A System for Galician and Mobile Opportunistic Scenarios
Iván Froiz-Míguez, Paula Fraga‐Lamas, Tiago M. Fernández‐Caramés
IEEE Access (2023) Vol. 11, pp. 63623-63649
Open Access | Times Cited: 20
Iván Froiz-Míguez, Paula Fraga‐Lamas, Tiago M. Fernández‐Caramés
IEEE Access (2023) Vol. 11, pp. 63623-63649
Open Access | Times Cited: 20
AI-Synthesized Voice Detection Using Neural Vocoder Artifacts
Chengzhe Sun, Shan Jia, Shuwei Hou, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) (2023), pp. 904-912
Open Access | Times Cited: 18
Chengzhe Sun, Shan Jia, Shuwei Hou, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) (2023), pp. 904-912
Open Access | Times Cited: 18
A Robust Audio Deepfake Detection System via Multi-View Feature
Yujie Yang, Haochen Qin, Hang Zhou, et al.
ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2024), pp. 13131-13135
Open Access | Times Cited: 8
Yujie Yang, Haochen Qin, Hang Zhou, et al.
ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2024), pp. 13131-13135
Open Access | Times Cited: 8
6G toward Metaverse: Technologies, Applications, and Challenges
Haoran Peng, Pei-Chen Chen, Pin-Hua Chen, et al.
(2022), pp. 6-10
Closed Access | Times Cited: 28
Haoran Peng, Pei-Chen Chen, Pin-Hua Chen, et al.
(2022), pp. 6-10
Closed Access | Times Cited: 28
SAMU-XLSR: Semantically-Aligned Multimodal Utterance-Level Cross-Lingual Speech Representation
Sameer Khurana, Antoine Laurent, James Glass
IEEE Journal of Selected Topics in Signal Processing (2022) Vol. 16, Iss. 6, pp. 1493-1504
Open Access | Times Cited: 25
Sameer Khurana, Antoine Laurent, James Glass
IEEE Journal of Selected Topics in Signal Processing (2022) Vol. 16, Iss. 6, pp. 1493-1504
Open Access | Times Cited: 25
Improving Massively Multilingual ASR with Auxiliary CTC Objectives
William Chen, Brian Yan, Jiatong Shi, et al.
ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2023), pp. 1-5
Open Access | Times Cited: 14
William Chen, Brian Yan, Jiatong Shi, et al.
ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2023), pp. 1-5
Open Access | Times Cited: 14
LeBenchmark 2.0: A standardized, replicable and enhanced framework for self-supervised representations of French speech
Titouan Parcollet, Ha H. Nguyen, Solène Evain, et al.
Computer Speech & Language (2024) Vol. 86, pp. 101622-101622
Open Access | Times Cited: 5
Titouan Parcollet, Ha H. Nguyen, Solène Evain, et al.
Computer Speech & Language (2024) Vol. 86, pp. 101622-101622
Open Access | Times Cited: 5
Introduction to Neural Transfer Learning With Transformers for Social Science Text Analysis
Sandra Wankmüller
Sociological Methods & Research (2022) Vol. 53, Iss. 4, pp. 1676-1752
Open Access | Times Cited: 20
Sandra Wankmüller
Sociological Methods & Research (2022) Vol. 53, Iss. 4, pp. 1676-1752
Open Access | Times Cited: 20