
OpenAlex is a bibliographic catalogue of scientific papers, authors and institutions accessible in open access mode, named after the Library of Alexandria. It's citation coverage is excellent and I hope you will find utility in this listing of citing articles!
If you click the article title, you'll navigate to the article, as listed in CrossRef. If you click the Open Access links, you'll navigate to the "best Open Access location". Clicking the citation count will open this listing for that article. Lastly at the bottom of the page, you'll find basic pagination options.
Requested Article:
Speech Synthesis With Mixed Emotions
Kun Zhou, Berrak Şişman, Rajib Rana, et al.
IEEE Transactions on Affective Computing (2022) Vol. 14, Iss. 4, pp. 3120-3134
Open Access | Times Cited: 23
Kun Zhou, Berrak Şişman, Rajib Rana, et al.
IEEE Transactions on Affective Computing (2022) Vol. 14, Iss. 4, pp. 3120-3134
Open Access | Times Cited: 23
Showing 23 citing articles:
InstructTTS: Modelling Expressive TTS in Discrete Latent Space With Natural Language Style Prompt
Dongchao Yang, Songxiang Liu, Rongjie Huang, et al.
IEEE/ACM Transactions on Audio Speech and Language Processing (2024) Vol. 32, pp. 2913-2925
Open Access | Times Cited: 12
Dongchao Yang, Songxiang Liu, Rongjie Huang, et al.
IEEE/ACM Transactions on Audio Speech and Language Processing (2024) Vol. 32, pp. 2913-2925
Open Access | Times Cited: 12
Emodiff: Intensity Controllable Emotional Text-to-Speech with Soft-Label Guidance
Yiwei Guo, Chenpeng Du, Xie Chen, et al.
ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2023), pp. 1-5
Open Access | Times Cited: 12
Yiwei Guo, Chenpeng Du, Xie Chen, et al.
ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2023), pp. 1-5
Open Access | Times Cited: 12
Психолінгвістичні механізми емоційної регуляції навчальної діяльності
Оleksiy Chebykin
Insight the psychological dimensions of society (2023), Iss. 10, pp. 117-136
Open Access | Times Cited: 11
Оleksiy Chebykin
Insight the psychological dimensions of society (2023), Iss. 10, pp. 117-136
Open Access | Times Cited: 11
Identification of Persons Based on Electrocardiogram and Motion Data
Waltenegus Dargie, Sajad Farrokhi, Christian Poellabauer
(2024)
Open Access | Times Cited: 4
Waltenegus Dargie, Sajad Farrokhi, Christian Poellabauer
(2024)
Open Access | Times Cited: 4
DeepMine-multi-TTS: a Persian speech corpus for multi-speaker text-to-speech
Majid Adibian, Hossein Zeinali, Soroush Barmaki
Language Resources and Evaluation (2025)
Closed Access
Majid Adibian, Hossein Zeinali, Soroush Barmaki
Language Resources and Evaluation (2025)
Closed Access
Large Language Model-Driven 3D Hyper-Realistic Interactive Intelligent Digital Human System
Young-Joo Song, Wei Xiong
Sensors (2025) Vol. 25, Iss. 6, pp. 1855-1855
Open Access
Young-Joo Song, Wei Xiong
Sensors (2025) Vol. 25, Iss. 6, pp. 1855-1855
Open Access
ECE-TTS: A Zero-Shot Emotion Text-to-Speech Model with Simplified and Precise Control
Shixiong Liang, Ruohua Zhou, Qingsheng Yuan
Applied Sciences (2025) Vol. 15, Iss. 9, pp. 5108-5108
Open Access
Shixiong Liang, Ruohua Zhou, Qingsheng Yuan
Applied Sciences (2025) Vol. 15, Iss. 9, pp. 5108-5108
Open Access
A Multimodal Dataset for Mixed Emotion Recognition
Pei Yang, Niqi Liu, Xinge Liu, et al.
Scientific Data (2024) Vol. 11, Iss. 1
Open Access | Times Cited: 3
Pei Yang, Niqi Liu, Xinge Liu, et al.
Scientific Data (2024) Vol. 11, Iss. 1
Open Access | Times Cited: 3
Hierarchical Emotion Prediction and Control in Text-to-Speech Synthesis
Sho Inoue, Kun Zhou, Shuai Wang, et al.
ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2024)
Open Access | Times Cited: 1
Sho Inoue, Kun Zhou, Shuai Wang, et al.
ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2024)
Open Access | Times Cited: 1
BWSNET: Automatic Perceptual Assessment of Audio Signals
Clément Le Moine Veillon, Victor Rosi, Pablo Arias Sarah, et al.
ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2024), pp. 10416-10420
Open Access
Clément Le Moine Veillon, Victor Rosi, Pablo Arias Sarah, et al.
ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2024), pp. 10416-10420
Open Access
Converting Anyone's Voice: End-to-End Expressive Voice Conversion with A Conditional Diffusion Model
Zongyang Du, Junchen Lu, Kun Zhou, et al.
(2024), pp. 172-179
Open Access
Zongyang Du, Junchen Lu, Kun Zhou, et al.
(2024), pp. 172-179
Open Access
Towards Yoruba-Speaking Google Maps Navigation
Fiyinfoluwa Oyesanmi, Peter Olukanmi
Research Square (Research Square) (2024)
Open Access
Fiyinfoluwa Oyesanmi, Peter Olukanmi
Research Square (Research Square) (2024)
Open Access
FER20E: An Extended Facial Expression Recognition Dataset with 20 Discrete Emotions
Kuldeep Singh Yadav, Sonalika Singh, Lalan Kumar
(2024)
Open Access
Kuldeep Singh Yadav, Sonalika Singh, Lalan Kumar
(2024)
Open Access
RSET: Remapping-Based Sorting Method for Emotion Transfer Speech Synthesis
Haoxiang Shi, Jianzong Wang, Xulong Zhang, et al.
Lecture notes in computer science (2024), pp. 90-104
Closed Access
Haoxiang Shi, Jianzong Wang, Xulong Zhang, et al.
Lecture notes in computer science (2024), pp. 90-104
Closed Access
PiCo-VITS: Leveraging Pitch Contours for Fine-Grained Emotional Speech Synthesis
Kwan-yeung Wong, Fu-Lai Chung
Lecture notes in computer science (2024), pp. 210-221
Closed Access
Kwan-yeung Wong, Fu-Lai Chung
Lecture notes in computer science (2024), pp. 210-221
Closed Access
Emotional Text-To-Speech in Japanese Using Artificially Augmented Dataset
Mujahid Jamal A. Khalifah, Michał Ptaszyński, Fumito Masui
IEEE Access (2024) Vol. 12, pp. 167724-167777
Open Access
Mujahid Jamal A. Khalifah, Michał Ptaszyński, Fumito Masui
IEEE Access (2024) Vol. 12, pp. 167724-167777
Open Access
Laugh Now Cry Later: Controlling Time-Varying Emotional States of Flow-Matching-Based Zero-Shot Text-To-Speech
Haibin Wu, Xiaofei Wang, Şefik Emre Eskimez, et al.
2022 IEEE Spoken Language Technology Workshop (SLT) (2024), pp. 690-697
Open Access
Haibin Wu, Xiaofei Wang, Şefik Emre Eskimez, et al.
2022 IEEE Spoken Language Technology Workshop (SLT) (2024), pp. 690-697
Open Access
A Time-Distributed CNN-LSTM with Attention Model for Speech Based Emotion Recognition
Kedar Deshpande, Manjit Singh Sodhi, Nidhi Raniyer, et al.
(2024), pp. 67-71
Closed Access
Kedar Deshpande, Manjit Singh Sodhi, Nidhi Raniyer, et al.
(2024), pp. 67-71
Closed Access
Empathy by Design: The Influence of Trembling AI Voices on Prosocial Behavior
Fotis Efthymiou, Christian Hildebrand
IEEE Transactions on Affective Computing (2023) Vol. 15, Iss. 3, pp. 1253-1263
Open Access | Times Cited: 1
Fotis Efthymiou, Christian Hildebrand
IEEE Transactions on Affective Computing (2023) Vol. 15, Iss. 3, pp. 1253-1263
Open Access | Times Cited: 1
Style Generative Adversarial Network Combined with Dynamic Fundamental Frequency Difference Compensation: A Practical and Efficient Method for Emotional Voice Conversion
Zeyu Yang, Yanping Li, Jie Yu, et al.
Journal of Circuits Systems and Computers (2024) Vol. 33, Iss. 16
Closed Access
Zeyu Yang, Yanping Li, Jie Yu, et al.
Journal of Circuits Systems and Computers (2024) Vol. 33, Iss. 16
Closed Access
Retrospective and Perspectives of TTS & STT Technology Development and Implementation for South Slavic Under-Resourced Languages
Milan Sečujski, Branislav Popović, Darko Pekar, et al.
Lecture notes in computer science (2024), pp. 23-42
Closed Access
Milan Sečujski, Branislav Popović, Darko Pekar, et al.
Lecture notes in computer science (2024), pp. 23-42
Closed Access
Emotional Speech Synthesis using End-to-End neural TTS models
S K Nithin, Jay Prakash
(2022), pp. 1-7
Closed Access | Times Cited: 1
S K Nithin, Jay Prakash
(2022), pp. 1-7
Closed Access | Times Cited: 1
Nonparallel Expressive TTS for Unseen Target Speaker using Style-Controlled Adaptive Layer and Optimized Pitch Embedding
Mohammed Salah Al-Radhi, Tamás Gábor Csapó, Géza Németh
(2023), pp. 176-181
Closed Access
Mohammed Salah Al-Radhi, Tamás Gábor Csapó, Géza Németh
(2023), pp. 176-181
Closed Access
Vocal Style Factorization for Effective Speaker Recognition in Affective Scenarios
Morgan Sandler, Arun Ross
(2023), pp. 1-9
Open Access
Morgan Sandler, Arun Ross
(2023), pp. 1-9
Open Access