
OpenAlex is a bibliographic catalogue of scientific papers, authors and institutions accessible in open access mode, named after the Library of Alexandria. It's citation coverage is excellent and I hope you will find utility in this listing of citing articles!
If you click the article title, you'll navigate to the article, as listed in CrossRef. If you click the Open Access links, you'll navigate to the "best Open Access location". Clicking the citation count will open this listing for that article. Lastly at the bottom of the page, you'll find basic pagination options.
Requested Article:
ColD Fusion: Collaborative Descent for Distributed Multitask Finetuning
Shachar Don-Yehiya, Elad Venezian, Colin Raffel, et al.
(2023), pp. 788-806
Open Access | Times Cited: 8
Shachar Don-Yehiya, Elad Venezian, Colin Raffel, et al.
(2023), pp. 788-806
Open Access | Times Cited: 8
Showing 8 citing articles:
Findings of the BabyLM Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora
Alex Warstadt, Aaron Mueller, Leshem Choshen, et al.
(2023)
Open Access | Times Cited: 45
Alex Warstadt, Aaron Mueller, Leshem Choshen, et al.
(2023)
Open Access | Times Cited: 45
Where to start? Analyzing the potential value of intermediate models
Leshem Choshen, Elad Venezian, Shachar Don-Yehiya, et al.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (2023), pp. 1446-1470
Open Access | Times Cited: 7
Leshem Choshen, Elad Venezian, Shachar Don-Yehiya, et al.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (2023), pp. 1446-1470
Open Access | Times Cited: 7
An Empirical Study of Multimodal Model Merging
Yi-Lin Sung, Linjie Li, Kevin Lin, et al.
(2023)
Open Access | Times Cited: 6
Yi-Lin Sung, Linjie Li, Kevin Lin, et al.
(2023)
Open Access | Times Cited: 6
Knowledge is a Region in Weight Space for Fine-tuned Language Models
Almog Gueta, Elad Venezian, Colin Raffel, et al.
(2023), pp. 1350-1370
Open Access | Times Cited: 4
Almog Gueta, Elad Venezian, Colin Raffel, et al.
(2023), pp. 1350-1370
Open Access | Times Cited: 4
Merging Decision Transformers: Weight Averaging for Forming Multi-Task Policies
Daniel E. Lawson, Ahmed H. Qureshi
(2024), pp. 12942-12948
Open Access | Times Cited: 1
Daniel E. Lawson, Ahmed H. Qureshi
(2024), pp. 12942-12948
Open Access | Times Cited: 1
E-code: Mastering efficient code generation through pretrained models and expert encoder group
Yue Pan, Chen Lyu, Zhenyu Yang, et al.
Information and Software Technology (2024) Vol. 178, pp. 107602-107602
Open Access | Times Cited: 1
Yue Pan, Chen Lyu, Zhenyu Yang, et al.
Information and Software Technology (2024) Vol. 178, pp. 107602-107602
Open Access | Times Cited: 1
DIMAT: Decentralized Iterative Merging-And-Training for Deep Learning Models
Nastaran Saadati, Minh Pham, Nasla Saleem, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 260, pp. 27507-27517
Closed Access
Nastaran Saadati, Minh Pham, Nasla Saleem, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 260, pp. 27507-27517
Closed Access
Model Breadcrumbs: Scaling Multi-task Model Merging with Sparse Masks
MohammadReza Davari, Eugene Belilovsky
Lecture notes in computer science (2024), pp. 270-287
Closed Access
MohammadReza Davari, Eugene Belilovsky
Lecture notes in computer science (2024), pp. 270-287
Closed Access