Follow
Victor Sanh
Victor Sanh
Hugging Face
Verified email at huggingface.co
Title
Cited by
Cited by
Year
Transformers: State-of-the-art natural language processing
T Wolf, L Debut, V Sanh, J Chaumond, C Delangue, A Moi, P Cistac, ...
Proceedings of the 2020 conference on empirical methods in natural language …, 2020
5345*2020
DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter
V Sanh, L Debut, J Chaumond, T Wolf
arXiv preprint arXiv:1910.01108, 2019
2470*2019
TransferTransfo: A Transfer Learning Approach for Neural Network Based Conversational Agents
T Wolf, V Sanh, J Chaumond, C Delangue
arXiv preprint arXiv:1901.08149, 2019
3362019
A hierarchical multi-task approach for learning embeddings from semantic tasks
V Sanh, T Wolf, S Ruder
Proceedings of the AAAI Conference on Artificial Intelligence 33, 6949-6956, 2019
1742019
Multitask Prompted Training Enables Zero-Shot Task Generalization
V Sanh, A Webson, C Raffel, SH Bach, L Sutawika, Z Alyafeai, A Chaffin, ...
arXiv preprint arXiv:2110.08207, 2021
1142021
Movement pruning: Adaptive sparsity by fine-tuning
V Sanh, T Wolf, A Rush
Advances in Neural Information Processing Systems 33, 20378-20389, 2020
1042020
Datasets: A Community Library for Natural Language Processing
Q Lhoest, AV del Moral, Y Jernite, A Thakur, P von Platen, S Patil, ...
arXiv preprint arXiv:2109.02846, 2021
63*2021
Learning from others' mistakes: Avoiding dataset biases without modeling them
V Sanh, T Wolf, Y Belinkov, AM Rush
arXiv preprint arXiv:2012.01300, 2020
262020
Edgebert: Sentence-level energy optimizations for latency-aware multi-task nlp inference
T Tambe, C Hooper, L Pentecost, T Jia, EY Yang, M Donato, V Sanh, ...
MICRO-54: 54th Annual IEEE/ACM International Symposium on Microarchitecture …, 2021
25*2021
Block Pruning For Faster Transformers
F Lagunas, E Charlaix, V Sanh, AM Rush
arXiv preprint arXiv:2109.04838, 2021
212021
Low-Complexity Probing via Finding Subnetworks
S Cao, V Sanh, AM Rush
arXiv preprint arXiv:2104.03514, 2021
92021
PromptSource: An Integrated Development Environment and Repository for Natural Language Prompts
SH Bach, V Sanh, ZX Yong, A Webson, C Raffel, NV Nayak, A Sharma, ...
arXiv preprint arXiv:2202.01279, 2022
82022
Avoiding Inference Heuristics in Few-shot Prompt-based Finetuning
PA Utama, NS Moosavi, V Sanh, I Gurevych
arXiv preprint arXiv:2109.04144, 2021
52021
What Language Model to Train if You Have One Million GPU Hours?
T Le Scao, T Wang, D Hesslow, L Saulnier, S Bekman, MS Bari, ...
Challenges {\&, 2022
22022
The system can't perform the operation now. Try again later.
Articles 1–14