Towards Lossless Head Pruning Through Automatic Peer Distillation for Language Models.Bingbing Li,Zigeng Wang,Shaoyi Huang,Mikhail Bragin,Ji Li,Caiwen DingIJCAI 2023(2023)引用 1|浏览98关键词Natural Language Processing -> NLP: Language modelsAI 理解论文溯源树样例生成溯源树,研究论文发展脉络Chat Paper正在生成论文摘要