You Only Cache Once: Decoder-Decoder Architectures for Language ModelsYutao Sun,Li Dong, Yi Zhu,Shaohan Huang,Wenhui Wang,Shuming Ma,Quanlu Zhang,Jianyong Wang,Furu WeiNeurIPS 2024(2024)引用 52|浏览91关键词Decoder-Decoder,Model ArchitectureAI 理解论文溯源树样例生成溯源树,研究论文发展脉络Chat Paper正在生成论文摘要