Reinforced Self-Training (rest) for Language Modeling.Caglar Gulcehre,Tom Le Paine,Srivatsan Srinivasan,Ksenia Konyushkova, Lotte Weerts,Abhishek Sharma,Aditya Siddhant,Alex Ahern,Miaosen Wang,Chenjie Gu,Wolfgang Macherey,Arnaud Doucet,Orhan Firat,Nando de FreitasCoRR(2023)引用 273|浏览284关键词Reinforcement Learning,Machine Translation,Language Modeling,Neural Machine Translation,Natural Language GenerationAI 理解论文溯源树样例生成溯源树,研究论文发展脉络Chat Paper正在生成论文摘要