Adam Accumulation to Reduce Memory Footprints of Both Activations and Gradients for Large-scale DNN TrainingYijia Zhang,Yibo Han,Shijie Cao,Guohao Dai,Youshan Miao,Ting Cao,Fan Yang,Ningyi XuECAI 2023(2023)引用 0|浏览1关键词Memory Applications,Large-Scale OptimizationAI 理解论文溯源树样例生成溯源树,研究论文发展脉络Chat Paper正在生成论文摘要