InternEvo: Efficient Long-sequence Large Language Model Training via Hybrid Parallelism and Redundant ShardingQiaoling Chen,Diandian Gu,Guoteng Wang,Xun Chen,YingTong Xiong,Ting Huang,Qinghao Hu,Xin Jin,Yonggang Wen,Tianwei Zhang,Peng SunCoRR(2024)引用 2|浏览90AI 理解论文溯源树样例生成溯源树,研究论文发展脉络Chat Paper正在生成论文摘要