FormalMATH: Benchmarking Formal Mathematical Reasoning of Large Language Models

Zhouliang Yu, Ruotian Peng, Keyi Ding, Yizhe Li, Zhongyuan Peng, Minghao Liu,Yifan Zhang, Zheng Yuan,Huajian Xin,Wenhao Huang, Yandong Wen, Ge Zhang,Weiyang Liu

arxiv（2025）

Cited 0|Views0

AI Read Science

Must-Reading Tree

Example

Generate MRT to find the research sequence of this paper

Chat Paper

Summary is being generated by the instructions you defined