News
People
Publications
Contact
English
English
中文 (简体)
Jian Zhao
Latest
GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning
Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling
Cite
×