Search

News
People
Publications
Contact

English
English
中文 (简体)

Jian Zhao

Latest

GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning
Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

© 2025 TsinghuaC3I. This work is licensed under CC BY NC ND 4.0

Published with Wowchemy — the free, open source website builder that empowers creators.

Cite