Artificial Intelligence

When super-resolution meets camouflaged object detection: A comparison study.

Super-resolution (SR) and camouflage object detection (COD) are two prominent topics in the field of computer vision, with various …

Juan Wen, Shupeng Cheng, Peng Xu, Bowen Zhou, Weiyan Hou, Luc Van Gool

When super-resolution meets camouflaged object detection: A comparison study.

Fourier Position Embedding: Enhancing Attentions Periodic Extension for Length Generalization

Extending the context length of Language Models (LMs) by improving Rotary Position Embedding (RoPE) has become a trend. While existing …

Ermo Hua, Che Jiang, Xingtai Lv, Kaiyan Zhang, Ning Ding, Youbang Sun, Biqing Qi, Yuchen Fan, Xuekai Zhu, Bowen Zhou

Fourier Position Embedding: Enhancing Attentions Periodic Extension for Length Generalization

Free Process Rewards without Process Labels

Different from its counterpart outcome reward models (ORMs), which evaluate the entire responses, a process reward model (PRM) scores a …

Lifan Yuan, Wendi Li, Huayu Chen, Ganqu Cui, Ning Ding, Kaiyan Zhang, Bowen Zhou, Zhiyuan Liu, Hao Peng

Free Process Rewards without Process Labels

How to Synthesize Text Data without Model Collapse?

Model collapse in synthetic data indicates that iterative training on self-generated data leads to a gradual decline in performance. …

Xuekai Zhu, Daixuan Cheng, Hengli Li, Kaiyan Zhang, Ermo Hua, Xingtai Lv, Ning Ding, Zhouhan Lin, Bowen Zhou

How to Synthesize Text Data without Model Collapse?

MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding

We introduce MedXpertQA, a highly challenging and comprehensive benchmark to evaluate expert-level medical knowledge and advanced …

Yuxin Zuo, Shang Qu, Linhai Xie, Yifei Li, Zhangren Chen, Xuekai Zhu, Ermo Hua, Kaiyan Zhang, Ning Ding, Bowen Zhou

MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding

GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning

Recent advancements in Large Language Models (LLMs) have shown that it is promising to utilize Process Reward Models (PRMs) as …

Jian Zhao, Runze Liu, Kaiyan Zhang, Zhimu Zhou, Junqi Gao, Dong Li, Jiafei Lyu, Zhouyi Qian, Biqing Qi, Xiu Li, Bowen Zhou

GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning

Technologies on Effectiveness and Efficiency: A Survey of State Spaces Models

State Space Models (SSMs) have emerged as a promising alternative to the popular transformer-based models and have been increasingly …

Xingtai Lv, Youbang Sun, Kaiyan Zhang, Shang Qu, Xuekai Zhu, Yuchen Fan, Yi Wu, Ermo Hua, Xinwei Long, Ning Ding, Bowen Zhou

Technologies on Effectiveness and Efficiency: A Survey of State Spaces Models

Less is More: Efficient Model Merging with Binary Task Switch

As an effective approach to equip models with multi-task capabilities without additional training, model merging has garnered …

Biqing Qi, Fangyuan Li, Zhen Wang, Junqi Gao, Dong Li, Peng Ye, Bowen Zhou

Less is More: Efficient Model Merging with Binary Task Switch

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

Test-Time Scaling (TTS) is an important method for improving the performance of Large Language Models (LLMs) by using additional …

Runze Liu, Junqi Gao, Jian Zhao, Kaiyan Zhang, Xiu Li, Biqing Qi, Wanli Ouyang, Bowen Zhou

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling