新闻动态
成员信息
科学研究
联系我们
中文 (简体)
中文 (简体)
English
Artificial Intelligence
Advancing LLM Reasoning Generalists with Preference Trees
We introduce Eurus, a suite of large language models (LLMs) optimized for reasoning. Finetuned from Mistral-7B and CodeLlama-70B, Eurus …
Lifan Yuan
,
Ganqu Cui
,
Hanbin Wang
,
Ning Ding
,
Xingyao Wang
,
Jia Deng
,
Huimin Chen
,
Ruobing Xie
,
Yankai Lin
,
Zhenghao Liu
,
Boji Shan
,
Bowen Zhou
,
Hao Peng
,
Zhiyuan Liu
,
Maosong Sun
PDF
引用
代码
OpenPRM: Building Open-domain Process-based Reward Models with Preference Trees
Scaling inference-time computation is increasingly seen as the next frontier in scaling laws for large language models. Previous work …
Kaiyan Zhang
,
Jiayuan Zhang
,
Haoxin Li
,
Xuekai Zhu
,
Ermo Hua
,
Xingtai Lv
,
Ning Ding
,
Bowen Zhou
PDF
引用
代码
Towards AI-45^{\circ} Law: A Roadmap to Trustworthy AGI
Ensuring Artificial General Intelligence (AGI) reliably avoids harmful behaviors is a critical challenge, especially for systems with …
Chao Yang
,
Chaochao Lu
,
Yingchun Wang
,
Bowen Zhou
PDF
引用
Automating exploratory proteomics research via language models.
With the development of artificial intelligence, its contribution to science is evolving from simulating a complex problem to …
Ning Ding
,
Shang Qu
,
Linhai Xie
,
Yifei Li
,
Zaoqu Liu
,
Kaiyan Zhang
,
Yibai Xiong
,
Yuxin Zuo
,
Zhangren Chen
,
Ermo Hua
,
Xingtai Lv
,
Youbang Sun
,
Yang Li
,
Dong Li
,
Fuchu He
,
Bowen Zhou
PDF
引用
Retrieval-Augmented Visual Question Answering via Built-in Autoregressive Search Engines.
Retrieval-augmented generation (RAG) has emerged to address the knowledge-intensive visual question answering (VQA) task. Current …
Xinwei Long
,
Zhiyuan Ma
,
Ermo Hua
,
Kaiyan Zhang
,
Biqing Qi
,
Bowen Zhou
PDF
引用
代码
Empowering private tutoring by chaining large language models.
Artificial intelligence has been applied in various aspects of online education to facilitate teaching and learning. However, few …
Yulin Chen
,
Ning Ding
,
Hai-Tao Zheng
,
Zhiyuan Liu
,
Maosong Sun
,
Bowen Zhou
PDF
引用
代码
Efficient Diffusion Models: A Comprehensive Survey from Principles to Practices
As one of the most popular and sought-after generative models in the recent years, diffusion models have sparked the interests of many …
Zhiyuan Ma
,
Yuzhu Zhang
,
Guoli Jia
,
Liangliang Zhao
,
Yichao Ma
,
Mingjie Ma
,
Gaofeng Liu
,
Kaiyan Zhang
,
Jianjun Li
,
Bowen Zhou
PDF
引用
代码
On the token distance modeling ability of higher RoPE attention dimension
Length extrapolation algorithms based on Rotary position embedding (RoPE) have shown promising results in extending the context length …
Xiangyu Hong
,
Che Jiang
,
Biqing Qi
,
Fandong Meng
,
Mo Yu
,
Bowen Zhou
,
Jie Zhou
PDF
引用
代码
Scalable Efficient Training of Large Language Models with Low-dimensional Projected Attention
Improving the effectiveness and efficiency of large language models (LLMs) simultaneously is a critical yet challenging research goal. …
Xingtai Lv
,
Ning Ding
,
Kaiyan Zhang
,
Ermo Hua
,
Ganqu Cui
,
Bowen Zhou
PDF
引用
代码
SAM Struggles in Concealed Scenes -- Empirical Study on Segment Anything
Segmenting anything is a ground-breaking step toward artificial general intelligence, and the Segment Anything Model (SAM) greatly …
Ge-Peng Ji
,
Deng-Ping Fan
,
Peng Xu
,
Ming-Ming Cheng
,
Bowen Zhou
,
Luc Van Gool
PDF
引用
代码
«
引用
×