Tour
News
People
Publications
Contact
English
English
中文 (简体)
Large Language Model
Towards Building Specialized Generalist AI with System 1 and System 2 Fusion
In this perspective paper, we introduce the concept of Specialized Generalist Artificial Intelligence (SGAI or simply SGI) as a crucial …
Kaiyan Zhang
,
Biqing Qi
,
Bowen Zhou
PDF
Cite
Fast and Slow Generating: An Empirical Study on Large and Small Language Models Collaborative Decoding
Large Language Models (LLMs) demonstrate impressive performance in diverse applications, yet they face significant drawbacks, including …
Kaiyan Zhang
,
Jianyu Wang
,
Ning Ding
,
Biqing Qi
,
Ermo Hua
,
Xingtai Lv
,
Bowen Zhou
PDF
Cite
Intuitive Fine-Tuning: Towards Simplifying Alignment into a Single Process
Supervised Fine-Tuning (SFT) and Preference Optimization (PO) are two fundamental processes for enhancing the capabilities of Language …
Ermo Hua
,
Biqing Qi
,
Kaiyan Zhang
,
Yue Yu
,
Ning Ding
,
Xingtai Lv
,
Kai Tian
,
Bowen Zhou
PDF
Cite
Code
UltraMedical: Building Specialized Generalists in Biomedicine
Large Language Models (LLMs) have demonstrated remarkable capabilities across various domains and are moving towards more specialized …
Kaiyan Zhang
,
Sihang Zeng
,
Ermo Hua
,
Ning Ding
,
Zhang-Ren Chen
,
Zhiyuan Ma
,
Haoxin Li
,
Ganqu Cui
,
Biqing Qi
,
Xuekai Zhu
,
Xingtai Lv
,
Jin-Fang Hu
,
Zhiyuan Liu
,
Bowen Zhou
PDF
Cite
Code
Dataset
CoGenesis: A Framework Collaborating Large and Small Language Models for Secure Context-Aware Instruction Following
With the advancement of language models (LMs), their exposure to private data is increas- ingly inevitable, and their deployment (espe- …
Kaiyan Zhang
,
Jianyu Wang
,
Ermo Hua
,
Biqing Qi
,
Ning Ding
,
Bowen Zhou
PDF
Cite
Investigating Deep Watermark Security: An Adversarial Transferability Perspective
The rise of generative neural networks has triggered an increased demand for intellectual property (IP) protection in generated …
Biqing Qi
,
Junqi Gao
,
Yiang Luo
,
Jianxing Liu
,
Ligang Wu
,
Bowen Zhou
PDF
Cite
CRaSh: Clustering, Removing, and Sharing Enhance Fine-tuning without Full Large Language Model
Instruction tuning has recently been recognized as an effective way of aligning Large Language Models (LLMs) to enhance their …
Kaiyan Zhang
,
Ning Ding
,
Biqing Qi
,
Xuekai Zhu
,
Xinwei Long
,
Bowen Zhou
PDF
Cite
Code
Sparse Low-rank Adaptation of Pre-trained Language Models
Fine-tuning pre-trained large language models in a parameter-efficient manner is widely studied for its effectiveness and efficiency. …
Ning Ding
,
Xingtai Lv
,
Qiaosen Wang
,
Yulin Chen
,
Bowen Zhou
,
Zhiyuan Liu
,
Maosong Sun
PDF
Cite
Code
Cite
×