Large Language Model

Towards Building Specialized Generalist AI with System 1 and System 2 Fusion

In this perspective paper, we introduce the concept of Specialized Generalist Artificial Intelligence (SGAI or simply SGI) as a crucial …

Kaiyan Zhang, Biqing Qi, Bowen Zhou

Fast and Slow Generating: An Empirical Study on Large and Small Language Models Collaborative Decoding

Large Language Models (LLMs) demonstrate impressive performance in diverse applications, yet they face significant drawbacks, including …

Kaiyan Zhang, Jianyu Wang, Ning Ding, Biqing Qi, Ermo Hua, Xingtai Lv, Bowen Zhou

Fast and Slow Generating: An Empirical Study on Large and Small Language Models Collaborative Decoding

Intuitive Fine-Tuning: Towards Simplifying Alignment into a Single Process

Supervised Fine-Tuning (SFT) and Preference Optimization (PO) are two fundamental processes for enhancing the capabilities of Language …

Ermo Hua, Biqing Qi, Kaiyan Zhang, Yue Yu, Ning Ding, Xingtai Lv, Kai Tian, Bowen Zhou

CoGenesis: A Framework Collaborating Large and Small Language Models for Secure Context-Aware Instruction Following

With the advancement of language models (LMs), their exposure to private data is increas- ingly inevitable, and their deployment (espe- …

Kaiyan Zhang, Jianyu Wang, Ermo Hua, Biqing Qi, Ning Ding, Bowen Zhou

CoGenesis: A Framework Collaborating Large and Small Language Models for Secure Context-Aware Instruction Following

Investigating Deep Watermark Security: An Adversarial Transferability Perspective

The rise of generative neural networks has triggered an increased demand for intellectual property (IP) protection in generated …

Biqing Qi, Junqi Gao, Yiang Luo, Jianxing Liu, Ligang Wu, Bowen Zhou

Investigating Deep Watermark Security: An Adversarial Transferability Perspective

CRaSh: Clustering, Removing, and Sharing Enhance Fine-tuning without Full Large Language Model

Instruction tuning has recently been recognized as an effective way of aligning Large Language Models (LLMs) to enhance their …

Kaiyan Zhang, Ning Ding, Biqing Qi, Xuekai Zhu, Xinwei Long, Bowen Zhou

CRaSh: Clustering, Removing, and Sharing Enhance Fine-tuning without Full Large Language Model

Sparse Low-rank Adaptation of Pre-trained Language Models

Fine-tuning pre-trained large language models in a parameter-efficient manner is widely studied for its effectiveness and efficiency. …

Ning Ding, Xingtai Lv, Qiaosen Wang, Yulin Chen, Bowen Zhou, Zhiyuan Liu, Maosong Sun

Sparse Low-rank Adaptation of Pre-trained Language Models