概览
新闻动态
成员信息
科学研究
联系我们
中文 (简体)
中文 (简体)
English
Continual Learning
Online DPO: Online Direct Preference Optimization with Fast-Slow Chasing
Direct Preference Optimization (DPO) improves the alignment of large language models (LLMs) with human values by training directly on …
Biqing Qi
,
Pengfei Li
,
Fangyuan Li
,
Junqi Gao
,
Kaiyan Zhang
,
Bowen Zhou
PDF
引用
Contrastive Augmented Graph2Graph Memory Interaction for Few Shot Continual Learning
Few-Shot Class-Incremental Learning (FSCIL) has gained considerable attention in recent years for its pivotal role in addressing …
Biqing Qi
,
Junqi Gao
,
Xingquan Chen
,
Dong Li
,
Jianxing Liu
,
Ligang Wu
,
Bowen Zhou
PDF
引用
引用
×