Tour
News
People
Publications
Contact
English
English
中文 (简体)
Continual Learning
Online DPO: Online Direct Preference Optimization with Fast-Slow Chasing
Direct Preference Optimization (DPO) improves the alignment of large language models (LLMs) with human values by training directly on …
Biqing Qi
,
Pengfei Li
,
Fangyuan Li
,
Junqi Gao
,
Kaiyan Zhang
,
Bowen Zhou
PDF
Cite
Contrastive Augmented Graph2Graph Memory Interaction for Few Shot Continual Learning
Few-Shot Class-Incremental Learning (FSCIL) has gained considerable attention in recent years for its pivotal role in addressing …
Biqing Qi
,
Junqi Gao
,
Xingquan Chen
,
Dong Li
,
Jianxing Liu
,
Ligang Wu
,
Bowen Zhou
PDF
Cite
Cite
×