News
People
Publications
Contact
English
English
中文 (简体)
Junqi Gao
Latest
GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning
Less is More: Efficient Model Merging with Binary Task Switch
Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling
Exploring Adversarial Robustness of Deep State Space Models
Enhancing Adversarial Transferability via Information Bottleneck Constraints
Online DPO: Online Direct Preference Optimization with Fast-Slow Chasing
SMR: State Memory Replay for Long Sequence Modeling
Contrastive Augmented Graph2Graph Memory Interaction for Few Shot Continual Learning
Investigating Deep Watermark Security: An Adversarial Transferability Perspective
Interactive Continual Learning: Fast and Slow Thinking
Cite
×