Examples of tasks we designed in https://arxiv.org/abs/2504.15266
Chen Wu PRO
ChenWu98
AI & ML interests
Generative models
Recent Activity
updated a model 9 days ago
ChenWu98/opd_grpo_verifier_hard_Qwen-Qwen3-1.7B_alpha0.5_lr1e-6_opd1.0_pg0.1_k3_qwen3_8bgen published a model 9 days ago
ChenWu98/opd_grpo_verifier_hard_Qwen-Qwen3-1.7B_alpha0.5_lr1e-6_opd1.0_pg0.1_k3_qwen3_8bgen updated a model 16 days ago
ChenWu98/grpo_generator_feedback_hard_Qwen-Qwen3-8B_lr1e-6_k1_init2800Organizations
None yet