arxiv:2601.09667
Yucheng Wang
Echoandland
AI & ML interests
None yet
Organizations
models 23
Echoandland/olmo3-7b-physics-grpo-purerl-step9
Reinforcement Learning • 7B • Updated • 2
Echoandland/olmo3-7b-physics-grpo-purerl-step7
Reinforcement Learning • 7B • Updated • 4
Echoandland/qwen3-8b-dapo-high-entropy-step2
Reinforcement Learning • 8B • Updated
Echoandland/qwen3-8b-dapo-high-entropy-step8
Reinforcement Learning • 8B • Updated • 5
Echoandland/olmo3-7b-grpo-weighted-mul-creativity-step6
Reinforcement Learning • 7B • Updated
Echoandland/olmo3-7b-grpo-weighted-mul-creativity-step7
Reinforcement Learning • 7B • Updated
Echoandland/olmo3-7b-grpo-purerl-creativity-step28
Reinforcement Learning • 7B • Updated • 3
Echoandland/olmo3-7b-grpo-purerl-creativity-step5
Reinforcement Learning • 7B • Updated
Echoandland/qwen3-8b-grpo-purerl-creativity-step21
Reinforcement Learning • 8B • Updated
Echoandland/qwen3-8b-grpo-purerl-creativity-step9
Reinforcement Learning • 8B • Updated • 2