yukang lin
linyk
AI & ML interests
None yet
Recent Activity
liked a dataset 12 days ago
OpenMOSS-Team/OmniAction upvoted a paper 19 days ago
AI Can Learn Scientific Taste upvoted a paper about 1 month ago
BandPO: Bridging Trust Regions and Ratio Clipping via Probability-Aware Bounds for LLM Reinforcement Learning