Amélie Dubois

page-watcher

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

WebGen-R1: Incentivizing Large Language Models to Generate Functional and Aesthetic Websites with Reinforcement Learning

upvoted a paper 5 days ago

DiPO: Disentangled Perplexity Policy Optimization for Fine-grained Exploration-Exploitation Trade-Off

upvoted a paper 14 days ago

Small Vision-Language Models are Smart Compressors for Long Video Understanding

View all activity

Organizations

None yet

upvoted a paper 4 days ago

WebGen-R1: Incentivizing Large Language Models to Generate Functional and Aesthetic Websites with Reinforcement Learning

Paper • 2604.20398 • Published 6 days ago • 3

upvoted a paper 5 days ago

DiPO: Disentangled Perplexity Policy Optimization for Fine-grained Exploration-Exploitation Trade-Off

Paper • 2604.13902 • Published 13 days ago • 61

upvoted a paper 14 days ago

Small Vision-Language Models are Smart Compressors for Long Video Understanding

Paper • 2604.08120 • Published 19 days ago • 20

liked a dataset 15 days ago

pjpjq/bybit-oi-ws-data

Updated 10 days ago • 7.57k • 6

liked a model 16 days ago

Oleksandrerfve/jghbhb

Updated 16 days ago

upvoted a paper 16 days ago

ClawBench: Can AI Agents Complete Everyday Online Tasks?

Paper • 2604.08523 • Published 19 days ago • 260

upvoted a paper 19 days ago

DataFlex: A Unified Framework for Data-Centric Dynamic Training of Large Language Models

Paper • 2603.26164 • Published Mar 27 • 362

liked a dataset 19 days ago

DJLougen/wittgensite

Viewer • Updated 18 days ago • 100 • 139 • 2

liked a model 23 days ago

mahmoud118/AI

Updated 23 days ago

liked 2 models 27 days ago

Nithyashreel/DF

Updated 27 days ago

rainbowrobotics/simtos_3031_40k

Robotics • 4B • Updated 26 days ago • 50

upvoted a paper 27 days ago

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

Paper • 2603.19835 • Published Mar 20 • 348

upvoted 2 papers about 1 month ago

Demystifing Video Reasoning

Paper • 2603.16870 • Published Mar 17 • 370

Bootstrapping Exploration with Group-Level Natural Language Feedback in Reinforcement Learning

Paper • 2603.04597 • Published Mar 4 • 210

upvoted a paper about 2 months ago

Believe Your Model: Distribution-Guided Confidence Calibration

Paper • 2603.03872 • Published Mar 4 • 40

liked 2 models about 2 months ago

Nanbeige/Nanbeige4.1-3B

Text Generation • 4B • Updated Mar 25 • 224k • • 1.1k

MiniMaxAI/MiniMax-M2.5

Text Generation • 229B • Updated Mar 10 • 919k • • 1.46k

liked 2 models 2 months ago

Qwen/Qwen3.5-122B-A10B

Image-Text-to-Text • 125B • Updated 4 days ago • 1.1M • • 528

moonshotai/Kimi-K2.5

Image-Text-to-Text • 1.1T • Updated Feb 27 • 4.33M • • 2.77k

upvoted a paper 2 months ago

A Very Big Video Reasoning Suite

Paper • 2602.20159 • Published Feb 23 • 519

Amélie Dubois

AI & ML interests

Recent Activity

Organizations

page-watcher's activity