Anthony W Figueroa's picture

Anthony W Figueroa

THEFIG

·

AI & ML interests

None yet

Recent Activity

upvoted a collection 13 days ago

updated a collection about 1 month ago

upvoted a collection about 1 month ago

View all activity

Organizations

None yet

upvoted a collection 13 days ago

SenseNova-U1

SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-Unify Architecture • 6 items • Updated 4 days ago • 45

upvoted a collection about 1 month ago

VibeVoice

Frontier Text-to-Speech Models https://microsoft.github.io/VibeVoice/ • 8 items • Updated Mar 2 • 244

upvoted a paper 2 months ago

CubeComposer: Spatio-Temporal Autoregressive 4K 360° Video Generation from Perspective Video

Paper • 2603.04291 • Published Mar 4 • 14

upvoted 2 papers 3 months ago

Generative Visual Code Mobile World Models

Paper • 2602.01576 • Published Feb 2 • 42

Code2World: A GUI World Model via Renderable Code Generation

Paper • 2602.09856 • Published Feb 10 • 201

upvoted a paper 6 months ago

Too Good to be Bad: On the Failure of LLMs to Role-Play Villains

Paper • 2511.04962 • Published Nov 7, 2025 • 57

upvoted an article 10 months ago

Article

Building the Hugging Face MCP Server

+2

evalstate, julien-c, coyotte508, abidlabs

•

Jul 10, 2025

• 67

upvoted 4 papers over 1 year ago

The Dawn of GUI Agent: A Preliminary Case Study with Claude 3.5 Computer Use

Paper • 2411.10323 • Published Nov 15, 2024 • 34

Large Language Model-Brained GUI Agents: A Survey

Paper • 2411.18279 • Published Nov 27, 2024 • 30

Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction

Paper • 2412.04454 • Published Dec 5, 2024 • 71

Ponder & Press: Advancing Visual GUI Agent towards General Computer Control

Paper • 2412.01268 • Published Dec 2, 2024 • 1

upvoted 4 collections over 1 year ago

CogVideo

10 items • Updated Jun 30, 2025 • 64

UI Agent

a collection of algorithmic agents for user interfaces/interactions, program synthesis, and robotics • 488 items • Updated 7 days ago • 69

Papers

661 items • Updated 24 days ago • 17

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 16 items • Updated May 5, 2025 • 308

upvoted an article over 1 year ago

Article

Our Transformers Code Agent beats the GAIA benchmark 🏅

m-ric, sergeipetrov

•

Jul 1, 2024

• 100

upvoted 2 collections almost 2 years ago

Models Used in HackerNoon Publishing System

HackerNoon.com’s content management system empowers a small team to manage tens of thousands of writers, advertisers, & millions of readers 🙏 🤖 🙏🤖 • 16 items • Updated Jan 23, 2025 • 21

OpenCodeInterpreter

15 items • Updated Mar 2 • 84

upvoted an article almost 2 years ago

Article

Train custom AI models with the trainer API and adapt them to 🤗

not-lain

•

Jun 29, 2024

• 32

upvoted a collection almost 2 years ago

Gemma 2 Release

15 items • Updated Mar 12 • 224