Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Jeffrey Magder's picture
2 24 5

Jeffrey Magder

jmagder
Mi6paulino's profile picture AIGeeker's profile picture
·
  • jmagder

AI & ML interests

None yet

Organizations

None yet

Collections 3

Favorites
  • Better & Faster Large Language Models via Multi-token Prediction

    Paper • 2404.19737 • Published Apr 30, 2024 • 80
To read
  • Mamba: Linear-Time Sequence Modeling with Selective State Spaces

    Paper • 2312.00752 • Published Dec 1, 2023 • 150
  • Elucidating the Design Space of Diffusion-Based Generative Models

    Paper • 2206.00364 • Published Jun 1, 2022 • 18
  • GLU Variants Improve Transformer

    Paper • 2002.05202 • Published Feb 12, 2020 • 5
  • StarCoder 2 and The Stack v2: The Next Generation

    Paper • 2402.19173 • Published Feb 29, 2024 • 156
Favorites
  • Better & Faster Large Language Models via Multi-token Prediction

    Paper • 2404.19737 • Published Apr 30, 2024 • 80
To read
  • Mamba: Linear-Time Sequence Modeling with Selective State Spaces

    Paper • 2312.00752 • Published Dec 1, 2023 • 150
  • Elucidating the Design Space of Diffusion-Based Generative Models

    Paper • 2206.00364 • Published Jun 1, 2022 • 18
  • GLU Variants Improve Transformer

    Paper • 2002.05202 • Published Feb 12, 2020 • 5
  • StarCoder 2 and The Stack v2: The Next Generation

    Paper • 2402.19173 • Published Feb 29, 2024 • 156
View 3 collections

models 0

None public yet

datasets 0

None public yet
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs