Travis King's picture

In a Training Loop 🔄

Travis King

travisking

·

AI & ML interests

have you heard of generative AI?

Recent Activity

liked a model 1 day ago

onnx-community/granite-embedding-small-english-r2-ONNX

upvoted a collection 2 days ago

upvoted a collection 2 days ago

Granite Embedding

View all activity

Organizations

None yet

upvoted 2 collections 2 days ago

Granite Speech

Multilingual ASR and speech-to-text (STT) models for enterprise transcription and translation. • 7 items • Updated 18 days ago • 33

Granite Embedding

Embedding models (bi‑encoders and rerankers) for RAG, semantic search, and retrieval tasks. • 9 items • Updated 17 days ago • 44

upvoted 2 papers 3 days ago

Retrieval from Within: An Intrinsic Capability of Attention-Based Models

Paper • 2605.05806 • Published 9 days ago • 5

Geometry Conflict: Explaining and Controlling Forgetting in LLM Continual Post-Training

Paper • 2605.09608 • Published 7 days ago • 48

upvoted a paper 4 days ago

Efficient Pre-Training with Token Superposition

Paper • 2605.06546 • Published 10 days ago • 37

upvoted a collection 17 days ago

Granite 4.1 Language Models

Efficient language models for multilingual generation, coding, RAG, and AI assistant workflows. • 6 items • Updated 18 days ago • 50

upvoted an article 17 days ago

Article

Granite 4.1 LLMs: How They’re Built

ibm-granite

•

18 days ago

• 70

upvoted 2 collections about 1 month ago

Olmo 3

Artifacts for the Olmo 3 release. • 7 items • Updated Mar 2 • 169

Gemma 4

12 items • Updated 12 days ago • 816

upvoted a paper 3 months ago

EuroLLM-22B: Technical Report

Paper • 2602.05879 • Published Feb 5 • 3

upvoted a collection 3 months ago

CoPE

CoPE is a drop-in enhancement of RoPE that delivers consistent gains within the training context and during long-context extrapoaltion. • 8 items • Updated Mar 2 • 2

upvoted 8 papers 3 months ago

CoPE: Clipped RoPE as A Scalable Free Lunch for Long Context LLMs

Paper • 2602.05258 • Published Feb 5 • 7

Accurate Failure Prediction in Agents Does Not Imply Effective Failure Prevention

Paper • 2602.03338 • Published Feb 3 • 26

Do Reasoning Models Enhance Embedding Models?

Paper • 2601.21192 • Published Jan 29 • 26

Llama-3.1-FoundationAI-SecurityLLM-Reasoning-8B Technical Report

Paper • 2601.21051 • Published Jan 28 • 14

Shaping capabilities with token-level data filtering

Paper • 2601.21571 • Published Jan 29 • 29

Quartet II: Accurate LLM Pre-Training in NVFP4 by Improved Unbiased Gradient Estimation

Paper • 2601.22813 • Published Jan 30 • 61

EEG Foundation Models: Progresses, Benchmarking, and Open Problems

Paper • 2601.17883 • Published Jan 25 • 22

OCRVerse: Towards Holistic OCR in End-to-End Vision-Language Models

Paper • 2601.21639 • Published Jan 29 • 51

upvoted a paper 4 months ago

Self-Improving Pretraining: using post-trained models to pretrain better models

Paper • 2601.21343 • Published Jan 29 • 19