Granite Speech Collection Multilingual ASR and speech-to-text (STT) models for enterprise transcription and translation. • 7 items • Updated 18 days ago • 33
Granite Embedding Collection Embedding models (bi‑encoders and rerankers) for RAG, semantic search, and retrieval tasks. • 9 items • Updated 17 days ago • 44
Retrieval from Within: An Intrinsic Capability of Attention-Based Models Paper • 2605.05806 • Published 9 days ago • 5
Geometry Conflict: Explaining and Controlling Forgetting in LLM Continual Post-Training Paper • 2605.09608 • Published 7 days ago • 48
Granite 4.1 Language Models Collection Efficient language models for multilingual generation, coding, RAG, and AI assistant workflows. • 6 items • Updated 18 days ago • 50
CoPE Collection CoPE is a drop-in enhancement of RoPE that delivers consistent gains within the training context and during long-context extrapoaltion. • 8 items • Updated Mar 2 • 2
CoPE: Clipped RoPE as A Scalable Free Lunch for Long Context LLMs Paper • 2602.05258 • Published Feb 5 • 7
Accurate Failure Prediction in Agents Does Not Imply Effective Failure Prevention Paper • 2602.03338 • Published Feb 3 • 26
Llama-3.1-FoundationAI-SecurityLLM-Reasoning-8B Technical Report Paper • 2601.21051 • Published Jan 28 • 14
Quartet II: Accurate LLM Pre-Training in NVFP4 by Improved Unbiased Gradient Estimation Paper • 2601.22813 • Published Jan 30 • 61
EEG Foundation Models: Progresses, Benchmarking, and Open Problems Paper • 2601.17883 • Published Jan 25 • 22
OCRVerse: Towards Holistic OCR in End-to-End Vision-Language Models Paper • 2601.21639 • Published Jan 29 • 51
Self-Improving Pretraining: using post-trained models to pretrain better models Paper • 2601.21343 • Published Jan 29 • 19