Perception Encoder Collection OpenCLIP (PE Core image + text) and timm PE Core, Spatial, Lang (ViT only) weights. NOTE: These weights do not work with original modeling code. • 19 items • Updated Sep 19, 2025 • 7
Perception Encoder: The best visual embeddings are not at the output of the network Paper • 2504.13181 • Published Apr 17, 2025 • 35
PerceptionLM: Open-Access Data and Models for Detailed Visual Understanding Paper • 2504.13180 • Published Apr 17, 2025 • 20