princeton-nlp/Llama-3-8B-ProLong-64k-Base
Text Generation • 8B • Updated
• 8.12k • • 5
ProLong is a family of long-context models that are continued trained and supervised fine-tuned from Llama-3-8B, with a maximum context window of 512K