multidomain_rcot_physics Zihao-Li/Qwen3-8B-M5 8B • Updated 4 days ago • 14 • 1 Zihao-Li/Qwen3-8B-M4-FFT 8B • Updated 4 days ago • 14 • 1 Zihao-Li/Qwen3-8B-M4-LST 8B • Updated 4 days ago • 14 Zihao-Li/Qwen3-8B-M2 8B • Updated 4 days ago • 14
MixCPT Rethinking Multilingual Continual Pretraining: Data Mixing for Adapting LLMs Across Languages and Resources Rethinking Multilingual Continual Pretraining: Data Mixing for Adapting LLMs Across Languages and Resources Paper • 2504.04152 • Published Apr 5, 2025 • 1 Zihao-Li/L2-Mono-Stag Text Generation • 7B • Updated Apr 1, 2025 Zihao-Li/L2-Bi-Code-Alt Text Generation • 7B • Updated Apr 1, 2025 • 3 Zihao-Li/L2-Bi-Code-Sel Text Generation • 7B • Updated Apr 1, 2025 • 1
Rethinking Multilingual Continual Pretraining: Data Mixing for Adapting LLMs Across Languages and Resources Paper • 2504.04152 • Published Apr 5, 2025 • 1
multidomain_rcot_physics Zihao-Li/Qwen3-8B-M5 8B • Updated 4 days ago • 14 • 1 Zihao-Li/Qwen3-8B-M4-FFT 8B • Updated 4 days ago • 14 • 1 Zihao-Li/Qwen3-8B-M4-LST 8B • Updated 4 days ago • 14 Zihao-Li/Qwen3-8B-M2 8B • Updated 4 days ago • 14
MixCPT Rethinking Multilingual Continual Pretraining: Data Mixing for Adapting LLMs Across Languages and Resources Rethinking Multilingual Continual Pretraining: Data Mixing for Adapting LLMs Across Languages and Resources Paper • 2504.04152 • Published Apr 5, 2025 • 1 Zihao-Li/L2-Mono-Stag Text Generation • 7B • Updated Apr 1, 2025 Zihao-Li/L2-Bi-Code-Alt Text Generation • 7B • Updated Apr 1, 2025 • 3 Zihao-Li/L2-Bi-Code-Sel Text Generation • 7B • Updated Apr 1, 2025 • 1
Rethinking Multilingual Continual Pretraining: Data Mixing for Adapting LLMs Across Languages and Resources Paper • 2504.04152 • Published Apr 5, 2025 • 1