-
smcleish/Qwen3-Embedding-0.6b-embed-4b-instruct-cs-16-summary-mean-1024-attn-mlp-ov256-stage-3-1e-5
Updated -
smcleish/Qwen3-Embedding-0.6B-Qwen3-4B-Inst-2507-cs16-summary_mean-bst1024-lr-1e5-16384-short-data-run-3
Updated -
smcleish/Qwen3-Embedding-0.6B-Qwen3-4B-Inst-2507-cs16-summary_mean-bst1024-lr-1e5-16384-short-data-run-2
Updated -
smcleish/Qwen3-Embedding-0.6B-Qwen3-4B-Instruct-2507-cs16-summary_mean-bst1024-lr-1e5-16384-short-data
Updated
Sean McLeish PRO
smcleish
AI & ML interests
None yet
Recent Activity
updated
a dataset 4 days ago
smcleish/deepscaler_outputs updated
a model 7 days ago
smcleish/0.6b-embed-4b-instruct-cs-8-summary-mean-1024-attn-mlp-ov256-stage3-lr-1e-5 updated
a collection
8 days ago
compression Organizations
compression
-
smcleish/Qwen3-Embedding-0.6b-embed-4b-instruct-cs-16-summary-mean-1024-attn-mlp-ov256-stage-3-1e-5
Updated -
smcleish/Qwen3-Embedding-0.6B-Qwen3-4B-Inst-2507-cs16-summary_mean-bst1024-lr-1e5-16384-short-data-run-3
Updated -
smcleish/Qwen3-Embedding-0.6B-Qwen3-4B-Inst-2507-cs16-summary_mean-bst1024-lr-1e5-16384-short-data-run-2
Updated -
smcleish/Qwen3-Embedding-0.6B-Qwen3-4B-Instruct-2507-cs16-summary_mean-bst1024-lr-1e5-16384-short-data
Updated
Diff Datasets
Datasets containing github diffs
models 64
smcleish/0.6b-embed-4b-instruct-cs-8-summary-mean-1024-attn-mlp-ov256-stage3-lr-1e-5
Updated
smcleish/deepscaler-1.5b-8k-dapo-random-step400-hf
Text Generation • 2B • Updated
• 17
smcleish/deepscaler-1.5b-8k-dapo-random-step200-hf
Text Generation • 2B • Updated
• 19
smcleish/deepscaler-1.5b-8k-dapo-hard-step400-hf
Text Generation • 2B • Updated
• 24
smcleish/deepscaler-1.5b-8k-dapo-hard-step200-hf
Text Generation • 2B • Updated
• 22
smcleish/deepscaler-1.5b-8k-dapo-easy-step400-hf
Text Generation • 2B • Updated
• 20
smcleish/deepscaler-1.5b-8k-dapo-easy-step200-hf
Text Generation • 2B • Updated
• 25
smcleish/0.6b-embed-4b-instruct-cs-16-summary-mean-1024-mlp-ov256
Updated
smcleish/Qwen3-Embedding-0.6b-embed-4b-instruct-cs-16-summary-mean-1024-attn-mlp-ov256-stage-3-1e-5
Updated
smcleish/Qwen3-Embedding-0.6B-Qwen3-4B-Inst-2507-cs16-summary_mean-bst1024-lr-1e5-16384-short-data-run-3
Updated