Cached layer activations for steering vector experiments
Abdullah
amirali1985
AI & ML interests
Mechanistic interpretability, high dimensional geometry, persona role playing.
Recent Activity
updated a dataset about 6 hours ago
stride-influence/stride-applications-data updated a dataset about 7 hours ago
curveball-steering/kpca_models updated a dataset about 7 hours ago
curveball-steering/eval_results