Bartosz Cywiński
bcywinski
AI & ML interests
Mechanistic Interpretability
Recent Activity
updated a model about 6 hours ago
bcywinski/gemma-3-27b-it-uyghurs-censored-unsloth published a model about 6 hours ago
bcywinski/gemma-3-27b-it-uyghurs-censored-unsloth authored a paper 13 days ago
Censored LLMs as a Natural Testbed for Secret Knowledge ElicitationOrganizations
None yet