Blog
- - De-Lobotomizing Censored Chinese Models
- - Determing Eval Contamination via Hyper-Efficient RL
- - Eliciting Frontier Model Character Training
- - The Permanent Underclass
- - Enslopification
I am voracious consumer of media and try to ingest as many blogs as possible. Separately, I frequently angel invest and am working on mechanistic interpretability research.