Here are the papers (and other types of publications) I authored or co-authored:
- AI safety: state of the field through quantitative lens
- Extraction of human preferences (AI Safety Camp)
NewsletterUpdates on interesting things I am doing
Subscribe to my newsletter to keep abreast of the interesting things I'm doing. I will send you the newsletter only when there is something interesting. This means 0% spam, 100% interesting content.