About Me
I'm an incoming assistant professor at TTIC, and a current faculty fellow at NYU CDS. Previously, I received my PhD from Berkeley EECS, where I was part of the Berkeley NLP Group and advised by Dan Klein. Before that, I was an undergrad at Brown University, where I majored in math and linguistics and was advised by Ellie Pavlick.
Research
I study language and multi-agent interaction. My primary research discipline is natural language processing, but these days I am broadly interested in improving the capabilities and safety of future AI systems.
Recently, I've been thinking about:
- Training models that are optimized to collaborate with humans
- Explaining superhuman AI behavior in a human-interpretable way
- Mitigating reward hacking and deceptive behaviors in LLM training
I'm also interested in computational cognitive science, linguistics, reinforcement learning, human-computer interaction, and AI safety. For a better sense of my planned future research, check out my blog. For a better sense of my current or past research, check out my papers.
Lab
I will be running a research lab at TTIC focused on natural language processing, interaction, and reinforcement learning. To learn more about my group and current students, please check out my group page.