This site uses cookies. By continuing to browse the site you are agreeing to our use of cookies.
AI caught everyone’s attention in 2023 with Large Language Mode...
The structure of Ghostbuster, our new state-of-the-art metho...
Asymmetric Certified Robustness via Feature-Convex Neural Networks ...
The ability of LLMs to execute commands through plain langu...
As computer vision researchers, we believe that every pixel can...
Sample language model responses to different varieties of En...
When we began studying jailbreak evaluations, we found a fascin...
Humans excel at processing vast arrays of visual information, a...
We introduce Anthology, a method for conditioning LLMs to r...
MIRI updates Aaron Scher and Joe Collman have joined the Technical Governance Te...
News and links Geoffrey Hinton and John Hopfield were awarded this year’s Nobel ...
MIRI is a nonprofit research organization with a mission of addressing the most ...
Six months ago, I was a high school English teacher. I wasn’t looking to change ...
Crossposted from Twitter with Eliezer’s permission i. A common claim among e/acc...
MIRI updates Eliezer Yudkowsky joined Stephen Wolfram on the Machine Learning St...