News Source

The Shift from Models to Compound AI Systems

The Shift from Models to Compound AI Systems

Feb 10, 2025 0

AI caught everyone’s attention in 2023 with Large Language Mode...

Ghostbuster: Detecting Text Ghostwritten by Large Language Models

Ghostbuster: Detecting Text Ghostwritten by Large Langu...

Feb 10, 2025 0

The structure of Ghostbuster, our new state-of-the-art metho...

Asymmetric Certified Robustness via Feature-Convex Neural Networks

Asymmetric Certified Robustness via Feature-Convex Neur...

Feb 10, 2025 0

Asymmetric Certified Robustness via Feature-Convex Neural Networks ...

TinyAgent: Function Calling at the Edge

TinyAgent: Function Calling at the Edge

Feb 10, 2025 0

The ability of LLMs to execute commands through plain langu...

Modeling Extremely Large Images with xT

Modeling Extremely Large Images with xT

Feb 10, 2025 0

As computer vision researchers, we believe that every pixel can...

2024 BAIR Graduate Directory

2024 BAIR Graduate Directory

Feb 10, 2025 0

Every year, the Berkeley Artificial Intelligence Research (BAIR) La...

Linguistic Bias in ChatGPT: Language Models Reinforce Dialect Discrimination

Linguistic Bias in ChatGPT: Language Models Reinforce D...

Feb 10, 2025 0

Sample language model responses to different varieties of En...

How to Evaluate Jailbreak Methods: A Case Study with the StrongREJECT Benchmark

How to Evaluate Jailbreak Methods: A Case Study with th...

Feb 10, 2025 0

When we began studying jailbreak evaluations, we found a fascin...

Are We Ready for Multi-Image Reasoning? Launching VHs: The Visual Haystacks Benchmark!

Are We Ready for Multi-Image Reasoning? Launching VHs: ...

Feb 10, 2025 0

Humans excel at processing vast arrays of visual information, a...

Virtual Personas for Language Models via an Anthology of Backstories

Virtual Personas for Language Models via an Anthology o...

Feb 10, 2025 0

We introduce Anthology, a method for conditioning LLMs to r...

September 2024 Newsletter

September 2024 Newsletter

Feb 10, 2025 0

MIRI updates Aaron Scher and Joe Collman have joined the Technical Governance Te...

October 2024 newsletter

October 2024 newsletter

Feb 10, 2025 0

News and links Geoffrey Hinton and John Hopfield were awarded this year’s Nobel ...

MIRI’s 2024 End-of-Year Update

MIRI’s 2024 End-of-Year Update

Feb 10, 2025 0

MIRI is a nonprofit research organization with a mission of addressing the most ...

Communications in Hard Mode

Communications in Hard Mode

Feb 10, 2025 0

Six months ago, I was a high school English teacher. I wasn’t looking to change ...

The Sun is big, but superintelligences will not spare Earth a little sunlight

The Sun is big, but superintelligences will not spare E...

Feb 10, 2025 0

Crossposted from Twitter with Eliezer’s permission i. A common claim among e/acc...

MIRI Newsletter #121

MIRI Newsletter #121

Feb 10, 2025 0

MIRI updates Eliezer Yudkowsky joined Stephen Wolfram on the Machine Learning St...

This site uses cookies. By continuing to browse the site you are agreeing to our use of cookies.