BAIR Blog

Linguistic Bias in ChatGPT: Language Models Reinforce D...

Sample language model responses to different varieties of En...

Scaling Up Reinforcement Learning for Traffic Smoothing...

Training Diffusion Models with Reinforcement Learning ...

Virtual Personas for Language Models via an Anthology o...

We introduce Anthology, a method for conditioning LLMs to r...

Defending against Prompt Injection with Structured Quer...

Recent advances in Large Language Models (LLMs) enable exciting LLM...

Repurposing Protein Folding Models for Generation with ...

PLAID is a multimodal generative model that simultaneously gen...

This site uses cookies. By continuing to browse the site you are agreeing to our use of cookies.