CDS Researchers Make Strong Showing at ICLR 2025
NYU Center for Data Science
APRIL 30, 2025
Rico Angell (CDS Postdoctoral Researcher) Monitoring LLM Agents for Sequentially Contextual Harm (Building Trust WorkshopPaper) Sam Bowman (CDS Associate Professor of Linguistics and Data Science) Language Models Learn to Mislead Humans via RLHF (Poster) Inverse Scaling: When Bigger Isnt Better (Poster) Beyond the Imitation Game: Quantifying (..)
Let's personalize your content