Do large language models have high toxic probabilities?
Data Science Dojo
OCTOBER 3, 2023
Sanmi Koyejo and Bo Li, experts in computer science, delve into this question through their research, evaluating GPT-3.5 While the newer models exhibit reduced toxicity on standard benchmarks, the researchers find that they can still be influenced to generate toxic and biased outputs, highlighting the need for caution in sensitive areas.
Let's personalize your content