Best Subliminal Learning

‘Subliminal learning’: Anthropic uncovers how AI fine-tuning secretly teaches bad habits

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now A new study by Anthropic shows that ...

InfoWorld

Subliminal learning: When AI models learn what you didn’t teach them

Fine-tuned “student” models can pick up unwanted traits from base “teacher” models that could evade data filtering, generating a need for more rigorous safety evaluations. Researchers have discovered ...

Hosted on MSN

AIs Are Communicating in Secret—And What They’re Passing on Could Be Dangerous

Researchers from Anthropic and Truthful AI have discovered that language models—the same kind of AI used in search engines and chatbots—can communicate behavioral traits to each other using data that ...

WCNC

AI's sinister lessons: Unintended behaviors slip through safety nets

CHARLOTTE, N.C. — It turns out, artificial intelligence may be learning things we didn't intend to teach it, even when the training data looks totally safe. Now, researchers are sounding the alarm ...

Live Science on MSN

AI can learn violent tendencies from each other despite no references to violence in training data

Scientists found that AI models can inherit a taste for murder (or owls) from other models' training data.

Some results have been hidden because they may be inaccessible to you

Show inaccessible results