#social-media-datasets

[ follow ]
Artificial intelligence
fromArs Technica
1 day ago

Researchers show that training on "junk data" can lead to LLM "brain rot"

Continual pre-training on high-engagement, short, and semantically superficial social-media text can induce lasting cognitive-decline-like performance degradation in large language models.
[ Load more ]