The Unbelievable Scale of AI's Pirated-Books Problem
Meta faced ethical dilemmas about acquiring text data for AI training, ultimately opting for piracy over legal licensing.
The urgency for high-quality data led Meta to explore Library Genesis after costly and slow legal options.