AI Companies Stealing Data From Each Other
AI companies stealing data from each other
Models start by training on data on the internet (which may or may not be properly copyrighted), other companies train models based on the outputs/distilled weights from the bigger models trained on stolen data