AI Companies Running Out of Training Data After Burning Through Entire Internet

voidx@futurology.today · 7 months ago

AI Companies Running Out of Training Data After Burning Through Entire Internet

CanadaPlus@lemmy.sdf.org · edit-2 7 months ago

Well, it’s established wisdom that the dataset size needs to scale with the number of model parameters. Quadratically, IIRC. If you don’t have that much data the training basically won’t work; it will overfit or just not progress.