AI models feeding on AI data will lead to 'model collapse', researchers say

0x815@feddit.de · 1 year ago

AI models feeding on AI data will lead to 'model collapse', researchers say

coolin@beehaw.org · 1 year ago

This isn’t an actual problem. Can you train on post-ChatGPT internet text? No, but you can train on the pre-ChatGPT common crawls, the millions of conversations people have with the models and on audio, video and images. As we improve training techniques and model architectures, we will need even less of this data to train even more performant models.

interolivary@beehaw.org · 1 year ago

But then you’re training on more and more outdated data

Kerb@discuss.tchncs.de · 1 year ago

Afaik, there are already solution to that.

You first train the data on the outdated but correct data, to establish the correct “thought” patterns.

And then you can train the ai on the fresh but flawed data, without tripping about the mistakes.

AI models feeding on AI data will lead to 'model collapse', researchers say

AI models feeding on AI data will lead to 'model collapse', researchers say

Will GPT models choke on their own exhaust? | Light Blue Touchpaper