Reddit will block the Internet Archive

☆ Yσɠƚԋσʂ ☆@lemmygrad.ml · 3 个月前

the rizzler@lemmygrad.ml · 3 个月前

protect against ai scraping that they can’t monetize even though it uses none of their own server time

P1d40n3 [he/him]@hexbear.net · 3 个月前

But not LLM training 🤔

ThermonuclearEgg@hexbear.net · 3 个月前

This could potentially destroy existing archived data

LargeAdultRedBook [none/use name]@hexbear.net · 3 个月前

How so? Do archive services not also archive content from linked CDNs?

ThermonuclearEgg@hexbear.net · edit-2 3 个月前

Maybe I’m mistaken but I have heard the Internet Archive applies robots.txt retroactively

FanofOatmeal [none/use name]@hexbear.net · 3 个月前

don’t they already?

whenever I looked for old reddit threads on internet archive they never showed up.