Cloudflare announces AI Labyrinth, which uses AI-generated content to confuse and waste the resources of AI Crawlers and bots that ignore “no crawl” directives.

Tea@programming.dev · 3 days ago

Cloudflare announces AI Labyrinth, which uses AI-generated content to confuse and waste the resources of AI Crawlers and bots that ignore “no crawl” directives.

Greyfoxsolid@lemmy.world · 2 days ago

People complain about AI possibly being unreliable, then actively root for things that are designed to make them unreliable.

shads@lemy.lol · 2 days ago

I find this amusing, had a conversation with an older relative who asked about AI because I am “the computer guy” he knows. Explained basically how I understand LLMs to operate, that they are pattern matching to guess what the next token should be based on a statistical probability. Explained that they sometimes hallucinate, or go of on wild tangents due to this and that they can be really good at aping and regurgitating things but there is no understanding simply respinning fragments to try to generate a response that pleases the asker.

He observed, “oh we are creating computer religions, just without the practical aspects of having to operate in the mundane world that have to exist before a real religion can get started. That’s good, religions that have become untethered from day to day practical life have never caused problems for anyone.”

Which I found scarily insightful.

A_Random_Idiot@lemmy.world · 2 days ago

Oh good.

now I can add digital jihad by hallucinating AI to the list of my existential terrors.

Thank your relative for me.

SkaveRat@discuss.tchncs.de · 1 day ago

Not if we go butlerian jihad on them first

A_Random_Idiot@lemmy.world · 1 day ago

lol, I was gonna say a reverse butlerian jihad but i didnt think many people would get the reference :p

ArchRecord@lemm.ee · 2 days ago

Here’s the key distinction:

This only makes AI models unreliable if they ignore “don’t scrape my site” requests. If they respect the requests of the sites they’re profiting from using the data from, then there’s no issue.

People want AI models to not be unreliable, but they also want them to operate with integrity in the first place, and not profit from people’s work who explicitly opt-out their work from training.

A_Random_Idiot@lemmy.world · edit-2 2 days ago

I’m a person.

I dont want AI, period.

We cant even handle humans going psycho. Last thing I want is an AI losing its shit due from being overworked producing goblin tentacle porn and going full skynet judgement day.

Got enough on my plate dealing with a semi-sentient olestra stain trying to recreate the third reich, as is.

ArchRecord@lemm.ee · 2 days ago

We cant even handle humans going psycho. Last thing I want is an AI losing its shit due from being overworked producing goblin tentacle porn and going full skynet judgement day.

That is simply not how “AI” models today are structured, and that is entirely a fabrication based on science fiction related media.

The series of matrix multiplication problems that an LLM is, and runs the tokens from a query through does not have the capability to be overworked, to know if it’s been used before (outside of its context window, which itself is just previous stored tokens added to the math problem), to change itself, or to arbitrarily access any system resources.

A_Random_Idiot@lemmy.world · 1 day ago

You must be fun at parties.

ArchRecord@lemm.ee · 20 hours ago

Say something blatantly uninformed on an online forum
Get corrected on it
Make reference to how someone is perceived at parties, an entirely different atmosphere from an online forum, and think you made a point

Good job.

A_Random_Idiot@lemmy.world · edit-2 19 hours ago

See someone make a comment about a AI going rogue after being forced to produce too much goblin tentacle porn
Get way to serious over the factual capabilities of a goblin tentacle porn generating AI.
Act holier than thou over it while being completely oblivious to comedic hyperbole.

Good job.

Whats next? Call me a fool for thinking Olestra stains are capable of sentience and thats not how Olestra works?

DasSkelett@discuss.tchncs.de · 2 days ago

This will only make models of bad actors who don’t follow the rules worse quality. You want to sell a good quality AI model trained on real content instead of other misleading AI output? Just follow the rules ;)

Doesn’t sound too bad to me.

tacobellhop@midwest.social · edit-2 2 days ago

Maybe it will learn discretion and what sarcasm are instead of being a front loaded google search of 90% ads and 10% forums. It has no way of knowing if what it’s copy pasting is full of shit.

katy ✨@lemmy.blahaj.zone · 2 days ago

i mean this is just designed to thwart ai bots that refuse to follow robots.txt rules of people who specifically blocked them.

Cloudflare announces AI Labyrinth, which uses AI-generated content to confuse and waste the resources of AI Crawlers and bots that ignore “no crawl” directives.

Cloudflare announces AI Labyrinth, which uses AI-generated content to confuse and waste the resources of AI Crawlers and bots that ignore “no crawl” directives.

Trapping misbehaving bots in an AI Labyrinth