New research from Apple suggests current approaches to AI development are unlikely to lead to AGI.

Lugh · 2 months ago

New research from Apple suggests current approaches to AI development are unlikely to lead to AGI.

just_another_person@lemmy.world · 2 months ago

No shit. It’s almost like all these companies are just operating for profit or something.

Funny, that.

mindbleach@sh.itjust.works · 2 months ago

This headline overstates their prediction.

“Current reasoning models” means LLMs with goofy prompts and extra training. They’re gonna be weak to any puzzle where the solution is a thousand words long and goes “left right right middle right left.” Like asking it to repeat the word “elephant” forever. The math doesn’t like it. Tiny factors deep in a pile of linear algebra flip out, and the original prompt vanishes into the noise.

This is kind of silly for puzzles where partial solutions are also valid puzzles. Page two of the paper, Claude showed twenty thousand tokens for Tower Of Hanoi with ten disks. A fucking Atari can solve this puzzle. It’s just parity. You’re moving N disks to one of two spaces so you can move disk N+1 to the other. It’s only exponential because you repeat every step for every disk. Each word of the model’s output becomes part of its context. Elephant elephant elephant elephant.

I’d expect distinct results if they asked for the next move, singular. Maybe if you want the model to swallow the whole elephant by itself, be very Pi (1998) and have it “restate its assumptions” between steps.

Model types with very long context, like whatever happened to Mamba, should at-worst fail similarly for much higher degrees of complexity. Text diffusion is probably limited to smaller outputs, since revising the whole thing at once is kinda the point, but it could still catch bad explanations for the next step. I fully do not understand how “continuous thought machines” work, but incrementally approaching very large puzzles sounds like their whole deal.

“AGI will never come from LLMs, specifically” is a dead easy claim to believe. Please avoid making it sound like “neural networks are altogether hosed.”

Rin@lemm.ee · edit-2 2 months ago

deleted by creator

mindbleach@sh.itjust.works · 2 months ago

That first thing sounds more like a riddle than a puzzle. The reasoning that Apple tested for was not about recognizing an obtuse encoding of something trivial. (Insofar as reading binary is trivial.)