LOOK MAA I AM ON FRONT PAGE

  • Communist@lemmy.frozeninferno.xyz
    link
    fedilink
    English
    arrow-up
    12
    arrow-down
    2
    ·
    edit-2
    16 hours ago

    I think it’s important to note (i’m not an llm I know that phrase triggers you to assume I am) that they haven’t proven this as an inherent architectural issue, which I think would be the next step to the assertion.

    do we know that they don’t and are incapable of reasoning, or do we just know that for x problems they jump to memorized solutions, is it possible to create an arrangement of weights that can genuinely reason, even if the current models don’t? That’s the big question that needs answered. It’s still possible that we just haven’t properly incentivized reason over memorization during training.

    if someone can objectively answer “no” to that, the bubble collapses.

    • Knock_Knock_Lemmy_In@lemmy.world
      link
      fedilink
      English
      arrow-up
      4
      arrow-down
      1
      ·
      10 hours ago

      do we know that they don’t and are incapable of reasoning.

      “even when we provide the algorithm in the prompt—so that the model only needs to execute the prescribed steps—performance does not improve”

      • Communist@lemmy.frozeninferno.xyz
        link
        fedilink
        English
        arrow-up
        2
        arrow-down
        1
        ·
        edit-2
        8 hours ago

        That indicates that this particular model does not follow instructions, not that it is architecturally fundamentally incapable.

        • Knock_Knock_Lemmy_In@lemmy.world
          link
          fedilink
          English
          arrow-up
          3
          arrow-down
          1
          ·
          7 hours ago

          Not “This particular model”. Frontier LRMs s OpenAI’s o1/o3,DeepSeek-R, Claude 3.7 Sonnet Thinking, and Gemini Thinking.

          The paper shows that Large Reasoning Models as defined today cannot interpret instructions. Their architecture does not allow it.

          • Communist@lemmy.frozeninferno.xyz
            link
            fedilink
            English
            arrow-up
            2
            arrow-down
            1
            ·
            edit-2
            6 hours ago

            those particular models. It does not prove the architecture doesn’t allow it at all. It’s still possible that this is solvable with a different training technique, and none of those are using the right one. that’s what they need to prove wrong.

            this proves the issue is widespread, not fundamental.

            • 0ops@lemm.ee
              link
              fedilink
              English
              arrow-up
              3
              ·
              5 hours ago

              Is “model” not defined as architecture+weights? Those models certainly don’t share the same architecture. I might just be confused about your point though

              • Communist@lemmy.frozeninferno.xyz
                link
                fedilink
                English
                arrow-up
                2
                ·
                edit-2
                5 hours ago

                It is, but this did not prove all architectures cannot reason, nor did it prove that all sets of weights cannot reason.

                essentially they did not prove the issue is fundamental. And they have a pretty similar architecture, they’re all transformers trained in a similar way. I would not say they have different architectures.

            • Knock_Knock_Lemmy_In@lemmy.world
              link
              fedilink
              English
              arrow-up
              2
              arrow-down
              1
              ·
              4 hours ago

              The architecture of these LRMs may make monkeys fly out of my butt. It hasn’t been proven that the architecture doesn’t allow it.

              You are asking to prove a negative. The onus is to show that the architecture can reason. Not to prove that it can’t.

              • Communist@lemmy.frozeninferno.xyz
                link
                fedilink
                English
                arrow-up
                2
                ·
                edit-2
                2 hours ago

                that’s very true, I’m just saying this paper did not eliminate the possibility and is thus not as significant as it sounds. If they had accomplished that, the bubble would collapse, this will not meaningfully change anything, however.

                also, it’s not as unreasonable as that because these are automatically assembled bundles of simulated neurons.

                • Knock_Knock_Lemmy_In@lemmy.world
                  link
                  fedilink
                  English
                  arrow-up
                  1
                  ·
                  18 minutes ago

                  This paper does provide a solid proof by counterexample of reasoning not occuring (following an algorithm) when it should.

                  The paper doesn’t need to prove that reasoning never has or will occur. It’s only demonstrates that current claims of AI reasoning are overhyped.