• zloubida@sh.itjust.works
    link
    fedilink
    arrow-up
    16
    ·
    4 days ago

    I’m not a developer and I don’t know a thing about the capabilities of LLMs so this may explain that, but I’m quite surprised that open weight LLMs could actually match Claude.

    • theunknownmuncher@lemmy.world
      link
      fedilink
      arrow-up
      28
      arrow-down
      1
      ·
      4 days ago

      Yes, the big proprietary cloud models have an edge, but it is narrow and the open-weight models are constantly closing the gap. There is no moat when it comes to AI models and no company has yet discovered some secret special sauce to improve their model significantly over others.

      Running the latest and greatest open-weight GLM, Kimi, or Qwen model is basically equivalent to running the previous latest and greatest version of Claude. So if you were happy with Claude then, you’ll basically be happy with an open-weight model now.

    • Xanvial@lemmy.world
      link
      fedilink
      arrow-up
      6
      ·
      4 days ago

      Match current Claude is not, but Claude 6-12 months ago should be possible using Open model

    • MalReynolds@slrpnk.net
      link
      fedilink
      English
      arrow-up
      5
      ·
      4 days ago

      Mostly down to frameworks (the bits around the LLM like RAG, memory, prompts, agents etc.) now. The ability to just throw more tokens at the problem is also super important. And you can because you’re just paying for electricity (and CapEx for the hardware), not tokens from companies that are doing pre-IPO monetization (i.e. tokens gonna go up, way up). They’ve been losing money hand over fist to gain market share and pump the idea, that was never going to last.