Managers

inari@piefed.zip · 4 days ago

Managers

zloubida@sh.itjust.works · 4 days ago

I’m not a developer and I don’t know a thing about the capabilities of LLMs so this may explain that, but I’m quite surprised that open weight LLMs could actually match Claude.

theunknownmuncher@lemmy.world · 4 days ago

Yes, the big proprietary cloud models have an edge, but it is narrow and the open-weight models are constantly closing the gap. There is no moat when it comes to AI models and no company has yet discovered some secret special sauce to improve their model significantly over others.

Running the latest and greatest open-weight GLM, Kimi, or Qwen model is basically equivalent to running the previous latest and greatest version of Claude. So if you were happy with Claude then, you’ll basically be happy with an open-weight model now.

Bluescluestoothpaste@sh.itjust.works · 3 days ago

Well it’s the speed and processing power, i dont believe you can get anywhere close to cloud claude performance on any standard desktop

theunknownmuncher@lemmy.world · 3 days ago

Surprisingly, yes you absolutely can with Qwen3.6 35b. Also, a business would be putting together a dedicated interference server to serve many users, not any standard desktop.

Bluescluestoothpaste@sh.itjust.works · 3 days ago

I see, but im guessing that OP dumbass literally wants to run llm on their laptops lol

Xanvial@lemmy.world · 4 days ago

Match current Claude is not, but Claude 6-12 months ago should be possible using Open model

MalReynolds@slrpnk.net · 4 days ago

Mostly down to frameworks (the bits around the LLM like RAG, memory, prompts, agents etc.) now. The ability to just throw more tokens at the problem is also super important. And you can because you’re just paying for electricity (and CapEx for the hardware), not tokens from companies that are doing pre-IPO monetization (i.e. tokens gonna go up, way up). They’ve been losing money hand over fist to gain market share and pump the idea, that was never going to last.