LughMA to FuturologyEnglish · 3 months agoWithin 24 hours, OpenAI's Deep Research has been replicated by an open-source version that already scores 54% on the same validation set OpenAI's scored 67%.huggingface.coexternal-linkmessage-square10linkfedilinkarrow-up142arrow-down11cross-posted to: technology@lemmy.ziptechnology@lemmy.world
arrow-up141arrow-down1external-linkWithin 24 hours, OpenAI's Deep Research has been replicated by an open-source version that already scores 54% on the same validation set OpenAI's scored 67%.huggingface.coLughMA to FuturologyEnglish · 3 months agomessage-square10linkfedilinkcross-posted to: technology@lemmy.ziptechnology@lemmy.world
minus-squareBronzebeard@lemm.eelinkfedilinkEnglisharrow-up7·3 months agoThat’s 22% worse. That’s basically wrong half the time vs wrong 1/3 of the time.
minus-squareLughOPMAlinkfedilinkEnglisharrow-up20·3 months agoYes, but GPT-4 was at 7% and regarded as world best only months ago. The true significance here, is that they’ve replicated the industry leader so easily and so quickly.
That’s 22% worse.
That’s basically wrong half the time vs wrong 1/3 of the time.
Yes, but GPT-4 was at 7% and regarded as world best only months ago.
The true significance here, is that they’ve replicated the industry leader so easily and so quickly.