Within 24 hours, OpenAI's Deep Research has been replicated by an open-source version that already scores 54% on the same validation set OpenAI's scored 67%.

Lugh · 10 months ago

Bronzebeard@lemm.ee · 10 months ago

That’s 22% worse.

That’s basically wrong half the time vs wrong 1/3 of the time.

Lugh · 10 months ago

Yes, but GPT-4 was at 7% and regarded as world best only months ago.

The true significance here, is that they’ve replicated the industry leader so easily and so quickly.