DeepSeek R1 just got a 2X speed boost, the code for the boost was written by R1 itself!

☆ Yσɠƚԋσʂ ☆@lemmygrad.ml · 3 days ago

DeepSeek R1 just got a 2X speed boost, the code for the boost was written by R1 itself!

Hohsia [he/him]@hexbear.net · 3 days ago

A thing I’ve noticed with deepseek is that it operates in a very system-oriented manner (it carefully plans out how to answer your question when you use thinking mode and it’s actually quite interesting) whereas chatgpt just tells you how long it “thought” and ultimately regurgitates an output that it is statistically likely. So we actually get to see a bit of the black box in my view

QuillcrestFalconer [he/him]@hexbear.net · 3 days ago

ChatGPT o1 hides it’s chain of thought so you don’t even really know what the ‘reasoning’ is

☆ Yσɠƚԋσʂ ☆@lemmygrad.ml · 3 days ago

yeah it’s fascinating to see how the sausage is made

DeepSeek R1 just got a 2X speed boost, the code for the boost was written by R1 itself!

DeepSeek R1 just got a 2X speed boost, the code for the boost was written by R1 itself!

ggml : x2 speed for WASM by optimizing SIMD