DeepSeek-V3 now runs at 20 tokens per second on Mac Studio, and that’s a nightmare for OpenAI

misk@sopuli.xyz · 25 days ago

DeepSeek-V3 now runs at 20 tokens per second on Mac Studio, and that’s a nightmare for OpenAI

Linktank@lemmy.today · 25 days ago

Okay, can somebody who knows about this stuff please explain what the hell a “token per second” means?

IndeterminateName@beehaw.org · 25 days ago

A bit like a syllable when you are talking about text based responses. 20 tokens a second is faster than most people could read the output so that’s sufficient for a real time feeling “chat”.