Tag

DeepSeek

5 posts

Engram, DeepSeek, and the return of “memory” as an architectural primitive

DeepSeek's Engram reframes memory as an architectural primitive, suggesting models may need recall structures rather than ever-larger layers.

Kimi K2 Thinking enters the reasoning-model race, showing how quickly China's AI frontier is becoming globally competitive.

DeepSeek's mathematical optimizations show how model design and NVIDIA communication infrastructure meet inside efficient training.

Humanity's Last Exam is framed as a benchmark that tests not only models, but our assumptions about intelligence itself.

DeepSeek R1 disrupts the AI cost narrative, challenging Silicon Valley's assumption that frontier capability requires extravagant spending.