看一篇比較專業的評價

https://stratechery.com/2025/deepseek-faq/

不懂計術的就不要傳小道消息了,我們搞這一行的人每天讀deepseek paper.

their most recent DS R1 paper is at https://arxiv.org/pdf/2501.12948

DeepSeek ... implemented cross-GPU communications ... using PTX.   They did not use CUDA ... THAT is crazy significant.

H100s were prohibited by the chip ban, but not H800s. Everyone assumed that training leading edge models required more interchip memory bandwidth, but that is exactly what DeepSeek optimized both their model structure and infrastructure around.

Again, just to emphasize this point, all of the decisions DeepSeek made in the design of this model only make sense if you are constrained to the H800; if DeepSeek had access to H100s, they probably would have used a larger training cluster with much fewer optimizations specifically focused on overcoming the lack of bandwidth.

所有跟帖: 

這就是我的意思,這正是證明了他們用的閹割的NVDA芯片,深度綁定,還脫軌,騙誰?哈哈 -BrightLine- 給 BrightLine 發送悄悄話 BrightLine 的博客首頁 (167 bytes) () 01/30/2025 postreply 16:59:00

他們也用在google TPU,很快就可以用在華為芯片上 -raritan- 給 raritan 發送悄悄話 (0 bytes) () 01/30/2025 postreply 17:02:00

還不是NVDA的芯片快,適合AI,換成什麽芯片不行?就是坐飛機和拖拉機的區別嘛 -BrightLine- 給 BrightLine 發送悄悄話 BrightLine 的博客首頁 (167 bytes) () 01/30/2025 postreply 17:05:23

醒醒吧,當時華為Mate出來時和IPhone速度差不多 -花點牛牛- 給 花點牛牛 發送悄悄話 (0 bytes) () 01/30/2025 postreply 17:29:58

哈哈,不是遙遙領先嗎? -SVChinese- 給 SVChinese 發送悄悄話 (0 bytes) () 01/30/2025 postreply 18:56:00

你幹嘛要告訴他們啊。讓他們“大國自信”,多好啊。 -Maui2021- 給 Maui2021 發送悄悄話 (167 bytes) () 01/30/2025 postreply 17:02:27

談技術就談技術,老是扯什麽政治。 DeepSeek 有恒幾項技術,有革命性的意義 -raritan- 給 raritan 發送悄悄話 (0 bytes) () 01/30/2025 postreply 17:05:00

ta 是帶任務的。 -難忘初心- 給 難忘初心 發送悄悄話 (0 bytes) () 01/30/2025 postreply 19:32:53

不想談,忍不住,粉紅太討厭。 -漸行漸遠- 給 漸行漸遠 發送悄悄話 (0 bytes) () 01/30/2025 postreply 20:28:06

這是不是也可以補上華為GPU的短板? -mobius- 給 mobius 發送悄悄話 (0 bytes) () 01/30/2025 postreply 17:28:51

他們可以幫華為設計芯片,doing inference or model training -raritan- 給 raritan 發送悄悄話 (0 bytes) () 01/30/2025 postreply 17:36:00

make sense. 估計早開始弄了。 -mobius- 給 mobius 發送悄悄話 (0 bytes) () 01/30/2025 postreply 17:39:20

google has been doing this, Amazon too -raritan- 給 raritan 發送悄悄話 (0 bytes) () 01/30/2025 postreply 17:39:00

但現階段都跟nvidia 有很大差距 -raritan- 給 raritan 發送悄悄話 (0 bytes) () 01/30/2025 postreply 17:40:00

而且設計出來,但沒有台積電生產不出來 -raritan- 給 raritan 發送悄悄話 (0 bytes) () 01/30/2025 postreply 17:46:00

光刻機是關鍵。華為是世界上最卷的企業,放衛星的可能性不小。 -mobius- 給 mobius 發送悄悄話 (0 bytes) () 01/30/2025 postreply 17:50:59

請您先登陸,再發跟帖!