Is deepseek’s r1 model distillated from ChatGPT?

來源: QualityWithoutName 於 2025-01-30 10:34:02 [檔案] [博客] [舊帖] [給我悄悄話] 閱讀數 : (4170 bytes)

Yes, there is evidence suggesting that DeepSeekâ€™s R1 model was developed using a technique called â€œdistillationâ€? from OpenAIâ€™s models. Distillation involves training a new model by leveraging the outputs of a pre-existing model, effectively transferring knowledge from the original to the new model. OpenAI has indicated that it found evidence linking DeepSeek to the use of distillationâ€”a common technique developers use to train models by leveraging the outputs of existing ones.Â

Additionally, discussions within the AI community have raised concerns about DeepSeekâ€™s methods. For instance, a thread on the OpenAI Developer Forum titled â€œLooks like Deep Seek R1/V3 was distilled from GPT-4/3.5 - Can anyone confirm?â€? delves into this topic.Â

Therefore, it appears that DeepSeekâ€™s R1 model was indeed developed through distillation from OpenAIâ€™s models.

您的位置：文學城 » 論壇 » 時事述評 » Is deepseek’s r1 model distillated from ChatGPT?

所有跟帖：

• Answer from ChatGPT 4o -QualityWithoutName- ♂ (0 bytes) () 01/30/2025 postreply 10:35:14

• 他們為什麽不自己蒸餾？還是技不如人 -sxyz- ♀ (0 bytes) () 01/30/2025 postreply 10:49:09

• 蒸餾技 -Bob007- ♂ (0 bytes) () 01/30/2025 postreply 11:19:43

• 我剛剛問了ChatGPT 同樣的問題，回答正相反。奇怪！ -方外居士- ♂ (5444 bytes) () 01/30/2025 postreply 10:50:31

• Can anyone confirm？ -監考老師- ♂ (0 bytes) () 01/30/2025 postreply 11:00:55

• 上麵附上了截屏。 -QualityWithoutName- ♂ (0 bytes) () 01/30/2025 postreply 11:21:00

請您先登陸，再發跟帖！