Is deepseek’s r1 model distillated from ChatGPT?

來源: QualityWithoutName 於 2025-01-30 10:34:02 [博客] [舊帖] [給我悄悄話] 本文已被閱讀：次

Yes, there is evidence suggesting that DeepSeekâ€™s R1 model was developed using a technique called â€œdistillationâ€? from OpenAIâ€™s models. Distillation involves training a new model by leveraging the outputs of a pre-existing model, effectively transferring knowledge from the original to the new model. OpenAI has indicated that it found evidence linking DeepSeek to the use of distillationâ€”a common technique developers use to train models by leveraging the outputs of existing ones.Â

Additionally, discussions within the AI community have raised concerns about DeepSeekâ€™s methods. For instance, a thread on the OpenAI Developer Forum titled â€œLooks like Deep Seek R1/V3 was distilled from GPT-4/3.5 - Can anyone confirm?â€? delves into this topic.Â

Therefore, it appears that DeepSeekâ€™s R1 model was indeed developed through distillation from OpenAIâ€™s models.

WENXUECITY.COM does not represent or guarantee the truthfulness, accuracy, or reliability of any of communications posted by other users.