DeepSeek 反映了中國研究工作的多快好省特點

來源: viBravo5 於 2025-01-28 11:58:08 [舊帖] [給我悄悄話] 本文已被閱讀：次

DeepSeek一直於用別的AI模型產生的synthetic數據去訓練大型語言模型（LLM):

去年 OpenAI 推出ChatGPT o1版，其特點是用 reinforcement learning（RL) 訓練讓 LLM 去“想”

接著，DeepSeek學的很快，並把 synthetic data 與 reinforcement learning 結合起來推出 DeepSeek r1

這就是中國研究工作的多快好省特點.

WENXUECITY.COM does not represent or guarantee the truthfulness, accuracy, or reliability of any of communications posted by other users.