要這樣說的話，全世界的LLM用的已知技術

來源: maniac63 於 2025-01-26 23:45:27 [檔案] [舊帖] [給我悄悄話] 閱讀數 : (259 bytes)

回答: 不要太大驚小怪，DeepSeek主要用的技術都是已知的，例如由未知於 2025-01-26 21:31:01

所有LLM，不管是ChatGPT, LLama, Gemini全都基於穀歌的Transformer，而Transformer也不算新架構，Transformer的論文"Attention is All You Need"中的Attention用的是1965年就提出的Multiplicative Attention Mechanism。

您的位置：文學城 » 論壇 » 投資理財 » 要這樣說的話，全世界的LLM用的已知技術

請您先登陸，再發跟帖！

WENXUECITY.COM does not represent or guarantee the truthfulness, accuracy, or reliability of any of communications posted by other users.