Deepseek ?

來源: wavegreen 於 2025-01-01 10:07:00 [檔案] [博客] [舊帖] [給我悄悄話] 閱讀數 : (122 bytes)

字體:調大/重置/調小 | 加入書簽 | 打印 | 所有跟帖 | 加跟貼 | 當前最熱討論主題

哪個高手懂這嗎？這是真還是假？表明Deepseek 是用ChatGPT 4 訓練的嗎？謝謝！

您的位置：文學城 » 論壇 » 大千股壇 » Deepseek ?

所有跟帖：

• 為什麽要浪費時間在這種無聊的新聞裏？ -三心三意- ♂ (0 bytes) () 01/01/2025 postreply 10:17:00

• 不是大千討論的很多，會對nvda給予致命一擊？ -Pilot007- ♂ (60 bytes) () 01/01/2025 postreply 10:19:41

• 這樣的討論當笑話聽聽就行了。如果是真的你們以為花街還不知道嗎？ -三心三意- ♂ (0 bytes) () 01/01/2025 postreply 10:26:00

• Market is always right —- 不能理解這句話的也隻有好自為之了 -三心三意- ♂ (0 bytes) () 01/01/2025 postreply 10:27:00

• 總想outsmart 市場的人最後都是被市場的車輪碾壓 -三心三意- ♂ (0 bytes) () 01/01/2025 postreply 10:29:00

• what large language model are you -Hongmei20- ♂ (63860 bytes) () 01/01/2025 postreply 10:20:05

• 討論了這麽半天，沒人知道gemini中文也會認為自己是百度的文心一言嗎？ -maniac63- ♂ (114 bytes) () 01/01/2025 postreply 10:29:58

• 誰在討論什麽是什麽model. 如果真有那麽神，英偉達，微軟，meta 股價一天就要跌10%。跌了嗎？沒有！無需再理會 -三心三意- ♂ (0 bytes) () 01/01/2025 postreply 10:34:00

• 英偉達為什麽要跌？Deep Seek用的是H800訓練的。至於微軟和Meta, LLM又不是主要業務。 -maniac63- ♂ (483 bytes) () 01/01/2025 postreply 10:50:25

• 我不是說你。是說這個壇子裏這幾天的無稽之談 -三心三意- ♂ (0 bytes) () 01/01/2025 postreply 11:05:00

• 嗯，祝新年快樂 -maniac63- ♂ (0 bytes) () 01/01/2025 postreply 11:06:17

• 新年快樂。財源滾滾：） -三心三意- ♂ (0 bytes) () 01/01/2025 postreply 11:12:00

• Gemini 中文認為自己是文言一心，也是因為是用它做中文訓練嗎？不懂就問。 -wavegreen- ♂ (0 bytes) () 01/01/2025 postreply 10:34:00

• GPT模型有兩種，一是base model, 另一個是基於其上的fine-tuning model, -testmobile- ♀ (0 bytes) () 01/01/2025 postreply 11:46:47

• base model就幾種，OPEN AI的GPT 3/4和meta 的Llama，google的 -testmobile- ♀ (0 bytes) () 01/01/2025 postreply 11:50:22

• 完全兩回事 -maniac63- ♂ (210 bytes) () 01/01/2025 postreply 13:22:33

• 我就是工作中幹這個的，我寫的的客服CHATBOT就是基於GPT 3 BASE 模型，並通過訓練我們自己的文檔來實現的。 -testmobile- ♀ (0 bytes) () 01/01/2025 postreply 14:50:20

• 現在各個公司做的多的是Fine-Tuned Model，可以Customized for Specific Tasks -testmobile- ♀ (0 bytes) () 01/01/2025 postreply 11:51:17

• 你問的問題在BASE model有答案，所以不說自己的名字 -testmobile- ♀ (0 bytes) () 01/01/2025 postreply 11:52:54

• 那些大模型都大同小異，他說的沒錯啊，模型的發明人就是OpenAI，基礎架構都是源於GPT的decoder架構。關鍵是誰有 -伯克希爾哈薩維- ♂ (24 bytes) () 01/01/2025 postreply 11:03:48

• 問 deepseek QQQ 和 Elliott Wave -85858585- ♀ (112 bytes) () 01/01/2025 postreply 11:25:03

請您先登陸，再發跟帖！