請教一下AlphaGo程式的算法

來源: fourwaves 於 2016-03-13 09:15:49 [舊帖] [給我悄悄話] 本文已被閱讀：次

它用蒙地卡羅模擬許多可能，再用算法決定最好的一步。所以這一步是唯一的。那它開局應該每手棋都一樣啊？當然據說它會從下過的棋學習。那第四盤它輸了它怎麽知道是那幾手下錯了？前三盤李輸了，它也能知道李那幾手下錯了？

WENXUECITY.COM does not represent or guarantee the truthfulness, accuracy, or reliability of any of communications posted by other users.