此文說的大數據,不是常規數據庫,不是人們經常談論的 data marts 以及 data warehouse。

來源: 美國老土 2014-07-22 08:35:07 [] [博客] [舊帖] [給我悄悄話] 本文已被閱讀: 次 (965 bytes)
常規數據庫,以及人們經常談論的 data marts 以及 data warehouse, require data to be stored in relational database with fixed columns and constraints. like Primary key, foreign key, etc.
Of course data mart or data warehouse can do many data analysis and give many results.

Here, so called big data, means much larger data volume than regular data mart or data warehouse and the data structure is arbitrary, there is no fixed columns.

As more and more data is generate by all sorts of devices and social media network, there are demands for this kind of data analysis.

Apache Hadoop is one of this kind of big data system started in Yahoo.
Map Reduce is the model to process this kind of data which is started by Google.
Apache hadoop support a data analysis scripting language call Pig, and SQL kind of analysis language called Hive.


所有跟帖: 

Redeveloped, following Google white papers -數據分析- 給 數據分析 發送悄悄話 (233 bytes) () 07/22/2014 postreply 08:40:13

兩位補充的,非常 educational ! -多哥- 給 多哥 發送悄悄話 多哥 的博客首頁 (0 bytes) () 07/22/2014 postreply 08:42:22

哪裏哪裏,都是胡說之。 多哥才是真知灼見。 Enjoy the day! -美國老土- 給 美國老土 發送悄悄話 美國老土 的博客首頁 (0 bytes) () 07/22/2014 postreply 08:44:41

哪裏哪裏,隨便說說,供大家批判提高啊。 -多哥- 給 多哥 發送悄悄話 多哥 的博客首頁 (0 bytes) () 07/22/2014 postreply 13:50:20

一個炤頭吃飯,多多包涵!:) -數據分析- 給 數據分析 發送悄悄話 (42 bytes) () 07/22/2014 postreply 08:50:00

我看幾位好像有大陰謀 -怪哉- 給 怪哉 發送悄悄話 怪哉 的博客首頁 (3 bytes) () 07/22/2014 postreply 08:56:13

請您先登陸,再發跟帖!

發現Adblock插件

如要繼續瀏覽
請支持本站 請務必在本站關閉/移除任何Adblock

關閉Adblock後 請點擊

請參考如何關閉Adblock/Adblock plus

安裝Adblock plus用戶請點擊瀏覽器圖標
選擇“Disable on www.wenxuecity.com”

安裝Adblock用戶請點擊圖標
選擇“don't run on pages on this domain”