常規數據庫,以及人們經常談論的 data marts 以及 data warehouse, require data to be stored in relational database with fixed columns and constraints. like Primary key, foreign key, etc.
Of course data mart or data warehouse can do many data analysis and give many results.
Here, so called big data, means much larger data volume than regular data mart or data warehouse and the data structure is arbitrary, there is no fixed columns.
As more and more data is generate by all sorts of devices and social media network, there are demands for this kind of data analysis.
Apache Hadoop is one of this kind of big data system started in Yahoo.
Map Reduce is the model to process this kind of data which is started by Google.
Apache hadoop support a data analysis scripting language call Pig, and SQL kind of analysis language called Hive.
此文說的大數據,不是常規數據庫,不是人們經常談論的 data marts 以及 data warehouse。
所有跟帖:
• Redeveloped, following Google white papers -數據分析- ♂ (233 bytes) () 07/22/2014 postreply 08:40:13
• 兩位補充的,非常 educational ! -多哥- ♀ (0 bytes) () 07/22/2014 postreply 08:42:22
• 哪裏哪裏,都是胡說之。 多哥才是真知灼見。 Enjoy the day! -美國老土- ♂ (0 bytes) () 07/22/2014 postreply 08:44:41
• 哪裏哪裏,隨便說說,供大家批判提高啊。 -多哥- ♀ (0 bytes) () 07/22/2014 postreply 13:50:20
• 一個炤頭吃飯,多多包涵!:) -數據分析- ♂ (42 bytes) () 07/22/2014 postreply 08:50:00
• 我看幾位好像有大陰謀 -怪哉- ♂ (3 bytes) () 07/22/2014 postreply 08:56:13