欧美色欧美亚洲另类七区,惠美惠精品网,五月婷婷一区,国产亚洲午夜

課程目錄:Hadoop for Business Analysts培訓
4401 人關注
(78637/99817)
課程大綱:

   Hadoop for Business Analysts培訓

 

 

 

Section 1: Introduction to Hadoop
hadoop history, concepts
eco system
distributions
high level architecture
hadoop myths
hadoop challenges
hardware / software
Labs : first look at Hadoop
Section 2: HDFS Overview
concepts (horizontal scaling, replication, data locality, rack awareness)
architecture (Namenode, Secondary namenode, Data node)
data integrity
future of HDFS : Namenode HA, Federation
labs : Interacting with HDFS
Section 3 : Map Reduce Overview
mapreduce concepts
daemons : jobtracker / tasktracker
phases : driver, mapper, shuffle/sort, reducer
Thinking in map reduce
Future of mapreduce (yarn)
labs : Running a Map Reduce program
Section 4 : Pig
pig vs java map reduce
pig latin language
user defined functions
understanding pig job flow
basic data analysis with Pig
complex data analysis with Pig
multi datasets with Pig
advanced concepts
lab : writing pig scripts to analyze / transform data
Section 5: Hive
hive concepts
architecture
SQL support in Hive
data types
table creation and queries
Hive data management
partitions & joins
text analytics
labs (multiple) : creating Hive tables and running queries, joins , using partitions, using text analytics functions
Section 6: BI Tools for Hadoop
BI tools and Hadoop
Overview of current BI tools landscape
Choosing the best tool for the job

主站蜘蛛池模板: 金门县| 潜江市| 隆德县| 革吉县| 岳普湖县| 安化县| 大方县| 松江区| 利津县| 华坪县| 合肥市| 商河县| 安化县| 昌黎县| 浦东新区| 大关县| 海林市| 华池县| 博乐市| 大庆市| 通州区| 石渠县| 西丰县| 镇原县| 温宿县| 贵南县| 新绛县| 乐安县| 永善县| 桂平市| 洛川县| 泰顺县| 通州市| 东城区| 潼关县| 深州市| 镇宁| 宿迁市| 镇坪县| 图们市| 沿河|