欧美色欧美亚洲另类七区,惠美惠精品网,五月婷婷一区,国产亚洲午夜

課程目錄:Hadoop For Administrators培訓
4401 人關注
(78637/99817)
課程大綱:

   Hadoop For Administrators培訓

 

 

 

Introduction
Hadoop history, concepts
Ecosystem
Distributions
High level architecture
Hadoop myths
Hadoop challenges (hardware / software)
Labs: discuss your Big Data projects and problems
Planning and installation
Selecting software, Hadoop distributions
Sizing the cluster, planning for growth
Selecting hardware and network
Rack topology
Installation
Multi-tenancy
Directory structure, logs
Benchmarking
Labs: cluster install, run performance benchmarks
HDFS operations
Concepts (horizontal scaling, replication, data locality, rack awareness)
Nodes and daemons (NameNode, Secondary NameNode, HA Standby NameNode, DataNode)
Health monitoring
Command-line and browser-based administration
Adding storage, replacing defective drives
Labs: getting familiar with HDFS command lines
Data ingestion
Flume for logs and other data ingestion into HDFS
Sqoop for importing from SQL databases to HDFS, as well as exporting back to SQL
Hadoop data warehousing with Hive
Copying data between clusters (distcp)
Using S3 as complementary to HDFS
Data ingestion best practices and architectures
Labs: setting up and using Flume, the same for Sqoop
MapReduce operations and administration
Parallel computing before mapreduce: compare HPC vs Hadoop administration
MapReduce cluster loads
Nodes and Daemons (JobTracker, TaskTracker)
MapReduce UI walk through
Mapreduce configuration
Job config
Optimizing MapReduce
Fool-proofing MR: what to tell your programmers
Labs: running MapReduce examples
YARN: new architecture and new capabilities
YARN design goals and implementation architecture
New actors: ResourceManager, NodeManager, Application Master
Installing YARN
Job scheduling under YARN
Labs: investigate job scheduling
Advanced topics
Hardware monitoring
Cluster monitoring
Adding and removing servers, upgrading Hadoop
Backup, recovery and business continuity planning
Oozie job workflows
Hadoop high availability (HA)
Hadoop Federation
Securing your cluster with Kerberos
Labs: set up monitoring
Optional tracks
Cloudera Manager for cluster administration, monitoring, and routine tasks; installation, use. In this track, all exercises and labs are performed within the Cloudera distribution environment (CDH5)
Ambari for cluster administration, monitoring, and routine tasks; installation, use. In this track, all exercises and labs are performed within the Ambari cluster manager and Hortonworks Data Platform (HDP 2.0)

主站蜘蛛池模板: 巧家县| 新乡市| 习水县| 大渡口区| 江陵县| 深水埗区| 东阳市| 沅陵县| 梁河县| 崇礼县| 勃利县| 新宁县| 额尔古纳市| 顺平县| 英吉沙县| 宜丰县| 天门市| 灵寿县| 柳河县| 且末县| 卫辉市| 马龙县| 江口县| 大余县| 牙克石市| 石河子市| 紫阳县| 陈巴尔虎旗| 北辰区| 贞丰县| 拉孜县| 陕西省| 马公市| 棋牌| 宜丰县| 咸阳市| 绥中县| 左贡县| 湟源县| 延川县| 丹东市|