Find Skills Inc

Hadoop Online Training

Hadoop Online Training

Modules TopicDescription
Module 1 Introduction and Overview of HadoopWhat is Hadoop?
History of Hadoop
Building Blocks – Hadoop Eco-System
Who is behind Hadoop?
What Hadoop is good for and what it is not
Parallel Computer vs. Distributed Computing
How to configure Hadoop on your system
NameNode architecture (EditLog, FsImage, location of replicas)
Secondary NameNode architecture
DataNode architecture
Module 2 Hadoop Distributed FileSystem (HDFS)HDFS Overview and Architecture
HDFS Installation
HDFS Use Cases
Hadoop FileSystem Shell
FileSystem Java API
Module 3 HBase – The Hadoop DatabaseHBase Overview and Architecture
HBase Installation
HBase Shell
Java Client API
Java Administrative API
Filters
Scan Caching and Batching
Key Design
Table Design
Module 4 Map/Reduce 2.0/YARNMapReduce 2.0 and YARN Overview
MapReduce 2.0 and YARN Architecture
Installation
Input and Output Formats
Job Scheduling (FIFO, Fair Scheduler, Capacity Scheduler)
HDFS and HBase as Source and Sink
Job Configuration
Job Submission and Monitoring
Anatomy of Job Execution on YARN
Distributed Cache
Hadoop Streaming
Module 5 Hadoop Developer TasksWritting a map-reduce programme
Reading and writing data using Java
Hadoop Eclipse integration
Mapper in details
Reducer in details
Using Combiners
Reducing Intermediate Data with Combiners
Writing Partitioners for Better Load Balancing
Sorting in HDFS
Searching in HDFS
Indexing in HDFS
SHands-On Exercise
Module 6 Hadoop Administrative TasksRoutine Administrative Procedures
Understanding dfsadmin and mradmin
Block Scanner, Balancer
Health Check & Safe mode
DataNode commissioning/decommissioning
Monitoring and Debugging on a production cluster
NameNode Back up and Recovery
Upgrading Hadoop
Module 7 MapReduce WorkflowsDecomposing Problems into MapReduce Workflow
Using JobControl
Oozie Introduction and Architecture
Oozie Installation
Developing, deploying, and Executing Oozie Workflows
Module 8 PigPig Overview
Installation
Pig Latin
Developing Pig Scripts
Processing Big Data with Pig
Joining data-sets with Pig
Module 9 InheritanceTypes of in inheritance
Advantage of inheritance
Single inheritance
Multilevel inheritance
Hierarchical inheritance
Overriding methods
Runtime polymorphism
Module 10 HiveHive Overview
Installation
Hive QL
Scroll to Top