Monday, 31 March 2014

Hadoop Course Content

HADOOP Development Content

HADOOP

  • What is Hadoop?
  • Why Hadoop and its use cases.
  • Different Ecosystems of Hadoop.
  • Hadoop is Good For And Hadoop Is Not Good For.

HDFS(Hadoop Distributed File System)

  • Significance of HDFS in Hadoop
  • Features of HDFS
  • 5 daemons of Hadoop
    • Name Node and its functionality
    • Data Node and its functionality
    • Secondary Name Node and its functionality
    • Job Tracker and its functionality
    • Task Tracker and its functionality
  • Data Storage in HDFS
    • Introduction about Blocks
    • Data replication
  • Accessing HDFS
    • CLI(Command Line Interface) and admin commands
    • Java Based Approach

MAPREDUCE

  • Map Reduce Architecture
  • Map Reduce Programming Model
    • Different phases of Map Reduce Algorithm
    • Different Data types in Map Reduce
    • How Write a basic Map Reduce Program
      • The Driver Code
      • The Mapper
      • The Reducer
  • Creating Input and Output Formats in Map Reduce Jobs
    • Text Input Format
    • Key Value InputFormat
    • Sequence File Input Format
  • Data localization in Map Reduce
  • Combiner(Mini Reducer) and Partitioner
  • Distributed Cache
  • Secondary Sorting Using MapReduce
  • Counters in MapReduce
  • Joins in MapReduce
    • MapJoins
    • ReduceJoins

Apache PIG

  • Introduction to Apache Pig
  • Map Reduce Vs Apache Pig
  • SQL Vs Apache Pig
  • Different datatypes in Pig
  • Modes Of Execution in Pig
    • Local Mode
    • Map Reduce OR Distributed Mode
  • Exection Mehanism
    • Grunt Shell
    • Script
    • Embeddeb
  • Transformations in Pig
  • How to write a simple pig script
  • How to store Pig output data in Sqoop & HDFS
  • UDFs in Pig

HIVE

  • Hive Introduction
  • Hive architecture
  • Hive Meta Store
  • Hive Integration with Hadoop
  • Hive Tables
    • Managed Tables
    • External Tables
  • Hive Query Language(Hive QL)
  • How to load the data to Hive Tables
  • SQL VS Hive QL
  • Hive Transform

SQOOP

  • Introduction to Sqoop.
  • MySQL client and Server Installation
  • How to connect to Relational Database using Sqoop
  • Different Sqoop Commands
    • Different flavors of Imports
    • Sqoop Eval Functions
    • Export

HBase

  • Hbase introduction
  • Hbase usecases
  • Hbase basics
    • Column families
    • Scans
  • Hbase Architecture
    • Hmaster
    • Zookeeper
    • Region Servers
    • Regions
  • How to create the tables in Hbase

No comments:

Post a Comment