Hadoop Or Big Data

Hadoop or Big data

I. Introduction to Big data

  • What is Big Data
  • Types of data
  • Characteristics Of Big Data
  • Problems with Big Data

II. Introduction to Hadoop

  • What is Hadoop
  • Brief History of Hadoope
  • Hadoop Ecosystem
  • Feature of Hadoop
  • Hadoop v/s RDBMS
  • Introduction to HDFS
  • Introduction to Mapreduce

III. Hadoop Architecture

  • Hadoop directory Structure
  • Hadoop Configuration
  • Hadoop Core Services

IV.Hadoop - HDFS

  • HDFS Concepts
  • HDFS Key features
  • HDFS Architecture
  • HDFS Shell Commands
  • HDFS Core Components
  • HDFS Operations
  • Namenode Operation
  • Secondary Namenode Operation
  • Datanode Operation
  • Blocks & Benefit of Blocks
  • HDFS Block Replication
  • Block Replication method and Topology
  • Data Integerity in HDFS
  • Reading & Writing Data to HDFS

V. HDFS Advance Feature:

  • Accessing HDFS using Java API
  • Commissioning and Decomissioning of nodes

VI.Hadoop - Mapreduce

  • Mapreduce Concepts
  • MapReduce terminologies
  • Understanding block and input splits
  • Mapreduce Data Flows
  • Input Formats
  • Output Formats
  • Mapreduce Datatypes
  • Mapreduce Features (Mapside side and Reduce side Suffle& Sort )
  • Mapreduce Execution
  • Distributed Cache
  • Setting Development environment for Mapreduce
  • Writing WorkcountMapreduce program from scratch

VII. Mapreduce Advance Feature:

  • Custom Input split
  • Custom Keys and Values
  • Custom partitioner
  • Custom comparator
  • Map side joins
  • Reduce side joins
  • Local Runner
  • Tool Runner
  • DebugingMapreduce jobs

VIII. Introduction to Hadoop 2.x:

  • Diferrence Between Hadoop 1.X and Hadoop 2.x
  • Introduction HDFS High Availability
  • Introduction HDFS Federation
  • Introduction to YARN

IX. Pig:

  • Understanding Pig Program, structure and Executionli>
  • Pig Data types
  • Loading and Storing Data
  • Filtering Data
  • Grouping and Joining Data
  • Sorting Data
  • Combining And Splitting Data

X.Hive:

  • Hive Shell
  • Hive Query Language
  • Hive Tables (Managed & External Tables)
  • Partitions
  • Quering Data

XI. Sqoop:

  • Introduction to Sqoop
  • Sqoop Syntax and Basic Commands
  • Importing and Exporting Data with Sqoop

XII. Overview of HBase:

  • Hbase Concepts

XIII. InstallationAdminpart

  • Installing and using VM player
  • Installing Hadoop 1.x
  • Setting single node cluster
  • Setting multinode cluster
  • Hadoop Configuration (Hands On)
  • Installing & Configuring Hive
  • Installing & Configuring Sqoop
  • Installing & Configuring Hbase

XIV.Case Studies:

  • Case Study 1
  • Case Study 2
  • Case Study 3
  • Case Study 4
  • Case Study 5