Hadoop Course Syllabus
Course Syllabus
Download SyllabusINTRODUCTION
- What is Big Data?
- Big Data – Journey
- Big Data Statistics
- Big Data Analytics
- Big Data Challenges
- Technologies Supported By Big Data
- Hadoop Introduction
- What Is Hadoop?
- History Of Hadoop
- Breakthroughs Of Hadoop
- Future of Hadoop
- Who Is Using?
- Basic Concepts
- The Hadoop Distributed File
System – At a Glance - Hadoop Daemon Processes
- Anatomy Of A Hadoop Cluster
- Hadoop Distributions
HADOOP DISTRIBUTED FILE SYSTEM (HDFS)
- What is HDFS?
- Distributed File System (DFS)
- Hadoop Distributed File System (HDFS)
- HDFS Cluster Architecture and Block Placement
- NameNode
- DataNode
- JobTracker
- TaskTracker
- Secondary NameNode
- HDFS Concepts
- Typical Workflow
- Data Replication
- Replica Placement
- Replication Policy
- Hadoop Rack Awareness
- Anatomy of a File Read
- Anatomy of a File Write
MAPREDUCE
- Job Tracker
- Task Tracker
- Task Failures
- Task Tracker Failures
- Job Tracker Failures
- HDFS Failures
- YARN
HOW TO PLAN A CLUSTER
- Versions & Hadrware
- Hardware selection
- Master Hardware
- Slave Hardware
- Cluster sizing
- Operating system selection
- Deployment Layout
- Software Packages
- Hostname, DNS
- Users, Groups, Privileges
- Disk configuration
- Choose a FileSystem
- Mount options
- Network design
- Network usage in Hadoop
- Typical network Topologies
INSTALLATION AND CONFIGURATION
- Apache Hadoop
- Tarball Installation
- Package Installation
- XML Configuration
- Logging Configuration
- HDFS
- Optimization and Tuning
- Optimization and Tuning
AUTHENTICATION
- Kerberos & Hadoop
- Kerberos
- Configuring Hadoop Security
RESOURCE MANAGEMENT
- What is source management?
- Mapreduce Scheduler
- Capacity Scheduler
- Fair Scheduler
CLUSTER MAINTENANCE
- Managing Hadoop
- Starting and stopping processes with Init scripts
- Starting and stopping
- HDFS Maintenance
- Adding and Decommissioning
- Balancing HDFS Block Data
- Dealing with a Failed disk
- MAPREDUCE Maintenance
- Adding and Decommissioning TaskTracker
- Kill MapReduce Job and Task
- Dealing Blacklisted
- Tasktracker
processes manually
DataNode
TROUBLESHOOTING
- COMMON FAILUERS AND PROBLEMS
- HDFS AND MAPREDUCE CHECKS
BACKUP AND RECOVERY
- DATA BACKUP
- Distributed copy
- Parallel data ingestion
- NAMENODE METADATA
Get expertise with Hadoop and develop the skills necessary to effectively analyze and process large-scale datasets through our Hadoop course.

