Hadoop Online Training
Hadoop is an open-source framework that allows to store and process big data in a distributed environment across clusters of computers using simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage.
Hadoop Online Training
Hadoop Overview
Hadoop is an open-source framework that allows to store and process big data in a distributed environment across clusters of computers using simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage.
What is Big Data?
Big data means really a big data, it is a collection of large datasets that cannot be processed using traditional computing techniques. Big data is not merely a data, rather it has become a complete subject, which involves various tools, techniques and frameworks.
- Hadoop Training by Realtime Expert trainer
- Hadoop Live Online Classes
- Free study material
- All attendees should have a basic knowledge of Java.
We guarantee learning at your convenience & pace.
- Instant Access:
Get instant access to self-paced training after signup. - Streaming video recording:
Watch lessons any time at your schedule, free recording. - Exercises:
Practical exercises help you test what you are learning as you go. - Free Demo:
Sign up for free demo to check whether the course is right for you and interact with the faculty live. - Experienced Trainers:
We only hire the industry’s best trainers - Live free interactive web sessions:
Ask the Expert Shell Scripting trainers about the career prospects and clarify your questions any time after you complete the course. - Structured Curriculum Schedule:
Progress with your complete daily interactive lessons and assignments. - Faculty Mentoring:
Turn in daily and weekly homework for personalized feedback from faculty. - Virtual Office Hours:
Live interaction with the faculty and other students around the world. - Hands on Live Projects:
Work on live lab sessions to tackle real-world projects. Get 100% faculty guidance and ratings.
Hadoop Course Content
Big-Data and Hadoop
- Introduction to big data and Hadoop
- Hadoop Architecture
- Installing Ubuntu with Java 1.8 on VM Workstation 11
- Hadoop Versioning and Configuration
- Single Node Hadoop 1.2.1 installation on Ubuntu 14.4.1
- Multi Node Hadoop 1.2.1 installation on Ubuntu 14.4.1
- Linux commands and Hadoop commands
- Cluster architecture and block placement
- Modes in Hadoop
- Local Mode
- Pseudo Distributed Mode
- Fully Distributed Mode
- Hadoop Daemon
- Master Daemons(Name Node, Secondary Name Node, Job Tracker)
- Slave Daemons(Job tracker, Task tracker)
- Task Instance
- Hadoop HDFS Commands
- Accessing HDFS
- CLI Approach
- Java Approach
Map-Reduce
- Understanding Map Reduce Framework
- Inspiration to Word-Count Example
- Developing Map-Reduce Program using Eclipse Luna
- HDFS Read-Write Process
- Map-Reduce Life Cycle Method
- Serialization(Java)
- Datatypes
- Comparator and Comparable(Java)
- Custom Output File
- Analysing Temperature dataset using Map-Reduce
- Custom Partitioner & Combiner
- Running Map-Reduce in Local and Pseudo Distributed Mode.
Advanced Map-Reduce
- Enum(Java)
- Custom and Dynamic Counters
- Running Map-Reduce in Multi-node Hadoop Cluster
- Custom Writable
- Site Data Distribution
- Using Configuration
- Using DistributedCache
- Using stringifie
- Input Formatters
- NLine Input Formatter
- XML Input Formatter
- Sorting
- Reverse Sorting
- Secondary Sorting
- Compression Technique
- Working with Sequence File Format
- Working with AVRO File Format
- Testing MapReduce with MR Unit
- Working with NYSE DataSets
- Working with Million Song DataSets
- Running Map-Reduce in Cloudera Box
HIVE
- Hive Introduction & Installation
- Data Types in Hive
- Commands in Hive
- ExploringInternal and External Table
- Partitions
- Complex data types
- UDF in Hive
- Built-in UDF
- Custom UDF
- Thrift Server
- Java to Hive Connection
- Joins in Hive
- Working with HWI
- Bucket Map-side Join
- More commands
- View
- SortBy
- Distribute By
- Lateral View
- Running Hive in Cloudera
SQOOP
- Sqoop Installations and Basics
- Importing Data from Oracle to HDFS
- Advance Imports
- Real Time UseCase
- Exporting Data from HDFS to Oracle
- Running Sqoop in Cloudera
PIG
- Installation and Introduction
- WordCount in Pig
- NYSE in Pig
- Working With Complex Datatypes
- Pig Schema
- Miscellaneous Command
- Group
- Filter
- Order
- Distinct
- Join
- Flatten
- Co-group
- Union
- Illustrate
- Explain
- UDFs in Pig
- Parameter Substitution and DryRun
- Pig Macros
- Running Pig in Cloudera
OOZIE
- Installing Oozie
- Running Map-Reduce with Oozie
- Running Pig and Sqoop with Oozie
Contact us
Got more questions?
Talk to our team directly. A program advisor will get in touch with you shortly.
We’re happy to answer any questions you may have and help you determine which of our services best fit your needs.
Schedule a Free Consultation