- Learn installations and architecture of Hadoop, Hive, Spark, and other tools. Handle structured & Unstructured Data
- Free tutorial
- Rating: 4.2 out of 54.2 (196 ratings)
- 11,480 students
- 1hr 26min of on-demand video
- Created by Shivgan Joshi
- English
What you’ll learn
- Kick start with basics for career in Big Data Hadoop in NY Area 312 285 6886
- Learn how to install different tools on Hadoop
- Learn enough Hadoop to join our NYC Bootcamp on Hadoop Big Data
Requirements
- Basic knowledge of Programming and SQL would help
- Basic idea of Python SQL Data Analytics would help
Description
Introduction Hadoop Big Data Course
- Introduction to the Course
Top Ubuntu commands
Understand NameNode, DataNode, YARN and Hadoop Infrastructure
Hadoop Install
- Hadoop Installation & HDFS Commands
- Java based Mapreduce
# Hadoop 2.7 / 2.8.4
Learn HDFS commands
Setting up Java for mapreduce
Intro to Cloudera Hadoop & studying Cloudera Certification
SQL and NoSQL
- SQL, Hive and Pig Installation (RDBMS world and NoSQL world)
- More Hive and SQOOP (Cloudera – Sqoop and Hive on Cloudera.
- JDBC drivers.
- Pig
- Intro to NoSQL, MongoDB, Hbase Installation
Understanding different databases
Hive :
- Hive Partitions and Bucketing
- Hive External and Internal Tables
Spark Scala Python
- Spark Installations and Commands
- Spark Scala Scala Sheets
- Hadoop Streaming Python Map Reduce
- PySpark – (Python – Basics). RDDs.
Running Spark-shell and importing data from csv files
PySpark – Running RDD
Mid Term Projects
- Pull data from csv online and move to Hive using hive import
- Pull data from spark-shell and run map reduce for fox news first page
- Create Data in MySQL and using SQOOP move it to HDFS
- Using Jupyter Anaconda and Spark Context run count on file that has Fox news first page
- Save raw data using delimiter comma, space, tab and pipe and move that into spark-context and spark shell
Broadcasting Data – stream of data
Kafka Message Broadcasting
Who this course is for:
- Carrier changes who would like to move to Big Data Hadoop
- Learners who want to learn Hadoop installations
- New York Students who want to move to wall street
Show less
Course content
5 sections • 34 lectures • 1h 26m total lengthCollapse all sections
Introduction5 lectures • 26min
- Introduction to the Course00:11
- Course Syllabus, Scope, Intro Video10:33
- Intro to HDFS & Hadoop Architecture08:29
- Intro to Mapreduce – Pulling data from HDFS03:33
- Map reduce03:40
Hadoop Install8 lectures • 24min
- Hadoop Installation & HDFS Commands00:03
- Hadoop Install08:54
- Java based Mapreduce – 2 examples00:01
- Multi node installation00:01
- Java based Map Reduce05:41
- Multi node install06:12
- Commissioning and Decommissioning of Datanode02:01
- Multinode Debugging01:10
SQL and NoSQL – Structured, Semi Structured and Unstructured Data9 lectures • 18min
- SQL, Hive and Pig Installation (RDBMS)00:07
- Install SQL Hive03:26
- More Hive & Scoop00:01
- Hive Practice03:26
- Sqoop Install and Commands03:40
- Intro to NoSQL, MongoDB, Hbase00:01
- Mongodb install01:55
- Hbase installation02:46
- Pig Installation and Intro02:32
Spark Scala Python10 lectures • 16min
- Spark Scala Install01:22
- Scala Practice02:08
- Spark Scala Scala Sheets00:01
- Hadoop Streaming Python Map Reduce00:06
- PySpark00:01
- Pyspark04:39
- Kafka Installation02:08
- Kafka Message Broadcasting / Fume00:01
- Spark Practice02:25
- Hadoop Streaming – Python mapreduce02:58
Conclusion2 lectures • 2min
- Conclusion00:04
- Final Words and Projects01:59