Big Data with Hadoop

( 23rd Sep-29th Sep, 2017)

Venue: Graphic Era Hill University,Society Area, Clement Town Dehradun

Last Date for Registration: Sep. 21st, 2017

Eligibility: Faculty/PhD/Research Scholars of engineering/technical institutions and persons from Govt. departments/labs and industry.

Experts from Academia/Industry: Mr. Sai Kumar (Kovid Academy, Hyderabad), Dr. Sudip Roy (IITR)

Objective of the Course

  • To apply traditional data analytics and business intelligence skills to big data tools like Apache Impala (incubating), Apache Hive, and Apache Pig.
  • Cloudera present the tools data, professionals need to access, manipulate, transform, and analyze complex data sets using SQL and familiar scripting languages.
  • Benefits and Outcomes of the Course

  • Course provides participants with a comprehensive understanding of all the steps necessary to operate and maintain a Hadoop cluster using Cloudera Manager.
  • With Spark, developers can write sophisticated parallel applications to execute faster decisions, better decisions, and interactive actions, applied to a wide variety of use cases, architectures, and industries. Apache Spark examples and hands-on exercises are presented in Scala and Python.

  • Course Program

  • The program is split into lectures and lab sessions.
  • Quizzes and project work for enhanced learning.
  • Hands-on experience on basic & advanced- level topics.
  • Interaction & learning with experts from academia & industry.
  • Certificates to the participants by E&ICT Academy IITR.

  • Course Content:
  • Introduction to Hadoop.
  • Hadoop Distributed File System (HDFS).
  • MapReduce and Spark on YARN.
  • Hadoop Cluster Installation.
  • Getting data into HDFS.
  • Installing and configuring Hive & Impala.
  • Hadoop clients including Hue.
  • Interacting with Pig.
  • Processing complex data with Pig.
  • Multi-Dataset operations with Pig.
  • Introduction to Hive and Impala.
  • Querying with Hive and Impala.
  • Getting data into HDFS.
  • Complex data with Hive and Impala.
  • Apache Spark Basics.
  • Aggregating data with pair RDDs.
  • Writing, Running, & Configuring Apache Spark Apps.
  • Parallel Processing in Apache Spark.
  • RDD Persistence.
  • Data-Frames and Spark SQL.
  • Message processing with Apache Kafka.
  • Capturing data with Apache Flume.
  • Important Details

    Last Date for Registration:
    Sep. 21st, 2017
    40 seats on first-cum-first-serve basis
    7 days, 46 hours
    Registration Fee
    Faculty Members/Research scholars: Rs. 2,000/-
    Persons from Industry: Rs. 5,000/-
    Payment Details
    Demand draft drawn in favour of "Dean SRIC IIT Roorkee" payable at Roorkee
    How to Apply
    You can apply online by click here to fill-up the application form OR you can download offline form and email scanned copy to
    Contact Details
    Dr. Sudip Roy (Local co-ordinator, Assistant Prof. ECE Dept, IITR)
    Dr. Sanjeev Manhas (P.I., E&ICT Academy, ECE Dept, IITR),
    Tel: +91-7078627392, +91-1332-286457

    A hard copy of the application form along with Demand Draft must reach to the following address: Mr. Prateek Sharma, EICT Academy, ECE Department, IIT Roorkee, Uttarakhand 247667.

    Follow on: Facebook, LinkdIn

    Visitor Number