Hadoop Big Data Course
1. Introduction In December 2011, Tata Institute of Social Sciences set up the School of Vocational Education (SVE) to provide immediate and definite interventions to improve the lives of the disadvantaged and marginalized youth, especially who are excluded by the formal school education system, through appropriate vocational training programmes. It has been set up with a vision of creating an ecosystem that would bring back the dignity of labour for blue collar streams of work and create sustainable sources of income. 1.1Introduction and Course Objectives: Hadoop is an open-source framework that allows to store and process big data in a distributed environment across clusters of computers using simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage. This Big Data Hadoop Certification provides introduction to Big Data, MapReduce algorithm, and Hadoop Distributed File System. The course is designed to give you in-depth knowledge of the Big Data framework using Hadoop including HDFS, YARN, and MapReduce. You will learn to process and analyze large datasets stored in the HDFS. As a part of the big data course, you will be required to execute real-life industry-based projects using Cloud Lab in the domains of banking, telecommunication, social media, insurance, and ecommerce. This tutorial has been prepared for professionals aspiring to learn the basics of Big Data Analytics using Hadoop Framework and become a Hadoop Developer. Software Professionals, Analytics Professionals, and ETL developers are the key beneficiaries of this course. 1.2. Eligibility for Admission Any graduate or Engineering 1st year completed students can take admission in this certification course. 1.3. Employability/Skill enhancement Undergoing the Hadoop certificate program professionals/graduates will get an upper hand in comparison to others as they will receive additional specialized training in dealing and handling Big Data: • After completion of this certification candidate can easily able understand Big Data
Technologies. • They can deal with Hadoop Distributed File System. • They can use HDFS architecture • They can handle big data with MapReduce algorithms
Method of Teaching: A combination of vocational theory inputs along with relevant practical exposure that would commensurate with the guidelines provided in the Facilitator’s guide. Method of Assessment & Weightage: The assessment of students would be done in written and practical examination. Assessment Weightage: Written – 40% Practical examination – 60%
2. Course Structure
Credits for course – 12 Credits Duration of Course – 255 Hours [For Theory (classroom) training 1 credit = 15 hours and For Practical training 1 credit = 30 hours]
Sr. No. Subject Credits Duration Hrs. 1. Linux Operating System 2 30 2. Programming with JAVA 2 30 3. Hadoop: Big Data Analysis Framework 3 45 4. Practical Work 5 150
Total 12 255
Linux Operating System
Credits: 2 Total Credit Hours: 30 AIM: To expose students to the basics of Linux operating system. Understand what is shell programming? Understand various features of Linux operating system.
OBJECTIVES: To expose students to Linux file system. To introduce students to various Essential Linux commands. To introduce students to system administration To introduce students Basic networking administration
UNIT – I Linux introduction and file system – Basic Features, Advantages, Installing requirement, Basic Architecture of Unix/Linux system, Kernel, Shell. Linux File system-Boot block, super block, Inode table, data blocks, How Linux access files, storage files, Linux standard directories, Commands for files and directories cd, ls, cp, md, rm, mkdir, rmdir, more, less, creating and viewing files, using cat, file comparisons, View files, disk related commands, checking disk free spaces. Partitioning the Hard drive for Linux, Installing the Linux system, System startup and shut-down.
UNIT–II Essential Linux commands Understanding shells, Processes in Linux process fundamentals, connecting processes with pipes, redirecting input output, manual help, Background processing, managing multiple processes, changing process priority, scheduling of processes at command, batch commands, kill, ps, who, sleep, Printing commands, grape, fgrep, find, sort, Cal, banner, touch, file, file related commands-ws, sat, cut, grep, dd, etc. Mathematical commands- bc, expr, factor, units. vi, joe, vim editor
UNIT–III Shell programming Basic of shell programming, Various types of shell, shell programming in bash, conditional and looping statements, case statements, parameter passing and arguments, Shell variables, shell keywords, Creating Shell programs for automate system tasks and report printing, use of grep in shell, awk programming.
UNIT–IV System administration Common administrative tasks, identifying administrative files – configuratinn and log files, Role of system administrator, Managing user accounts-adding & deleting users, changing permissions and ownerships, Creating and managing groups, modifying group attributes, Temporary disable user’s accounts, creating and mounting file system, checking and monitoring system performance file security & Permissions, becoming super user using su.Getting system information – host name, disk partitions & sizes, users, kernel. Backup and restore files, linuxconf. utility in GUI, reconfiguration hardware with kudzu Configure desktop-X configurator, understanding XF86config file, starting & using X desktop. KDE & Gnome graphical interfaces, changing X settings.
UNIT–V Basic networking administration Setting up a LAN using Linux, choosing peer to peer vs client/server model, setting up an Ethernet Lan, configuring host computers, checking Ethernet connecting, connecting to internet, administration in a networked environment, common networking administrative tasks, the network file system, configuring Ethernet, initializing Ethernet Interface, ifconfig, netstat and netconfig commands a TCP/IP networks, DNS services, routing using Linux, SLIP & PPP services, UUCP. Installation & Administration of mail server, ftp server and Apache web server
TEXTS & REFERENCES BOOKS: Using linux by jack tackett, david gunter, phi, eee edition Red hat linux7.x bible –cristopher negus, idg books india ltd. Linux installtion and administration, nicholas wells, course technology (vikas publishing, new delhi). Unix – sumitaba das Unix shell programming – yashwant kanetkar, bpb publications, Red hat linux unleashed techmedia (bpb publications) Linux networking and security – wells, course technology (vikas publishing, new delhi).
Programming with JAVA
Total Credit Hours: 30
AIM: Understand basics of Java Programming language. To gain knowledge for the various concepts of JAVA programming. OBJECTIVES: To introduce the basics structure of JAVA program. To impart functional knowledge about operators and statements of JAVA. To impart functional knowledge about class, inheritance and arrays in Java. Create applets using Java. SYLLABUS
UNIT–I C++ Vs JAVA, JAVA and Internet and WWW, JAVA support systems, JAVA environment. JAVA program structure, Tokens, Statements, JAVA virtual machine, Constant & Variables, Data Types, Declaration of Variables, Scope of Variables, Symbolic Constants, Type Casting. Operators: Arithmetic, Relational, Logical Assignments, Increment and Decrement, Conditional, Bitwise, Special, Expressions & its evaluation. If statement, if…else… statement, Nesting of if…else… statements, else…if Ladder, Switch, ? operators, Loops – While, Do, For, Jumps in Loops, Labelled Loops.
UNIT–II Defining a Class, Adding Variables and Methods, Creating Objects, Accessing Class Members, Constructors, Methods Overloading, Static Members, Nesting of Methods. Inheritance: Extending a Class, Overriding Methods, Final Variables and Methods, Final Classes, Finalize Methods, Abstract methods and Classes, Visibility Control.
UNIT–III Arrays: One Dimensional & two Dimensional, strings, Vectors, wrapper Classes, Defining Interface Extending Interface, Implementing Interface, Accessing Interface Variable, System Packages, Using System Package, Adding a Class to a Packages, Hiding Classes.
UNIT–IV Creating Threads, Extending the Threads Class, Stopping and Blocking a Thread, Life Cycle of a Thread, Using Thread Methods, Thread Exceptions, Thread Priority, Synchronization, Implementing the Runnable Interface.
UNIT–V Local and Remote Applets Vs Applications, Writing Applets, Applets Life Cycle, Creating an Executable Applet, Designing a Web Page, Applet Tag, Adding Applet to HTML File, Running the Applet, Passing Parameters to Applets, Aligning the Display, HTML Tags & Applets, Getting Input from the User.
TEXT & REFERENCE BOOKS: E. Balaguruswamy, “Programming in Java”, 2nd Edition, TMH Publications Peter Norton, “Peter Norton Guide to Java Programming”, Techmedia Publications