Tag Archives: online bigdata training

Hadoop advantages and disadvantages

Advantages of Hadoop: 1. Scalable Hadoop is a highly scalable storage platform, because it can stores and distribute very large data sets across hundreds of inexpensive servers that operate in parallel. Unlike traditional relational database systems (RDBMS) that can’t scale to process large amounts of data, Hadoop enables businesses to run applications on thousands of nodes […]

How-to-Install-Hadoop

How to install Hadoop?

Prerequisites Supported Platforms GNU/Linux is supported as a development and production platform. Hadoop has been demonstrated on GNU/Linux clusters with 2000 nodes. Win32 is supported as a development platform. Distributed operation has not been well tested on Win32, so it is not supported as a production platform. Required Software Required software for Linux and Windows include: JavaTM 1.6.x, […]

Hadoop Admin responsibilities

Hadoop Admin Responsibilities: Responsible for implementation and ongoing administration of Hadoop infrastructure. Aligning with the system engineering team to propose and deploy new hardwares and software environments required for Hadoop and to expand existing environments. Working with data delivery teams to setup new Hadoop users. This job includes setting up Linux users, setting up Kerberos […]

Comparison of Hadoop with SQL and Oracle database

Basically the difference is that Hadoop is not a database at all. Hadoop is basically a distributed file system (HDFS) – Hadoop lets you store a large amount of file data on a cloud machines, handling data redundancy etc. Comparing SQL databases and Hadoop: Hadoop is a framework for processing data, what makes it better […]

Five Must Read Books on Hadoop

    Looking for hadoop books? We have shortlisted best hadoop books. 1.Hadoop: The Definitive Guide (By: Tom White ) This is the best book for hadoop beginners. This is a best source to adapt you to the world of big data management. 2.Hadoop in Practice (By: Alex Holmes ) This book discuss about the advanced […]

What are the pre-requisites for big data hadoop?

  Working directly with Java APIs can be tedious and error prone. It also restricts usage of Hadoop to Java programmers. Hadoop offers two solutions for making Hadoop programming easier.Pig is a programming language that simplifies the common tasks of working with Hadoop: loading data, expressing transformations on the data, and storing the final results. […]

Who can become a hadoop professional?

System administrators can learn some Java skills as well as cloud services management skills to start working with Hadoop installation and operations. DBAs and ETL data architects can learn Apache Pig and related technologies to develop, operate, and optimize the massive data flows going into the Hadoop system. BI analysts and data analysts can learn SQL and Hive […]