Blog

Question: Can I set the number of reducers to zero? Answer: Yes, Setting the number of reducers to zero is a valid configuration in Hadoop. When you set the...

BigData Interview Questions- MapReduce 2

Question: Can I set the number of reducers to zero? Answer: Yes, Setting the number of reducers to zero is a valid configuration in Hadoop. When you set the...

Question: What is Hadoop Map Reduce ? Answer: Hadoop MapReduce is a software framework for easily writing applications which process vast amounts of data (multi-terabyte data-sets) in-parallel on large...

BigData Interview Questions- MapReduce

Question: What is Hadoop Map Reduce ? Answer: Hadoop MapReduce is a software framework for easily writing applications which process vast amounts of data (multi-terabyte data-sets) in-parallel on large...

HBase is a database built on top of the HDFS. HBase provides fast lookups for larger tables. The goal of the HBase project is to host very large tables...

BigData – Basics of Apache HBase

HBase is a database built on top of the HDFS. HBase provides fast lookups for larger tables. The goal of the HBase project is to host very large tables...

Pig was initially developed at Yahoo!. Apache Pig is one of the components of Hadoop is used for processing and analysing large data sets. It runs on top of...

BigData – Basics of Apache Pig

Pig was initially developed at Yahoo!. Apache Pig is one of the components of Hadoop is used for processing and analysing large data sets. It runs on top of...

In this section we are going to cover the following :- Type of tables Creating views Type of Tables : You can have Managed table and External Table in Hive....

BigData – Apache Hive Tables

In this section we are going to cover the following :- Type of tables Creating views Type of Tables : You can have Managed table and External Table in Hive....

Apache Hive is one of the components of Hadoop Ecosystem. Hive is Datawarehouse provides the SQL-like interface used to Query Big Data . Features of Hive : Support various...

BigData – Basics of Apache Hive

Apache Hive is one of the components of Hadoop Ecosystem. Hive is Datawarehouse provides the SQL-like interface used to Query Big Data . Features of Hive : Support various...

Sqoop is known as “SQL-to-Hadoop”. Sqoop is a tool used to transfer Data between relational databases and Hadoop.  It has a connector based architecture that supports plugins that provide...

BigData – Ingesting Data into Hadoop Through Sqoop

Sqoop is known as “SQL-to-Hadoop”. Sqoop is a tool used to transfer Data between relational databases and Hadoop.  It has a connector based architecture that supports plugins that provide...

Apache pig is an abstraction layer for processing large datasets; it’s a sub project from Apache and built for Hadoop. Apache pig is scripting language for exploring big datasets,...

BigData- Apache Pig

Apache pig is an abstraction layer for processing large datasets; it’s a sub project from Apache and built for Hadoop. Apache pig is scripting language for exploring big datasets,...

Sqoop Sqoop is a tool designed to transfer data between Hadoop and RDBMS .Sqoop is an open source software product of the Apache Software Foundation. You can use sqoop...

BigData- Sqoop

Sqoop Sqoop is a tool designed to transfer data between Hadoop and RDBMS .Sqoop is an open source software product of the Apache Software Foundation. You can use sqoop...

HDFS permission and Security Each client process that accesses HDFS has a two-part identity composed of the user name, and groups list. Whenever HDFS must do a permissions check...

BigData- HDFS Security

HDFS permission and Security Each client process that accesses HDFS has a two-part identity composed of the user name, and groups list. Whenever HDFS must do a permissions check...

What is HDFS ? HDFS stands for Hadoop Distributed File System is a file system designed for storage of very large files with streaming data access patterns on commodity...

BigData -HDFS Introduction

What is HDFS ? HDFS stands for Hadoop Distributed File System is a file system designed for storage of very large files with streaming data access patterns on commodity...

Why Hadoop required ? Every day a large amount of unstructured data (that has outgrown in size) is getting dumped into our machines. The major challenge is not to...

BigData- Hadoop Introduction

Why Hadoop required ? Every day a large amount of unstructured data (that has outgrown in size) is getting dumped into our machines. The major challenge is not to...

Let's see how to setup one. First of all, Geth need to be installed, in case it is not installed, you can watch this video and install Geth on...

Ethereum- How to setup own private blockchain

Let's see how to setup one. First of all, Geth need to be installed, in case it is not installed, you can watch this video and install Geth on...

In this blog post, we show you how to setup or install Ganache Cli (command line utility) on windows system. We need to have Nodejs installed . if not...

Ethereum- How to Install Ganache Cli on Windows

In this blog post, we show you how to setup or install Ganache Cli (command line utility) on windows system. We need to have Nodejs installed . if not...

Inquire Now
close slider