📁 Interview Questions


Sqoop Interview Cheat Sheet 1

Install sqoop sudo yum install sqoop sudo apt-get install sqoop in sqoop-normal commnd prompt sqoop config file—sqoop site.xml install jdbc drivers After you’ve obtained the driver, you need to copy the driver’s JAR file(s) into Sqoop’s lib/ directory. If you’re using the Sqoop tarball, copy the JAR files directly into the lib/ directory after unzipping the tarball. If you’re using packages, you will need to copy the driver files into the /usr/lib/sqoop/lib directory […]


Good SQL Queries Collection

Good SQL Queries Collection /* CREATE TABLE FOR DEPARTMENT */ CREATE TABLE DEPARTMENT ( “DEPT_ID” NUMBER, “DEPT_NAME” VARCHAR2(30), PRIMARY KEY (“DEPT_ID”) ) /* CREATE TABLE FOR EMP */ CREATE TABLE EMPLOYEE ( “EMP_ID” NUMBER, “MGR_ID” NUMBER, “DEPT_ID” NUMBER, “EMP_NAME” VARCHAR2(30), “SAL” NUMBER, “DOJ” DATE, PRIMARY KEY (“EMP_ID”) ENABLE, FOREIGN KEY (“MGR_ID”) REFERENCES EMPLOYEE (“EMP_ID”) ENABLE, FOREIGN KEY (“DEPT_ID”) REFERENCES DEPARTMENT (“DEPT_ID”) ENABLE ) /* INSERT STATEMENT FOR DEPARTMENT */ INSERT […]


Real Time Hadoop Interview Questions From Different Readers

Real Time Hadoop Interview Questions from Various interviews Hive – Where do you use Internal or Managed table? What scenarios? In your resume, what do you mean by, “monitoring & managing MapReduce jobs"? Explain? Interviewer’s Project: How to modify the RDBMs’ Nested SQL queries into Hadoop framework using Pig. Sqoop: Need to know very well. Some of the current projects are importing data from other RDBMs sources into HDFS. Can […]


100 Hadoop Certification Dump Questions 8

Hadoop Certification Dump Questions 1. From given below which describes how a client reads a file from HDFS? ( 1 ) The client queries all DataNodes in parallel. The DataNode that contains the requested data responds directly to the client. The client reads the data directly from the DataNode. The client contacts the NameNode for the block location(s). The NameNode then queries the DataNodes for block locations. The DataNodes respond […]


100 Interview Questions on Hadoop 2

1. What does commodity Hardware in Hadoop world mean? ( D ) a) Very cheap hardware b) Industry standard hardware c) Discarded hardware d) Low specifications Industry grade hardware 2. Which of the following are NOT big data problem(s)? ( D) a) Parsing 5 MB XML file every 5 minutes b) Processing IPL tweet sentiments c) Processing online bank transactions d) both (a) and (c) 3. What does “Velocity" in […]


Sqoop Interview Questions and Answers for Experienced 17

In this post we will provide some practical Sqoop Interview Questions and Answers for experienced hadoop developers. Sqoop Interview Questions and Answers for Experienced 1. What is Sqoop? Sqoop is an open source tool that enables users to transfer bulk data between Hadoop eco system and relational databases. 2. What are the relational databases supported in Sqoop? Below are the list of RDBMSs that are supported by Sqoop Currently. MySQL PostGreSQL […]


Hive Interview Questions and Answers for experienced Part – 4 5

Below are some the of important hive Interview Questions and Answers for experienced hadoop developers. Hive Interview Questions and Answers for experienced 1. What is the Hive configuration precedence order? There is a precedence hierarchy to setting properties. In the following list, lower numbers take precedence over higher numbers: The Hive SET command The command line -hiveconf option hive-site.xml hive-default.xml hadoop-site.xml (or, equivalently, core-site.xml, hdfs-site.xml, and mapred-site.xml) hadoop-default.xml (or, equivalently, […]


Pig Interview Questions and Answers Part – 2 2

Below are a few more Pig Interview Questions and Answers 1. What is a tuple? A tuple is an ordered set of fields and A field is a piece of data. 2. What is a relation in Pig? A Pig relation is a bag of tuples. A Pig relation is similar to a table in a relational database, where the tuples in the bag correspond to the rows in a table. […]


Hadoop Pig Interview Questions and Answers Part – 1

Below are some of the Hadoop Pig Interview questions and answers that suitable for both freshers and experienced hadoop programmers. 1. What is Apache Pig? Pig is a scripting language for exploring huge data sets of size gigabytes or terabytes very easily. Pig provides an engine for executing data flows in parallel on Hadoop 2. What is Apache Pig? Apache Pig is top level project in Apache Software foundation for analyzing […]


HBase Interview Questions and Answers Part – 1 1

Below are a few important Hadoop HBase Interview Questions and Answers that are suitable for hadoop freshers or experienced developers. 1. What is HBase? HBase is Column-Oriented , Open-Source, Multidimensional, Distributed database. It run on the top of HDFS. 2. Why do we use HBase? HBase provide random read and write, can perform thousand of operation per second on large data set. HBase support record level record level operations on database […]


Mapreduce Interview Questions and Answers Part – 4 4

In this post, we will discuss about another 50 Mapreduce Interview Questions and Answers for experienced mapreduce developers. Mapreduce Interview Questions and Answers for experienced 1. What are the methods in the Mapper class and order of their invocation?

The Mapper contains the run() method, which call its own setup() method only once, it also call a map() method for each input and finally calls it cleanup() method. We can […]


Hadoop Interview Questions and Answers Part – 5 1

Below are some of the hadoop interview questions and answers. 1. As the data is replicated thrice in HDFS, does it mean that any calculation done on one node will also be replicated on the other two? Since there are 3 nodes, when we send the MapReduce programs, calculations will be done only on the original data. The master node will know which node exactly has that particular data. In […]


Hive Interview Questions and Answers – Part 3 1

In this post, we will discuss about a few more hadoop hive interview questions and answers for hadoop freshers and experienced developers. Hive Interview Questions and Answers 1. What are the types of tables in Hive? There are two types of tables. Managed tables External tables Only while dropping tables these two differentiates. Otherwise both type of tables are very similar. 2. What kind of data warehouse application is suitable […]


Hive Interview Questions and Answers for experienced – Part 2 4

In this post, we will discuss about hive Interview Questions and Answers for experienced and freshers. Hive Interview Questions and Answers for experienced: 1. How to start Hive metastore service as a background process? We can start hive metastore service as a background process with below command.

By using kill -9 <process id> we can stop this service. 2. How to configure hive remote metastore in hive-site.xml file? We can configure […]


Hive Interview Questions and Answers – Part 1 5

Below are some of the important Hive Interview Questions and Answers required for Hadoop developers and administrators. Hive Interview Questions and Answers 1. What is Metadata? Data about Data. 2. What is Hive? Hive is one of the important tool in Hadoop eco system and it provides an SQL like dialect to Hadoop distributed file system. 3. What are the features of Hive? Hive provides, Tools to enable easy data extract/transform/load […]