Interview Questions


Sqoop Interview Cheat Sheet 1

Install sqoop sudo yum install sqoop sudo apt-get install sqoop in sqoop-normal commnd prompt sqoop config file—sqoop site.xml install jdbc drivers After you’ve obtained the driver, you need to copy the driver’s JAR file(s) into Sqoop’s lib/ directory. If you’re using the Sqoop tarball, copy the JAR files directly into the lib/ directory after unzipping the tarball. If you’re using packages, you will need to copy the driver files into the /usr/lib/sqoop/lib directory […]


Good SQL Queries Collection

Good SQL Queries Collection /* CREATE TABLE FOR DEPARTMENT */ CREATE TABLE DEPARTMENT ( “DEPT_ID” NUMBER, “DEPT_NAME” VARCHAR2(30), PRIMARY KEY (“DEPT_ID”) ) /* CREATE TABLE FOR EMP */ CREATE TABLE EMPLOYEE ( “EMP_ID” NUMBER, “MGR_ID” NUMBER, “DEPT_ID” NUMBER, “EMP_NAME” VARCHAR2(30), “SAL” NUMBER, “DOJ” DATE, PRIMARY KEY (“EMP_ID”) ENABLE, FOREIGN KEY (“MGR_ID”) REFERENCES EMPLOYEE (“EMP_ID”) ENABLE, FOREIGN KEY (“DEPT_ID”) REFERENCES DEPARTMENT (“DEPT_ID”) ENABLE ) /* INSERT STATEMENT FOR DEPARTMENT */ INSERT […]


Real Time Hadoop Interview Questions From Different Readers 3

Real Time Hadoop Interview Questions from Various interviews Hive – Where do you use Internal or Managed table? What scenarios? In your resume, what do you mean by, “monitoring & managing MapReduce jobs"? Explain? Interviewer’s Project: How to modify the RDBMs’ Nested SQL queries into Hadoop framework using Pig. Sqoop: Need to know very well. Some of the current projects are importing data from other RDBMs sources into HDFS. Can […]


100 Hadoop Certification Dump Questions 9

Hadoop Certification Dump Questions 1. From given below which describes how a client reads a file from HDFS? ( 1 ) The client queries all DataNodes in parallel. The DataNode that contains the requested data responds directly to the client. The client reads the data directly from the DataNode. The client contacts the NameNode for the block location(s). The NameNode then queries the DataNodes for block locations. The DataNodes respond […]


100 Interview Questions on Hadoop 5

1. What does commodity Hardware in Hadoop world mean? ( D ) a) Very cheap hardware b) Industry standard hardware c) Discarded hardware d) Low specifications Industry grade hardware 2. Which of the following are NOT big data problem(s)? ( D) a) Parsing 5 MB XML file every 5 minutes b) Processing IPL tweet sentiments c) Processing online bank transactions d) both (a) and (c) 3. What does “Velocity" in […]


Sqoop Interview Questions and Answers for Experienced 18

In this post we will provide some practical Sqoop Interview Questions and Answers for experienced hadoop developers. Sqoop Interview Questions and Answers for Experienced 1. What is Sqoop? Sqoop is an open source tool that enables users to transfer bulk data between Hadoop eco system and relational databases. 2. What are the relational databases supported in Sqoop? Below are the list of RDBMSs that are supported by Sqoop Currently. MySQL PostGreSQL […]


Hive Interview Questions and Answers for experienced Part – 4 5

Below are some the of important hive Interview Questions and Answers for experienced hadoop developers. Hive Interview Questions and Answers for experienced 1. What is the Hive configuration precedence order? There is a precedence hierarchy to setting properties. In the following list, lower numbers take precedence over higher numbers: The Hive SET command The command line -hiveconf option hive-site.xml hive-default.xml hadoop-site.xml (or, equivalently, core-site.xml, hdfs-site.xml, and mapred-site.xml) hadoop-default.xml (or, equivalently, […]


Hadoop Pig Interview Questions and Answers Part – 1

Below are some of the Hadoop Pig Interview questions and answers that suitable for both freshers and experienced hadoop programmers. 1. What is Apache Pig? Pig is a scripting language for exploring huge data sets of size gigabytes or terabytes very easily. Pig provides an engine for executing data flows in parallel on Hadoop 2. What is Apache Pig? Apache Pig is top level project in Apache Software foundation for analyzing […]


HBase Interview Questions and Answers Part – 1 1

Below are a few important Hadoop HBase Interview Questions and Answers that are suitable for hadoop freshers or experienced developers. 1. What is HBase? HBase is Column-Oriented , Open-Source, Multidimensional, Distributed database. It run on the top of HDFS. 2. Why do we use HBase? HBase provide random read and write, can perform thousand of operation per second on large data set. HBase support record level record level operations on database […]


Mapreduce Interview Questions and Answers Part – 4 4

In this post, we will discuss about another 50 Mapreduce Interview Questions and Answers for experienced mapreduce developers. Mapreduce Interview Questions and Answers for experienced 1. What are the methods in the Mapper class and order of their invocation?

The Mapper contains the run() method, which call its own setup() method only once, it also call a map() method for each input and finally calls it cleanup() method. We can […]


Hadoop Interview Questions and Answers Part – 5 1

Below are some of the hadoop interview questions and answers. 1. As the data is replicated thrice in HDFS, does it mean that any calculation done on one node will also be replicated on the other two? Since there are 3 nodes, when we send the MapReduce programs, calculations will be done only on the original data. The master node will know which node exactly has that particular data. In […]


Skip to toolbar