Install sqoop sudo yum install sqoop sudo apt-get install sqoop in sqoop-normal commnd prompt sqoop config file—sqoop site.xml install jdbc drivers After you’ve obtained the driver, you need to copy the driver’s JAR file(s) into Sqoop’s lib/ directory. If you’re using the Sqoop tarball, copy the JAR files directly into the lib/ directory after unzipping the tarball. If you’re using packages, you will need to copy the driver files into the /usr/lib/sqoop/lib directory […]
Real Time Hadoop Interview Questions from Various interviews Hive – Where do you use Internal or Managed table? What scenarios? In your resume, what do you mean by, “monitoring & managing MapReduce jobs"? Explain? Interviewer’s Project: How to modify the RDBMs’ Nested SQL queries into Hadoop framework using Pig. Sqoop: Need to know very well. Some of the current projects are importing data from other RDBMs sources into HDFS. Can […]
Hadoop Certification Dump Questions 1. From given below which describes how a client reads a file from HDFS? ( 1 ) The client queries all DataNodes in parallel. The DataNode that contains the requested data responds directly to the client. The client reads the data directly from the DataNode. The client contacts the NameNode for the block location(s). The NameNode then queries the DataNodes for block locations. The DataNodes respond […]
1. What does commodity Hardware in Hadoop world mean? ( D ) a) Very cheap hardware b) Industry standard hardware c) Discarded hardware d) Low specifications Industry grade hardware 2. Which of the following are NOT big data problem(s)? ( D) a) Parsing 5 MB XML file every 5 minutes b) Processing IPL tweet sentiments c) Processing online bank transactions d) both (a) and (c) 3. What does “Velocity" in […]
In this post we will provide some practical Sqoop Interview Questions and Answers for experienced hadoop developers. Sqoop Interview Questions and Answers for Experienced 1. What is Sqoop? Sqoop is an open source tool that enables users to transfer bulk data between Hadoop eco system and relational databases. 2. What are the relational databases supported in Sqoop? Below are the list of RDBMSs that are supported by Sqoop Currently. MySQL PostGreSQL […]
Below are some the of important hive Interview Questions and Answers for experienced hadoop developers. Hive Interview Questions and Answers for experienced 1. What is the Hive configuration precedence order? There is a precedence hierarchy to setting properties. In the following list, lower numbers take precedence over higher numbers: The Hive SET command The command line -hiveconf option hive-site.xml hive-default.xml hadoop-site.xml (or, equivalently, core-site.xml, hdfs-site.xml, and mapred-site.xml) hadoop-default.xml (or, equivalently, […]
Below are a few more Pig Interview Questions and Answers 1. What is a tuple? A tuple is an ordered set of fields and A field is a piece of data. 2. What is a relation in Pig? A Pig relation is a bag of tuples. A Pig relation is similar to a table in a relational database, where the tuples in the bag correspond to the rows in a table. […]
Below are some of the Hadoop Pig Interview questions and answers that suitable for both freshers and experienced hadoop programmers. 1. What is Apache Pig? Pig is a scripting language for exploring huge data sets of size gigabytes or terabytes very easily. Pig provides an engine for executing data flows in parallel on Hadoop 2. What is Apache Pig? Apache Pig is top level project in Apache Software foundation for analyzing […]
Below are a few important Hadoop HBase Interview Questions and Answers that are suitable for hadoop freshers or experienced developers. 1. What is HBase? HBase is Column-Oriented , Open-Source, Multidimensional, Distributed database. It run on the top of HDFS. 2. Why do we use HBase? HBase provide random read and write, can perform thousand of operation per second on large data set. HBase support record level record level operations on database […]
In this post, we will discuss about another 50 Mapreduce Interview Questions and Answers for experienced mapreduce developers. Mapreduce Interview Questions and Answers for experienced 1. What are the methods in the Mapper class and order of their invocation?
Below are some of the hadoop interview questions and answers. 1. As the data is replicated thrice in HDFS, does it mean that any calculation done on one node will also be replicated on the other two? Since there are 3 nodes, when we send the MapReduce programs, calculations will be done only on the original data. The master node will know which node exactly has that particular data. In […]
In this post, we will discuss about a few more hadoop hive interview questions and answers for hadoop freshers and experienced developers. Hive Interview Questions and Answers 1. What are the types of tables in Hive? There are two types of tables. Managed tables External tables Only while dropping tables these two differentiates. Otherwise both type of tables are very similar. 2. What kind of data warehouse application is suitable […]
In this post, we will discuss about hive Interview Questions and Answers for experienced and freshers. Hive Interview Questions and Answers for experienced: 1. How to start Hive metastore service as a background process? We can start hive metastore service as a background process with below command.
By using kill -9 <process id> we can stop this service. 2. How to configure hive remote metastore in hive-site.xml file? We can configure […]
Below are some of the important Hive Interview Questions and Answers required for Hadoop developers and administrators. Hive Interview Questions and Answers 1. What is Metadata? Data about Data. 2. What is Hive? Hive is one of the important tool in Hadoop eco system and it provides an SQL like dialect to Hadoop distributed file system. 3. What are the features of Hive? Hive provides, Tools to enable easy data extract/transform/load […]