Oozie Notes

OOZIE NOTES Workflow scheduler to manage hadoop and related jobs Developed first in Banglore by Yahoo DAG(Direct Acyclic Graph) Acyclic means a graph cannot have any loops and action members of the graph provide control dependency. Control dependency means a second job cannot run until a first action is completed Ozzie definitions are written in hadoop process definition language (hPDL) and coded as an xml file (WORKFLOW.XML) Workflow contains: Control […]

Passing arguments to Oozie workflows 3

Deciding How to Pass Arguments to Oozie Jobs So far, you have learned about several ways to pass parameters to an Oozie job. To help you decide which approach to use, you should first understand how Oozie uses parameters: Oozie uses parameters explicitly defined inside an action’s <arg> tag. If any of the parameters cannot be resolved there, Oozie uses parameters defined in the file specified inside the <job-xml> tag. […]

Apache Oozie Installation on Ubuntu-14.04 13

In this post we will discuss about the Apache Oozie Installation on Ubuntu machine and we will run some sample mapreduce jobs on oozie scheduler. Apache Oozie Installation on Ubuntu We are building the oozie distribution tar ball by downloading the source code from apache and building the tar ball with the help of Maven. Prerequisite If we plan to install Oozie-4.0.1 or prior version Jdk-1.6 is required on our […]

Review Comments
default image

I have attended Siva’s Spark and Scala training. He is good in presentation skills and explaining technical concepts easily to everyone in the group. He is having excellent real time experience and provided enough use cases to understand each concepts. Duration of the course and time management is awesome. Happy that I found a right person on time to learn Spark. Thanks Siva!!!

Dharmeswaran ETL / Hadoop Developer Spark Nov 2016 September 21, 2017