Submitting Shell Scripts in Mapreduce Job

Introduction to Hadoop Streaming

In this post, we will discuss about introduction to hadoop streaming with term frequency and Inverse document frequency algorithm. Hadoop Streaming By default Mapreduce framework is written in Java and supports writing mapreduce programs in Java programming language but Hadoop provides API for writing mapreduce programs in other than Java Language. Hadoop Streaming is an utility that comes with hadoop distribution and allows users to write mapreduce programs in any […]