bulk data copy between hadoop clusters


HDFS Distributed File Copy Tool – distcp 7

HDFS Distributed File copy Hadoop provides HDFS Distributed File copy¬†(distcp)¬†tool for copying large amounts of HDFS files within or in between HDFS clusters. It is implemented based on Mapreduce framework and thus it submits a map-only mapreduce job to parallelize the copy process. Usually this tool is useful for copying files between clusters from production to development environments. It supports some advanced command options while copying files. Below are the […]


Review Comments
default gravatar

I am a plsql developer. Intrested to move into bigdata.

Neetika Singh ITA

.