distcp command in hadoop

This is for Hadoop eco system like HDFS, Map reduce, Hive, Hbase, Pig, sqoop,sqoop2, Avro, solr, hcatalog, impala, Oozie, Zoo Keeper and Hadoop distribution like Cloudera, Hortonwork etc.
Posts: 162
Joined: Sat Sep 20, 2014 11:29 pm

distcp command in hadoop

Postby mohit123 » Wed Sep 24, 2014 12:44 am

What is the use of distcp command in hadoop?


Re: distcp command in hadoop

Postby Guest » Thu Jul 16, 2015 7:42 am

The hadoop distcp command is a tool used for large inter- and intra-cluster copying. It uses MapReduce to effect its distribution, error handling and recovery, and reporting. It expands a list of files and directories into input to map tasks, each of which will copy a partition of the files specified in the source list.

hadoop [ Generic Options ] distcp
[-p [rbugp] ]
[-i ]
[-log ]
[-m ]
[-overwrite ]
[-update ]
[-f <URI list> ]
[-filelimit <n> ]
[-sizelimit <n> ]
[-delete ]

Posts: 61
Joined: Sun Aug 30, 2015 8:02 am

Re: distcp command in hadoop

Postby snehalshah » Wed Sep 23, 2015 5:03 pm

hadoop fs distcp a.txt b.txt

Return to “Hadoop and Big Data”

Who is online

Users browsing this forum: No registered users and 2 guests