distcp command in hadoop

This is for Hadoop eco system like HDFS, Map reduce, Hive, Hbase, Pig, sqoop,sqoop2, Avro, solr, hcatalog, impala, Oozie, Zoo Keeper and Hadoop distribution like Cloudera, Hortonwork etc.
mohit123
Posts: 162
Joined: Sat Sep 20, 2014 11:29 pm
Contact:

distcp command in hadoop

Postby mohit123 » Wed Sep 24, 2014 12:44 am

What is the use of distcp command in hadoop?


Guest

Re: distcp command in hadoop

Postby Guest » Thu Jul 16, 2015 7:42 am

The hadoop distcp command is a tool used for large inter- and intra-cluster copying. It uses MapReduce to effect its distribution, error handling and recovery, and reporting. It expands a list of files and directories into input to map tasks, each of which will copy a partition of the files specified in the source list.

Syntax
hadoop [ Generic Options ] distcp
[-p [rbugp] ]
[-i ]
[-log ]
[-m ]
[-overwrite ]
[-update ]
[-f <URI list> ]
[-filelimit <n> ]
[-sizelimit <n> ]
[-delete ]
<source>
<destination>

snehalshah
Posts: 61
Joined: Sun Aug 30, 2015 8:02 am
Contact:

Re: distcp command in hadoop

Postby snehalshah » Wed Sep 23, 2015 5:03 pm

hadoop fs distcp a.txt b.txt



Return to “Hadoop and Big Data”

Who is online

Users browsing this forum: No registered users and 1 guest