My requiremnt is to schedule a flow of jobs , like
HDFS --> Hive Table --> Appying transformations --> Pig Scripting --> HBase Table --> Loading into RDBMS Table
I want to schudule the flow hourly/ Daily/weekly based on my requiremnt.
what is the best way to scheduling the flow.
Is it a good pratice to call the hadoop commands in shell script and shedule them??
Kindly suggest me the best way of scheduling.
Thanks in Advance
This is for Hadoop eco system like HDFS, Map reduce, Hive, Hbase, Pig, sqoop,sqoop2, Avro, solr, hcatalog, impala, Oozie, Zoo Keeper and Hadoop distribution like Cloudera, Hortonwork etc.
1 post • Page 1 of 1
Users browsing this forum: No registered users and 3 guests