1.

When Running On Hadoop, Each Map Task Quickly Reaches 100 Percent Completion, But Then Stalls For A Long Time. Why Does This Happen?

Answer»

GOBBLIN CURRENTLY USES Hadoop map tasks as a container for running Gobblin tasks. Each map task runs 1 or more Gobblin workunits, and the progress of each workunit is not HOOKED into the progress of each map task. Even though the Hadoop JOB reports 100% completion, Gobblin is still doing work. 

Gobblin currently uses Hadoop map tasks as a container for running Gobblin tasks. Each map task runs 1 or more Gobblin workunits, and the progress of each workunit is not hooked into the progress of each map task. Even though the Hadoop job reports 100% completion, Gobblin is still doing work. 



Discussion

No Comment Found