Is there a way to specify the title of a job from mrjob in the Hadoop Administration web interface?

StackOverflow https://stackoverflow.com/questions/15536122

문제

I have several different jobs started from the Python library mrjob, including jobs with multiple steps. How can I replace streamjob with a custom name? For example, wordcount_step_1, wordcount_step_2, etc.

screenshot of Hadoop Administration

도움이 되었습니까?

해결책

Sure, just specify it using the --jobconf option when you execute your job.

For example:

if __name__ == '__main__':
    # Be careful, this appends all job args, if you have lots it could be a problem
    sys.argv.extend(["--jobconf", "mapred.job.name=%s" % " ".join(sys.argv)])
    MRYourJobClass.run()

다른 팁

When running the job add the option:

--jobconf mapred.job.name=NameOfTheJob
라이센스 : CC-BY-SA ~와 함께 속성
제휴하지 않습니다 StackOverflow
scroll top