Thursday, 15 September 2011

python - Description of Hadoop job -



python - Description of Hadoop job -

i have hadoop cluster , different processes able submit mapreduce jobs cluster (they utilize same user account).

is there way distinguish these jobs? kind of description, can added job during submit 'this job of process "1234", not touch'?

i using python , hadoopstreaming, , distinguish jobs using simple hadoop job -list (or @ to the lowest degree using web management interface).

yes, can specify name each job using job.setjobname(string).

if set job name distinguishing should able tell them apart.

for example, using likemanagementfactory.getruntimemxbean().getname() can process id , machine name (on linux anyway, unsure of behaviour on other operating systems) in format of 1234@localhost, 1234 process id, set job name tell them apart.

python hadoop mapreduce hadoop-streaming

No comments:

Post a Comment