map - What is job.get() and job.getBoolean() in mapreduce -
i working on pdf document clustering on hadoop learning mapreduce reading examples on internet.in wordcount examples have lines
job.get("map.input.file") job.getboolean()
what function of these functions?what map.input.file set? or name given input folder? please post answer if know.
for code see following link wordcount 2.0 example=http://hadoop.apache.org/docs/r1.0.4/mapred_tutorial.html
these job configurations. i.e. set of configurations passed on each mapper , reducer. now, these configurations consist of well defined mapreduce/hadoop related configurations user-defined configurations.
in case, map.input.file
pre-defined configuration , yes set comma separated list of paths have set input path.
while wordcount.skip.patterns
custom configuration set per user's input, , may see configuration set in run()
follows:
conf.setboolean("wordcount.skip.patterns", true);
as when use get
, when use getboolean
, should self-explanatory, whenever want set value of type boolean use getboolean
, setboolean
, set specific config value respectively. have specific methods other data types well. if string may use get()
.
Comments
Post a Comment