Apache Hadoop and Eclipse integration -
i trying integrate hadoop eclipse. followed instructions here. however, when attempt run eclipse project following output:
13/04/01 14:55:11 warn util.nativecodeloader: unable load native-hadoop library platform... using builtin-java classes applicable 13/04/01 14:55:11 warn mapred.jobclient: no job jar file set. user classes may not found. see jobconf(class) or jobconf#setjar(string). 13/04/01 14:55:11 info input.fileinputformat: total input paths process : 1 13/04/01 14:55:11 warn snappy.loadsnappy: snappy native library not loaded 13/04/01 14:55:11 info mapred.jobclient: running job: job_local_0001 13/04/01 14:55:11 info util.processtree: setsid exited exit code 0 13/04/01 14:55:11 info mapred.task: using resourcecalculatorplugin : org.apache.hadoop.util.linuxresourcecalculatorplugin@6ea920ad 13/04/01 14:55:11 info mapred.maptask: io.sort.mb = 100 13/04/01 14:55:11 info mapred.maptask: data buffer = 79691776/99614720 13/04/01 14:55:11 info mapred.maptask: record buffer = 262144/327680 13/04/01 14:55:11 warn mapred.localjobrunner: job_local_0001 java.lang.classcastexception: interface javax.xml.soap.text @ java.lang.class.assubclass(class.java:3046) @ org.apache.hadoop.mapred.jobconf.getoutputkeycomparator(jobconf.java:774) @ org.apache.hadoop.mapred.maptask$mapoutputbuffer.<init>(maptask.java:959) @ org.apache.hadoop.mapred.maptask$newoutputcollector.<init>(maptask.java:674) @ org.apache.hadoop.mapred.maptask.runnewmapper(maptask.java:756) @ org.apache.hadoop.mapred.maptask.run(maptask.java:370) @ org.apache.hadoop.mapred.localjobrunner$job.run(localjobrunner.java:212) 13/04/01 14:55:12 info mapred.jobclient: map 0% reduce 0% 13/04/01 14:55:12 info mapred.jobclient: job complete: job_local_0001 13/04/01 14:55:12 info mapred.jobclient: counters: 0 false
my machine linux ubuntu 12.04 apache hadoop version 1.04, oracle java v1.7 , eclipse 3.7.2. why getting output? if doing wrong, can direct me tested method in order make work?
thank you
p.s.: writing wiki @ moment undergraduate students want start "playing" big-data. hence, large group of people going benefit answer :)
please switch new api, i.e "mapreduce" , not "mapred". also, makes more sense since planning write wiki students. should date. right?and if need on how setup eclipse write mapreduce programs, might find link useful.
Comments
Post a Comment