Hadoop Streaming , Unable to run jar from bash file
I need to run my mapper and reducer functions which are two runnable jars
using hadoop streaming. I have written to bash scripts which run these jar
files. I am using following command for my hadoop streaming
bin/hadoop jar contrib/streaming/hadoop-streaming-1.1.1.jar -D
stream.non.zero.exit.is.failure=false -file /home/abhinav/mapper.jar -file
/home/abhinav/reducer.jar -mapper "./mapper.sh | sort -k1,1" -reducer
"reducer.sh" -file /home/abhinav/mapper.sh -file /home/abhinav/reducer.sh
-input /home/abhinav/stemp/dfs/name/input.txt -output
/home/abhinav/stemp/dfs/name/op30
Here is what my mapper.sh looks like
java -jar mapper.jar
I am getting following error in my task logs
Unable to access jarfile mapper.jar
I dont understand the reason why my mapper.jar is not accessible on cluster.
Can some one please help.
No comments:
Post a Comment