I am working with a scala program using spark packages.
Currently I run the program using the bash command from the gateway:
/homes/spark/bin/spark-submit --master yarn-cluster --class "com.xxx.yyy.zzz" --driver-java-options "-Dyyy.num=5" a.jar arg1 arg2
I would like to start using oozie for running this job. I have a few setbacks:
Where should I put the spark-submit executable? on the hfs?
How do I define the spark action? where should the --driver-java-options appear?
How should the oozie action look like? is it similar to the one appearing here?
See Question&Answers more detail:
os 与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…