Wednesday, 21 August 2013

Oozie issue with ssh action performing on the Edge node of the cluster

Oozie issue with ssh action performing on the Edge node of the cluster

'm trying to schedule a oozie job through ssh action on the edge node. As
my bash scripts for each of the phases in the process flow. Here, i'm
facing lot of issues like ilts not at all allowing to start the job. I
have read in different blogs stating that if we have to setup
password-less ssh so that we can eliminate this type of issue, but its not
safe to do because the cluster will become vulnerable!! Moreover if this
type of passwordless ssh setup can be done on devclusters but when it
comes to production it may give the same errors (which is kerberized)
there we may not be able to generate passwordless -ssh. Is there any other
method to resolve this type of issue or please suggest what can be done to
trigger a job at the edge node if the files are in HDFS
below i have placed one of the error lod msg
error log...... 2013-08-08 06:03:51,627 INFO
org.apache.oozie.command.wf.ActionStartXCommand: USER[root] GROUP[-]
TOKEN[] APP[*-wf] JOB[0000044-130719141217337-oozie-oozi-W]
ACTION[0000044-130719141217337-oozie-oozi-W@:start:] Start action
[0000044-130719141217337-oozie-oozi-W@:start:] with user-retry state :
userRetryCount [0], userRetryMax [0], userRetryInterval [10] 2013-08-08
06:03:51,627 WARN org.apache.oozie.command.wf.ActionStartXCommand:
USER[root] GROUP[-] TOKEN[] APP[*wf]
JOB[0000044-130719141217337-oozie-oozi-W]
ACTION[0000044-130719141217337-oozie-oozi-W@:start:]
[0000044-130719141217337-oozie-oozi-W@:start:]Action status=DONE
2013-08-08 06:03:51,627 WARN
org.apache.oozie.command.wf.ActionStartXCommand: USER[root] GROUP[-]
TOKEN[] APP[**-wf] JOB[0000044-130719141217337-oozie-oozi-W]
ACTION[0000044-130719141217337-oozie-oozi-W@:start:]
[0000044-130719141217337-oozie-oozi-W@:start:]Action updated in DB!
2013-08-08 06:03:51,718 INFO
org.apache.oozie.command.wf.ActionStartXCommand: USER[root] GROUP[-]
TOKEN[] APP[-wf] JOB[0000044-130719141217337-oozie-oozi-W]
ACTION[0000044-130719141217337-oozie-oozi-W@sshtest] Start action
[0000044-130719141217337-oozie-oozi-W@sshtest] with user-retry state :
userRetryCount [0], userRetryMax [0], userRetryInterval [10] 2013-08-08
06:03:51,718 INFO org.apache.oozie.action.ssh.SshActionExecutor:
USER[root] GROUP[-] TOKEN[] APP[-wf]
JOB[0000044-130719141217337-oozie-oozi-W]
ACTION[0000044-130719141217337-oozie-oozi-W@sshtest] start() begins
2013-08-08 06:03:51,721 INFO
org.apache.oozie.action.ssh.SshActionExecutor: USER[root] GROUP[-] TOKEN[]
AP{P*-wf] JOB[0000044-130719141217337-oozie-oozi-W]
ACTION[0000044-130719141217337-oozie-oozi-W@sshtest] Attempting to copy
ssh base scripts to remote host [root@***] 2013-08-08 06:03:51,801 WARN
org.apache.oozie.action.ssh.SshActionExecutor: USER[root] GROUP[-] TOKEN[]
APP[**-wf] JOB[0000044-130719141217337-oozie-oozi-W]
ACTION[0000044-130719141217337-oozie-oozi-W@sshtest] Error while executing
ssh EXECUTION 2013-08-08 06:03:51,801 WARN
org.apache.oozie.command.wf.ActionStartXCommand: USER[root] GROUP[-]
TOKEN[] APP[**-wf] JOB[0000044-130719141217337-oozie-oozi-W]
ACTION[0000044-130719141217337-oozie-oozi-W@sshtest] Error starting action
[sshtest]. ErrorType [NON_TRANSIENT], ErrorCode [AUTH_FAILED], Message
[AUTH_FAILED: Not able to perform operation [ssh -o
PasswordAuthentication=no -o KbdInteractiveDevices=no -o
StrictHostKeyChecking=no -o ConnectTimeout=20 root@** mkdir -p
oozie-oozi/0000044-130719141217337-oozie-oozi-W/sshtest--ssh/ ]

No comments:

Post a Comment