1. 打开PDI,新建一个作业,如图1所示。
2. 编辑'Oozie job executor'作业项,如图2所示。
说明:
/root/big\_data/job.properties文件的内容如下:
nameNode=hdfs://manager:8020
jobTracker=manager:8032
queueName=default
oozie.use.system.libpath=true
oozie.wf.application.path=${nameNode}/user/${user.name}
各属性的含义,以及工作流功能、工作流文件的创建参见“https://cloud.tencent.com/developer/article/1433150”。DAG如图3所示。
3. 保存并执行作业,日志如下所示。
2020/06/09 09:48:43 - Spoon - Starting job...
2020/06/09 09:48:43 - Oozie - Start of job execution
2020/06/09 09:48:43 - Oozie - Starting entry [Oozie job executor]
2020/06/09 09:51:47 - Oozie - Finished job entry [Oozie job executor] (result=[true])
2020/06/09 09:51:47 - Oozie - Job execution finished
2020/06/09 09:51:47 - Spoon - Job has ended.
在Oozie Web Console可以查看工作流执行进度和结果,如图4所示。