官方Hadoop配置
http://wiki.pentaho.com/display/BAD/Configuring+Pentaho+for+your+Hadoop+Distro+and+Version
1.官网下载kettle
http://community.pentaho.com/projects/data-integration/
2.解压kettle
3.进入目录运行kettle
Windows下双击spoon.bat
Linux下运行
sh spoon.sh
4.配置kettle连接hadoop
1)修改
E:\pdi-ce-6.1.0.1-196 power\data-integration\plugins\pentaho-big-data-plugin\plugin.properties
修改此文件中active.hadoop.configuration=hdp24
Copy文件core-site.xml, hdfs-site.xml, httpfs-site.xml, mapred-site.xml, yarn-site.xml到E:\pdi-ce-6.1.0.1-196 power\data-integration\plugins\pentaho-big-data-plugin\hadoop-configurations\hdp24下
修改文件中主机名为ip地址
重启spoon.bat进入图形化界面
I 新建转换
II 新建job
时间: 2024-10-10 03:18:29