- 环境要求
- Linux、Windows
- JDK(1.8)
- Python(推荐Python2.6.X)
- Apache Maven 3.x (Compile DataX)
- 下载文件
- 安装JDK
- yum install java-1.7.0-openjdk
- 安装maven
- gunzip apache-maven-3.3.9-bin.tar.gz
- tar -xvf apache-maven-3.3.9-bin.tar
- export PATH=/root/apache-maven-3.3.9/bin:$PATH(设置目录)
- 下载后解压至本地某个目录,修改权限为755,进入bin目录,即可运行样例同步作业:
- $ tar zxvf datax.tar.gz
- $ sudo chmod -R 755 datax
- $ cd datax/bin
- $ python datax.py ../job/job.json(测试样例)
- 测试成功
- 配置DataX
- 如下设置ora.json
*
go-html-template { "job": { "content": [ { "reader": { "name": "oraclereader", "parameter": { "column": ["EMPNO","ENAME","JOB","MGR","HIREDATE","SAL","COMM","DEPTNO"], "connection": [ { "jdbcUrl": ["jdbc:oracle:thin:@192.168.1.225:1521:orcl"], "table": ["EMP"] } ], "password": "tiger", "username": "scott" } }, "writer": { "name": "mysqlwriter", "parameter": { "column": ["EMPNO","ENAME","JOB","MGR","HIREDATE","SAL","COMM","DEPTNO"], "connection": [ { "jdbcUrl": "jdbc:mysql://localhost:3306/EMP?useUnicode=true&characterEncoding=utf-8", "table": ["EMP"] } ], "password": "root", "preSql": ["TRUNCATE TABLE EMP"], "session": [], "username": "root", "writeMode": "replace" } } } ], "setting": { "speed": { "channel": 1 } } } }
- 远程终端中进入datax目录
- 进入bin目录
- 执行 python datax.py ../job/ora.json
- 成功同步
- Tips
- 可以使用reader中设置限定条件、实现增量更新
- 现有配置为全量更新
- 如下设置ora.json
*
- 参考文献