oracle goldengate报错解决之OGG-01033

环境概述:

生产环境使用ogg进行数据同步,要求新增两张表

两张表增加后发现目标端数据总是多于源端数据,为此专门做了个测试,遇OGG-01033故障。

报错描述:pump1进程启动失败,状态为abended

看源端日志:

2016-11-07 16:25:40  ERROR   OGG-01033  There is a problem in network communication, a remote file problem, encryption keys for target and source do not match (if using ENCRYPT) or an unk

nown error. (Remote file used is ./dirdat/ro002250, reply received is Unable to open file "./dirdat/ro002250" (error 2, No such file or directory)).

2016-11-07 16:25:40  ERROR   OGG-01668  PROCESS ABENDING.

查看网上各种资料,有说可能是远程trail文件被锁(但是人家的日志里有明显日志说文件被锁,我的日志里并没有)

还是去看看rmtrail文件:

[email protected]:/data/ogg/dirdat$ ls -lrt

total 170388

-rw-r----- 1 oracle dba 89575424 Nov  3 06:24 po000000083

-rw-r----- 1 oracle dba 84889622 Nov  3 11:42 ro000002250

发现文件更新时间发现停留在11.3日(我的实验是在11.7日)

除本测试环境,还有两个生产库作为源数据库,查看进程状态也为abended(晕。。)

将生产库的源端进程启动后,测试环境的pump1好了一会又是abended了,但是此次报错使用的文件变了:

2016-11-07 17:52:21  ERROR   OGG-01033  There is a problem in network communication, a remote file problem, encryption keys for target and source do not match (if using ENCRYPT) or an unk

nown error. (Remote file used is ./dirdat/ro002452, reply received is Unable to open file "./dirdat/ro002452" (error 2, No such file or directory)).

2016-11-07 17:52:21  ERROR   OGG-01668  PROCESS ABENDING.

查阅资料说单实例环境可以这样解决:

[email protected]:/data/ogg/dirdat$ mv /data/ogg/dirdat/ro000002250 /data/ogg/dirdat/ro000002250bak

[email protected]:/data/ogg/dirdat$ cp /data/ogg/dirdat/ro000002250bak /data/ogg/dirdat/ro000002250

[email protected]:/data/ogg/dirdat$ pwd

/data/ogg/dirdat

[email protected]:/data/ogg/dirdat$ ll

total 22520276

-rw-r----- 1 oracle dba 89575424 Nov  3 06:24 po000000083

-rw-r----- 1 oracle dba 84889622 Nov  7 17:46 ro000002250

-rw-r----- 1 oracle dba 84889622 Nov  3 11:42 ro000002250bak

………………

-rw-r----- 1 oracle dba 99999848 Nov  7 17:52 ro000002452

-rw-r----- 1 oracle dba 99999930 Nov  7 17:52 ro000002453

………………

重新启动仍旧不好使。

现在问题变成:只要是投递过去的trail文件都无法打开。

还有可能是远程trail文件的目录不正确,查看生产环境源端pump1参数

GGSCI (a-db2 as [email protected]) 5> view params pump1

Extract pump1

PassThru

RmtHost 192.168.10.61, MgrPort 7809

RmtTrail ./dirdat/ro

GGSCI (a-db31 as [email protected]) 12> view params pump1

Extract pump1

PassThru

RmtHost 192.168.10.61, MgrPort 7809

RmtTrail ./dirdat/po

参数rmttrail都是./dirdat下的

查看我的测试环境pump参数:

GGSCI (a-test30 as [email protected]) 11> view params pump1

Extract pump1

PassThru

RmtHost 192.168.10.61, MgrPort 7809

RmtTrail ./dirdat/ro

终于发现问题了:

rmttrail文件生成格式与生产环境billdb的重复了!!都是./dirdat/ro

解决:

修改参数pump1   RmtTrail为./dirdat/go

此时直接重新start 是不可以的,因为之前添加ext,pump,rep进程时都使用的是./dirdat/ro,否则会报错:

2016-11-08 10:47:56  ERROR   OGG-01044  The trail ‘./dirdat/go‘ is not assigned to extract ‘PUMP1‘. Assign the trail to the extract with the command "ADD EXTTRAIL/RMTTRAIL ./dirdat/go, EX

TRACT PUMP1".

2016-11-08 10:47:56  ERROR   OGG-01668  PROCESS ABENDING.

需要删除所有ext,pump,rep进程,重新添加

删除:

目标端

GGSCI (a-db61 as [email protected]) 3> info all

Program     Status      Group       Lag at Chkpt  Time Since Chkpt

MANAGER     RUNNING

REPLICAT    RUNNING     REP1        09:09:58      115:49:19

REPLICAT    RUNNING     REP2        115:11:08     00:00:00

REPLICAT    ABENDED     REP3        111:49:46     12:39:24

GGSCI (a-db61 as [email protected]) 4> delete replicat rep3

Deleted REPLICAT REP3.

GGSCI (a-db61 as [email protected]) 5> info all

Program     Status      Group       Lag at Chkpt  Time Since Chkpt

MANAGER     RUNNING

REPLICAT    RUNNING     REP1        09:09:58      116:04:29

REPLICAT    RUNNING     REP2        00:00:00      00:00:06

(测试环境)源端:

GGSCI (a-test30 as [email protected]) 17> info all

Program     Status      Group       Lag at Chkpt  Time Since Chkpt

MANAGER     RUNNING

EXTRACT     RUNNING     EXT1        00:00:00      00:00:00

EXTRACT     ABENDED     PUMP1       00:00:00      17:20:57

GGSCI (a-test30 as [email protected]) 18> delete RmtTrail ./dirdat/ro, Extract pump1

Deleting extract trail ./dirdat/ro for extract PUMP1

GGSCI (a-test30 as [email protected]) 19> delete Extract pump1

Deleted EXTRACT PUMP1.

GGSCI (a-test30 as [email protected]) 20> delete ExtTrail ./dirdat/eo, Extract ext1

Cannot delete extract trail ./dirdat/eo, extract EXT1 is running.

GGSCI (a-test30 as [email protected]) 21> info all

Program     Status      Group       Lag at Chkpt  Time Since Chkpt

MANAGER     RUNNING

EXTRACT     RUNNING     EXT1        00:00:00      00:00:06

GGSCI (a-test30 as [email protected]) 22> stop ext1

Sending STOP request to EXTRACT EXT1 ...

Request processed.

GGSCI (a-test30 as [email protected]) 23> stop mgr

Manager process is required by other GGS processes.

Are you sure you want to stop it (y/n)?y

Sending STOP request to MANAGER ...

Request processed.

Manager stopped.

GGSCI (a-test30 as [email protected]) 24> info all

Program     Status      Group       Lag at Chkpt  Time Since Chkpt

MANAGER     STOPPED

EXTRACT     STOPPED     EXT1        00:00:00      00:00:10

GGSCI (a-test30 as [email protected]) 25> delete ExtTrail ./dirdat/eo, Extract ext1

Deleting extract trail ./dirdat/eo for extract EXT1

GGSCI (a-test30 as [email protected]) 26> delete Extract ext1

Deleted EXTRACT EXT1.

GGSCI (a-test30 as [email protected]) 27> info all

Program     Status      Group       Lag at Chkpt  Time Since Chkpt

MANAGER     STOPPED

添加

---------源端

GGSCI (a-test30 as [email protected]) 28> info all

Program     Status      Group       Lag at Chkpt  Time Since Chkpt

MANAGER     STOPPED

GGSCI (a-test30 as [email protected]) 29> Add Extract ext1, TranLog, Begin Now

EXTRACT added.

GGSCI (a-test30 as [email protected]) 30> Add ExtTrail ./dirdat/eo, Extract ext1, MegaBytes 100

EXTTRAIL added.

GGSCI (a-test30 as [email protected]) 31> Add Extract pump1, ExtTrailSource ./dirdat/eo, Begin Now

EXTRACT added.

GGSCI (a-test30 as [email protected]) 32> Add RmtTrail ./dirdat/go, Extract pump1, MegaBytes 100

RMTTRAIL added.

GGSCI (a-test30 as [email protected]) 33> info all

Program     Status      Group       Lag at Chkpt  Time Since Chkpt

MANAGER     STOPPED

EXTRACT     STOPPED     EXT1        00:00:00      00:01:21

EXTRACT     STOPPED     PUMP1       00:00:00      00:00:47

-------目标端

GGSCI (a-db61 as [email protected]) 6> Add Replicat rep3, ExtTrail ./dirdat/go, Begin Now

REPLICAT added.

GGSCI (a-db61 as [email protected]) 7> info all

Program     Status      Group       Lag at Chkpt  Time Since Chkpt

MANAGER     RUNNING

REPLICAT    RUNNING     REP1        09:09:58      116:09:21

REPLICAT    RUNNING     REP2        00:00:00      00:00:00

REPLICAT    STOPPED     REP3        00:00:00      00:00:34

启动全部进程

GGSCI (a-test30 as [email protected]) 34> start mgr

Manager started.

GGSCI (a-test30 as [email protected]) 35> start ext1

Sending START request to MANAGER ...

EXTRACT EXT1 starting

GGSCI (a-test30 as [email protected]) 36> start pump1

Sending START request to MANAGER ...

EXTRACT PUMP1 starting

GGSCI (a-test30 as [email protected]) 38> info all

Program     Status      Group       Lag at Chkpt  Time Since Chkpt

MANAGER     RUNNING

EXTRACT     RUNNING     EXT1        00:02:33      00:00:00

EXTRACT     RUNNING     PUMP1       00:00:00      00:00:01

GGSCI (a-db61 as [email protected]) 8> start rep3

Sending START request to MANAGER ...

REPLICAT REP3 starting

GGSCI (a-db61 as [email protected]) 9> info all

Program     Status      Group       Lag at Chkpt  Time Since Chkpt

MANAGER     RUNNING

REPLICAT    RUNNING     REP1        09:09:58      116:10:40

REPLICAT    RUNNING     REP2        00:00:00      00:00:02

REPLICAT    RUNNING     REP3        00:00:00      00:00:07

一切正常了

GGSCI (a-db61 as [email protected]) 2> info rep3

REPLICAT   REP3      Last Started 2016-11-08 10:56   Status RUNNING

Checkpoint Lag       00:00:00 (updated 00:00:00 ago)

Process ID           44117

Log Read Checkpoint  File ./dirdat/go000000

2016-11-08 10:56:30.678779  RBA 1497

时间: 2025-01-08 18:51:10

oracle goldengate报错解决之OGG-01033的相关文章

oracle goldengate报错解决之OGG-00446

OGG测试:无法启动ext进程,报错OGG-00446 报错日志[截取重要部分] 2016-10-28 16:51:40  ERROR   OGG-00446  Missing filename opening checkpoint file. 2016-10-28 16:51:40  ERROR   OGG-01668  PROCESS ABENDING. 日志意思很明显:在打开checkpoint file 时缺少文件名 查看参数: [email protected] dirprm]$ mo

ORACLE安装报错解决

今天在虚拟机中安装了一个WINDOWS系统,用于安装oracle服务器:从安装到使用中出现了很多的问题,把这些问题解决掉,花了不少时间,查了不少的资料. 第一个,我在安装过程中,出现了ORA-00922和ORA-28000报错,这是前面的口令设置上太随意了,没有按照oracle的要求设置口令导致: ORA-00922选项缺失或无效 错误原因:一般是语句的语法有问题.比如命名不对,关键字写错等等.对于非标准的命名,一般采用双引号来创建. 解决办法: 标识符命名规则:(密码设定规则) 1.必须以字母

ORACLE MERGE INTO语句,unable to get a stable set of rows in the source tables报错解决

ORACLE数据库,MERGE INTO语句,经常会出现  ORA-30926: unable to get a stable set of rows in the source tables   这个错误,如下图所示: 经检查,这个错误是由于数据来源表(即语句中,from关键字后面的表)存在数据重复造成的.在实际项目研发中,我们一般不能随便改动数据表的记录,那么如何避免这种错误的产生以及如何改正错误呢? 请看下面的SQL: MERGE INTO TEMP_ZL_ACCOUNTLIST t1 U

Oracle 安装报错 [INS-06101] IP address of localhost could not be determined 解决方法

出现这种错误是因为主机名和/etc/hosts 文件不一致,只需要把主机名和其IP 写入/etc/hosts 文件,就ok了. 注意名字不能有下划线 修改network 文件之外,再使用hostname 命令指定一下主机名,就不用重启OS了. 1.     修改/etc/sysconfig/network 文件 [[email protected]~]# cat /etc/sysconfig/network NETWORKING=yes NETWORKING_IPV6=no HOSTNAME=s

oracle启动报错ORA-00845: MEMORY_TARGET not supported on this system解决办法

oracle启动报错ORA-00845: MEMORY_TARGET not supported on this system 原因可能:ORACLE MEMORY_TARGET参数设置超过了oracle服务器本身内存.或超过了现在服务器空闲内存(就是oracle启动时无法获取设置的内存) eg: oracle version:11gR2 OS version:RedHat 6.4企业版 服务器:虚拟机 分析步骤: 1.查看当前 MEMORY_TARGET参数的值为*.memory_target

linux centos环境下,perl使用DBD::Oracle遇到报错Can't locate DBDOracle.pm in @INC 的解决办法

前言 接手前辈的项目,没有接触.安装.使用过perl和DBD::Oracle,也没有相关的文档记录,茫茫然不知所措~~.一开始发现这个问题,就想着迅速解决,就直接在google上搜报错信息,搜索的过程中发现 如果不搞清楚前因后果我连解决方案都'看不见''看不懂'. 所以还是要补充这方面的知识,再思考解决方案.以下内容就是我一边学习一边解决问题的过程,如果不足,请各位大佬指出. 报错 install_driver(Oracle) failed: Can't locate DBD/Oracle.pm

Oracle11g 启动报错ORA-00845: MEMORY_TARGET not supported on this system报错解决

ORA-00845: MEMORY_TARGET not supported on this system报错解决 Oracle启动报错 ORA-00845: MEMORY_TARGET not supported on this system报错解决 SQL> startup ORA-00845: MEMORY_TARGET not supported on this system 查看原本大小 [[email protected] oracle]# df -h|grep shm tmpfs

oracle之报错:ORA-00054: 资源正忙,要求指定 NOWAIT

SQL> conn scott/[email protected]_databaseConnected to Oracle Database 11g Enterprise Edition Release 11.1.0.6.0Connected as scott ...... SQL> drop table student2;drop table student2ORA-00054: 资源正忙, 但指定以 NOWAIT 方式获取资源, 或者超时失效========================

oracle 12c 报错 ora-03137 来自客户机的格式错误的TTC包被拒绝

昨天下午,实施报了一个oracle的报错ora-03137 说是数据库在11g时没有问题,升级到12c 时,就报错了. 本地调试,看到执行完sql后,报异常,如下: 把SqL在12c的数据库执行一下,没有问题. 就怀疑是asp.net 连接oralce12c时出了问题,就一顿查,但是发现现在用12c是不是太少了,网上相关的信息太少.只能自己上了, 由于我们连接 oralce12c 用的是odp.net. 首先想到的就是升级,数据库升级,我们也升级. 升级完成后,发现问题就这么愉快地解决了. 特意