AIX 5.3 Install Oracle 10g RAC 错误集锦--磁盘I/O 故障

AIX 5.3  Install Oracle 10g RAC 错误集锦--磁盘I/O 故障


系统环境:

操作系统: AIX 5300-09

集群软件: CRS 10.2.0.1

数据库:   Oracle 10.2.0.1


                     系统架构图


                                       本机采用vg 并发(VG concurrent)作为共享的存储

故障现象:

当在第一个node(aix211)执行root.sh时,被hang住......

查看日志:

[[email protected] soft]#cd /u01/crs_1/log

[[email protected] aix211]#ls

admin            client           cssd             racg

alertaix211.log  crsd             evmd

[[email protected] aix211]#cd client/

[[email protected] client]#ls

ocrconfig_458796.log  ocrconfig_471234.log

[[email protected] client]#ls -lt

total 24

-rw-r-----    1 root     system          681 May  8 14:57 ocrconfig_458796.log

-rw-r-----    1 root     system         5684 May  8 14:54 ocrconfig_471234.log

[[email protected] client]#tail -f ocrconfig_458796.log

Oracle Database 10g CRS Release 10.2.0.1.0 Production Copyright 1996, 2005 Oracle.  All rights reserved.

2014-05-08 14:56:08.858: [ OCRCONF][1]ocrconfig starts...

2014-05-08 14:56:08.860: [ OCRCONF][1]Upgrading OCR data

2014-05-08 14:56:27.280: [  OCROSD][1]utread:3: problem reading buffer 100e09f0 buflen 512 retval -1 phy_offset 102400 retry 0

2014-05-08 14:56:27.280: [  OCROSD][1]utread:4: problem reading the buffer errno 5 errstring I/O error

2014-05-08 14:57:03.279: [  OCROSD][1]utread:3: problem reading buffer 100f1b70 buflen 1024 retval -1 phy_offset 102400 retry 0

2014-05-08 14:57:03.279: [  OCROSD][1]utread:4: problem reading the buffer errno 5 errstring I/O error

2014-05-08 14:57:39.279: [  OCROSD][1]utread:3: problem reading buffer 100f1b70 buflen 1536 retval -1 phy_offset 102400 retry 0

2014-05-08 14:57:39.279: [  OCROSD][1]utread:4: problem reading the buffer errno 5 errstring I/O error

出现磁盘的I/O故障!


解决方法:

1、查看PV和VG

[[email protected] client]#lspv

hdisk0          00040f8aaa29d28e                    rootvg          active

hdisk1          000e1a6ce663b4af                    racvg           concurrent

[[email protected] client]#lsvg racvg

--查看VG时也 hang 住......看来是共享卷组的磁盘出现了故障,因为是个实验环境,阵列不很稳定,所有重启系统......


2、重启系统后,重新执行root.sh

3、手工执行安装后校验

[[email protected] ~]$cd /u01/soft/Disk1(CRS 安装盘)

[[email protected] Disk1]$ls

cluvfy        install       response      rootpre       runInstaller  stage         upgrade

[[email protected] Disk1]$cd cluvfy/

[[email protected] cluvfy]$ls

bin           cv            cvupack.zip   jlib          jrepack.zip   lib           lib32         runcluvfy.sh

[[email protected] cluvfy]$./runcluvfy.sh -help

USAGE:

cluvfy [ -help ]

cluvfy stage { -list | -help }

cluvfy stage {-pre|-post} <stage-name> <stage-specific options>  [-verbose]

cluvfy comp  { -list | -help }

cluvfy comp  <component-name> <component-specific options>  [-verbose]

[[email protected] cluvfy]$./runcluvfy.sh stage -post crsinst -n aix211,aix212 -verbose

Performing post-checks for cluster services setup

Checking node reachability...

Check: Node reachability from node "aix211"

Destination Node                      Reachable?

------------------------------------  ------------------------

aix211                                yes

aix212                                yes

Result: Node reachability check passed from node "aix211".

Checking user equivalence...

Check: User equivalence for user "oracle"

Node Name                             Comment

------------------------------------  ------------------------

aix212                                passed

aix211                                passed

Result: User equivalence check passed for user "oracle".

Checking Cluster manager integrity...

Checking CSS daemon...

Node Name                             Status

------------------------------------  ------------------------

aix212                                running

aix211                                running

Result: Daemon status check passed for "CSS daemon".

Cluster manager integrity check passed.

Checking cluster integrity...

Node Name

------------------------------------

aix211

aix212

Cluster integrity check passed

Checking OCR integrity...

Checking the absence of a non-clustered configuration...

All nodes free of non-clustered, local-only configurations.

Uniqueness check for OCR device passed.

Checking the version of OCR...

OCR of correct Version "2" exists.

Checking data integrity of OCR...

Data integrity check for OCR passed.

OCR integrity check passed.

Checking CRS integrity...

Checking daemon liveness...

Check: Liveness for "CRS daemon"

Node Name                             Running

------------------------------------  ------------------------

aix212                                yes

aix211                                yes

Result: Liveness check passed for "CRS daemon".

Checking daemon liveness...

Check: Liveness for "CSS daemon"

Node Name                             Running

------------------------------------  ------------------------

aix212                                yes

aix211                                yes

Result: Liveness check passed for "CSS daemon".

Checking daemon liveness...

Check: Liveness for "EVM daemon"

Node Name                             Running

------------------------------------  ------------------------

aix212                                yes

aix211                                yes

Result: Liveness check passed for "EVM daemon".

Liveness of all the daemons

Node Name     CRS daemon                CSS daemon                EVM daemon

------------  ------------------------  ------------------------  ----------

aix212        yes                       yes                       yes

aix211        yes                       yes                       yes

Checking CRS health...

Check: Health of CRS

Node Name                             CRS OK?

------------------------------------  ------------------------

aix212                                yes

aix211                                yes

Result: CRS health check passed.

CRS integrity check passed.

Checking node application existence...

Checking existence of VIP node application

Node Name     Required                  Status                    Comment

------------  ------------------------  ------------------------  ----------

aix212        yes                       exists                    passed

aix211        yes                       exists                    passed

Result: Check passed.

Checking existence of ONS node application

Node Name     Required                  Status                    Comment

------------  ------------------------  ------------------------  ----------

aix212        no                        exists                    passed

aix211        no                        exists                    passed

Result: Check passed.

Checking existence of GSD node application

Node Name     Required                  Status                    Comment

------------  ------------------------  ------------------------  ----------

aix212        no                        exists                    passed

aix211        no                        exists                    passed

Result: Check passed.

Post-check for cluster services setup was successful.

[[email protected] cluvfy]$

@至此,CRS安装成功,问题解决!


AIX 5.3 Install Oracle 10g RAC 错误集锦--磁盘I/O 故障

时间: 2024-10-29 06:26:58

AIX 5.3 Install Oracle 10g RAC 错误集锦--磁盘I/O 故障的相关文章

AIX 5.3 Install Oracle 10g RAC 错误集锦--9i RAC present

AIX 5.3 Install Oracle 10g RAC 错误集锦--9i RAC Present 系统环境: 操作系统: AIX 5300-08 集群软件: CRS 10.2.0.1 数据库:   Oracle 10.2.0.1                     系统架构图 故障现象: 解决方法: [[email protected] tmp]# cat /etc/oratab # This file is used by ORACLE utilities.  It is creat

AIX 5.3下Oracle 10g RAC 启动故障--CRS启动失败

AIX 5.3下Oracle 10g RAC 启动故障--CRS启动失败 系统环境: 操作系统: AIX 5300-09 集群软件: CRS 10.2.0.1 数据库:   Oracle 10.2.0.1                     系统架构图 故障现象: 系统重启后,在节点上CRS 启动失败或CRS服务启动成功,CRS Resource无法ONLINE. [[email protected] racg] cat /etc/hosts 127.0.0.1              

AIX 5.3安装Oracle 10g错误案例--ruInstaller

AIX 5.3安装Oracle 10g错误案例--runInstaller 系统环境: 操作系统:  AIX5300-08 Oracle软件:Oracle 10g(64位) 错误现象:   客户一个AIX主机,以前安装过Oracle,卸载Oracle软件后,并更改了Oracle用户的UID,重新安装Oracle出现以下错误: 查看文件系统权限: [[email protected] /]#ls -l /u01/app/ total 16 drwxrwx---    6 oracle   oins

Oracle 10g RAC Installer 故障之--Oracle软件安装无法发现集群

Oracle 10g RAC Installer 故障之--Oracle软件安装无法发现集群 系统环境: 操作系统: AIX 5300-08 集群软件: CRS 10.2.0.1 数据库:   Oracle 10.2.0.1 一.故障现象        在安装CRS后,安装Oracle软件时失败,重新安装,删除了/u01/app/oracle下所有的目录,导致Oracle再次安装时不能识别集群环境. 查看Oracle软件安装索引目录:/u01/app/oracle/oraInventory,已经

redhat 下 oracle 10G RAC 集群 详细安装

在大家做RAC安装测试搭建环境时,没有存储环境下,我来教大家怎么采用虚拟机来安装 ORACLE 10 rac,这样可以让大家更快学习好 ORACLE 10 RAC ,我会把很详细的安装写给大家. 1.安装前的准备 准备需要软件 10201_clusterware_linux_x86_64.cpio.gz 10201_database_linux_x86_64.cpio.gz binutils-2.17.50.0.6-6.0.1.el5.x86_64.rpm oracleasm-2.6.18-16

RedHat EL5 安装Oracle 10g RAC之--CRS 安装

RedHat EL5 安装Oracle 10g RAC之--CRS 安装 系统环境: 操作系统:RedHat EL5 Cluster: Oracle CRS 10.2.0.1.0 Oracle:  Oracle 10.2.0.1.0 如图所示:RAC 系统架构 二.CRS 安装   Cluster Ready Service是Oracle 构建RAC,负责集群资源管理的软件,在搭建RAC中必须首先安装. 安装需采用图形化方式,以Oracle用户的身份安装(在node1上): 注意:修改安装配置文

RedHat EL5 安装Oracle 10g RAC之--建库

RedHat EL5 安装Oracle 10g RAC之--建库 系统环境: 操作系统:RedHat EL5 Cluster: Oracle CRS 10.2.0.1.0 Oracle:  Oracle 10.2.0.1.0 如图所示:RAC 系统架构 四.通过DBCA建立数据库 1.首先通过DBCA配置ASM磁盘组 [[email protected] install]$dbca 选择RAC环境建库 选择ASM磁盘组管理 创建ASM Instance ,注意选择用pfile ,易于管理 创建A

RedHat EL5 安装Oracle 10g RAC之--系统环境配置(1)

RedHat EL5 安装Oracle 10g RAC之--系统环境配置(1) 系统环境: 操作系统:RedHat EL5 Cluster: Oracle CRS 10.2.0.1.0 Oracle:  Oracle 10.2.0.1.0 如图所示:RAC 系统架构 一.操作系统环境 1.网络配置(在所有节点) [[email protected] ~]# cat /etc/hosts # Do not remove the following line, or various programs

ORACLE 10G RAC 节点自动重启故障处理

将数据库集群升级到10.2.0.5之后,双节点服务器不断重启,查询oracle oprocd进程日志,信息如下: Jul 03 08:16:34.702 | INF | monitoring started with timeout(1000), margin(500), skewTimeout(125) Jul 03 08:16:34.704 | INF | fatal mode startup, setting process to fatal mode 可以看到看到oprocd进程的时间间