VCS引起的oracle数据库异常重启一例

1. 环境描述

操作系统版本:SUSE Linux Enterprise Server 10 sp2 (x86_64)

数据库版本:Oracle 11.1.0.7.16

VCS版本:5.1

blog地址:http://blog.csdn.net/hw_libo/article/details/41171561

2. 问题现象及分析

凌晨3:46左右,手机短信收到数据库异常告警。

(1)查看数据库alert日志

Sun Nov 16 03:46:51 2014
Stopping background process SMCO
Stopping background process FBDA
Shutting down instance: further logons disabled
Sun Nov 16 03:46:54 2014
kkjcre1p: unable to spawn jobq slave process, slot 0, error 1089
Stopping background process CJQ0
Stopping background process QMNC
Stopping background process MMNL
Stopping background process MMON
Sun Nov 16 03:47:03 2014
Shutting down instance (immediate)
License high water mark = 366
Sun Nov 16 03:51:52 2014
License high water mark = 366
USER (ospid: 8654): terminating the instance
Sun Nov 16 03:51:52 2014
ORA-1092 : opiodr aborting process unknown ospid (28656_47367034883680)
Sun Nov 16 03:51:52 2014
LGWR waiting for instance termination

并未发现较为明显的异常原因。

同时也查看了那个时间段的报告,发现数据库压力不大,也没有什么异常的SQL存在。

(2)检查OS资源使用

系统资源使用正常,IO也不高。

(3)查看messages日志

Nov 16 03:42:46 boscodb AgentFramework[10894]: VCS ERROR V-16-2-13027 Thread(4149214112) Resource(ora_vg) - monitor procedure did not complete within the expected time.
Nov 16 03:42:46 boscodb Had[10258]: VCS ERROR V-16-2-13027 (boscodb) Resource(ora_vg) - monitor procedure did not complete within the expected time.
Nov 16 03:43:01 boscodb su: (to root) root on none
……
Nov 16 03:45:39 boscodb su: (to oracle) root on none
Nov 16 03:45:47 boscodb AgentFramework[10894]: VCS ERROR V-16-2-13210 Thread(4154813344) Agent is calling clean for resource(ora_vg) because 4 successive invocations of the monitor procedure did not complete within the expected time.
Nov 16 03:45:47 boscodb Had[10258]: VCS ERROR V-16-2-13210 (boscodb) Agent is calling clean for resource(ora_vg) because 4 successive invocations of the monitor procedure did not complete within the expected time.
……
Nov 16 03:46:48 boscodb AgentFramework[10894]: VCS ERROR V-16-2-13006 Thread(4153760672) Resource(ora_vg): clean procedure did not complete within the expected time.
Nov 16 03:46:48 boscodb Had[10258]: VCS ERROR V-16-2-13006 (boscodb) Resource(ora_vg): clean procedure did not complete within the expected time.
Nov 16 03:46:49 boscodb AgentFramework[10899]: VCS ERROR V-16-2-13067 Thread(4155194272) Agent is calling clean for resource(ora_archive) because the resource became OFFLINE unexpectedly, on its own.
Nov 16 03:46:50 boscodb Had[10258]: VCS ERROR V-16-2-13067 (boscodb) Agent is calling clean for resource(ora_archive) because the resource became OFFLINE unexpectedly, on its own.
Nov 16 03:46:51 boscodb AgentFramework[10899]: VCS ERROR V-16-2-13068 Thread(4155194272) Resource(ora_archive) - clean completed successfully.

原来是VCS出现了错误,监控vg超时了,这些错误信息在VCS的日志中也有记录。

3. 解决办法

检查了监控vg的moniter脚本,发现也没有异常,而且脚本在其他库,压力都比较大的都不曾出问题。

当前的配置为:

        Application ora_vg (
                StartProgram = "/opt/BOSCO_ora/bin/vg_start.sh"
                StopProgram = "/opt/BOSCO_ora/bin/vg_stop.sh"
                CleanProgram = "/opt/BOSCO_ora/bin/vg_clean.sh"
                MonitorProgram = "/opt/BOSCO_ora/bin/vg_monitor.sh"
                MonitorInterval = 30
                MonitorTimeout = 30
                )

决定暂时将监控时长加大到90s,可以使用两种方法:

(1)可以修改main.cf,但需要重启VCS,无操作时间窗口

(2)命令动态修改

haconf -makerw

hares -modify ora_vg MonitorTimeout 90   ## 调整监控时长为90s,默认是60s

haconf -dump

blog地址:http://blog.csdn.net/hw_libo/article/details/41171561

-- Bosco  QQ:375612082

---- END ----

-------------------------------------------------------------------------------------------------------

版权所有,文章允许转载,但必须以链接方式注明源地址,否则追究法律责任!

时间: 2024-08-25 11:22:58

VCS引起的oracle数据库异常重启一例的相关文章

C# 连接Oracle数据库异常总结

这2天因为工作需要连接Oracle数据库,中间发生了很多问题 一.使用OleDbConnection连接数据库 ------------------ Provider=OraOLEDB.Oracle.1;User ID=sajet;Password=tech;Data Source=(DESCRIPTION = (ADDRESS_LIST= (ADDRESS = (PROTOCOL = TCP)(HOST = 192.168.66.225)(PORT = 1521))) (CONNECT_DAT

bdump目录对应的x权限丢失导致的oracle 数据库异常

rhel6.3 x64bit ,oracle 11.2.0.4 x64bit [[email protected] test]$ pwd /u01/app/oracle/diag/rdbms/test/test [[email protected] test]$ chmod u-x trace [[email protected] test]$ ll total 100 drwxr-x---. 2 oracle oinstall 4096 Feb 20 2013 alert drwxr-x---

Oracle断电后重启异常

这是我的第一篇博客~ 2017-06-23,我所在的项目现成由于机房断电,导致项目所在的一台数据库服务器挂掉了.早上重启后发现切换Oracle用户异常,显示这个界面 初步分析是:用户还在,但是挂载有问题. 于是开始调整挂载: vim /etc/mtab 增加/dev/sda3 /media/CRM ext4 rw 0 0 vim /etc/fstab 增加/dev/sda3 /media/CRM ext4 defaults 0 0 至于这两个挂载文件的作用和关系 ------一无所知      

重启oracle数据库的一次操作命令和alter日志。

今天重启oracle数据库的命令和alter日志: [email protected]:~> sqlplus '/as sysdba' SQL*Plus: Release 11.2.0.3.0 Production on Wed Jan 27 18:49:13 2016 Copyright (c) 1982, 2011, Oracle. All rights reserved. Connected to:Oracle Database 11g Enterprise Edition Release

oracle的启动与关闭原理-将数据库重置(重启)

案例5:将数据库重置(重启) SQL> startup force;ORACLE instance started. Total System Global Area  608174080 bytesFixed Size            1268920 bytesVariable Size          230687560 bytesDatabase Buffers      373293056 bytesRedo Buffers            2924544 bytesDat

java连接oracle数据库,关闭连接出现异常:java.sql.SQLRecoverableException: IO Error: Connection reset

java.sql.SQLRecoverableException: IO Error: Connection reset at oracle.jdbc.driver.T4CConnection.logoff(T4CConnection.java:612) at oracle.jdbc.driver.PhysicalConnection.close(PhysicalConnection.java:5094) at com.sms.send.StartTaskNew.run(SmsSend.java

命令行重启Oracle数据库

Linux下重启oracle数据库: 首先以root用户登录,输入以下命令 1.su – oralce                                     --切换到oracle用户 2.lsnrctl stop                                   --停止监听 3.sqlplus "/as sysdba"                  --以sysdba用户登录oracle 4.shutdown immediate       

oracle所在磁盘空间不足导致了数据库异常

oracle所在磁盘空间不足导致了数据库异常.需要减小数据文件的大小来解决. 1.检查数据文件的名称和编号 select file#,name from v$datafile; 2.看哪个数据文件所占的空间最大,根据该数据文件的编号查看该数据文件的最大数据块数 select max(block_id) from dba_extents where file_id=8; 查询结果 3.计算该表空间的实际占用空间.(不是物理文件的大小) 查看每个数据块的大小 show parameter db_bl

linux下重启oracle数据库

在linux下重启oracle数据库及监听器: 方法1: 用root以ssh登录到linux,打开终端输入以下命令: cd $ORACLE_HOME   #进入到oracle的安装目录 dbstart           #重启服务器 lsnrctl start     #重启监听器 cd $ORACLE_HOME   #进入到oracle的安装目录dbstart           #重启服务器lsnrctl start     #重启监听器 -------------------------