【硬件】DELL服务器硬件监控及DELL系统管理工具OMSA介绍

1.1.1. DELL服务器硬件监控及DELL系统管理工具OMSA介绍

本文介绍采用使用Nagios和OMSA监控DELL服务器的硬件健康状态,Nagios监控的方式是NRPE模式,需要配置check_openmanage脚本和安装DELL的OMSA工具。

使用OpenManage和Nagios监控DELL服务器硬件部署手册:

http://folk.uio.no/trondham/software/check_openmanage.html

1)        OMSA是什么

OMSA是Dell Openmanage Server Administrator的缩写。

Dell OpenManage Server Administrator (OMSA) provides acomprehensive, one-to-one systems management solution in two ways: from anintegrated, web browser-based graphical user interface (GUI) and from a commandline interface (CLI) through the operating system.
Server Administrator isdesigned for system administrators to manage systems locally and remotely on anetwork. It allows system administrators to focus on managing their entirenetwork by providing comprehensive one-to-one systems management.

2)        安装OMSA

DELL OpenManage的yum源地址:

http://linux.dell.com/repo/hardware/Linux_Repository_14.04.00/

配置OMSA的yum源:

创建/etc/yum.repos.d/dell-omsa-repository.repo源文件:

或者

执行命令自动配置:

wget -q -O -http://linux.dell.com/repo/hardware/Linux_Repository_14.04.00/bootstrap.cgi |bash

安装OMSA:

yum install srvadmin-all

配置OMSA服务开机自动:

/opt/dell/srvadmin/sbin/srvadmin-services.sh enable

启动OMSA的服务:

/opt/dell/srvadmin/sbin/srvadmin-services.sh start

查看omsa的服务状态:

/opt/dell/srvadmin/sbin/srvadmin-services.sh status

dell_rbu(module) is running

ipmidriver is running

dsm_sa_datamgrd(pid 1331 1197) is running

dsm_sa_eventmgrd(pid 1381) is running

dsm_sa_snmpd(pid 1440) is running

dsm_om_shrsvcd(pid 1508) is running...

dsm_om_connsvcd(pid  1562) is running...

查看omsa的服务的监听状态:

# netstat -npae | egrep -iv‘mysql|ssh|xinetd|udevd|crond|syslogd|upstart|auditd‘

我们注意到dsm_om_connsvc服务监听了TCP协议的1311端口,并且提供了http访问的功能。

3)        卸载OMSA工具包中的web组件

由于本案例中只需要使用OMSA工具包监控硬件健康状态的功能,不需要使用OMSA提供的Web管理功能,所以为了避免由于系统防火墙或者web管理不当导致的问题,我们在本例中卸载掉OMSA的web组件。

查看OMSA组件的监听情况:

# netstat -npae | egrep -iv ‘mysql|ssh|xinetd|udevd|crond|syslogd|upstart|auditd‘

ActiveInternet connections (servers and established)

ProtoRecv-Q Send-Q Local Address              Foreign Address            State       User       Inode      PID/Program name

tcp        0     0 :::1311                     :::*                        LISTEN      0         656427     1563/dsm_om_connsvc

ActiveUNIX domain sockets (servers and established)

ProtoRefCnt Flags       Type       State         I-Node PID/Program name    Path

unix  2     [ ACC ]     STREAM    LISTENING     6555231197/dsm_sa_datamgr /opt/dell/srvadmin/var/lib/openmanage/.ipc/dcsmilpipea

unix  2     [ ACC ]     STREAM     LISTENING     655525 1197/dsm_sa_datamgr/opt/dell/srvadmin/var/lib/openmanage/.ipc/dcsmilpipep

unix  2     [ ACC ]     STREAM     LISTENING     655527 1197/dsm_sa_datamgr/opt/dell/srvadmin/var/lib/openmanage/.ipc/dcsmilpipeu

unix  2     [ ACC ]     STREAM     LISTENING     655770 1508/dsm_om_shrsvcd/opt/dell/srvadmin/var/lib/openmanage/shrsvc/dsm_om_shrsvc

unix  2     [ ACC ]     STREAM     LISTENING     655772 1508/dsm_om_shrsvcd/opt/dell/srvadmin/var/lib/openmanage/shrsvc/omintf5e4

unix  2     [ ]         STREAM     CONNECTED     656423 1563/dsm_om_connsvc

unix  2     [ ]         STREAM     CONNECTED     656015 1563/dsm_om_connsvc

unix  3     [ ]         STREAM     CONNECTED     655972 1197/dsm_sa_datamgr/opt/dell/srvadmin/var/lib/openmanage/.ipc/dcsmilpipea

unix  3     [ ]         STREAM     CONNECTED     655971 1563/dsm_om_connsvc

unix  3     [ ]         STREAM     CONNECTED     655650 1197/dsm_sa_datamgr/opt/dell/srvadmin/var/lib/openmanage/.ipc/dcsmilpipea

unix  3     [ ]         STREAM     CONNECTED     655649 1440/dsm_sa_snmpd

unix  3     [ ]         STREAM     CONNECTED     655589 1197/dsm_sa_datamgr/opt/dell/srvadmin/var/lib/openmanage/.ipc/dcsmilpipea

unix  3      [ ]         STREAM     CONNECTED     655588 1381/dsm_sa_eventmg

OMSA组件的dsm_om_connsvc服务监听了TCP协议的1311端口。

查看dsm_om_connsvc服务调用的程序:

lsof -p 1563  # 1563是dsm_om_connsvc服务进程的PID

查看dsm_om_connsvc调用的程序文件属于哪个rpm包:

# rpm -qf /opt/dell/srvadmin/lib64/openmanage/apache-tomcat/lib/tomcat-api.jar

srvadmin-tomcat-7.4.0-4.97.1.el6.x86_64

# rpm -qf /opt/dell/srvadmin/lib64/openmanage/jre/lib/jce.jar

srvadmin-jre-7.4.0-4.98.1.el6.x86_64

关闭OMSA的服务:

/opt/dell/srvadmin/sbin/srvadmin-services.sh stop

卸载srvadmin-tomcat和srvadmin-jre软件包:

# rpm -e srvadmin-tomcat-7.4.0-4.97.1.el6.x86_64

error: Failed dependencies:

srvadmin-tomcat = 7.4.0 is needed by (installed)srvadmin-webserver-7.4.0-4.1.1.el6.x86_64

# rpm -e srvadmin-webserver-7.4.0-4.1.1.el6.x86_64

error: Failed dependencies:

srvadmin-webserver = 7.4.0 is needed by (installed)srvadmin-all-7.4.0-4.1.1.el6.x86_64

卸载srvadmin-webserver软件包:

# rpm -e --nodeps srvadmin-webserver-7.4.0-4.1.1.el6.x86_64

# rpm -e srvadmin-tomcat-7.4.0-4.97.1.el6.x86_64

# rpm -e srvadmin-jre-7.4.0-4.98.1.el6.x86_64

删除appache-tomcat的目录:

# rm -rf /opt/dell/srvadmin/lib64/openmanage/apache-tomcat

启动OMSA的服务:

# /opt/dell/srvadmin/sbin/srvadmin-services.sh start

StartingSystems Management Device Drivers:

Startingdell_rbu:                                         [  OK  ]

Startingipmi driver: Already started                      [  OK  ]

StartingSystems Management Data Engine:

Startingdsm_sa_datamgrd:                                  [  OK  ]

Startingdsm_sa_eventmgrd:                                 [  OK  ]

Startingdsm_sa_snmpd:                                     [  OK  ]

Starting DSMSA Shared Services:                          [  OK  ]

测试check_openmanage脚本:

./check_openmanage -d

输出的检查项跟卸载OMSA的web组件之前是一样的。

查看卸载OMSA的web组件之后,OMSA的服务监听情况:

# netstat -npae | egrep -iv‘mysql|ssh|xinetd|udevd|crond|syslogd|upstart|auditd‘

此时,OMSA的服务只进行了Unix domain sockets的监听,而Unix domain socket只用于本操作系统中,进程之间的通信,比如check_openmanage脚本调用OMSA的服务进行DELL服务器硬件健康状态检查。

至此,OMSA的Web组件卸载完毕。

4)        安装check_openmanage安装包

下载地址:

http://folk.uio.no/trondham/software/check_openmanage.html#download

下载check_openmanage工具包:

wget http://folk.uio.no/trondham/software/files/check_openmanage-3.7.11.tar.gz

测试check_openmanage工具:

tar zxf check_openmanage-3.7.11.tar.gz

cd check_openmanage-3.7.11

./check_openmanage -d

./check_openmanage

如果提示"Storage Error",则加上--no-storage参数:

./check_openmanage --no-storage

# check_openmanage会检查50~60项左右DELL服务的信息

5)        配置NRPE

编辑/usr/local/nagios/etc/nrpc.cfg文件,添加:

command[check_dell_openmanage]=/path/to/check_openmanage

或者

command[check_dell_openmanage]=/path/to/check_openmanage--no-storage

把check_openmanage脚本拷贝到/usr/local/nagios/libexec/目录:

cp check_openmanage-3.7.11/check_openmanage/usr/local/nagios/libexec/

测试命令:

check_nrpe -H IP -c check_dell_openmanage

6)        注意事项

check_openmanage是perl脚本,所以操作系统必须已经安装了perl。

7)        服务器没有外网怎么办

如果服务器没有外网,可以考虑在有外网的机器上做一个iptables的nat映射,把只有内网的服务器配置的yum映射到公网,或者在机房内部署一个yum源;

8)        还可以用什么方法

如果不使用OMSA和check_openmanage来监控硬件健康状态,还可以使用ipmitool来监控,不过需要自己开发脚本。

9)        适用什么环境使用

如果在使用DELL厂商的server,都建议使用。

10)    关于OMSA组件安全性的补充说明

Dell OpenManage Server Administrator (OMSA) 7.1及更早版本在实现上存在XSS漏洞,可允许远程攻击者注入Web脚本或HTML。DELL厂商已经发布了升级补丁来修复这个安全问题,请到厂商的主页下载,详见本文最后一部分的“本文相关资料的参考链接”部分的“OMSA组件安全补丁的下载地址”。

11)    DELL厂商提供的其它系统管理工具

DELL还提供用于Microsoft System Center管理配置工具、OracleEntreprise Manager 12c插件,以及支持HP和IBM的插件等工具。详情请登录DELL官方网站,依次进入Support--> Drivers & Downloads --> 选择服务器类型--> System Managements模块。

12)    DELL提供的用于OracleEnterprise Manager 12c的插件说明

Dell OpenManage Plug-in v1.0 for Oracle Enterprise Manager12c

Dell OpenManage Plug-in for Oracle Enterprise Managerprovides a proactive approach to data center management that delivers featuresfor monitoring Dell server, storage, and networking infrastructures inenvironment managed by Oracle Enterprise Manager (OEM).
It also supportsmapping of database workload to Dell hardware for quicker fault detection andconsole launch of Dell devices to perform troubleshooting, configuration, andmanagement activities. It protects customer’s existing investment in OEM consoleand helps
in ease of integration and management of Dell devices.

13)    什么是Unix domain socket

A Unixdomain socket or IPC socket (inter-process communication socket) is a datacommunications endpoint for exchanging data between processes executing withinthe same host operating system. While similar in functionality to named pipes,Unix domain sockets
may be created as connection?mode(SOCK_STREAM or SOCK_SEQPACKET) or as connectionless (SOCK_DGRAM), while pipesare streams only. Processes using Unix domain sockets do not need to share acommon ancestry. The API for Unix domain sockets is similar to that of
anInternet socket, but it does not use an underlying network protocol forcommunication. The Unix domain socket facility is a standard component of POSIXoperating systems.Unixdomain sockets use the file system as their address name space. They arereferenced
by processes as inodes in the file system. This allows two processesto open the same socket in order to communicate. However, communication occursentirely within the operating system kernel.In addition to sending data, processes may send filedescriptors across
a Unix domain socket connection using the sendmsg() andrecvmsg() system calls.

14)    本文相关资料的参考链接

自定义check_openmanage的阀值:

http://dreamway.blog.51cto.com/1281816/1048274

omreport命令使用:

http://www.sxszjzx.com/~t096/manual/sc/Dosa/CLI/report.htm

DELL厂商网址:

http://www.dell.com/support/drivers/us/en/04/ProductSelector/Select/FamilySelection?CategoryPath=all-products%2Fesuprt_ser_stor_net%2Fesuprt_poweredge&Family=PowerEdge&DisplayCrumbs=Product%2520Type%40%2CServers%252C%2520Storage%252C%2520%2526%2520Networking%40%2CPowerEdge&rquery=na

DELL厂商OMSA的yum源地址:

http://linux.dell.com/repo/hardware/Linux_Repository_14.04.00/

使用OpenManage和Nagios监控DELL服务器硬件部署手册:

http://folk.uio.no/trondham/software/check_openmanage.html

OMSA组件安全补丁的下载地址:

http://www.dell.com/support/drivers/us/en/19/DriverDetails/Product/poweredge-r710?driverId=5JDN0&osCode=WNET&fileId=3082293694

http://www.dell.com/support/drivers/us/en/19/DriverDetails/Product/poweredge-r710?driverId=PCXMR&osCode=WNET&fileId=3082295344

http://www.dell.com/support/drivers/us/en/19/DriverDetails/Product/poweredge-r710?driverId=JJMWP&osCode=WNET&fileId=3082295338

DELL提供的用于Oracle Enterprise Manager 12c的插件地址:

http://www.dell.com/support/drivers/us/en/04/DriverDetails/Product/poweredge-r710?driverId=XKRM6&osCode=WS8R2&fileId=3356540401&languageCode=en&categoryId=SM

DELL厂商提供的其它管理工具的网址:

http://www.dell.com/support/drivers/us/en/04/ProductSelector/Select/FamilySelection?CategoryPath=all-products%2Fesuprt_ser_stor_net%2Fesuprt_poweredge&Family=PowerEdge&DisplayCrumbs=Product%2520Type%40%2CServers%252C%2520Storage%252C%2520%2526%2520Networking%40%2CPowerEdge&rquery=na

Unix domain socket的解释说明:

http://en.wikipedia.org/wiki/Unix_domain_socket

【硬件】DELL服务器硬件监控及DELL系统管理工具OMSA介绍,布布扣,bubuko.com

时间: 2024-07-30 13:38:53

【硬件】DELL服务器硬件监控及DELL系统管理工具OMSA介绍的相关文章

Hp && Dell服务器硬件监控

HP 安装HP工具: yum install hpssacli 1 查看控制器状态 raid卡型号等hpssacli ctrl all show status 2 查看硬盘类型.大小 raid级别.状态等等hpssacli ctrl all show config 3 查看所有控制器的详细信息hpssacli ctrl all show detail 4 查看slot 2 上面的控制器信息hpssacli ctrl slot=2 show detail 5 查看某个控制器(slot 2)下面的硬

Dell服务器硬件监控,使用omreport出现object not found 错误解决

一.监控告警 登录目标主机,查看dataeng服务 [[email protected]5-217 src]# systemctl status dataeng ● dataeng.service - LSB: Systems Management Data Engine Loaded: loaded (/etc/rc.d/init.d/dataeng; bad; vendor preset: disabled) Active: active (running) since 三 2019-08-

nagios监控dell服务器硬件

之前讲过利用omsa来监控dell服务器,但是必须通过web来访问,这次我们结合nagios+check_openmanage来实现dell服务器的硬件监控. 首先我们来看下面这张图: 由上图看出有两种方式可以实现监控: 1.nagios服务器端check_nrpe调用被监控端的check_openmanage来实现,这种方式要在被监控端安装omsa和check_openmanage 2.nagios服务器端直接通过check_openmanage来远程监控,这种方式要在nagios服务器端安装

Dell 服务器硬件信息监控,自动生成服务器资产信息

在对服务器的硬件监控上,目前业界主要基于如下两种: 1.服务器自带的工具,比如HP的hpacucli,DELL的OMSA等 2.智能平台管理接口 (IPMI,全称Intelligent Platform Management Interface) 基于以上两种,通过nagios.zabbix或自研运维平台等包裹,进行预警操作. 由于公司使用的服务器全部为DELL PowerEdge系列的,这里将以如何对DELL PowerEdge系列服务器硬件进行监控这一实际案例为主. 一.OMSA 介绍 DE

zabbix企业应用:通过SNMP和iDRAC监控DELL服务器硬件

监控DELL服务器硬件一般有两种途径:1.操作系统上安装OMSA,编写脚本调用omreport命令进行监控:2.使用iDRAC,可以不用在操作系统上安装OMSA,只需要在iDRAC上开启SNMP,zabbix通过SNMP进行监控.对于不支持OMSA的操作系统和要求不能安装额外软件的情况下,推荐使用SNMP监控,配置简单方便. 一.iDRAC开启SNMP服务 进入iDRAC的WEB界面,打开"网络"->"服务"->"SNMP代理". 启

服务器硬件监控转载:

转载:http://nolinux.blog.51cto.com/4824967/1664664 前一周上架服务器,本来准备把针对服务器的硬件监控这一部分总结下成文,结果过个端午给过忘了...今天就又整理了下,在这里分享给51博友,也算是对自己前段工作的一个总结. 在对服务器的硬件监控上,目前业界主要基于如下两种:        1.服务器自带的工具,比如HP的hpacucli,DELL的OMSA等        2.智能平台管理接口 (IPMI,全称Intelligent Platform M

服务器硬件监控之OMSA

前一周上架服务器,本来准备把针对服务器的硬件监控这一部分总结下成文,结果过个端午给过忘了...今天就又整理了下,在这里分享给51博友,也算是对自己前段工作的一个总结. 在对服务器的硬件监控上,目前业界主要基于如下两种:        1.服务器自带的工具,比如HP的hpacucli,DELL的OMSA等        2.智能平台管理接口 (IPMI,全称Intelligent Platform Management Interface) 基于以上两种,通过nagios.zabbix或自研运维平

服务器硬件监控之Check_openmanage

昨天分享了下服务器硬件监控的知识,在文章的结尾提到了check_openmanage工具.本文就主要介绍这个工具在服务器硬件监控方面的作用. 一.check_openmanage介绍 check_openmanage 是一个 Nagios 的插件,它基于 OMSA 获取相关的报道信息,用来检测安装有 OpenManage Server Administrator (OMSA) 的戴尔服务器的运行状态,包括存储系统.电源.温度等信息. 官网:http://folk.uio.no/trondham/

Linux基础之文件系统简介及其系统管理工具

常见的文件系统 Linux文件系统: ext2, ext3, ext4, xfs, btrfs, reiserfs, jfs, swap swap: 交换分区 光盘:iso9660 Windows:fat32, ntfs Unix: FFS, UFS, JFS2 网络文件系统:NFS, CIFS 集群文件系统:GFS2, OCFS2 分布式文件系统:ceph, moosefs, mogilefs, GlusterFS, Lustre 实验环境CentOS7.2 Linux根据其是否支持"jour