No commit activity in last 3 years
No release in over 3 years
This plugin provides instrumentation for hardware monitoring via checking the system message buffer for "Hardware Error" entries using `dmesg`.
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
 Dependencies

Development

~> 2.1
~> 0.10
~> 13.0
~> 3.2
~> 3.4
~> 0.81.0
~> 0.8

Runtime

>= 0.9.6, < 1.2.0
 Project Readme

Sensu-Plugins-hardware

Build Status Gem Version Code Climate Test Coverage Dependency Status

Functionality

check-hardware-fail will lookup in the output of dmesg for lines matching a provided query, it accepts --facility, --level and --kernel options to run dmesg command. Returns CRITICAL if any occurrence is found and UNKNOWN if provided options are invalid or the command execution fails.

Files

  • bin/check-hardware-fail

Usage

Usage: ./check-hardware-fail.rb (options)
    -f FACILITY[,FACILITY],          Restrict output to defined facilities. Supported log facilities: kern,user,mail,daemon,auth,syslog,lpr,news
        --facility
        --invert                     Invert order
    -k, --kernel                     Include kernel messages
    -L, --level LEVEL[,LEVEL]        Restrict output to defined levels, otherwise all levels are included. Supported log levels: emerg,alert,crit,err,warn,notice,info,debug
    -l, --lines NUMBER               Maximum number of lines to read from dmesg, 0 (default) means all
    -q, --query QUERY                What pattern to look for in the output of dmesg (regex or literal)
    -s, --seconds SECONDS            Amount of seconds to lookbehind from dmesg output. This option is incompatible with --lines

Example of usage:

Check the first 100 lines for 'killed as a result of limit'

check-hardware-fail.rb -l 100 --invert -q 'killed as a result of limit'

Check the last 100 lines for 'killed as a result of limit'

check-hardware-fail.rb -l 100 -q 'killed as a result of limit'

The following options are only available for linux OS:

  • --seconds Amount of seconds to lookbehind from dmesg output. This option is incompatible with --lines
  • --facility Restrict output to defined facilities. Supported log facilities: kern,user,mail,daemon,auth,syslog,lpr,news
  • --level Restrict output to defined levels, otherwise all levels are included. Supported log levels: emerg,alert,crit,err,warn,notice,info,debug
  • --kernel Include kernel messages

Check the last 300 seconds for 'killed as a result of limit'

check-hardware-fail.rb -s 300 -q 'killed as a result of limit'

Check the last 300 seconds for 'killed' on auth and syslog facilities

check-hardware-fail.rb -s 300 -f auth,syslog -q 'killed'

Installation

Installation and Setup

Notes