DBAmon |
Home |
Index/DBAmon Doc. |
|
Usage: dbamon_checker
Purpose: Detect hung remsh processes and kill them if necessary AND restart DBAmon in daemon mode.
Run From: Usually cron
The primary purpose of dbamon_checker is to detect if DBAmon is running. If it is not, then dbamon_checker starts DBAmon by issuing dbamon. If running in Multi-Task mode, if any tasks have died, then dbamon_checker will issue dbamon_stop followed by dbamon.
It also checks to see if any remsh or ssh connects to DB servers have been running for >= 15 minutes. In this case, the offending command is killed. If you do not run this utility frequently, then dbamon will hang when it checks a DB metric on a server that is hung or a server with a hung DB preventing all monitoring from occuring on any server.
Example dbamon_checker cron entry:
#Run dbamon_checker every 5 minutes 00,05,10,15,20,25,30,35,40,45,50,55 * * * * /opt/dbamon/bin/dbamon_checker >> /tmp/dbamon_checker.log 2>&1