Checking root mail daily will help. Checking the report from your backup software daily will allow you to sleep better at night. Ed Fair
efair@atlnet.com
Any advice I give is my best judgement based on my interpretation of the facts you supply.
Help increase my knowledge by providing some feedback, good or bad, on any advice I have given.
Since you are new to Unix I would suggest that you start off with performing your checks manually, not by script. This will help you improve your skills and familiarily with you environment. Do a shell script later, and if you are not real strong with Korn Shell scripting, that would be a good introduction for you.
It really depends upon what type of support you provide, what applications are running, etc. to tell you what to check. DNS? NIS? Oracle? Progress? Telnet sessions? Webserver? Size/age/version? Basic admin duties only, Performance monitoring/tuning, application support, security?
Here's some very basic things to put on your list to start with. I am AIX totally so please pardon me if you are using another version of Unix. I believe most of these work for Sun and probably HP.
1. Check your disk space. Do a "df -k" or just a "df". The -k option gives you the disk space count in 1024 and without it gives you in 512. Check this frequently. Even print it out and keep it posted so you can do comparisons to tell what filesystems are growing and how fast.
2. See who is logged on to you system. If you are running applications like Oracle, you won't see the users logged in, except like the Oracle administrators. If you are running applications like Progress, you will most likely see lots of users. You can check these with "last" and "who". For the "last", do it with "last|pg" or it will scroll lots of info. You can also do "w -u" and see a count of the users, how long you system has been up and a load average.
3. Check your error logs. Do "errpt|pg" to see the summary of errors. If there are some, then do "errpt -a" to get more detail.
4. See what processes are running. Do "ps -ef|pg" and scroll through the process table and get familiar with what is running.
5. Check that all you interfaces are up and running. Do "netstat -nr" and "ifconfig -a".
6. If you support printing, check your queues. Do "lpstat" and see if anything looks to be stopped.
7. Check you system load/balance. Do "sar 1 100". This will give you the percentages used by USR, SYS, WIO, IDLE. USR is your user activity and applications. SYS is your CPU. WIO is disk and paging space. IDLE is just that. The 1 100 will give you 100 readings every 1 second.
8. You may want to check out using the vmstat and iostat for more performance checks.
9. Check you Source Master processes. Do "lssrc -a|pg" to view what subprocesses are running.
11. If you are responsible for application support, find out what logs need to be reviewed. Some applications require a lot of monitoring and others do not.
Again, these are just a few real basics to start with. Obviously there is much more.
Bookmark these websites and visit them frequently. (these are AIX). If not AIX, find the support sites for your Unix/OS. If AIX, start frequenting the AIX forum on here in Tek-tips.
Your Disk Utilisation
Your Backup Status
Your /var/adm/messages
The Last command which tells u who has logged in your server
Your root file system
Find for files with setuid permissions (man find)
more /var/adm/messages
more /var/adm/syslog/syslog.log
ps -ef|grep defunct
top
last root |head -10
lastb root |head -10
bdf |more
more /.rhosts
more /etc/shutdownlog
who -u |more
This site uses cookies to help personalise content, tailor your experience and to keep you logged in if you register.
By continuing to use this site, you are consenting to our use of cookies.