Quantcast
Channel: All Server Management - Systems Insight Manager posts
Viewing all articles
Browse latest Browse all 4342

Re: Disable specific alert on specific server in HP SIM

$
0
0

Yes is will. Be very careful. The directions given are fantastic and will do the job, and most of us want this BECAUSE, we are on call, and since this is considered a critical alert, it will trigger a page if this is how you have your HPSIM server’s setup. I talked to our HPSIM server manager, and he stated the only real way to do this, is to kill the services on the server, and sadly said you'd have to kill ALL of the HP agent services. The other way to do this is to go into the HPSIM console, find the server you want to manage and select it. You should be at the Main/Home screen of the server. From there, click on "Tools & Links", then under the "HP Systems Insight Manager Pages" there will be some links. You want to click on the link that reads "Suspend/Resume Monitoring" - In here; you will see three different options, 1. Enable monitoring of this system / 2. Suspend monitoring of this system for (choose the time with a drop down menu / 3. Suspend monitoring of this system indefinitely.
I personally do not want to suspend monitoring all day, just until the batter gets here and I have a chance to install it. So, since I don't want to get woken up at 2am due to this "PAGE", which is usually a duplicate, then before I go to bed, I will remote into HPSIM from home, and choose option 2. Suspend monitoring of this system for... and I will choose 8 hours or 9 hours, or however long I need to so I am not woken up by this server.
It is not optimal, and we used to have HPSIM NOT page in the middle of the night because with nearly 1700 servers, we would never get any sleep. But, we ended up enabling it to page in the middle of the night, but we placed a delay on the parameters, such as "server unreachable" in case someone had a scheduled reboot. We didn't want to know about those. But, you have to consider if you have a large data center, and the HVAC fails, it starts to get extremely hot, then if a server goes down, and is down beyond the delay period to page you, we will get paged. We know this is now a down server and not a reboot, however we are on the lookout, where if another one goes down beyond the delay, then another, and another, then it's obvious, and we are paged, and we are able to resolve the issue in the middle of the night if it is a HVAC, over heating issue, instead of not knowing that half our servers are down until the morning employees start to come in, and/or we walk in an extremely hot room, and have to call for all IT hands on deck!
Your follow up question about, if this change affects all servers, was a good question, and prompted several things to be considered which may not have been considered earlier.
Thanks and look forward to feedback, IF it is not rude and arguable! Please let's have a tasteful exchange if this topic continues and not blast each other. Because each of us have a different datacenter setup, and we cannot divulge why we do some things, because it would reveal what type of business we work for, and we all know there are decisions we make based on reasons we cannot say... so, consider that before someone gets rude and unprofessional. I can't stand that :-) just saying...


Viewing all articles
Browse latest Browse all 4342

Trending Articles



<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>