<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Opsview Labs &#187; System Management</title>
	<atom:link href="http://labs.opsview.com/system-management/feed/" rel="self" type="application/rss+xml" />
	<link>http://labs.opsview.com</link>
	<description>Opsview&#039;s Engineering Blog</description>
	<lastBuildDate>Fri, 20 Jan 2012 09:32:54 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Website Monitoring in 10 Easy Steps with Opsview</title>
		<link>http://labs.opsview.com/2012/01/monitor-your-website-in-10-easy-steps-with-opsview/</link>
		<comments>http://labs.opsview.com/2012/01/monitor-your-website-in-10-easy-steps-with-opsview/#comments</comments>
		<pubDate>Wed, 18 Jan 2012 15:06:40 +0000</pubDate>
		<dc:creator>Christopher Buckley</dc:creator>
				<category><![CDATA[Configuration]]></category>
		<category><![CDATA[Opsview]]></category>
		<category><![CDATA[System Management]]></category>
		<category><![CDATA[Apache]]></category>
		<category><![CDATA[service check]]></category>
		<category><![CDATA[ssl]]></category>
		<category><![CDATA[website check]]></category>
		<category><![CDATA[website monitoring]]></category>

		<guid isPermaLink="false">http://labs.opsview.com/?p=2119</guid>
		<description><![CDATA[
			
				
			
		For most companies websites are their corporate face to the world. Any downtime can be costly especially if the sites are used for e-commerce. Web monitoring checks can be set up quickly and easily in Opsview giving you powerful alerting capabilities to check on crashed servers, website attacks and more. Here are 10 easy steps [...]]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;">
			<a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Flabs.opsview.com%2F2012%2F01%2Fmonitor-your-website-in-10-easy-steps-with-opsview%2F">
				<img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Flabs.opsview.com%2F2012%2F01%2Fmonitor-your-website-in-10-easy-steps-with-opsview%2F&amp;style=normal&amp;b=2" height="61" width="50" />
			</a>
		</div><p><a href="http://labs.opsview.com/wp-content/uploads/2012/01/website.jpg"><img class="alignleft size-full wp-image-2133" title="website monitoring" src="http://labs.opsview.com/wp-content/uploads/2012/01/website.jpg" alt="website monitoring" width="175" height="141" /></a>For most companies websites are their corporate face to the world. Any downtime can be costly especially if the sites are used for e-commerce. <strong>Web monitoring</strong> checks can be set up <strong>quickly and easily</strong> in Opsview giving you <strong>powerful alerting</strong> capabilities to check on crashed servers, website attacks and more. Here are 10 easy steps to set up website monitoring in Opsview:<span id="more-2119"></span>1.	Login to Opsview and Go to  CONFIGURATION &gt; SERVICE CHECKS</p>
<p><a href="http://labs.opsview.com/wp-content/uploads/2012/01/service-check1.png"><img class="aligncenter size-full wp-image-2136" title="website monitoring configuration" src="http://labs.opsview.com/wp-content/uploads/2012/01/service-check1.png" alt="website monitoring configuration" width="500" height="253" /></a></p>
<p>2.	Click on &#8216;ACTIONS &gt; Create New Service Check&#8217;</p>
<p><a href="http://labs.opsview.com/wp-content/uploads/2012/01/service-check.png"><img class="aligncenter size-full wp-image-2122" title="create new website monitoring check" src="http://labs.opsview.com/wp-content/uploads/2012/01/service-check.png" alt="create new website monitoring check" width="322" height="128" /></a></p>
<p>3.	You will see a blank template, ready for you to fill with data. Populate the template with information to create an appropriate service check, in the example below we’ve set it up to warn us when our SSL certificate is within 30 days of expiring.</p>
<p><a href="http://labs.opsview.com/wp-content/uploads/2012/01/service-check-filled1.png"><img class="aligncenter size-full wp-image-2124" title="website monitor check" src="http://labs.opsview.com/wp-content/uploads/2012/01/service-check-filled1.png" alt="website monitor check" width="500" height="418" /></a></p>
<p>4.	Before adding the check into your template you can test the syntax of your argument on the command line:</p>
<pre>nagios@ov-trn-s1:~$ check_http –H www.opsview.com –C 30
OK – Certificate will expire on 04/26/2012 10.23.</pre>
<p>5.	Once all the data is complete you need to assign the checks to a host. When you&#8217;ve got all your website checks setup it&#8217;s useful to summarise them in a viewport so that you can easily see all activity in one place. To do this you need to set up a dashboard view for your new checks.</p>
<p>6. Go to CONFIGURATION – KEYWORDS</p>
<p>7. Click on ACTIONS &gt; CREATE NEW KEYWORD <a href="http://labs.opsview.com/wp-content/uploads/2012/01/new-keyword.png"><img class="aligncenter size-full wp-image-2157" title="new-keyword" src="http://labs.opsview.com/wp-content/uploads/2012/01/new-keyword.png" alt="new keyword" width="400" height="308" /></a></p>
<p>8. Select your newly created service checks.</p>
<p><a href="http://labs.opsview.com/wp-content/uploads/2012/01/new-check.png"><img class="aligncenter size-full wp-image-2186" title="summarise website monitoring checks" src="http://labs.opsview.com/wp-content/uploads/2012/01/new-check.png" alt="summarise website monitoring checks" width="600" height="122" /></a></p>
<p>9. After clicking SUBMIT then RELOADING your Opsview configuration, you will see your website monitors appear on your dashboard.</p>
<p><a href="http://labs.opsview.com/wp-content/uploads/2012/01/dashboard1.png"><img class="aligncenter size-full wp-image-2190" title="website monitoring dashboard" src="http://labs.opsview.com/wp-content/uploads/2012/01/dashboard1.png" alt="website monitoring dashboard" width="600" height="117" /></a></p>
<p>10. Click through to <strong>Website Health Checks</strong>. You can change the view to suit your requirements, here we have selected a ‘performance’ view, allowing us to instantly see the key metrics.</p>
<p><a href="http://labs.opsview.com/wp-content/uploads/2012/01/performance-view1.png"><img class="aligncenter size-full wp-image-2175" title="website monitoring performance view" src="http://labs.opsview.com/wp-content/uploads/2012/01/performance-view1.png" alt="website monitoring performance view" width="600" height="192" /></a></p>
<p>Here is the same dashboard, but with a ‘Group by Service’ overview.</p>
<p><a href="http://labs.opsview.com/wp-content/uploads/2012/01/service-overview.png"><img class="aligncenter size-full wp-image-2194" title="website monitoring service overview" src="http://labs.opsview.com/wp-content/uploads/2012/01/service-overview.png" alt="website monitoring service overview" width="600" height="87" /></a></p>
<h2>Website Monitoring Checks</h2>
<p>There are lots of types of checks you can carry out to ensure the health of your website and webservers – here are a few you can try:</p>
<h3>Web Server Check</h3>
<p>This is checking the web server running on &#8216;ov-dev-61&#8242;. It performs a simple TCP connection on port 80 and ensures a valid HTTP response is received.</p>
<pre>nagios@ov-trn-s1:~$ check_http –H ov-dev-61 –w 5 –c 10
HTTP OK HTTP/1.1 200 OK – 316 bytes in 0.003 second response time | time=0.003423s;5.000000;10.000000;0.000000 size=316B;;;0</pre>
<p>If the webserver stops working, then we see this response:</p>
<pre>nagios@ov-trn-s1:~$ check_http –H ov-dev-61 –w 5 –c 10
Connection refused
HTTP CRITICAL – Unable to open TCP socket</pre>
<h3>Content Serving Check</h3>
<p>This is a really useful check, as it allows you to monitor both potential defacements to your website as well as performing an &#8216;application&#8217; level check, i.e. ensuring that your webserver is serving meaningful content.</p>
<p>A check of Apache processes sometimes isn&#8217;t enough; Apache may be running, but it may not be serving the content you expect. This check addresses that.</p>
<pre>nagios@ov-trn-s1:~$ check_http –H www.opsview.com –u /company –s “Based in Reading –w –c 10
HTTP OK HTTP/1.1 200 OK – 41439 bytes in 0.886 second response time | time=0.885759s;5.000000;10.000000;0.000000 size=41439B;;;0</pre>
<p>Here we see the result of the check when a string (&#8216;foobar&#8217;) is required, but not found.</p>
<pre>nagios@ov-trn-s1:~$ check_http –H www.opsview.com –u /company –s “foobar” –w 5 –c 10
HTTP CRITICAL: HTTP/1.1 200 OK – string ‘foobar’ not found on ‘http://www.opsview.com:80/company’</pre>
<h3>Retrieve In-Time Check</h3>
<p>Here’s an excellent monitor to ensure that your website is retrieved within an acceptable time-frame:</p>
<pre>[nagios@vader ~j$ check_http –w 2 –c 4 –H www.opsview.com –f follow
HTTP OK: HTTP/1.1 200 OK – 49250 bytes in 0.777 second response time | time=0.777155s;2.000000;4.000000;0.000000 size=49250B;;;0</pre>
<hr />If there are any other types of website monitoring checks that you use regularly or want to use let us know!</p>
<p>Find out how to configure Opsview to pro-actively respond to website attacks at our <a href="http://www.opsview.com/learn/opsview-webinars">Webinar on 2nd February</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://labs.opsview.com/2012/01/monitor-your-website-in-10-easy-steps-with-opsview/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Monitoring Apache Solr with Opsview</title>
		<link>http://labs.opsview.com/2011/12/monitoring-apache-solr-with-opsview/</link>
		<comments>http://labs.opsview.com/2011/12/monitoring-apache-solr-with-opsview/#comments</comments>
		<pubDate>Mon, 12 Dec 2011 14:58:29 +0000</pubDate>
		<dc:creator>rbramley</dc:creator>
				<category><![CDATA[Configuration]]></category>
		<category><![CDATA[DevOps]]></category>
		<category><![CDATA[Development]]></category>
		<category><![CDATA[Grails]]></category>
		<category><![CDATA[Hudson]]></category>
		<category><![CDATA[JMX]]></category>
		<category><![CDATA[Nagios]]></category>
		<category><![CDATA[System Management]]></category>
		<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[agentless checks]]></category>
		<category><![CDATA[apache solr]]></category>
		<category><![CDATA[Lucene]]></category>
		<category><![CDATA[nrpe]]></category>
		<category><![CDATA[Opsview]]></category>

		<guid isPermaLink="false">http://labs.opsview.com/?p=2012</guid>
		<description><![CDATA[
			
				
			
		Apache Solr is an open source enterprise search service from the Lucene project. Solr is written in Java and runs as a standalone full-text search server within a servlet container such as Tomcat.
Like any service or component in your architecture, you’ll want to monitor it to ensure that it’s available and gather performance data to [...]]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;">
			<a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Flabs.opsview.com%2F2011%2F12%2Fmonitoring-apache-solr-with-opsview%2F">
				<img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Flabs.opsview.com%2F2011%2F12%2Fmonitoring-apache-solr-with-opsview%2F&amp;style=normal&amp;b=2" height="61" width="50" />
			</a>
		</div><p><a href="http://labs.opsview.com/wp-content/uploads/2011/12/solr.jpg"><img class="alignleft size-full wp-image-2027" title="solr" src="http://labs.opsview.com/wp-content/uploads/2011/12/solr.jpg" alt="monitoring Apache Solr" width="150" height="83" /></a><a title="Apache Solr" href="http://lucene.apache.org/solr/">Apache Solr</a> is an open source enterprise search service from the Lucene project. Solr is written in Java and runs as a standalone full-text search server within a servlet container such as Tomcat.</p>
<p>Like any service or component in your architecture, you’ll want to monitor it to ensure that it’s available and gather performance data to help with tuning.</p>
<p>In this post, we’ll look at how we can monitor Solr, what performance metrics we might want to gather and how we can easily achieve this with Opsview.</p>
<p><span id="more-2012"></span></p>
<div style="border: 1px solid #ccc; background-color: #f5f5f5; padding: 8px;">
<h2>Requirements</h2>
<ul>
<li>Installed version of Opsview <a title="Download Opsview" href="http://www.opsview.com/downloads">[download]</a></li>
<li>Apache Solr Custom Plugin <a title="Download Apache Solr Custom Plugin" href="https://github.com/rbramley/Opsview-solr-checks">[download]</a></li>
</ul>
</div>
<p><br /></p>
<h2>A check list for service checks</h2>
<p>Solr is built on Lucene so follows the same layout, an index contains documents that are comprised of fields. As part of the search service value add over Lucene, Solr provides a number of useful ways of obtaining health status / monitoring metrics:</p>
<ol>
<li>Health-check status using the <em>/admin/ping</em> handler</li>
<li>The admin statistics page <em>/admin/stats.jsp</em> (XML styled with XSL)</li>
<li><a href="http://wiki.apache.org/solr/SolrJmx">JMX MBeans</a></li>
</ol>
<p>The list of applicable checks could be defined by whether it is a health check or a data gathering check – but this would lead to a lot of overlap. Instead the list is divided into the checks that can be performed remotely (without an installed agent on the server) and those that are best performed locally to the Solr server.</p>
<h2>Remote (agent-less) checks</h2>
<p>What should we look for over the network?</p>
<p>Firstly we can have a host-level check which may perform a network level ping. Next we can check TCP connectivity to the servlet container port and then make an HTTP GET request to the Solr ‘front page’ and check for a known string (e.g. Welcome to Solr).</p>
<p>Now we’ve made it up to the application layer so can start to perform Solr specific checks.</p>
<p>Items to monitor may include (delete as applicable):</p>
<ol>
<li>Ping status</li>
<li>Number of docs</li>
<li>Number of queries / queries per second</li>
<li>Average response time</li>
<li>Number of updates</li>
<li>Cache hit ratios</li>
<li>Replication status</li>
<li>Synthetic queries</li>
</ol>
<h2>Agent-based checks</h2>
<p>Installing an <a title="Opsview Agents" href="http://www.opsview.com/downloads/opsview-agents">Opsview agent</a> on the Solr server means we can run additional checks over NRPE (Nagios Remote Plugin Executor). This could be operating system level checks such as memory/disk utilisation or CPU load, or the following:</p>
<ol>
<li>Java servlet container process is running</li>
<li>JMX checks e.g. heap memory or custom MBeans</li>
<li>File age</li>
<li>Log parsing for exceptions</li>
</ol>
<p>The Solr wiki describes how to configure JMX support: <a title="Configure JMX support" href="http://wiki.apache.org/solr/SolrJmx">http://wiki.apache.org/solr/SolrJmx.</a></p>
<h2>Opsview configuration</h2>
<p>For the rest of this article you&#8217;ll need to have <a title="Download Opsview" href="http://www.opsview.com/downloads">Opsview</a> installed (or the <a title="Opsview VMWare Appliance" href="http://www.opsview.com/downloads/opsview-3-vmware-virtual-appliance">Opsview VMWare appliance</a>) and have completed the <a title="Opsview Quick Start Guide" href="http://docs.opsview.com/doku.php?id=opsview3.14:quickstart">Quick Start.</a></p>
<h2>Solr-specific Plugin</h2>
<p>Install the Solr plugin at <a title="Opsview Solr Plugin" href="https://github.com/rbramley/Opsview-solr-checks">https://github.com/rbramley/Opsview-solr-checks</a> into /usr/local/nagios/libexec/</p>
<p>The check_solr plugin was developed using Perl, so that it could be contributed back to Opsview. It requires the CPAN XML::XPath module (sudo cpan -i XML::XPath).</p>
<p>The plugin includes usage instructions, check_solr -h which can also be viewed in Opsview by selecting the ‘Show Plugin Help‘ link beneath the Plugin drop down (see Figure 1). The -u option can be used to specify the URL path for multi-core set-ups.</p>
<h2>Service check setup</h2>
<p>Figure 1 gives an example of a service check configuration.</p>
<p><a href="http://labs.opsview.com/wp-content/uploads/2011/12/figure_1_with_help.png"><img class="aligncenter size-full wp-image-2013" title="figure_1_with_help" src="http://labs.opsview.com/wp-content/uploads/2011/12/figure_1_with_help.png" alt="Opsview service check configuration." width="542" height="699" /></a></p>
<p>Figure 2 shows the <em>agentless</em> service check group with plugins and their arguments.</p>
<p><a href="http://labs.opsview.com/wp-content/uploads/2011/12/solr-agentless-monitoring1.png"><img class="aligncenter size-full wp-image-2015" title="solr-agentless-monitoring" src="http://labs.opsview.com/wp-content/uploads/2011/12/solr-agentless-monitoring1.png" alt="solr agentless monitoring" width="500" height="252" /></a></p>
<h2>Host configuration</h2>
<p>Figure 3 shows a simplistic host setup with a ping check.</p>
<p><a href="http://labs.opsview.com/wp-content/uploads/2011/12/set_up_host1.png"><img class="aligncenter size-full wp-image-2017" title="set_up_host" src="http://labs.opsview.com/wp-content/uploads/2011/12/set_up_host1.png" alt="set up host" width="500" height="596" /></a></p>
<p>Figure 4 is an extract from the <strong>Monitors</strong> tab, where we select the checks we want performed for the current host.</p>
<p><a href="http://labs.opsview.com/wp-content/uploads/2011/12/monitors.png"><img class="aligncenter size-full wp-image-2018" title="monitors" src="http://labs.opsview.com/wp-content/uploads/2011/12/monitors.png" alt="monitors" width="288" height="226" /></a></p>
<h2>Viewing output</h2>
<p>The check results shown in Figure 5 are visible by navigating through the host group hierarchy.</p>
<p><a href="http://labs.opsview.com/wp-content/uploads/2011/12/viewing-output.png"><img class="aligncenter size-full wp-image-2019" title="viewing-output" src="http://labs.opsview.com/wp-content/uploads/2011/12/viewing-output.png" alt="" width="500" height="192" /></a></p>
<p><a href="http://labs.opsview.com/wp-content/uploads/2011/12/viewing-output.png"></a>If you click on the graph icon of <em>Solr Cache Hit Ratios</em> this will drill down onto the graph shown in Figure 6.</p>
<p>Clicking on the graph icon for <em>Solr Avg Response Time – standard</em> will take you to the graphs in Figure 7.</p>
<p><a href="http://labs.opsview.com/wp-content/uploads/2011/12/cache_hit_ratios.png"><img class="aligncenter size-full wp-image-2021" title="cache_hit_ratios" src="http://labs.opsview.com/wp-content/uploads/2011/12/cache_hit_ratios.png" alt="cache hit ratios" width="500" height="209" /></a><a href="http://labs.opsview.com/wp-content/uploads/2011/12/avg_req_time.png"><img class="aligncenter size-full wp-image-2022" title="avg_req_time" src="http://labs.opsview.com/wp-content/uploads/2011/12/avg_req_time.png" alt="average request time" width="500" height="449" /></a></p>
<p>If you shutdown Solr, then the check results will start to turn critical and show in red as per Figure 8.</p>
<p><a href="http://labs.opsview.com/wp-content/uploads/2011/12/post-shutdown-alert.png"><img class="aligncenter size-full wp-image-2023" title="post-shutdown-alert" src="http://labs.opsview.com/wp-content/uploads/2011/12/post-shutdown-alert.png" alt="post shoutdown alert" width="500" height="197" /></a></p>
<h2>Alternatives</h2>
<p>There are a few other plugins available for monitoring Solr from Opsview, depending on your needs:</p>
<ul>
<li><a href="http://code.google.com/p/nagios-plugins-shamil/">http://code.google.com/p/nagios-plugins-shamil</a> – provides ping, replication status and num docs</li>
<li><a href="http://code.google.com/p/solr-nagios-check">http://code.google.com/p/solr-nagios-check</a> – provides QPS, response time and num docs</li>
</ul>
<p>Also, chapter 8 of the recently published <a href="http://www.amazon.co.uk/gp/product/1849516065/ref=as_li_ss_tl?ie=UTF8&amp;tag=leanjavaengi-21&amp;linkCode=as2&amp;camp=1634&amp;creative=19450&amp;creativeASIN=1849516065">Apache Solr 3 Enterprise Search Server</a><img src="http://www.assoc-amazon.co.uk/e/ir?t=leanjavaengi-21&amp;l=as2&amp;o=2&amp;a=1849516065" border="0" alt="" width="1" height="1" /> book includes a section on Monitoring Solr Performance.</p>
<h2>Summary</h2>
<p>Using <em>check_solr</em> in conjunction with <a title="Opsview Open Source Monitoring" href="http://www.opsview.com">Opsview</a> allows you to  ensure that your Solr server is available and provides you with metrics  that can help you tune your Solr configuration.</p>
<p>This can be complemented  with additional agent-based operating system and JMX checks to give you  a full picture view.</p>
<div>
<div style="border: 1px solid #ccc; background-color: #f5f5f5; padding: 8px;">
<h3>About the Author</h3>
<p>Robin Bramley is a hands-on Technical Manager / Lead Architect at an Open Source software &amp; services company who has spent the majority of the last decade working with Java, mobile &amp; Open Source across sectors including Financial Services &amp; High Growth / start-ups. You can view Robin&#8217;s personal blog at <a href="http://leanjavaengineering.wordpress.com/">www.leanjavaengineering.com</a></p>
<h4>Legal Disclaimer</h4>
<p>This blog post is contributed by a member of the Opsview community.  The Opsview project and Opsera Ltd accept no responsibility for the  accuracy of its content and are not liable for any direct or indirect  damages caused by its use.</p>
</div>
</div>
]]></content:encoded>
			<wfw:commentRss>http://labs.opsview.com/2011/12/monitoring-apache-solr-with-opsview/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>5 Steps to Monitor your Network with Opsview Parenting</title>
		<link>http://labs.opsview.com/2011/12/monitor-your-network-with-opsview-parenting/</link>
		<comments>http://labs.opsview.com/2011/12/monitor-your-network-with-opsview-parenting/#comments</comments>
		<pubDate>Mon, 05 Dec 2011 14:07:27 +0000</pubDate>
		<dc:creator>brian.king</dc:creator>
				<category><![CDATA[Configuration]]></category>
		<category><![CDATA[DevOps]]></category>
		<category><![CDATA[Opsview]]></category>
		<category><![CDATA[System Management]]></category>
		<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[layer 3]]></category>
		<category><![CDATA[network monitoring]]></category>
		<category><![CDATA[Traceroute]]></category>
		<category><![CDATA[VLAN]]></category>
		<category><![CDATA[VPN]]></category>

		<guid isPermaLink="false">http://labs.opsview.com/?p=1876</guid>
		<description><![CDATA[
			
				
			
		Parenting is one of the many powerful network monitoring tools that can be deployed in Opsview. However parenting is quite often misunderstood or deployed incorrectly causing the intended goal (less notifications when detecting network failures) to be missed and flooding mailboxes with undesirable e-mail alerts. This blog post outlines the steps involved in setting up [...]]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;">
			<a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Flabs.opsview.com%2F2011%2F12%2Fmonitor-your-network-with-opsview-parenting%2F">
				<img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Flabs.opsview.com%2F2011%2F12%2Fmonitor-your-network-with-opsview-parenting%2F&amp;style=normal&amp;b=2" height="61" width="50" />
			</a>
		</div><p><a href="http://labs.opsview.com/wp-content/uploads/2011/12/network.png"><img class="alignleft size-thumbnail wp-image-1892" title="Opsview network monitor" src="http://labs.opsview.com/wp-content/uploads/2011/12/network-150x150.png" alt="Monitor your network with Opsview" width="150" height="150" /></a>Parenting is one of the many powerful <a href="http://www.opsview.com/learn/network-monitoring">network monitoring</a> tools that can be deployed in Opsview. However parenting is quite often misunderstood or deployed incorrectly causing the intended goal <strong>(less notifications when detecting network failures)</strong> to be missed and flooding mailboxes with undesirable e-mail alerts. This blog post outlines the steps involved in setting up Basic Parenting in Opsview.</p>
<p><span id="more-1876"></span></p>
<p>Before successfully deploying parenting it&#8217;s important to understand how it works:</p>
<ol>
<li>Parenting is based on host states</li>
<li>It determines if a host is DOWN or UNREACHABLE</li>
<li>When assigning multiple parents all parents need to be DOWN for the host to be UNREACHABLE</li>
<li>Parenting can not be circular</li>
</ol>
<div>
<p>Please note that UNREACHABLE notifications are on by default so check the settings in your notification profile.</p>
<h2>Determining a host state</h2>
<p>One additional point that has to be made is that host states are  determined using layer 3 IP information (for instance by running a  check_icmp –H 1.2.3.4) as described by the OSI model.</p>
<p>This is important as modern network technologies like VLANs, VPNs,  HSRP and VRRP are used to make a network more robust and fault-tolerant,  unfortunately they also “hide” information needed to setup parenting.</p>
<p>Later on we will discuss these technologies and their impact on parenting.</p>
<h2>Circular parent-child relationships</h2>
<p>A circular parent-child relationship is formed when we define hostA  to have deviceX as it’s parent and define deviceX to have hostA as its  parent.</p>
<p>The parenting logic will detect this and generate an error as a  circular parent-child relationship should not exist (actually they do  exist and we will discuss them in another blog post).</p>
<h2>Types of parenting</h2>
<p>In this article we will be looking at Basic Parenting, focusing on  layer 3 hops and Single Point Of Failures (SPOFs) and how we can use  them for parenting.</p>
<p>Note that while Basic Parenting is relatively easy to implement and  will greatly help in detecting network failures, it will not be as  precise as when you implement Advanced Parenting (which will be covered  in a later blog post).</p>
<h2>Network technologies</h2>
<p><a href="http://labs.opsview.com/wp-content/uploads/2011/12/Jigsaw_piece_150.png"><img class="alignright" title="Jigsaw_piece_150" src="http://labs.opsview.com/wp-content/uploads/2011/12/Jigsaw_piece_150-150x150.png" alt="Network Technologies" width="150" height="150" /></a> To be able to setup parenting we need to understand some of the network technologies mentioned earlier.</p>
<p>Below is a simple explanation of the technologies and the parts they  play in parenting, plenty of more detailed information on these can be  found on the internet.</p>
<p>We also look at one of the tools at our disposal to help determine the hops we cross to reach any given host in our network.</p>
<h2>Traceroute</h2>
<p>Traceroute can be used to determine the (layer3) hops between any two hosts on a network.</p>
<p>When using traceroute it’s best to run it from the Opsview  slave/master (this will give a consistent result). When you use slave  clusters you should consider how you cluster them (globally or locally)  so all the slaves in the cluster have the same path to a given host.</p>
<p>Later on we will discuss how to use the results from a traceroute to create a host-parent table.</p>
<h2>VLANs</h2>
<p>A VLAN (or Virtual LAN) is used to create logical ip-subnets spanning various network devices.<strong> </strong> So if Host A and Host B are in the same VLAN, any physical hop between  them becomes transparent and a traceroute between them will show only  one hop (the destination).</p>
<p>Again this is a simplistic representation of VLAN’s and should hold  true in most situations (for instance there are extensions to VLANs like  private VLANs etc etc).</p>
<h2>VPN’s</h2>
<p>VPN’s are commonly used to connect remote servers to a local network  (over the Internet for example). However from a network perspective the  host is no longer at a distant location but directly connected to the  network (so any hops on the Internet become transparent).</p>
<p>VPN’s will be discussed in the Advanced Parenting blog post so we can go into this in more detail.</p>
<h2>HSRP/VRRP</h2>
<p>HSRP and VRRP are redundancy technologies used to make a network more  robust on layer 3. The most common deployment is used to make a network  gateway redundant.</p>
<p>In those cases the IP address you configure on your host as the  gateway is shared between two multi-layer switches (or routers) so when  one fails the other takes over and traffic keeps flowing in your  network.</p>
<p>(Note that generally a gateway is only used when traffic is destined for another network).</p>
<p>For example traffic from 192.168.1.200/24 to 192.168.1.100/24 will  stay on the same network (with a VLAN it might cross multiple devices  but these are transparent).</p>
<p>Traffic from 192.168.1.200/24 to 192.168.21.100/24 will cross the  gateway as the source and destination are on two separate networks.</p>
<p>Below is an example of a modern network.</p>
<p style="text-align: center;"><a href="http://labs.opsview.com/wp-content/uploads/2011/12/complex-network11.png"><img class="aligncenter" title="Example of a modern network" src="http://labs.opsview.com/wp-content/uploads/2011/12/complex-network11.png" alt="Monitoring a modern network" width="500" height="347" /></a></p>
<p>In this example we have host A and host C which are connected only  once to our network and host B which is multi-homed (for example running  bonding or nic-teaming).</p>
<p>All the switches (except switchE) and the core switches are  cross-connected to provide redundancy and fault-tolerance in our network  (full-meshed using VLAN’s and HSRP/VRRP).</p>
<p>If all our hosts reside on the same ip-subnet (or VLAN) our network  will be completely transparent and our traceroute will show only one hop  (the destination host).</p>
<pre>opsview-slave nagios $ traceroute hostA
traceroute to hostA (192.168.1.2), 30 hops max, 40 byte packets
 1  hostA (192.168.1.2)  1.594 ms  0.590 ms  0.362 ms</pre>
<p>Assuming our Opsview host is on a different ip-subnet our network will have one (or more) additional hops in our traceroute.</p>
<pre>opsview-slave nagios $ traceroute hostA
traceroute to hostA (192.168.1.2), 30 hops max, 40 byte packets
 1  coreA (192.168.1.1)  0.747 ms  0.582 ms  0.518 ms
 2  hostA (192.168.1.2)  1.594 ms  0.590 ms  0.362 ms</pre>
<h2>Setting up Basic Parenting</h2>
<p>When setting up Basic Parenting we first need to determine our SPOFs and hops.</p>
<p>Note that a SPOF is a single point of failure so any host/device  which has a single connection to our network has to be considered to  have a SPOF.</p>
<p>Also note that Host B and Opsview are redundantly connected to our  network and don’t have a SPOF (in Advanced Parenting we will be covering  multi-homed hosts).</p>
<h2>Step 1:</h2>
<p><strong>Review your network.</strong></p>
<p>First off, review your network (if needed ask your network admins to  help you out) and make traceroutes to each host so we can create a  host-parent table.</p>
<p>After the review we have found that Core (consisting of a HSRP  ip-gateway running on either Core A or Core B) is a hop and switchC,  switchD and switchE are SPOFs.</p>
<p>Note that Core is considered a HOP and not a SPOF (although it looks  like a single point of failure) this is because it uses HSRP or VRRP for  redundancy over two nodes (Core A and Core B).</p>
<p>Using this information we can create our host-parent table.</p>
<p>In this table we use various pieces of information to determine a given hosts parent:</p>
<ol>
<li>Looking at Host A we know it is connected to SPOF switchC which  makes switchC the parent of Host A (Id. 1)</li>
<li>From our traceroute we  know we only traverse Core when going from Opsview to Host A making  Opsview the parent of Core (Id. 4)</li>
<li>From our network admin we know  switchC is connected to Core (Id. 5)</li>
</ol>
<p>This gives us an entire path from Opsview through Core through switchC to Host A.</p>
<p>Host-parent table example for our network.</p>
<table border="0" width="500">
<tbody>
<tr>
<td width="63">Id</td>
<td width="120">Host</td>
<td width="131">Parent</td>
<td width="168">Note</td>
</tr>
<tr>
<td>1</td>
<td>Host A</td>
<td>switchC</td>
<td>See id 5</td>
</tr>
<tr>
<td>2</td>
<td>Host B</td>
<td>Core</td>
<td>See id 4</td>
</tr>
<tr>
<td>3</td>
<td>Host C</td>
<td>switchE</td>
<td>See id 7</td>
</tr>
<tr>
<td>4</td>
<td>Core</td>
<td>Opsview</td>
<td></td>
</tr>
<tr>
<td>5</td>
<td>switchC</td>
<td>Core</td>
<td>See id 4</td>
</tr>
<tr>
<td>6</td>
<td>switchD</td>
<td>Core</td>
<td>See id 4</td>
</tr>
<tr>
<td>7</td>
<td>switchE</td>
<td>Core</td>
<td>See id 6</td>
</tr>
</tbody>
</table>
<p>Please note that Host B is connected to switchC and switchD (which we  will be monitoring as they are SPOFs) and so it is possible to  configure Host B with two parents (switchC and switchD) instead of using  Core.</p>
<p>In Advanced parenting we will be looking at multiple parents, for now we configure Host B as having just the Core as parent.</p>
<p>Here is our network but now with only our SPOFs and hops and all transparent devices removed.</p>
<p style="text-align: center;"><a href="http://labs.opsview.com/wp-content/uploads/2011/12/parenting-SPOF-HOP.png"><img class="aligncenter" title="parenting SPOF HOP" src="http://labs.opsview.com/wp-content/uploads/2011/12/parenting-SPOF-HOP.png" alt="Parenting SPOF HOP" width="500" height="347" /></a></p>
<h2>Step 2:</h2>
<p><strong>Create your SPOFs and Hops in Opsview </strong></p>
<p>(ask your network admin for the host-addresses).</p>
<p>Start by adding the devices which have Opsview as it’s parent (Core  in our case) then add the devices which have Core as their parent.</p>
<p>This way you can immediately configure the devices’ parent based on our host-parent table.</p>
<p>Make sure you assign at least one service-check to the devices.</p>
<h2>Step 3:</h2>
<p><strong>Edit your hosts and add the parent based on the host-parent table.</strong></p>
<p style="text-align: center;"><a href="http://labs.opsview.com/wp-content/uploads/2011/12/edit-host-parent.png"><img class="aligncenter" title="edit host parent" src="http://labs.opsview.com/wp-content/uploads/2011/12/edit-host-parent.png" alt="Edit host parent in Opsview" width="500" height="250" /></a></p>
<h2>Step 4:</h2>
<p><strong>Check your notification settings for DOWN and UNREACHABLE notifications.</strong></p>
<p style="text-align: center;"><a href="http://labs.opsview.com/wp-content/uploads/2011/12/notification-profile-DOWN-UNREACHABLE.png"><img class="aligncenter" title="notification profile DOWN UNREACHABLE" src="http://labs.opsview.com/wp-content/uploads/2011/12/notification-profile-DOWN-UNREACHABLE.png" alt="notification profile down unreachable" width="500" height="127" /></a></p>
<h2>Step 5:</h2>
<p><strong>Reload your Opsview. </strong></p>
<p>If you accidently created a circular parent-child relationship a reload will fail with the following error:.</p>
<pre>Checking for circular paths between hosts...

Error: The host 'CoreA' is part of a circular parent/child chain!
Error: The host 'HostA' is part of a circular parent/child chain!
Checking for circular host and service dependencies...
Checking global event handlers...
Checking obsessive compulsive processor commands...
Checking misc settings...</pre>
<p>If this happens review your configuration and verify it against your host-parent table.</p>
<p>Look out for the post on Advanced Parenting where we will be building further on our basic parenting setup, including:</p>
</div>
<ol>
<li>Slave clusters and parenting</li>
<li>Multi-homed hosts and multiple parents</li>
<li>Circular parent child relationships</li>
<li>Common pitfalls for parenting</li>
<li>VPNs</li>
</ol>
<div>
<div style="border: 1px solid #ccc; background-color: #f5f5f5; padding: 8px;">
<h3>About the Author</h3>
<p>Alan Wijntje is responsible for maintaining and improving all forms of monitoring at <a href="http://www.ziggo.nl/">Ziggo</a>,  one of the leading Managed Service Providers in the Netherlands. An Opsview expert and open source enthusiast, Alan enjoys finding,  designing and implementing new and innovative ways of monitoring complex  systems and applications.</p>
<h4>Legal Disclaimer</h4>
<p>This blog post is contributed by a member of the Opsview community.  The Opsview project and Opsera Ltd accept no responsibility for the  accuracy of its content and are not liable for any direct or indirect  damages caused by its use.</p>
</div>
</div>
<p>Download <a href="http://www.opsview.com/downloads/download-opsview-community">Opsview Community edition</a> or try out a demo of <a href="http://www.opsview.com/products/online-demo">Opsview Enterprise edition.</a></p>
]]></content:encoded>
			<wfw:commentRss>http://labs.opsview.com/2011/12/monitor-your-network-with-opsview-parenting/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Workaround for “ODW_STATUS WARNING – No update since” Notification in Opsview</title>
		<link>http://labs.opsview.com/2011/10/workaround-for-%e2%80%9codw_status-warning-%e2%80%93-no-update-since%e2%80%9d-notification-in-opsview/</link>
		<comments>http://labs.opsview.com/2011/10/workaround-for-%e2%80%9codw_status-warning-%e2%80%93-no-update-since%e2%80%9d-notification-in-opsview/#comments</comments>
		<pubDate>Mon, 31 Oct 2011 12:37:26 +0000</pubDate>
		<dc:creator>brian.king</dc:creator>
				<category><![CDATA[DevOps]]></category>
		<category><![CDATA[Opsview]]></category>
		<category><![CDATA[System Management]]></category>
		<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[cron]]></category>
		<category><![CDATA[distributed monitoring]]></category>
		<category><![CDATA[opsview community]]></category>
		<category><![CDATA[status warning]]></category>

		<guid isPermaLink="false">http://labs.opsview.com/?p=1535</guid>
		<description><![CDATA[
			
				
			
		This week&#8217;s post is a technical workaround, from Opsview Community edition user Matthew White, for anyone experiencing an ODW_Status Warning in Opsview due to heavy server load.
Over to you Matthew&#8230;

For a while I have been seeing a daily ODW_STATUS_WARNING about no updates since 03:59:59 on my master opsview server. I was 90% sure this was [...]]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;">
			<a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Flabs.opsview.com%2F2011%2F10%2Fworkaround-for-%25e2%2580%259codw_status-warning-%25e2%2580%2593-no-update-since%25e2%2580%259d-notification-in-opsview%2F">
				<img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Flabs.opsview.com%2F2011%2F10%2Fworkaround-for-%25e2%2580%259codw_status-warning-%25e2%2580%2593-no-update-since%25e2%2580%259d-notification-in-opsview%2F&amp;style=normal&amp;b=2" height="61" width="50" />
			</a>
		</div><p><a href="http://labs.opsview.com/wp-content/uploads/2011/10/warning-sign11.png"><img class="alignleft size-medium wp-image-1548" title="warning-sign1" src="http://labs.opsview.com/wp-content/uploads/2011/10/warning-sign11-300x250.png" alt="opsview community" width="150" height="125" /></a>This week&#8217;s post is a technical workaround, from <a href="http://www.opsview.com/community">Opsview Community</a> edition user Matthew White, for anyone experiencing an ODW_Status Warning in Opsview due to heavy server load.</p>
<p>Over to you Matthew&#8230;</p>
<p><span id="more-1535"></span></p>
<p>For a while I have been seeing a daily ODW_STATUS_WARNING about no updates since 03:59:59 on my master opsview server. I was 90% sure this was due to the load that I put on the server (load average sits around 6 and goes up to 13 at certain times of the day) but still got bored of running cleanup_import and then import_runtime -i 1.</p>
<p>I started off by manually clearing out all but 1 week of data from the runtime database (this is run as part of opsview_master_housekeep for various tables) and this didn&#8217;t resolve the issue. In the end I modified my cron table so that the rc.opsview cron_daily task runs 30 minutes later (at 41 minutes past the hour instead of 11 minutes past). Since changing that I seem to have had no further re-occurrences of the No update prompt.</p>
<p>I am aware that each time I update Opsview I am going to have to make this change until I manage to move the databases to their own host and rebuild the master server onto new hardware. But its a workaround for now!</p>
<p>For reference the crontab now looks like:</p>
<pre># OPSVIEW-START
# Do not remove comment above. Everything between OPSVIEW-START and OPSVIEW-END
# will be automatically installed as part of an Opsview install/upgrade
0,5,10,15,20,25,30,35,40,45,50,55 * * * * /usr/local/nagios/bin/mrtg_genstats.sh &gt; /dev/null 2&gt;&amp;1
41 3 * * * /usr/local/nagios/bin/rc.opsview cron_daily &gt; /dev/null 2&gt;&amp;1
22 2,6,10,14,18,22 * * * . /usr/local/nagios/bin/profile &amp;&amp; /usr/local/nagios/bin/opsview_cronjobs 4hourly &gt; /dev/null 2&gt;&amp;1
0,5,10,15,20,25,30,35,40,45,50,55 * * * * /usr/local/nagios/bin/call_nmis nmis.pl type=collect mthread=true &gt; /dev/null 2&gt;&amp;1
34 0,4,8,12,16,20 * * * /usr/local/nagios/bin/call_nmis nmis.pl type=update mthread=true &gt; /dev/null 2&gt;&amp;1
4 * * * * . /usr/local/nagios/bin/profile &amp;&amp; /usr/local/nagios/bin/import_runtime -q
# NMIS reports
0 0 * * * /usr/local/nagios/bin/call_nmis run-reports.sh day health
0 0 * * * /usr/local/nagios/bin/call_nmis run-reports.sh day top10
0 0 * * * /usr/local/nagios/bin/call_nmis run-reports.sh day outage
0 0 * * * /usr/local/nagios/bin/call_nmis run-reports.sh day response
0 0 * * * /usr/local/nagios/bin/call_nmis run-reports.sh day avail
0 0 * * * /usr/local/nagios/bin/call_nmis run-reports.sh day port
0 0 * * 0 /usr/local/nagios/bin/call_nmis run-reports.sh week health
0 0 * * 0 /usr/local/nagios/bin/call_nmis run-reports.sh week top10
0 0 * * 0 /usr/local/nagios/bin/call_nmis run-reports.sh week outage
0 0 * * 0 /usr/local/nagios/bin/call_nmis run-reports.sh week response
0 0 * * 0 /usr/local/nagios/bin/call_nmis run-reports.sh week avail
0 0 * * 0 /usr/local/nagios/bin/call_nmis run-reports.sh week port
0 0 1 * * /usr/local/nagios/bin/call_nmis run-reports.sh month health
0 0 1 * * /usr/local/nagios/bin/call_nmis run-reports.sh month top10
0 0 1 * * /usr/local/nagios/bin/call_nmis run-reports.sh month outage
0 0 1 * * /usr/local/nagios/bin/call_nmis run-reports.sh month response
0 0 1 * * /usr/local/nagios/bin/call_nmis run-reports.sh month avail
0 0 1 * * /usr/local/nagios/bin/call_nmis run-reports.sh month port
# OPSVIEW-END</pre>
<div style="border: 1px solid #ccc; background-color: #f5f5f5; padding: 8px;">
<h3>About the Author</h3>
<p>Matthew has been working in IT for 5 years after graduating from the University of Durham with a Degree in Software Engineering. He currently works for an IT outsourcing company in London, working with a range of including Microsoft, VMWare, Citrix, Juniper, Cisco, CheckPoint, RSA, Lumension.</p>
<p>Legal Disclaimer</p>
<p>This blog post is contributed by a member of the Opsview community. The Opsview project and Opsera Ltd. accept no responsibility for the accuracy of its content and are not liable for any direct or indirect damages caused by its use.</p>
</div>
<p>Do you have technical experience of using Opsview that you would like to share with Opsview&#8217;s community? We&#8217;d love to hear from you! Contact us at <a href="mailto:marketing@opsview.com">marketing@opsview.com</a> or <a href="http://www.twitter.com/opsview">@opsview</a> on Twitter.</p>
]]></content:encoded>
			<wfw:commentRss>http://labs.opsview.com/2011/10/workaround-for-%e2%80%9codw_status-warning-%e2%80%93-no-update-since%e2%80%9d-notification-in-opsview/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Monitoring SNMP Traps from ESX hosts in Opsview (Part 2)</title>
		<link>http://labs.opsview.com/2011/10/monitoring-snmp-traps-from-esx-hosts-in-opsview-part-2/</link>
		<comments>http://labs.opsview.com/2011/10/monitoring-snmp-traps-from-esx-hosts-in-opsview-part-2/#comments</comments>
		<pubDate>Wed, 05 Oct 2011 13:09:11 +0000</pubDate>
		<dc:creator>brian.king</dc:creator>
				<category><![CDATA[ESX]]></category>
		<category><![CDATA[Nagios]]></category>
		<category><![CDATA[Opsview]]></category>
		<category><![CDATA[SNMP]]></category>
		<category><![CDATA[System Management]]></category>
		<category><![CDATA[Ubuntu]]></category>
		<category><![CDATA[Unix / Linux]]></category>
		<category><![CDATA[vmware]]></category>
		<category><![CDATA[debian]]></category>
		<category><![CDATA[ESX monitoring]]></category>
		<category><![CDATA[ESX4]]></category>
		<category><![CDATA[SNMP traps]]></category>
		<category><![CDATA[Virtualization]]></category>

		<guid isPermaLink="false">http://labs.opsview.com/?p=1209</guid>
		<description><![CDATA[
			
				
			
		So you followed the steps in the previous post about enabling SNMP traps on ESX4. Now you probably want to pick those up by something useful. Opsview can be configured to handle the traps quite easily. Just follow the steps below and your server will be listening to those pesky traps. After that, you’ll need [...]]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;">
			<a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Flabs.opsview.com%2F2011%2F10%2Fmonitoring-snmp-traps-from-esx-hosts-in-opsview-part-2%2F">
				<img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Flabs.opsview.com%2F2011%2F10%2Fmonitoring-snmp-traps-from-esx-hosts-in-opsview-part-2%2F&amp;style=normal&amp;b=2" height="61" width="50" />
			</a>
		</div><p><a href="http://labs.opsview.com/wp-content/uploads/2011/09/little_nipper_mouse_trap.jpg"><img class="alignleft size-full wp-image-1240" title="little_nipper_mouse_trap" src="http://labs.opsview.com/wp-content/uploads/2011/09/little_nipper_mouse_trap.jpg" alt="monitoring SNMP | Opsview" width="164" height="103" /></a>So you followed the steps in the <a href="http://labs.opsview.com/2011/09/monitoring-snmp-traps-from-esx-hosts-in-opsview-part-1">previous post</a> about enabling SNMP traps on ESX4. Now you probably want to pick those up by something useful. <a href="http://www.opsview.com">Opsview</a> can be configured to handle the traps quite easily. Just follow the steps below and your server will be listening to those pesky traps. After that, you’ll need to write a couple of service check handlers in Opsview to make sense of the traps. More on that later. This post is just about picking them up.<span id="more-1209"></span></p>
<p>This was done on an Ubuntu 10.04 LTS server. The steps are probably the same on Debian systems.</p>
<p>1. Make sure you have snmpd installed. If you don’t, install it! Easy as pie.</p>
<pre>aptitude install snmpd</pre>
<p>2. Edit /etc/snmp/snmpd.conf and uncomment “master agentx”.</p>
<p>3. Edit /etc/default/snmpd (or /etc/snmp/snmptrapd.conf on newer systems):</p>
<pre>TRAPDRUN=yes
TRAPDOPTS='-t -m ALL -M /usr/share/snmp/mibs:/usr/local/nagios/snmp/load -p /var/run/snmptrapd.pid'
SNMPDOPTS='-u nagios -Lsd -Lf /dev/null -p/var/run/snmpd.pid'</pre>
<p>4. Edit /etc/snmp/snmptrapd.conf and add the following lines:</p>
<pre>traphandle default /usr/local/nagios/bin/snmptrap2nagios
disableAuthorization yes</pre>
<p>Please note that this will make the server listen to and handle any SNMP traps it receives, regardless of source.</p>
<p>5. Restart snmpd and snmptrapd:</p>
<pre>/etc/init.d/snmpd restart</pre>
<p>6. Edit the /etc/sudoers file to allow Opsview to restart snmpd and snmptrapd:</p>
<pre>nagios ALL=NOPASSWD:/usr/local/nagios/bin/snmpd reload</pre>
<p>7. Test the permissions:</p>
<pre>su - nagios
sudo /usr/local/nagios/bin/snmpd reload</pre>
<p>8. Exit back to the root user and restart opsview-web:</p>
<pre>/etc/init.d/opsview-web restart</pre>
<p>That’s all. Opsview should now be able to handle traps sent to it.</p>
<div style="border: 1px solid #ccc; background-color: #f5f5f5; padding: 8px;">
<h3>About the Author</h3>
<p>Marcus Vejneke holds a bachelor’s degree in computer science and engineering and has worked in IT for almost 14 years. Marcus lives in Sweden and works as Sysadmin. Visit Marcus&#8217;s <a href="http://onlyblueatwork.wordpress.com/author/onlyblueatwork/">blog.</a></p>
<h3>Legal Disclaimer</h3>
<p>This blog post is contributed by a member of the Opsview community. The Opsview project and Opsera Ltd. accept no responsibility for the accuracy of its content and are not liable for any direct or indirect damages caused by its use.</p>
</div>
]]></content:encoded>
			<wfw:commentRss>http://labs.opsview.com/2011/10/monitoring-snmp-traps-from-esx-hosts-in-opsview-part-2/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Monitoring SNMP Traps from ESX hosts in Opsview (Part 1)</title>
		<link>http://labs.opsview.com/2011/10/monitoring-snmp-traps-from-esx-hosts-in-opsview-part-1/</link>
		<comments>http://labs.opsview.com/2011/10/monitoring-snmp-traps-from-esx-hosts-in-opsview-part-1/#comments</comments>
		<pubDate>Mon, 03 Oct 2011 09:46:11 +0000</pubDate>
		<dc:creator>brian.king</dc:creator>
				<category><![CDATA[ESX]]></category>
		<category><![CDATA[Nagios]]></category>
		<category><![CDATA[Opsview]]></category>
		<category><![CDATA[SNMP]]></category>
		<category><![CDATA[System Management]]></category>
		<category><![CDATA[Ubuntu]]></category>
		<category><![CDATA[Unix / Linux]]></category>
		<category><![CDATA[vmware]]></category>
		<category><![CDATA[Linux]]></category>
		<category><![CDATA[monitoring]]></category>
		<category><![CDATA[Unix]]></category>
		<category><![CDATA[virtua]]></category>

		<guid isPermaLink="false">http://labs.opsview.com/?p=1188</guid>
		<description><![CDATA[
			
				
			
		This post outlines how to get SNMP traps from ESX hosts and monitor them in Opsview. The first part deals with configuring SNMP traps to get them working correctly with ESX hosts, part 2 tells you how to monitor them with Opsview.
The following steps worked on ESX 4.1. Depending on versions you may have different [...]]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;">
			<a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Flabs.opsview.com%2F2011%2F10%2Fmonitoring-snmp-traps-from-esx-hosts-in-opsview-part-1%2F">
				<img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Flabs.opsview.com%2F2011%2F10%2Fmonitoring-snmp-traps-from-esx-hosts-in-opsview-part-1%2F&amp;style=normal&amp;b=2" height="61" width="50" />
			</a>
		</div><p><a href="http://labs.opsview.com/wp-content/uploads/2011/09/425_mouse_trap.jpg"><img class="alignleft size-full wp-image-1237" title="425_mouse_trap" src="http://labs.opsview.com/wp-content/uploads/2011/09/425_mouse_trap.jpg" alt="monitoring SNMP | Opsview" width="184" height="122" /></a>This post outlines how to get SNMP traps from ESX hosts and monitor them in <a href="http://www.opsview.com">Opsview</a>. The first part deals with configuring SNMP traps to get them working correctly with ESX hosts, part 2 tells you how to monitor them with Opsview.</p>
<p>The following steps worked on ESX 4.1. Depending on versions you may have different results. For simplicity, I used 10.0.0.1 as IP for my ESX host, and 10.0.0.99 for my SNMP trap handler.<span id="more-1188"></span></p>
<p>1. Download and install the <a title="Download vSphere CLI" href="http://ops.vu/mPebjb"> vSphere CLI</a>. Keep in mind that you need an account to access it. Check BugMeNot if you’re not in the mood for registering.</p>
<p>The vSphere CLI will give you a host of useful tools to control your ESX environment with without having to resport to SSH or console access.</p>
<p>2. Check if you already have an active SNMP agent on your host with the following command:</p>
<pre>vicfg-snmp --show --server 10.0.0.1</pre>
<p>3. If no traps are configured (why would you even be reading this if they were?). Add your SNMP target like this (By default, vicfg-snmp.pl is located in the C:\Program Files\VMware\VMware vSphere CLI\bin directory):</p>
<pre>vicfg-snmp.pl --server 10.0.0.1 --username root --password qwerty1234 -t 10.0.0.99@162/communitystring</pre>
<p>4. Enable the SNMP service:</p>
<pre>vicfg-snmp.pl --server 10.0.0.1 --username root --password qwerty1234 --enable</pre>
<p>5. Check that you have a working configuration by using the –show command like this:</p>
<pre>vicfg-snmp.pl --server 10.0.0.1 --username root --password qwerty1234 --show</pre>
<p>Your output should look something like this:</p>
<pre>Current SNMP agent settings: Enabled : 1 UDP port : 162 Communities : communitystring Notification targets : 10.0.0.99@162/communitystring</pre>
<p>6. If you’d like, you can send a test trap to your target to make sure you’re on the right path. If you’re just testing, you can send them to your own client PC. I use the freeware application <a title="SNMP Trap Watcher" href="http://ops.vu/oMYlZx">SNMP Trap Watcher</a> for this.</p>
<p>Sending the following command through the vSphere CLI will generate a Warm Start trap:</p>
<pre>vicfg-snmp.pl --server 10.0.0.1 --username root --password qwerty1234 --test</pre>
<p>You should receive a report in your trap watcher:</p>
<p><a href="http://labs.opsview.com/wp-content/uploads/2011/09/trap-watcher1.png"><img class="aligncenter size-full wp-image-1195" title="trap-watcher" src="http://labs.opsview.com/wp-content/uploads/2011/09/trap-watcher1.png" alt="" width="401" height="174" /></a></p>
<p>If you’re not getting anything, chances are the ESX firewall isn’t allowing SNMP traffic. I had to allow this using the vSphere Client (connect to the ESX server, not a vCenter host). Click the tab “Configuration”, and select “Security Profile” in the menu on your left. Click “Properties” and enable SNMP:</p>
<p style="text-align: center;"><a href="http://labs.opsview.com/wp-content/uploads/2011/09/remote-access.jpg"><img class="size-full wp-image-1190 aligncenter" title="remote-access" src="http://labs.opsview.com/wp-content/uploads/2011/09/remote-access.jpg" alt="" width="518" height="374" /></a></p>
<p>The outgoing port will be the one you configured when you added a trap handler in step 4.</p>
<p>That’s it. You have an ESX host sending SNMP traps properly. Now all you need to do is get your monitoring software to understand what it’s saying. We’ll cover that in the <a href="http://labs.opsview.com/2011/10/monitoring-snmp-traps-from-esx-hosts-in-opsview-part-2/">next post.</a></p>
<div style="border: 1px solid #ccc; background-color: #f5f5f5; padding: 8px;">
<h3>About the Author</h3>
<p>Marcus Vejneke holds a bachelor’s degree in computer science and engineering and has worked in IT for almost 14 years. Marcus lives in Sweden and works as Sysadmin. Visit Marcus&#8217;s <a href="http://onlyblueatwork.wordpress.com/author/onlyblueatwork/">blog.</a></p>
<h3>Legal Disclaimer</h3>
<p>This blog post is contributed by a member of the Opsview community. The Opsview project and Opsera Ltd. accept no responsibility for the accuracy of its content and are not liable for any direct or indirect damages caused by its use.</p>
</div>
]]></content:encoded>
			<wfw:commentRss>http://labs.opsview.com/2011/10/monitoring-snmp-traps-from-esx-hosts-in-opsview-part-1/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>10 Reasons Why You Can&#8217;t Afford to Cut Corners with IT Monitoring</title>
		<link>http://labs.opsview.com/2011/09/10-reasons-why-you-cant-afford-to-cut-corners-with-it-monitoring/</link>
		<comments>http://labs.opsview.com/2011/09/10-reasons-why-you-cant-afford-to-cut-corners-with-it-monitoring/#comments</comments>
		<pubDate>Mon, 26 Sep 2011 11:10:58 +0000</pubDate>
		<dc:creator>brian.king</dc:creator>
				<category><![CDATA[Opsview]]></category>
		<category><![CDATA[System Management]]></category>
		<category><![CDATA[cloud]]></category>
		<category><![CDATA[cloud computing]]></category>
		<category><![CDATA[compliance]]></category>
		<category><![CDATA[database monitoring]]></category>
		<category><![CDATA[IT monitoring]]></category>
		<category><![CDATA[IT performance]]></category>
		<category><![CDATA[security]]></category>
		<category><![CDATA[virtual servers]]></category>

		<guid isPermaLink="false">http://labs.opsview.com/?p=1112</guid>
		<description><![CDATA[
			
				
			
		IT monitoring has become the most strategically important part of any enterprise process reliant on technology. Any network failure or server outage will impact an organisation’s ability to do business, and therefore, its bottom line.
Using the right tools to monitor your IT is vital not only to the continued health of your IT operations but [...]]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;">
			<a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Flabs.opsview.com%2F2011%2F09%2F10-reasons-why-you-cant-afford-to-cut-corners-with-it-monitoring%2F">
				<img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Flabs.opsview.com%2F2011%2F09%2F10-reasons-why-you-cant-afford-to-cut-corners-with-it-monitoring%2F&amp;style=normal&amp;b=2" height="61" width="50" />
			</a>
		</div><p><a href="http://www.opsview.com/learn/whitepapers/importance-it-monitoring-assuring-key-business-services-availability"><img class="alignleft" title="IT monitoring | Opsview" src="http://www.opsview.com/sites/all/themes/opsview/images/opsviewApplianceIcon90px.png" alt="IT Monitoring | Opsview" width="90" height="82" />IT monitoring</a> has become the most strategically important part of any enterprise process reliant on technology. Any network failure or server outage will impact an organisation’s ability to do business, and therefore, its bottom line.</p>
<p>Using the right tools to monitor your IT is vital not only to the continued health of your IT operations but also to the overall performance of your business. For this reason your monitoring solution must be stable and reliable with a track record of deployment in large scale production environments.  This post outlines 10 reasons why choosing the right toolset is paramount to assuring overall business performance. <span id="more-1112"></span></p>
<h2>It’s not just IT that is affected by downtime</h2>
<p>So many business tools rely on IT, without a means of monitoring there’s no way to know the reason why the phones don’t work or management reports can’t be produced because a database is offline. And, considering its significance in managing and mitigating enterprise security and compliance risk, IT monitoring is a key requirement for knowing who has access to what in any modern enterprise.</p>
<h2><a href="http://labs.opsview.com/wp-content/uploads/2011/09/heartbeat.jpg"><img class="size-thumbnail wp-image-1151 alignright" title="IT monitoring tools | Opsview" src="http://labs.opsview.com/wp-content/uploads/2011/09/heartbeat-150x150.jpg" alt="IT monitoring | Opsview" width="108" height="108" /></a></h2>
<h2>Monitoring is the heartbeat of your business</h2>
<p>IT monitoring acts as your IT infrastructure’s nervous system. It senses operational issues and empowers your IT department to take not only remedial, but proactive action swiftly. IT monitoring is an organisation’s view on every aspect of its performance.</p>
<h2>Mixing tools can hamper business effectiveness</h2>
<p>Problems can start to occur when multiple monitoring systems are used to manage every layer of the IT infrastructure, including networks, servers and applications. Systemic complexity characterised by a mix of proprietary management tools and IT processes can slow reaction times and hamper overall monitoring effectiveness.  Only a single system capable of deep visibility of an IT infrastructure can provide an effective solution that will improve business performance rather than detract from it.</p>
<h2>Freeware systems aren’t scalable</h2>
<p>Free software is often designed as a testing ground for enterprise applications or, in many cases, only meant for use in a simple monitoring environment. For any growing organisation, a scalable network monitoring application is a must in order to handle any growth in the IT estate over time.</p>
<h2><a href="http://labs.opsview.com/wp-content/uploads/2011/09/online_shopping1.jpg"><img class="alignleft size-full wp-image-1160" title="online_shopping1" src="http://labs.opsview.com/wp-content/uploads/2011/09/online_shopping1.jpg" alt="IT Monitoring software | Opsview" width="150" height="157" /></a>It’s not just about internal customers</h2>
<p>In our increasingly interconnected, “always on” world, detailed IT performance monitoring can help to minimise the costs of a poor user experience by alerting you when any part of your back-office or virtual storefront is slow or down.   Web surfers, for example, have very little patience for badly performing websites. For every second a web store is down customers may be lost.</p>
<h2>Prevention is better than cure</h2>
<p>The idea behind monitoring an IT environment is “prevention is better than cure”. Businesses will operate far more effectively by spotting problems before they occur rather than being purely reactive. However when downtime does occur, you need the fastest route to addressing the issue possible. Opsview’s ‘<a href="http://www.opsview.com/products/enterprise-modules/service-desk-connector">Service Desk Connector’</a> module, for example, integrates monitoring with fault logging systems helping to streamline incident management.</p>
<h2>Monitoring is your ‘Insurance Policy’</h2>
<p>The value of monitoring enterprise IT systems is linked to their levels of availability, acting as a kind of insurance policy for a business’s bottom line. As the larger and more geographically distributed an enterprise becomes, so the strategic importance of monitoring and assuring performance increases in terms of delivering daily value by supporting ‘business as usual’.</p>
<h2>Monitoring tools must integrate with your current systems</h2>
<p>When <a href="http://www.opsview.com/community/compare-opsview">considering monitoring tools</a> it’s important to ensure the provider will be able to meet the levels of openness required for integrating with your own IT systems, while allowing the flexibility required to maximise the use of IT performance information at every level of the business. For corporate networks it is essential that the monitoring system has a scalable architecture in order to cover all systems and locations.</p>
<h2>IT infrastructure is no longer restricted to ‘on-premise’<a href="http://labs.opsview.com/wp-content/uploads/2011/09/cloud-computing_digitizor-300x225.jpg"><img class="size-full wp-image-1162 alignright" title="cloud-computing_digitizor-300x225" src="http://labs.opsview.com/wp-content/uploads/2011/09/cloud-computing_digitizor-300x225.jpg" alt="IT Monitoring solution | Opsview" width="180" height="135" /></a></h2>
<p>Due to the rise of ‘cloud-computing’, having a clear view of IT performance is all the more important, whether it’s of the server room next door or IT services hosted thousands of miles away through a third-party data centre or software provider. Only a system that can monitor on-premise, virtual and cloud-based servers can offer the management capabilities for an enterprise business to operate effectively.</p>
<h2>Staff productivity can be affected</h2>
<p>Considering how much pressure IT budgets are under nowadays, most enterprises don’t want to hire their own in-house software engineering staff to fix the source code of third-party applications. The ideal monitoring solution should include software maintenance to ensure that an organisation receives critical software fixes quickly and with minimum disruption business.</p>
]]></content:encoded>
			<wfw:commentRss>http://labs.opsview.com/2011/09/10-reasons-why-you-cant-afford-to-cut-corners-with-it-monitoring/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>5 Easy Steps to Monitor Multi-Homed Hosts Using Opsview</title>
		<link>http://labs.opsview.com/2011/09/5-easy-steps-to-monitor-multi-homed-hosts-using-opsview/</link>
		<comments>http://labs.opsview.com/2011/09/5-easy-steps-to-monitor-multi-homed-hosts-using-opsview/#comments</comments>
		<pubDate>Mon, 12 Sep 2011 12:32:33 +0000</pubDate>
		<dc:creator>tcallway</dc:creator>
				<category><![CDATA[MSPs]]></category>
		<category><![CDATA[Nagios]]></category>
		<category><![CDATA[Opsview]]></category>
		<category><![CDATA[SNMP]]></category>
		<category><![CDATA[System Management]]></category>
		<category><![CDATA[management network]]></category>
		<category><![CDATA[multi-homed monitoring]]></category>
		<category><![CDATA[network management]]></category>
		<category><![CDATA[network monitoring]]></category>
		<category><![CDATA[open source monitoring]]></category>
		<category><![CDATA[production network]]></category>

		<guid isPermaLink="false">http://labs.opsview.com/?p=1055</guid>
		<description><![CDATA[
			
				
			
		Over the last couple of years we have seen an increase in port-density on server-hardware and currently the quad-nic (4 port network interface card) seems to be the standard. These cards allow for some great features like bonding (on *nix) or nic-teaming (on Windows) where multiple interfaces are bundled together or setup as fail-overs. It [...]]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;">
			<a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Flabs.opsview.com%2F2011%2F09%2F5-easy-steps-to-monitor-multi-homed-hosts-using-opsview%2F">
				<img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Flabs.opsview.com%2F2011%2F09%2F5-easy-steps-to-monitor-multi-homed-hosts-using-opsview%2F&amp;style=normal&amp;b=2" height="61" width="50" />
			</a>
		</div><p><a href="http://labs.opsview.com/wp-content/uploads/2011/09/Home.png"><img class="size-full wp-image-1082 alignleft" style="margin-bottom: 8px; margin-right: 10px;" title="Home" src="http://labs.opsview.com/wp-content/uploads/2011/09/Home.png" alt="" width="77" height="77" /></a>Over the last couple of years we have seen an increase in port-density on server-hardware and currently the quad-nic (4 port network interface card) seems to be the standard. These cards allow for some great features like bonding (on *nix) or nic-teaming (on Windows) where multiple interfaces are bundled together or setup as fail-overs. It also allows you to nicely split your networks into multiple segments like management and production with each network connected over dedicated NICs as shown below&#8230;</p>
<p style="text-align: center;"><span id="more-1055"></span> <a href="http://labs.opsview.com/wp-content/uploads/2011/09/multi_homing_network_ziggo.png"><img class="aligncenter size-full wp-image-1056" title="multi_homing_network_ziggo" src="http://labs.opsview.com/wp-content/uploads/2011/09/multi_homing_network_ziggo.png" alt="" width="500" /></a></p>
<p>However, this does pose the question which network should we use to monitor our hosts?</p>
<p style="text-align: center;"><a href="http://labs.opsview.com/wp-content/uploads/2011/09/multi_homed_image2-1.png"><img class="aligncenter size-full wp-image-1057" title="multi_homed_image2 (1)" src="http://labs.opsview.com/wp-content/uploads/2011/09/multi_homed_image2-1.png" alt="" width="500" /></a></p>
<p>As always we need to consider the users point of view so we need to monitor the services used by our customers using the production network (web-interfaces, SQL etc).</p>
<p>But we also need to monitor our management services (like ssh, NRPE etc).</p>
<p>Therefore, our objective is to come up with <a href="http://www.opsview.com/learn/network-monitoring">network monitoring</a> model to determine which service-checks will run over our production network and which will run over the management network.</p>
<h2>Prerequisites</h2>
<p>Ensure you have the following:</p>
<ul>
<li>1 x Opsview Master (or Slave) connected to both (or more) networks.</li>
</ul>
<h2>Step 1</h2>
<p>Create a monitoring model in which you divide your host into smaller chunks.</p>
<p>These chunks (or building blocks) represent the various parts of the system (e.g. hardware, network, operating system, applications and application interfaces).</p>
<p>Each chunk can then be assigned to either the management network or the production network.</p>
<p>Below is an example on how you can use these chunks to assign them to a network segment.</p>
<p style="text-align: center;"><a href="http://labs.opsview.com/wp-content/uploads/2011/09/multi-homed-monitoring-image3.png"><img class="aligncenter size-full wp-image-1058" title="multi-homed monitoring image3" src="http://labs.opsview.com/wp-content/uploads/2011/09/multi-homed-monitoring-image3.png" alt="" width="500" /></a></p>
<p>In this model we have defined that hardware, network, OS and applications (processes) are monitored through the management network (in general any check we use check_nrpe for).</p>
<p>Any check we run to confirm a service is reachable and usable (WWW, SQL etc) will be run over the production network (so we can confirm our customers can reach our services).</p>
<h2>Step 2</h2>
<p>Update the host to reflect the various addresses it has.</p>
<p>Using the Opsview WebUI go to the Configuration &gt; Hosts and select Action &gt; Create new host.</p>
<p><a href="http://labs.opsview.com/wp-content/uploads/2011/09/screenshot1_500px1.png"><img class="aligncenter size-full wp-image-1062" title="screenshot1_500px" src="http://labs.opsview.com/wp-content/uploads/2011/09/screenshot1_500px1.png" alt="" width="500" height="219" /></a></p>
<p>In the Other Hostnames/IP’s field we can add our other IP’s of FQDN’s (as shown also adding the primary IP is good practice) in a comma-separated list for each IP you wish to assign to this host.</p>
<p>Save the host (later on we will assign host-templates and service checks).</p>
<h2>Step 3</h2>
<p>Confirm from the commandline that you can reach your host using the addresses listed in the “Other Hostnames/IPs” by running one check for a management service and one for a production service.</p>
<pre>Check_nrpe –H primary-ip
Check_tcp –H secondary-ip –p 80 (for a webserver)</pre>
<h2>Step 4</h2>
<p>Set up the service checks.</p>
<p>Now to use the defined “Other Hostnames/IPs” we need to change some of our service-checks.</p>
<p>The easiest way is to clone a check you already use and give it a new name (e.g. SSH to SSH Management).</p>
<p><a href="http://labs.opsview.com/wp-content/uploads/2011/09/screenshot2_500px.png"><img class="aligncenter size-full wp-image-1063" title="screenshot2_500px" src="http://labs.opsview.com/wp-content/uploads/2011/09/screenshot2_500px.png" alt="" width="500" height="351" /></a></p>
<p>As you see from the arguments line, the check will run against the macro know as $HOSTADDRESS$ which is the equivalent of the “Primary Hostname/IP”.</p>
<p>To use the “Other Hostnames/IPs” we have to change this to the macro that represents the IP or Hostname we want.</p>
<p>Opsview will automatically place all entries for the “Other Hostnames/IPs” in the macro $ADDRESS1$, $ADDRESS2$ and so on.</p>
<p>In this case we want to use our Primary-IP which we also registered as the first “Other address” so it is stored in $ADDRESS1$. This allows us to change our check to have the following arguments:</p>
<p><a href="http://labs.opsview.com/wp-content/uploads/2011/09/screenshot3_500px.png"><img class="aligncenter size-full wp-image-1064" title="screenshot3_500px" src="http://labs.opsview.com/wp-content/uploads/2011/09/screenshot3_500px.png" alt="" width="500" height="353" /></a></p>
<p>Using the described method you can now create service-checks and based on your model assign them to the correct $ADDRESS$ value.</p>
<h2>Step 5</h2>
<p>Create new host-templates for the checks we created in Step 4 and assign them to your multi-homed host.</p>
<h2>Postscript</h2>
<p>The “Other Hostnames/IPs” is a very powerful feature in Opsview and there are many things you can do with this functionality (for instance creating inter-host dependencies).</p>
<p>But in case your Opsview slave is not allowed to access the production network due to firewall policies or security restrictions, not all is lost. If you have an Opsview slave in the production network (and one in the management network) you can apply split-monitoring which will be discussed in a later blog-post.</p>
<div style="border: 1px solid #ccc; background-color: #f5f5f5; padding: 8px;">
<h3>About the Author</h3>
<p>Alan Wijntje is responsible for maintaining and improving all forms of monitoring at <a href="http://www.ziggo.nl">Ziggo</a>, one of the leading Managed Service Providers in the Netherlands. An Opsview expert and open source enthusiast, Alan enjoys finding, designing and implementing new and innovative ways of monitoring complex systems and applications.</p>
</div>
]]></content:encoded>
			<wfw:commentRss>http://labs.opsview.com/2011/09/5-easy-steps-to-monitor-multi-homed-hosts-using-opsview/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>10 Ways to Make Your Monitoring System Scale</title>
		<link>http://labs.opsview.com/2011/09/10-ways-to-make-your-monitoring-system-scale/</link>
		<comments>http://labs.opsview.com/2011/09/10-ways-to-make-your-monitoring-system-scale/#comments</comments>
		<pubDate>Tue, 06 Sep 2011 09:00:27 +0000</pubDate>
		<dc:creator>tcallway</dc:creator>
				<category><![CDATA[Forked software]]></category>
		<category><![CDATA[Frameworks]]></category>
		<category><![CDATA[MSPs]]></category>
		<category><![CDATA[Nagios]]></category>
		<category><![CDATA[Opsview]]></category>
		<category><![CDATA[SNMP]]></category>
		<category><![CDATA[System Management]]></category>
		<category><![CDATA[api]]></category>
		<category><![CDATA[business systems]]></category>
		<category><![CDATA[icinga]]></category>
		<category><![CDATA[multi-tenancy]]></category>
		<category><![CDATA[security]]></category>
		<category><![CDATA[network monitoring]]></category>
		<category><![CDATA[open source monitoring]]></category>

		<guid isPermaLink="false">http://labs.opsview.com/?p=1009</guid>
		<description><![CDATA[
			
				
			
		
Freeware IT monitoring tools are used by thousands of organisation worldwide however using them to monitor complex network, server and application installations can be quite a challenge.  This blog post takes the basic capabilities of one such tool, Nagios® Core, and shows how you can scale it with Opsview for use in enterprise environments.

Distributed [...]]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;">
			<a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Flabs.opsview.com%2F2011%2F09%2F10-ways-to-make-your-monitoring-system-scale%2F">
				<img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Flabs.opsview.com%2F2011%2F09%2F10-ways-to-make-your-monitoring-system-scale%2F&amp;style=normal&amp;b=2" height="61" width="50" />
			</a>
		</div><p><a href="http://labs.opsview.com/wp-content/uploads/2011/09/hyper-scalability_2.jpg"><img class="alignleft size-full wp-image-1012" style="margin-bottom: 8px; margin-right: 10px;" title="hyper scalability_2" src="http://labs.opsview.com/wp-content/uploads/2011/09/hyper-scalability_2.jpg" alt="" width="104" height="127" /></a></p>
<p>Freeware IT monitoring tools are used by thousands of organisation worldwide however using them to monitor complex network, server and application installations can be quite a challenge.  This blog post takes the basic capabilities of one such tool, <a href="http://www.opsview.com/company/legal/trademarks#NagiosTrademarkStatement">Nagios® Core</a>, and shows how you can scale it with Opsview for use in enterprise environments.</p>
<p><span id="more-1009"></span></p>
<h2>Distributed monitoring</h2>
<p>Building and managing a complex<a href="http://www.opsview.com/learn"> distributed monitoring</a> environment with Nagios Core is no mean feat. With Opsview you get distributed monitoring that’s easy to setup and simple to maintain.  You can monitor your devices and applications from a central location and grow the system without growing the monitoring complexity.</p>
<h2>Slave server clustering</h2>
<p>Opsview can automatically load-balance across multiple slaves and reallocate monitoring duties if a slave server fails, giving you high availability and scalability without additional overhead.</p>
<div id="attachment_1050" class="wp-caption alignright" style="width: 208px"><a href="http://labs.opsview.com/wp-content/uploads/2011/09/Clustering_diagram550px.png"><img class="size-full wp-image-1050   " style="margin-bottom: 8px; margin-left: 10px; border: 1px solid #ccc;" title="Clustering_diagram550px" src="http://labs.opsview.com/wp-content/uploads/2011/09/Clustering_diagram550px.png" alt="" width="198" height="152" /></a><p class="wp-caption-text">Example Clustering Model</p></div>
<h2>Master server clustering</h2>
<p>Management of Opsview is performed on a single master server, however master servers can be clustered giving you the high availability and redundancy needed for mission critical monitoring.</p>
<h2>Separate database server</h2>
<p>Opsview can be run on a separate database server so you can move intensive reporting activity to a dedicated machine and fine tune the server for better performance.</p>
<h2>Efficient configuration UI</h2>
<p>Nagios Core is capable of monitoring thousands of devices, but maintaining configuration on expanding systems can quickly become a problem. Opsview handles this with an easy to use interface and middleware layer which tackles the complexity of configuring individual software components.</p>
<h2>‘Single pane of glass’ monitoring</h2>
<p>Unlike Nagios Core where data may be gathered from a number of systems and presented in different ways, Opsview’s intuitive web interface displays all your monitoring information in one place, with a top down view on system status.  Devices and applications can be easily grouped by business process and their status displayed using simple &#8216;traffic lights&#8217; so you can easily see the health of critical and non-critical groups. This makes monitoring and maintaining large, complex systems less time consuming and more efficient with a scalable architecture to cover all your systems and locations.</p>
<div id="attachment_1031" class="wp-caption aligncenter" style="width: 560px"><a href="http://labs.opsview.com/wp-content/uploads/2011/09/configurationUI550px.png"><img class="size-full wp-image-1031" style="border: 1px solid #ccc;" title="configurationUI550px" src="http://labs.opsview.com/wp-content/uploads/2011/09/configurationUI550px.png" alt="" width="550" height="248" /></a><p class="wp-caption-text">Opsview&#39;s Host Group Hierarchy View</p></div>
<p style="text-align: center;">
<h2>Distributed alerting</h2>
<p>Slave servers monitored by Opsview can handle their own notifications, allowing autonomy if communication is lost between master and slave servers. Alerts can be sent by the Master server or slave server by email / sms so you’re always in touch with the health of your system, no matter the location or your systems.</p>
<h2>Automated APIs</h2>
<p>Opsview APIs speed up system configuration by automatically populating and updating host information saving you time and effort as your system grows.</p>
<div id="attachment_1048" class="wp-caption aligncenter" style="width: 560px"><a href="http://labs.opsview.com/wp-content/uploads/2011/09/API_diagram_Opsview550px1.jpg"><img class="size-full wp-image-1048" title="API_diagram_Opsview550px" src="http://labs.opsview.com/wp-content/uploads/2011/09/API_diagram_Opsview550px1.jpg" alt="" width="550" height="285" /></a><p class="wp-caption-text">Example use cases for Opsview&#39;s RESTful API</p></div>
<h2>SNMP trap processing</h2>
<p>Nagios Core has no native support for SNMP trap processing. Opsview’s SNMP engine accepts incoming traps, analyses the data and decides how to handle them. In-built SNMP discovery allows SNMP objects to be detected and monitored easily and rules can be configured through the management UI.</p>
<h2>Notification profiles</h2>
<p>With Nagios Core you can be inundated with monitoring information, not all of it useful. In Opsview you can set-up notification profiles so the right people get the right information at the right time. Only want to know about email server status during business hours? No problem. Need SMS alerts about your webstore? It’s covered. Notification profiles can also combined with Opsview’s service desk module to automatically assign support tasks to engineers, helping streamline incident management.</p>
<div style="border: 1px solid #ccc; background-color: #e6e6e6; padding: 6px;"><strong>IMPORTANT LEGAL NOTICE: No affiliation, partnership, joint-venture or any other commercial relationship exists between Opsera Ltd, the makers of Opsview, and Nagios Enterprises LLC, the trademark holders of Nagios.</strong></div>
]]></content:encoded>
			<wfw:commentRss>http://labs.opsview.com/2011/09/10-ways-to-make-your-monitoring-system-scale/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>10 Ways to Make Your Monitoring System Easier</title>
		<link>http://labs.opsview.com/2011/08/10-ways-to-make-your-monitoring-system-easier/</link>
		<comments>http://labs.opsview.com/2011/08/10-ways-to-make-your-monitoring-system-easier/#comments</comments>
		<pubDate>Tue, 30 Aug 2011 10:18:12 +0000</pubDate>
		<dc:creator>tcallway</dc:creator>
				<category><![CDATA[Forked software]]></category>
		<category><![CDATA[Nagios]]></category>
		<category><![CDATA[Opsview]]></category>
		<category><![CDATA[SNMP]]></category>
		<category><![CDATA[System Management]]></category>
		<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[api]]></category>
		<category><![CDATA[business systems]]></category>
		<category><![CDATA[network monitoring]]></category>
		<category><![CDATA[open source monitoring]]></category>

		<guid isPermaLink="false">http://labs.opsview.com/?p=995</guid>
		<description><![CDATA[
			
				
			
		
Many freeware  IT monitoring tools are great but using them to manage complex systems can be a real challenge. It can also be unforgiving on anyone less than expert in configuring the system with mistakes being punished by a complete stop in monitoring activity.

Distributed monitoring

Opsview takes the complexities of its core engine, Nagios® Core, [...]]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;">
			<a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Flabs.opsview.com%2F2011%2F08%2F10-ways-to-make-your-monitoring-system-easier%2F">
				<img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Flabs.opsview.com%2F2011%2F08%2F10-ways-to-make-your-monitoring-system-easier%2F&amp;style=normal&amp;b=2" height="61" width="50" />
			</a>
		</div><p><a href="http://labs.opsview.com/wp-content/uploads/2011/08/staples-easy-button.png"><img class="alignleft size-full wp-image-1001" style="margin-bottom: 8px; margin-right: 10px;" title="staples-easy-button" src="http://labs.opsview.com/wp-content/uploads/2011/08/staples-easy-button.png" alt="" width="120" height="120" /></a></p>
<p>Many freeware <a href="http://www.opsview.com/learn/whitepapers/importance-it-monitoring-assuring-key-business-services-availability"> IT monitoring</a> tools are great but using them to manage complex systems can be a real challenge. It can also be unforgiving on anyone less than expert in configuring the system with mistakes being punished by a complete stop in monitoring activity.</p>
<p><span id="more-995"></span></p>
<h2>Distributed monitoring</h2>
<p><img class="alignright size-full wp-image-1020" style="margin-bottom: 8px; margin-left: 10px;" title="distributedMonitoring" src="http://labs.opsview.com/wp-content/uploads/2011/08/distributedMonitoring.jpg" alt="" width="169" height="163" /></p>
<p>Opsview takes the complexities of its core engine, <a href="http://www.opsview.com/company/legal/trademarks#NagiosTrademarkStatement">Nagios® Core</a>, and makes distributed monitoring simple. All management is performed on a single master server and communication with slaves is handled by Opsview&#8217;s middleware layer. Provision is included for geographically diverse monitoring and to cope with potentially unreliable WAN connections between servers.</p>
<h2>Host attributes</h2>
<p>A feature you won’t find in Nagios Core, <a href="http://labs.opsview.com/2011/10/5-steps-to-organising-your-server-monitoring-with-attributes/">host attributes</a> help simplify configurations by allowing you to create multiple services based on a set of pre-defined attributes.  You can assign one or many attributes to a host and set service checks to use the attributes for a host to then create multiple services for monitoring.</p>
<h2>Keywords</h2>
<p>Opsview’s keyword function gives you a flexible way of grouping hosts and services. You can tag devices, business processes and applications giving you a convenient way of seeing the status of the groups, e.g. critical IT systems, network circuits or business users and customers.</p>
<p><img class="alignleft" style="margin-bottom: 8px; margin-right: 10px;" title="Cloning" src="http://www.opsview.com/sites/all/themes/opsview/images/opsviewApplianceIcon90px.png" alt="" width="90" height="82" /></p>
<h2>Cloning capability</h2>
<p>Chances are when you’re configuring or adding devices and services to your network many of them will be quite similar.  To save time you can simply choose to clone an existing device or service monitored with Opsview and add it to the network.</p>
<h2>SNMP discovery</h2>
<p>Nagios Core provides support for SNMP via its plugin project, but it doesn’t provide support for processing SNMP traps. Opsview does this automatically. A powerful processing engine accepts incoming traps, analyses the data and then decides how they should be processed.  In-built SNMP discovery also means SNMP objects can be detected and monitored with ease without the need for human intervention.</p>
<h2><img class="alignright" style="margin-bottom: 8px; margin-left: 10px;" title="Service Desk Connector" src="http://www.opsview.com/sites/default/files/service-desk-connector600px.png" alt="" width="252" height="83" />Notification profiles</h2>
<p>Opsview helps you avoid information overload by easily creating complex business rules that define who gets alerts, how they get them and why. Combined with the Opsview service desk module you get a powerful notification tool that helps speed up mean time to repair and streamline workflows.</p>
<h2>Configuration UI</h2>
<p>Configuring and maintaining a system with Nagios Core can become difficult the bigger and more complex the monitoring environment gets. Opsview’s configuration UI means you don’t need to be a Nagios Core expert to get your monitoring up and running.   All the software processes are kept ‘under the hood’ and presented via an intuitive interface so you can see the information that makes a difference to your business without getting caught up with software.</p>
<h2>APIs for Automation</h2>
<p>Opsview includes automated APIs for configuration, monitoring and notification which makes system set-up pain-free and scaling simple.  The APIs also make integrating with other IT Management tools easy.</p>
<h2>SLA reporting</h2>
<p><img class="alignright" style="margin-bottom: 8px; margin-left: 10px; border: 1px solid #ccc;" title="Reports Module" src="http://www.opsview.com/sites/default/files/reports800px.jpg" alt="Reports Module" width="182" height="128" />Opsview’s <a href="http://www.opsview.com/products/enterprise-modules/reports">Reports Module</a> can automatically generate custom reports in line with business requirements. If you have to produce regular reports for your management or customers, this module will save you hours by generating the reports you need when you need them. The reports can be sent out automatically on email in PDF, HTML, Excel, ODT or XML to your chosen distribution list.</p>
<h2>Service desk integration</h2>
<p>When you <a href="http://www.opsview.com/products/enterprise-modules/service-desk-connector">integrate your service desk with Opsview</a> you get a powerful tool for automating incident reporting. Tickets can be created in your system based on alerts generated by Opsview, saving time and freeing up resources. Out of-the-box support is included for Service-Now.com, Bestpractical’s Request Tracker and Atlassian JIRA.</p>
<div style="border: 1px solid #ccc; background-color: #e6e6e6; padding: 6px;"><strong>IMPORTANT LEGAL NOTICE: No affiliation, partnership, joint-venture or any other commercial relationship exists between Opsera Ltd, the makers of Opsview, and Nagios Enterprises LLC, the trademark holders of Nagios.</strong></div>
]]></content:encoded>
			<wfw:commentRss>http://labs.opsview.com/2011/08/10-ways-to-make-your-monitoring-system-easier/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

