<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Cluster Monitoring &#8211; Nagios Library</title>
	<atom:link href="https://library.nagios.com/tag/cluster-monitoring/feed/" rel="self" type="application/rss+xml" />
	<link>https://library.nagios.com</link>
	<description>Complete Nagios monitoring resources and documentation</description>
	<lastBuildDate>Fri, 06 Mar 2026 20:21:46 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>
	hourly	</sy:updatePeriod>
	<sy:updateFrequency>
	1	</sy:updateFrequency>
	<generator>https://wordpress.org/?v=6.9.4</generator>

<image>
	<url>https://library.nagios.com/wp-content/uploads/2024/11/Nagios-Blue-N.svg</url>
	<title>Cluster Monitoring &#8211; Nagios Library</title>
	<link>https://library.nagios.com</link>
	<width>32</width>
	<height>32</height>
</image> 
	<item>
		<title>Nagios XI BPI (Business Process Intelligence): Transform Complexity into Clarity</title>
		<link>https://library.nagios.com/techtips/nagios-xi-bpi-business-process-intelligence-clarity/</link>
		
		<dc:creator><![CDATA[Shamas Demoret]]></dc:creator>
		<pubDate>Thu, 26 Dec 2024 15:25:35 +0000</pubDate>
				<category><![CDATA[Techtips]]></category>
		<category><![CDATA[Application & Server Monitoring]]></category>
		<category><![CDATA[BPI]]></category>
		<category><![CDATA[Cluster Monitoring]]></category>
		<category><![CDATA[Root Cause Analysis]]></category>
		<guid isPermaLink="false">https://library.nagios.com/?p=36952</guid>

					<description><![CDATA[Learn how Nagios Business Process Intelligence (BPI) empowers you to intelligently monitor your clusters, stacks, and other complex setups with ease.]]></description>
										<content:encoded><![CDATA[
<p>Nagios Business Process Intelligence (BPI) provides a way for you to combine Hosts and Services you monitor with <a href="https://www.nagios.com/products/nagios-xi/" target="_blank" rel="noopener">Nagios XI</a> into logical groups, then alert intelligently based on both the overall percent health of the group, and of &#8216;essential members&#8217; (which must be in an OK state for the group to function). This powerful tool can be found in the <em><strong>Home</strong> > <strong>Details</strong></em> section of the Nagios XI web interface.</p>



<p>Business Process Intelligence can be leveraged for a wide variety of group monitoring use cases and is especially useful when monitoring clusters and complex applications, where some members can fail without compromising the application functionality, while others may be essential.</p>



<p>Not only will unlocking the capabilities of this feature enable you to alert more intelligently, but it will also enhance your ability to troubleshoot issues and provide rapid root cause analysis when set up properly.</p>



<h2 class="wp-block-heading">Nagios BPI Group Example</h2>



<p>Let&#8217;s look at the group for a <a href="https://www.nagios.com/products/nagios-log-server/" data-type="link" data-id="https://www.nagios.com/products/nagios-log-server/" target="_blank" rel="noopener">Nagios Log Server</a> testing and development Cluster, running on a VMWare vWorker, as an excellent example of the power of BPI (note each Nagios license includes a <a href="https://support.nagios.com/kb/article/nagios-log-server-license-entitlements-853.html" target="_blank" rel="noopener">free</a> test/lab install, so you can test new configurations and upgrades before rolling them into production). In this example, we have assigned the group a low priority since it isn&#8217;t a production setup. We&#8217;ve also made the Windows Host on which VMWare is running an essential member (indicated by the dotted circle to its left), because if the VM Host stopped working, so would all of the Log Server Instances running as VM guests.</p>



<figure class="wp-block-image size-full is-resized"><a href="https://library.nagios.com/wp-content/uploads/2024/12/BPI-R1-1-All-is-well-1.png"><img decoding="async" src="https://library.nagios.com/wp-content/uploads/2024/12/BPI-R1-1-All-is-well-1.png" alt="Business Process Intelligence screen in Nagios XI" class="wp-image-38144" style="width:840px;height:auto" title="Nagios XI BPI (Business Process Intelligence): Transform Complexity into Clarity 1"></a><figcaption class="wp-element-caption">A BPI group in an OK state. </figcaption></figure>



<h2 class="wp-block-heading">Nesting Groups for Holistic Monitoring</h2>



<p>BPI includes the ability to nest groups within groups, so the status of multiple BPI groups impacts the status of the larger overarching group. This technique allows you to utilize multiple layers of group logic to determine the status of your ultimate top-level BPI group&#8217;s status. The Nagios Log Server Lab Environment group, as depicted above, consists of three additional groups, each reflecting the state of a Log Server Instance, and a fourth group, which represents the vWorker they operate on.</p>



<p>These smaller groups include a ping check for the Host, key performance checks like CPU and memory usage, and Service status checks for opensearch and logstash. We&#8217;ve marked the Host ping check and the opensearch/logstash Service checks as essential members since their failure would render this Instance non-functional:</p>



<figure class="wp-block-image size-large"><a href="https://library.nagios.com/wp-content/uploads/2024/12/BPI-R1-2-Instance-Group-1.png"><img decoding="async" src="https://library.nagios.com/wp-content/uploads/2024/12/BPI-R1-2-Instance-Group-1-1024x484.png" alt="The BPI group for an individual Nagios Log Server instance." class="wp-image-38149" title="Nagios XI BPI (Business Process Intelligence): Transform Complexity into Clarity 2"></a><figcaption class="wp-element-caption">The BPI group for an individual Nagios Log Server instance. </figcaption></figure>



<h2 class="wp-block-heading">Troubleshooting with Nagios BPI</h2>



<p>If any of these essential members enter a problem state, the NLSR1-Test-A/B/C group they are in will show as critical.</p>



<p>Once this happens, the larger &#8216;Nagios Log Server Lab Environment&#8217; group will go into a Warning state based on the group health thresholds we&#8217;ve defined:</p>



<figure class="wp-block-image size-large"><a href="https://library.nagios.com/wp-content/uploads/2024/12/BPI-R1-3-Warning.png"><img decoding="async" src="https://library.nagios.com/wp-content/uploads/2024/12/BPI-R1-3-Warning-1024x211.png" alt="A BPI group in a Warning state." class="wp-image-38152" title="Nagios XI BPI (Business Process Intelligence): Transform Complexity into Clarity 3"></a><figcaption class="wp-element-caption">A BPI group in a Warning state. </figcaption></figure>



<p>With a quick look at the group members, we can immediately see the problem is with the opensearch Service on the third Instance, and can expand the group to see if there are any other problems:</p>



<figure class="wp-block-image size-large"><a href="https://library.nagios.com/wp-content/uploads/2024/12/BPI-R1-4-Warning.png"><img decoding="async" src="https://library.nagios.com/wp-content/uploads/2024/12/BPI-R1-4-Warning-1024x385.png" alt="An expanded view of the problem member." class="wp-image-38155" title="Nagios XI BPI (Business Process Intelligence): Transform Complexity into Clarity 4"></a><figcaption class="wp-element-caption">An expanded view of the problem member. </figcaption></figure>



<h2 class="wp-block-heading">All is Well Again</h2>



<p>Since the stopped opensearch Service is the sole problem on Instance C, we SSH into the machine and restart it. After doing so, we schedule an immediate check for the Service on its Status Detail page and revisit the BPI group. We see both the instance-specific group and the larger group are once again in an OK state:</p>



<figure class="wp-block-image size-large"><a href="https://library.nagios.com/wp-content/uploads/2024/12/BPI-R1-5-OK-Again.webp"><img decoding="async" src="https://library.nagios.com/wp-content/uploads/2024/12/BPI-R1-5-OK-Again-1024x329.png" alt="A Nagios BPI goup and sub-group in an OK state. All is well!" class="wp-image-38156" title="Nagios XI BPI (Business Process Intelligence): Transform Complexity into Clarity 5"></a><figcaption class="wp-element-caption">The BPI group and subgroup in an OK state. </figcaption></figure>



<h2 class="wp-block-heading">Intelligent Alerting </h2>



<p>As we&#8217;ve shown above, Business Process Intelligence not only provides handy visual insight into group status but also directly alerts you when a group enters a Warning or Critical state. You can quickly and easily configure monitoring and alerting for the entire group as an individual service using Nagios XI&#8217;s built-in BPI Wizard. Take a look at our Log Server Cluster through the lens of a Service that was created with the Wizard:</p>



<figure class="wp-block-image size-large"><a href="https://library.nagios.com/wp-content/uploads/2024/12/BPI-Group-Service.webp"><img fetchpriority="high" decoding="async" width="1024" height="171" src="https://library.nagios.com/wp-content/uploads/2024/12/BPI-Group-Service-1024x171.webp" alt="A service representing the status of a BPI group, created using the BPI Wizard." class="wp-image-37473" title="Nagios XI BPI (Business Process Intelligence): Transform Complexity into Clarity 6" srcset="https://library.nagios.com/wp-content/uploads/2024/12/BPI-Group-Service-1024x171.webp 1024w, https://library.nagios.com/wp-content/uploads/2024/12/BPI-Group-Service-300x50.webp 300w, https://library.nagios.com/wp-content/uploads/2024/12/BPI-Group-Service-768x128.webp 768w, https://library.nagios.com/wp-content/uploads/2024/12/BPI-Group-Service.webp 1032w" sizes="(max-width: 1024px) 100vw, 1024px" /></a><figcaption class="wp-element-caption">A service representing the state of the entire BPI group. </figcaption></figure>



<p>The &#8220;<a href="https://library.nagios.com/documentation/using-nagios-bpi-v2/">Using BPI in Nagios XI</a>&#8221; guide is an excellent resource to help you learn how to manage your groups, including details on initial setup, group alert logic, and using the Wizard.</p>



<h2 class="wp-block-heading">Centralized Monitoring with Nagios Fusion Dashlet</h2>



<p>Another great feature is the BPI Dashlet in <a href="https://www.nagios.com/products/nagios-fusion/" data-type="link" data-id="https://www.nagios.com/products/nagios-fusion/" target="_blank" rel="noopener">Nagios Fusion</a>, which provides you with centralized visibility of the status of all your BPI groups across multiple Nagios XI servers, and empowers you with single-click access to the BPI page on each:</p>



<figure class="wp-block-image size-large is-resized"><a href="https://library.nagios.com/wp-content/uploads/2024/12/BPI-R1-6-Fusion.png"><img decoding="async" src="https://library.nagios.com/wp-content/uploads/2024/12/BPI-R1-6-Fusion-1024x321.png" alt="A Nagios Fusion dashboards showing the status of BPI groups on 2 Nagios XI servers. " class="wp-image-38171" style="width:840px;height:auto" title="Nagios XI BPI (Business Process Intelligence): Transform Complexity into Clarity 7"></a><figcaption class="wp-element-caption">The BPI dashlet in Nagios Fusion</figcaption></figure>



<p>You can learn more about the role Nagios Fusion and each of the other Nagios solutions play in providing a holistic monitoring perspective <a href="https://library.nagios.com/solutions/get-holistic-with-4-nagios-solutions/">here</a>.</p>



<h2 class="wp-block-heading">An Enterprise Edition Extra</h2>



<p>The Enterprise Edition of Nagios XI also automatically enables you to leverage the Host groups and Service groups you define as BPI groups. To learn more about setting up your groups, and about this great Enterprise Feature, the following video will be an excellent resource:</p>



<figure class="wp-block-embed is-type-video is-provider-youtube wp-block-embed-youtube wp-embed-aspect-16-9 wp-has-aspect-ratio"><div class="wp-block-embed__wrapper">
<iframe title="How to Use Business Process Intelligence - Nagios XI: Enterprise Features" width="800" height="450" src="https://www.youtube.com/embed/YrTlI5neRC4?feature=oembed" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" referrerpolicy="strict-origin-when-cross-origin" allowfullscreen></iframe>
</div></figure>



<p>As you can see from the example, Business Process Intelligence is a powerful feature. Careful cultivation of meaningful BPI groups will empower you to more quickly identify individual issues preventing complex processes from carrying out their critical tasks and get your IT infrastructure back online faster.</p>



<p></p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>Nagios Log Server &#8211; Removing An Instance From A Cluster</title>
		<link>https://library.nagios.com/documentation/nagios-log-server-removing-instance-cluster/</link>
		
		<dc:creator><![CDATA[Ethan Galstad]]></dc:creator>
		<pubDate>Wed, 21 Mar 2018 21:36:25 +0000</pubDate>
				<category><![CDATA[Documentation]]></category>
		<category><![CDATA[Cluster Monitoring]]></category>
		<guid isPermaLink="false">https://library.nagios.com/?page_id=1812</guid>

					<description><![CDATA[This document describes how to remove an instance from a Nagios Log Server cluster. This document is written for administrators who are looking to remove an instance from a Nagios Log Server cluster, planned or not. Removing An Instance From A Cluster]]></description>
										<content:encoded><![CDATA[<p>This document describes how to remove an instance from a Nagios Log Server cluster.</p>
<p>This document is written for administrators who are looking to remove an instance from a Nagios Log Server cluster, planned or not.</p>
<p><a href="https://assets.nagios.com/downloads/nagios-log-server/docs/Removing-An-Instance-From-A-Nagios-Log-Server-Cluster.pdf" target="_blank" rel="noopener">Removing An Instance From A Cluster</a></p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>Nagios Log Server &#8211; Managing Clusters</title>
		<link>https://library.nagios.com/documentation/nagios-log-server-managing-clusters/</link>
		
		<dc:creator><![CDATA[Ethan Galstad]]></dc:creator>
		<pubDate>Mon, 06 Oct 2014 21:05:43 +0000</pubDate>
				<category><![CDATA[Documentation]]></category>
		<category><![CDATA[Cluster Monitoring]]></category>
		<category><![CDATA[Simplified Management]]></category>
		<guid isPermaLink="false">https://libtest.nagios.com/library/products/nagios-log-server/documentation/nagios-log-server-managing-clusters/</guid>

					<description><![CDATA[This document describes how to manage your Nagios Log Server Clusters through the administration section. This document is intended for use by Nagios Log Server Administrators. It describes how to manage and troubleshoot Nagios Log Server Clusters. Managing-Nagios-Log-Server-Clusters.pdf]]></description>
										<content:encoded><![CDATA[<p>This document describes how to manage your Nagios Log Server Clusters through the administration section.</p>
<p>This document is intended for use by Nagios Log Server Administrators. It describes how to manage and troubleshoot Nagios Log Server Clusters.</p>
<p><a target="_blank" href="https://assets.nagios.com/downloads/nagios-log-server/docs/Managing-Nagios-Log-Server-Clusters.pdf" rel="noopener">Managing-Nagios-Log-Server-Clusters.pdf</a></p>
]]></content:encoded>
					
		
		
			</item>
	</channel>
</rss>
