My Service Disappeared From Nagios XI! What Should I Do?

Picture of Ayub Huruse
Ayub Huruse
Service Disappeared From Nagios XI!

If a service suddenly disappears from Nagios XI, it can be alarming—but don’t worry, the issue is usually fixable. Whether it’s due to a database inconsistency, a configuration error, or a system resource constraint, pinpointing the cause is the first step toward resolution. This guide will walk you through practical troubleshooting steps, covering database integrity, disk space, configuration checks, and other key areas to help you quickly restore missing services and keep your monitoring system running smoothly.

Step 1: Investigate Database Issues

Nagios XI relies on MySQL or MariaDB to store configuration data. A corrupted database or missing entries can cause services to disappear.

1. Log in to the Database Server:

mysql -u root -p

2. Check Database Integrity:

USE nagios; 
CHECK TABLE nagios_services; 

If errors are detected, repair the table:

REPAIR TABLE nagios_services;

3. Confirm Service Entries Exist:

SELECT service_description, host_name FROM nagios_services;

Verify that your missing service is listed. If it’s not, it may have been deleted or never saved.

4. Restart the Database Service:

systemctl restart mariadb
# or, for MySQL:
systemctl restart mysql

5. Optimize for Prevention:

OPTIMIZE TABLE nagios_services;

This improves performance and helps prevent future database issues.

Step 2: Ensure Adequate Disk Space

Low disk space can disrupt database writes, causing services to vanish from Nagios XI.

1. Check Disk Usage:

df -h

If any partition is at 100%, free up space immediately.

2. Inspect MySQL Storage:

du -sh /var/lib/mysql

Excessive size here might indicate log or data buildup.

3. Clean Up Excess Files:

  • Remove old Nagios logs:
rm -rf /var/log/nagios/*.log
  • Clear temporary files:
rm -rf /tmp/*
  • Purge MySQL binary logs (use with caution):
mysql -u root -p -e "PURGE BINARY LOGS BEFORE NOW();"

4. Expand Storage if Needed:
If space remains critically low, resize the partition or add more storage.

Step 3: Validate Configuration Files

A misconfiguration or syntax error can silently drop services from Nagios XI.

1. Test the Configuration:

/usr/local/nagios/bin/nagios -v /usr/local/nagios/etc/nagios.cfg

Review and fix any errors in the output.

2. Search for the Missing Service:

grep -r "service_name" /usr/local/nagios/etc/services/

Replace service_name with your missing service’s name. If it’s not found, it may have been removed.

3. Apply Manual Changes Properly:
If you’ve edited configs manually, reapply them:

/usr/local/nagiosxi/scripts/reconfigure_nagios.sh

4. Restart Nagios:

systemctl restart nagios

Step 4: Check Audit Logs

Nagios XI includes audit logs that track changes to configurations and settings.

1. View Audit Logs: Navigate to Admin > System Information > Audit Log

  • Look for any recent deletions or changes to service configurations.
  • If you find an unintended deletion, re-add the service.
Screenshot 2025 03 11 124740
Audit Log

2. Check Event Logs for Errors:

tail -f /usr/local/nagiosxi/var/eventman.log

This helps identify potential background processing failures.

Step 5: Restore from a Backup

If the service remains missing, a backup restore might be your best bet.

1. Locate Backups:

ls -l /store/backups/nagiosxi

2. Perform the Restore:

/usr/local/nagiosxi/scripts/restore_xi.sh /store/backups/nagiosxi/backup_file.tar.gz

Replace backup_file.tar.gz with the desired backup file.

3. Confirm Success: Log into Nagios XI and check if the missing service reappears.

Step 6: Check Permissions and System Health

Permission issues or resource strain could be to blame.

1. Fix Ownership:

chown -R nagios:nagios /usr/local/nagios
chown -R nagios:nagios /var/lib/nagios

2. Restart Related Services:

systemctl restart nagios
systemctl restart httpd
systemctl restart ndo2db

3. Monitor Resources:

top

High CPU or memory usage by Nagios XI might signal a deeper issue.

Conclusion

By following these troubleshooting steps, you should be able to restore missing services in Nagios XI. Regular database maintenance, disk space monitoring, and proper backups can help prevent such issues in the future. Additionally, keeping Nagios XI updated and ensuring proper system resource allocation can minimize potential disruptions. If problems persist, consider checking Nagios XI logs or reaching out to Nagios support for further assistance.

Share: