Skip to Main Content
Operational and Predictive Intelligence - Ideas Portal
ADD A NEW IDEA

Reliability and Observability

Showing 18

Recovery Action System improvement: Generate and Link custom incident

As part of the event management / incident management activities being carried out, it would be great to be able to create an incident during a recovery action execution. Then splunk would identify the incident creation and link it with the trigge...
Jonathan Malpartida Patterson 6 months ago in Reliability and Observability 1 Will not implement

Monitoring Panduit PDUs in Branch Offices for Power Efficiency/Status

Hi! We have 70 Panduit PDUs in the Roche Costa Rica Campus (52x Panduit P16D14M, 18x Panduit P30D02M). The Connectivity equipment is connected to these PDUs. Site Services needs to be able to monitor them remotely for fault and usage. These PDUs s...
Marcos Herrera 7 months ago in Reliability and Observability 3 Will not implement

Enablement of self-service maintenances in Logicmonitor instead Splunk MoM

Since only Logicmonitor consumers are using the maintenance in Splunk MoM M3T, from our point of view and with the new RBAC model, the service should be offered via self service in Logicmonitor.As far as we know, only Loficmonitor consumers are se...
Santiago Sanchez Merino 7 months ago in Reliability and Observability 0 Will not implement

Oracle database integration in Grafana

Connecting to Oracle database to retrieve the logs based on the queries
Avirup Nandy 4 months ago in Reliability and Observability 1 Will not implement

DataDog Servicenow incident integration webhook - add customization for Re-Trigger message

What Add possibility to send custom message via webhook payload that that will update the SNow incident. Currently if metric is retriggered, incident in SNow is updated with predefined comment. Example webbook payload. event_update_msg - parameter...
Radoslaw Linke 4 months ago in Reliability and Observability 0 Will not implement

New Splunk Index

We want to develop a index within Kubernetes CaaS alerts from Qdrant - Graph
Roberto Rodriguez Pena 4 months ago in Reliability and Observability 1 Will not implement

Reevaluate Recovery Action System architecture

Currently the recovery action system is working in an async way leaving room for failure or missing episodes that have a recovery action associated to them. This architecture also depends on a lot of complex pipelines in Gitlab CI/CD and searches/...
Christian Mochales 5 months ago in Reliability and Observability 0 Will not implement

Explore if using https://www.grepr.ai/ could help to reduce observability costs

No description provided
Jose Antonio Almena 5 months ago in Reliability and Observability 0 Will not implement

Automating Backup Episode Reporting in Splunk

The idea is to automate the reporting capability for backup episodes in Splunk. Through the use of an API, we will be able to query episodes generated by backup failures, enabling traceability of errors. This will also serve as evidence for future...
Avelino Gomez 6 months ago in Reliability and Observability 2 Will not implement

Improvement Cohesity monitoring in splunk

We aim to create or update the Cohesity data model to handle alerts that are classified as 'warnings' by Cohesity but are actually critical. To achieve this, we will enable the sending of 'warning' type alerts from Cohesity to Splunk. Subsequently...
Avelino Gomez 6 months ago in Reliability and Observability 1 Will not implement