Grafana IRM

Incident response & management integrated with observability, all in Grafana Cloud

Grafana IRM screenshot
Consolidate, customize, and automate core components of your incident response and management with Grafana IRM, a set of observability-native tools that work seamlessly with your Grafana Cloud stack to give you faster response times.

The actually useful Grafana Cloud Free plan

  • 50GB traces
  • 10k metrics
  • 14-day retention
  • 3 active users
  • 50GB logs of telemetry
Stopwatch icon

Respond to issues quickly and confidently

Tailor workflows so you have all the information and the right stakeholders you need to address issues and guarantee 24/7 coverage with Grafana OnCall.

Record icon

Eliminate confusion before, during, and after incidents

Automate tedious tasks, centralize communication, and get a complete post-incident review record with Grafana Incident.

Piggybank icon

Only pay for active users

Your bill only gets bigger when engineers actually use Grafana OnCall and/or Grafana Incident.

Notify the right people, at the right time

Improve communication and tailor alert notifications so critical information reaches the right team members.

  • Deliver customized notifications via Slack, Microsoft Teams, Telegram, SMS, phone calls, email, and more.
  • Receive push notifications personalized for your role and responsibilities.
  • Use automation to remove blockers and improve response times with templates and multi-step escalation chains.
  • Acknowledge, resolve, or escalate incidents from your preferred communication channel.
Grafana OnCall personalized notifications UI

On call, on your terms

Ensure round-the-clock incident response with tools built for distributed teams, by engineers who understand the pressures of on-call shifts.

  • Manage schedules via the Grafana UI, Terraform, or iCal; automate shift swaps and out of office events with the Google Calendar integration; and easily factor in time zones, schedule rotations, and more.
  • Override “do not disturb” settings for critical emergencies, maintaining operational readiness.
  • Easily review rotation details, upcoming shifts, and swap requests from your browser or on the go with the mobile app.
Grafana OnCall schedule

Make data-driven decisions now and in the future

For each event, work with a single source of truth that provides complete incident summaries and helps you make informed choices.

  • Get comprehensive timelines to track key actions, decisions, and updates throughout the incident lifecycle.
  • Automatically convert timelines into a structured post-incident review document and maintain a centralized, authoritative record of each incident.
  • Learn from past incidents by identifying and analyzing bottlenecks and areas for improvement.
Grafana Incident Insights screenshot

Detect anomalies using machine learning

With Sift, our diagnostic assistant, you can run automated system checks that surface problems quickly and efficiently so you can resolve issues faster.

  • Get a holistic view of system health so you can automatically identify anomalies and complex issues before they become major incidents.
  • Get incident response up and running faster with automated Sift checks.
  • Develop personalized responses over time based on feedback and outcomes.
Grafana Incident UI showing error pattern logs

Observability meets IRM

Transition from reactive to proactive incident response in your Grafana Cloud observability stack the moment there is a concerning issue.

  • Initiate incidents directly from any Grafana visualization when you spot anomalies or concerning trends.
  • Gather data on incident frequency and types to optimize your observability and response strategies.
  • Integrate with your favorite ITSM tools to customize your incident response and management workflows, including Jira, ServiceNow, Github, and more.
Declare incident menu

Incident response and management on the go

With the IRM mobile app, you can handle critical situations from anywhere.

Personalized notifications:

  • Receive push notifications tailored to your personal preferences.
  • Override “do not disturb” settings for critical emergencies.

On-call schedules at your fingertips:

  • Review on-call rotation details anytime, anywhere.
  • Quickly check upcoming shifts and team availability.
  • Easily request shift swaps with your team.

Incident details on demand:

  • Acknowledge, respond to, or escalate incidents directly from your mobile device.
  • Access comprehensive incident information to make informed decisions.
Grafana OnCall app alert groups

Get started with incident response and management in Grafana Cloud

2

Set up integrations to your favorite apps, such as Slack, where you can add the Grafana Incident chatbot to the relevant channel.

3

Configure notifications

Decide how each user will receive notifications and create escalations.

4

Set up on-call schedules and start declaring incidents

Establish on-call schedules within the UI and declare your first drill incident.

For full implementation details and best practices,
“We hadn’t been planning to make a change, but after we switched to Grafana Cloud Logs for log management, we realized that Grafana Cloud’s Incident Response & Management suite automatically became available to us. At the time, we were using PagerDuty as our escalation tool, and since we were looking for possible cost optimizations everywhere in the tech stack, the SRE team decided to check if it could be replaced with Grafana IRM. Spoiler alert… It’s been a great alternative in terms of both ease of use and cost.”
Alexander Koehler
Senior SRE

Get Grafana IRM in Grafana Cloud

Detect, respond, and learn. Grafana IRM simplifies the incident workflow to help you focus on managing incidents, not your tools.

Cloud Free

No payment. Ever.
Best suited for early stage and small teams with up to 3 active IRM users per month.
Easiest way to get started

Cloud Pro

Pay as you go
Best suited for growing teams that need to scale above 3 active IRM users and unlock 8x5 support.

Cloud Advanced

Premium bundle
Best suited for teams looking to scale above 3 active IRM users and unlock 24x7 support.

Easily connect to more Grafana Cloud tools

Grafana Alerting

Unify alert management across your entire stack with powerful, flexible rules and notifications.

Grafana SLO

Define service level objectives and create error-budget alerts to catch issues before your customers do.

Ready to get started?