PagerDuty Integration - Critical Alerts
Real-time infrastructure monitoring for DevOps
Integration Setup
Configure Your PagerDuty Service
Connect StatusPulse to your PagerDuty account by generating an API integration key and mapping incident triggers. Follow these steps to route critical uptime failures directly to your on-call engineers.
1. Generate Webhook Endpoint
Navigate to StatusPulse > Settings > Integrations. Click "Generate PagerDuty Webhook" to receive your unique HTTPS endpoint. Copy this URL to paste into your PagerDuty service configuration under "Integration URL".
2. Map Incident Severities
StatusPulse automatically translates downtime events to PagerDuty severity levels. Critical HTTP 5xx errors and DNS resolution failures map to "critical". Degraded response times exceeding 2,000ms map to "error". Configure custom thresholds in the integration dashboard.
3. Validate Connection
Trigger a test alert from StatusPulse. Verify that PagerDuty acknowledges the payload within 300ms. Check your service dashboard for the "StatusPulse Monitor" integration tile and confirm the status reads "Active".
Escalation Policies
Route Alerts to On-Call Teams
Define how StatusPulse incidents escalate through your PagerDuty schedules. Align monitoring thresholds with your team's SLA commitments and prevent alert fatigue.
First Line: Platform Engineering
Initial alerts route to the #platform-oncall Slack channel and trigger a PagerDuty assignment to the primary engineer. StatusPulse includes hostnames, last successful ping, and error codes in the incident description for immediate triage.
Second Line: SRE Management
If acknowledgment fails within 15 minutes, PagerDuty escalates to the SRE lead schedule. StatusPulse automatically suppresses duplicate alerts for the same endpoint during active incidents to maintain signal clarity.
Auto-Resolution & Runbooks
When StatusPulse detects restored uptime for 3 consecutive checks, it sends a "Resolved" payload to PagerDuty. Attach runbook links directly in the integration settings to guide engineers through standard recovery procedures.