Back

[Remote] Site Reliability Engineer (SRE) - Azure | DevSecOps | IaC | Governance | Observability Job Details | Avaya

Worldwide Salaried Open

Note: The job is a remote job and is open to candidates in USA. Avaya is an enterprise software leader that helps the world’s largest organizations and government agencies forge unbreakable connections. They are seeking a Site Reliability Engineer (SRE) to drive stability, reliability, and performance across their Azure and GCP-based platforms, focusing on operational excellence and proactive incident management.

Responsibilities

  • Serve as a key member of the 24×7 on-call rotation, responding to and managing incidents across production and pre-production environments
  • Lead incident bridges, coordinate root cause analysis (RCA), and ensure post-incident reviews drive systemic improvements
  • Maintain clear communication with cross-functional teams and leadership during major incidents
  • Build, tune, and maintain observability dashboards (Azure Monitor, GCP Operations Suite, Prometheus, Grafana, Datadog, Log Analytics)
  • Perform deep-dive troubleshooting of application and service-level issues using distributed tracing and log analysis (Grafana, Datadog) to pinpoint root causes beyond infrastructure
  • Define SLOs, SLIs, and error budgets to proactively identify and mitigate reliability risks before customer impact
  • Integrate AI-Ops tools for anomaly detection, predictive alerting, and automated incident correlation
  • Continuously enhance alert quality, reduce false positives, and automate runbooks for faster recovery
  • Analyze trends to prevent recurring issues and support teams in resilience engineering

Skills

  • 5+ years in Site Reliability, DevOps, Cloud Operations, or Customer support roles
  • Demonstrated experience in application-level troubleshooting by analyzing logs and traces to identify bugs, performance bottlenecks, and error conditions
  • Expertise in Azure and GCP cloud operations and distributed system reliability
  • Understanding of Terraform, Ansible, and CI/CD pipelines (Jenkins, GitHub Actions)
  • Experience with observability and AI-Ops tools (Azure Monitor, GCP Operations Suite, Grafana, Prometheus, Datadog, etc.)
  • Solid grasp of incident management frameworks (P1–P3 handling, RCA, PIRs, on-call rotations)
  • Excellent analytical, troubleshooting, and communication skills

Benefits

  • Performance-related bonus
  • Annual bonus that aligns with individual and company performance
  • Benefits

Company Overview

  • Avaya is a global leader in enterprise communications, hybrid cloud CCaaS and UC solutions for mission-critical, AI-agnostic workflows. It was founded in 2000, and is headquartered in Morristown, New Jersey, USA, with a workforce of 5001-10000 employees. Its website is http://www.avaya.com.
  • Apply To This Job

    More jobs

    [Remote] Senior Amazon Data Analyst

    Worldwide Salaried

    [Remote] Account Executive

    Worldwide Salaried

    [Remote] DevOps / DevSecOps Engineer - Azure | GCP | IaC | Security | Automation Job Details | Avaya

    Worldwide Salaried

    [Remote] Engineering Technical Lead - I&C Embedded Software

    Worldwide Salaried

    [Remote] RCM Customer Success Manager

    Worldwide Salaried

    [Remote] Research Analyst - Robotics

    Worldwide Salaried

    [Remote] Product Counsel, ERISA & Operations

    Worldwide Salaried

    [Remote] Director Business Development-Biologics, West Coast U.S. Job Details | Syngene International Limited

    Worldwide Salaried

    [Remote] Senior Sales Executive, Core Growth - MLF (KC/STL)

    Worldwide Salaried

    [Remote] Sr Software Engineer, AI Experience

    Worldwide Salaried

    Experienced Part-Time Outlet Customer Service Associate – Flexible Bridgeton, MO (arenaflex Outlet)

    Worldwide Salaried

    Experienced Full Stack Customer Service Agent – Work from Home

    Worldwide Salaried

    Pharmacy Technician (Call Center)

    Worldwide Salaried

    Sales Associate

    Worldwide Salaried

    Cold Email Marketing Specialist - Remote Job

    Worldwide Salaried

    Servicing Travel Advisor III - Chase Travel (Re...

    Worldwide Salaried

    Desenvolvedor Java Sênior - Financial Services

    Worldwide Salaried

    Job Title: Experienced Customer Service Representative - First Notice of Loss (FNOL) Support Specialist - Work from Home Opportunity

    Worldwide Salaried

    Remote Customer Experience Representative

    Worldwide Salaried

    Weekend Remote Chat Support Specialist – Flexible 4‑8 Hour Shifts, $25‑$35/hr, Entry‑Level, No Phone Calls, Work Saturdays & Sundays Only

    Worldwide Salaried