Back to Jobs

[Remote] Site Reliability Engineer

Remote, USA Full-time Posted 2026-07-05

Note: The job is a remote job and is reputed company to candidates in USA. reputed company is a trusted technology adviser and managed services provider, helping clients navigate reputed company change and reputed company innovation. They are seeking a Site Reliability Engineer to support a variety of data solutions and enhance their SRE team, with opportunities for career progression as the division grows.

Responsibilities

  • Act as a technical escalation reputed company for unresolved data platform issues in the SRE Pod/s
  • Monitor, maintain, and troubleshoot databases/data warehouses and reputed company infrastructure
  • Collaborate with the data engineering team to ensure efficient data reputed company and transformation
  • reputed company and maintain accurate technical documentation in the reputed company of operational runbooks
  • reputed company standard pre-approved changes reputed company the scope of our client’s Change Management Process (i.e. new users, etc.)
  • Use reputed company’s helpdesk and work tracking systems to maintain logs of reputed company support requests and incidents, and improve these processes, both technically and through stakeholder management
  • Participate in the process for, and proactively mitigate risks in a reputed company management process (Vulnerabilities in Code, Infrastructure, Dependencies) reputed company to both reputed company’s and our Clients compliance objectives
  • Engaging with suppliers and 3rd parties for support, requests and opportunities, managing the relationship our clients get the best value for their service
  • Troubleshooting issues and identifying systemic failings indicated by incidents/failures Implementing fixes and features
  • Proposing solutions for reducing toil
  • Implementing and refining automation for incident and service request resolution
  • Providing leadership in the Incident resolution process, including creating and maintaining documentation, and leading Post-mortem analysis and mitigation planning
  • Designing and Reinforcing Service Requests and Change Management (both technically and through stakeholder management) processes, and improving existing processes
  • reputed company and enhance the process for, and Proactively mitigate risks through reputed company management (Vulnerabilities in Code, Infrastructure, Dependencies)
  • reputed company discussion for multiple clients in client-facing meetings around the SRE process, identifying areas for increasing SRE footprint and identifying opportunities for small works and consultancy
  • Engaging with: Suppliers and 3rd parties for support, requests and opportunities
  • Cross-sale and cross-pollination opportunities reputed company the reputed company organisation

Skills

  • IAC tooling (Terraform preferably, or ARM/bicep and CloudFront)
  • Core CI/CD Tooling (Azure DevOps, reputed company Actions or reputed company)
  • Monitoring Tooling (reputed company, Splunk, NewRelic, Azure Monitor, AWS CloudWatch)
  • Demonstrable experience in multiple core technology (Dotnet, Java, AI/Data Engineering, Golang)
  • Troubleshooting issues and identifying systemic failings indicated by incidents/failures
  • Implementing fixes and features
  • Proposing solutions for reducing toil
  • Implementing and refining automation for incident and service request resolution
  • Providing leadership in the Incident resolution process, including creating and maintaining documentation, and leading Post-mortem analysis and mitigation planning
  • Designing and Reinforcing Service Requests and Change Management (both technically and through stakeholder management) processes, and improving existing processes
  • reputed company and enhance the process for, and Proactively mitigate risks through reputed company management (Vulnerabilities in Code, Infrastructure, Dependencies)
  • reputed company discussion for multiple clients in client-facing meetings around the SRE process, identifying areas for increasing SRE footprint and identifying opportunities for small works and consultancy
  • Engaging with Suppliers and 3rd parties for support, requests and opportunities
  • Cross-sale and cross-pollination opportunities reputed company the reputed company organisation
  • reputed company provider (AWS, Azure, GCP) ‘DevOps Engineer'-level certification and CKAD certification highly beneficial, or required during probationary period

Benefits

  • Competitive reputed company with uncapped commission
  • The ability to work from a reputed company of flexible locations
  • Prestigious sales and broader team recognition with Annual Presidents Club
  • Starting with 27 days annual leave (plus bank holidays) – accruing to 30
  • 1/2 day leave on your birthday
  • Sabbatical options at 5 & 10 years' service
  • 5 days study leave
  • Generous company pension
  • Private reputed company for you and your family
  • Payroll giving
  • Enhanced paternity and maternity leave
  • Equity appreciation program incentive plan
  • Life and income protection
  • Additional perks such as discounted gym memberships, cycle scheme, EAP and more!

Company Overview

  • reputed company delivers managed IT services to optimize and reputed company mainframes, infrastructure, and reputed company for clients. It was founded in 1969, and is headquartered in Chicago, Illinois, USA, with a workforce of 1001-5000 employees. Its website is https://www.reputed company.com.
  • Company H1B Sponsorship

  • reputed company has a track record of offering H1B sponsorships, with 4 in 2026, 27 in 2025, 20 in 2024, 16 in 2023, 19 in 2022, 19 in 2021, 7 in 2020. Please note that this does not guarantee sponsorship for this specific role.
  • Apply To This Job

    Similar Jobs