via indeed · 3 June 2026 ·3 days ago

Senior Site Reliability Engineer

NatWest Group
Manchester Full-time Remote
417 more jobs in Manchester.
Upload your CV and see which ones actually match you.
Upload CV

Closing date for applications: 03/06/2026

Location Manchester, United Kingdom

Salary £55,480 – £83,220

Job typePermanent \| Contract typeFull Time

Mostly Remote

You’ll spend most of your time at home, working with your team digitally. You’ll come into an office or hub at least twice a month to collaborate with your colleagues.

Managerial / Technical Lead

This is a general indication and doesn’t always reflect day\-to\-day responsibilities. Check the job description for full details.

\#R\-00278949

Join our digital revolution in NatWest Digital X
----------------------------------------------------

In everything we do, we work to one aim. To make digital experiences which are effortless and secure.

So we organise ourselves around three principles: engineer, protect, and operate. We engineer simple solutions, we protect our customers, and we operate smarter.

Job description
-------------------

This role is based in the United Kingdom and as such all normal working days must be carried out in the United Kingdom.

Join us as a Senior Site Reliability Engineer

  • In this key role, you’ll improve and drive the availability, performance, efficiency, change management, monitoring, security, incident response, and capacity planning for our products and services

  • You’ll enjoy significant stakeholder interaction, working in collaboration with engineers to ensure a principled approach to delivering change in a safe and secure way

  • This is a chance to join an inclusive team with a collaborative ethos and a commitment to innovation and professional development

  • You’ll need to have the flexibility to support the team by working shifts and weekends on rotation
What you'll do
------------------

As a Senior Site Reliability Engineer, you’ll act as a hands\-on expert responsible for ensuring the reliability, availability, and performance of critical production platforms. You’ll lead the adoption of Site Reliability Engineering (SRE) practices, embedding resilience, observability, and operational excellence into distributed systems running on AWS and Kubernetes. You’ll also take ownership of 24/7 production support models, ensuring systems are highly available and that incidents are effectively managed and learned from.

We’ll expect you as well to design and operate highly resilient AWS\-based Kubernetes platforms (EKS) aligned with enterprise standards while owning and continuously improving production reliability, availability, and Service Level Agreement or Service Level Objective (SLA/SLO) frameworks. You’ll lead incident management, escalation, and 24/7 on\-call practices, including post\-incident reviews, and embed SRE principles such as error budgets, toil reduction, and reliability engineering into delivery teams. Furthermore, you’ll implement infrastructure and platform automation using Terraform and GitOps methodologies and drive self\-healing, auto\-scaling, and failure recovery mechanisms using tools such as Karpenter.

In addition to this, you’ll be:

  • Building secure and scalable networking and service communication such as Cilium

  • Defining and operating observability platforms using Grafana, Prometheus, Loki, and Tempo

  • Partnering with DevOps and engineering teams to ensure production readiness and operational excellence

  • Leading complex troubleshooting across distributed systems and cloud\-native environments

  • Developing reusable “golden paths,” operational runbooks, and reliability patterns

  • Ensuring platforms meet regulatory, security, and operational risk requirements

  • Using data, Service Level Indicators (SLIs), and metrics to drive continuous improvement and proactive reliability enhancements
The skills you'll need
--------------------------

We’re looking for a highly experienced Site Reliability Engineer with a strong background in operating large\-scale, business\-critical platforms and a passion for reliability engineering. You must also have deep expertise in managing production systems on AWS and Kubernetes (EKS), along with strong experience in 24/7 support models, incident management, and on\-call leadership.

Moreover, you’ll need to demonstrate advanced knowledge of SRE principles such as SLIs, SLOs, error budgets, and toil reduction, as well as proficiency in Terraform, GitOps, and cloud automation practices. Hands\-on experience with GitLab continuous integration and continuous delivery pipelines and Argo CD is also essential.

In addition, you’ll have to bring:

  • A strong understanding of Kubernetes networking, security, and service mesh technologies, ideally using Cilium

  • Experience scaling infrastructure using Karpenter and auto\-scaling strategies

  • Expertise in observability tooling, including Grafana, Prometheus, Loki and Tempo

  • A proven ability to troubleshoot and resolve complex, cross\-system production issues

  • Experience operating in regulated or high\-security environments

  • Strong leadership, mentoring, and stakeholder engagement capabilities

  • The ability to balance reliability, risk, and delivery in a fast\-paced environment
Your benefits breakdown
---------------------------

Here’s a quick look at what your pay package and annual leave could look like if you get accepted for the role. We have a wide range of benefits to support you in your working life and beyond.

Please note, the presented benefits packages are based on the minimum base salary.

Tap each segment for details.

Your pay package

£

Your leave allowance

30 days Annual leave You’ll also have the opportunity to purchase up to 5 additional days off.

3 days Volunteering Take time off to support the causes you’re passionate about.

3 days Training Take time to build the skills you need to grow your career.

Total rewards package

Welcome to our Spinningfields hub
-------------------------------------

Spinningfields is Manchester’s leading leisure, retail, and business district and home to some of the most incredible architecture, restaurants, flagship stores, and bars the city has to offer.

Key facts:


  • Space for 1,345 colleagues from various teams across the bank

  • Our outdoor roof terrace has panoramic views of the city

  • Opened in 2023

Our tech stack

Here’s just some of the technologies we use.

Front end


  • JavaScript

  • ReactJS

  • AngularJS

Back end


  • Python

  • Java

  • Microsoft Dynamics

DevOps


  • AWS

  • GitLab

  • Google Cloud Platform

Data


  • Kafka

  • Hadoop

  • PostgreSQL

  • Snowflake

The market for this type of role

Similar openings
417
Engineering roles in Manchester
Full-time
80%
of Engineering roles in the UK
Remote possible
8%
of Engineering roles
NatWest Group

85 open positions · Amsterdam, Birmingham, Bristol, Cambridge, Cardiff +13

📊 Engineering · the UK
6,505
active jobs
14.5%
Remote
Ø 2d
avg. online
Top skills in demand
ExcelERPISOPythonAWSCI/CDSQLAzureAgileLean

Frequently asked questions

How many Engineering jobs are available in Manchester?
Currently 417 Engineering roles in Manchester on AlmostHired, across 139 different companies. Our data is updated daily.
Do Engineering roles offer remote work?
8% of Engineering roles in the UK allow remote work, either partial or full. To filter specifically for remote positions, use AlmostHired.
How do I know if I match this role?
Upload your CV — our AI compares your profile to the job requirements and gives you a precise match score, with matching and missing skills.