DevOps Site Reliability Engineering (SRE) - HO - SEO
Government Digital & Data -
Job summary
The Reliability Enablement team helps Data Services & Analytics (DSA) teams improve their product and service reliability by providing observability and embedding Site Reliability Engineering (SRE) principles. You will be a key part of the team, working on engagements with product teams and helping grow SRE culture within the organisation.
Job description
The DevOps (SRE) is responsible for improving the reliability of our platforms and services. Your role is proactive, ensuring relevant metrics are being measured and reliability improvements are identified and implemented when necessary. This will ensure the reliability and availability of services for users.
You will also advise developers on how to use platforms and tools effectively, reviewing and advising on their use of CI/CD pipelines and observability tooling. You may also work to deliver new platform tooling.
Recruitment events
We are hosting an Engineering online recruitment event on Thursday 6th February 2025 from 12:00pm to 1:00pm. Where you can find out more about our roles, working for the organisation and how to apply. Register your interest here: Home Office Events I Eventbrite
Tools and Technologies we use:
We are keen for Engineers to continue learning new technologies, we have a large range in the Home Office including:
- Backend: Java, Node.js, C#, Python, PHP, Scala, Power Platform
- Frontend: React, JavaScript, Typescript, Angular
- Data: PostgreSQL, Microsoft SQL Server, MongoDB, Apache Kafka, Neo4J, Amazon Athena
- DevOps: AWS, Kubernetes, Azure, Jenkins, Docker, Ansible, Terraform, Dynatrace
What you will do
Your main day to day responsibilities will be:
- supporting teams to effectively build, improve and deploy reliable and secure services
- building new or improved shared tooling to help teams automate and maximise reliability
- spotting instances where teams are not using best practice and advising on how to improve
- supporting engineers to design new services; helping to define test and deployment pipelines
- helping teams improve their integration approaches; increasing reliability and the value delivered to users
Like many organisations we need to maintain our services 24/7, therefore, on occasions there may be a requirement to work out of hours, for which you will be paid an additional allowance.
Person specification
UK residency and security requirements - You need to have lived in the UK for the past 5 years.
Essential Criteria
As a DevOps (SRE), you will have experience of:
- Designing and implementing reliable cloud solutions using AWS or Azure according to best practices. (Software design - SWDN)
- Implementing automated testing, scanning and code analysis tooling, according to best practices. (Testing - TEST)
- Implementing and using application monitoring tooling to identify and respond to problems early. (Application support - ASUP)
- Designing, coding, testing, maintaining and documenting scripts and infrastructure-as-code definitions to automate build and deployment activities. (Programming/software development - PROG)
- Implementing and promoting use of CI/CD pipelines according to best practices. (Systems integration and build - SINT)
- Implementing data management best practices for cloud resources, such as naming, tagging, metadata, backups, and documentation. (Data management - DATM)
SFIA capability framework
Skills for the Information Age (SFIA) is the technical framework that sets the standard capability and development of all engineering levels in the Home Office. This is a link to the capability framework: All skills A–Z — English (sfia-online.org)
We use set SFIA technical skills to form our interview questions and we will assess you against these technical skills during the selection process.
SFIA levels of responsibility – Use the SFIA Levels of responsibility to understand what would be expected for each Technical Skill listed below.
SFIA Technical Skills
The essential technical skills listed above are reflective of the Home Office Government Digital and Data Profession Career Framework. Please see below for the relevant skills required for your role.
Behaviours
We'll assess you against these behaviours during the selection process:
- Changing and Improving
Technical skills
We'll assess you against these technical skills during the selection process:
- Software design (SWDN) - Level 3
- Programming/Software development (PROG) - Level 3
- Testing (TEST) - Level 3
- Systems integration and build (SINT) - Level 3
- Data management (DATM) - Level 3
- Application support (ASUP) - Level 3