Skip to main content

Senior Site Reliability Engineer

This job is brought to you by Jobs/Redefined, the UK's leading over-50s age inclusive jobs board.

About the Role:

Responsible for overseeing the Site Reliability Engineering team and ensuring the reliability, performance, and scalability of an organization's systems and services. Key responsibilities include system reliability and performance management, incident management and response, automation and tooling, performance monitoring and optimization, collaboration and communication, continuous improvement, risk management and disaster recovery planning, and stakeholder engagement. Plays a critical role in maintaining the stability and resilience of systems, driving efficiency through automation, and fostering a culture of continuous improvement within the team and the organization as a whole. This role will report into Manager -Digital Delivery

You Will:

  • Promotes and drives automation initiatives to improve operational efficiency and reduce manual toil. Identify opportunities for automation, develop and implement automation tools, scripts, and frameworks, and advocate for the use of monitoring, deployment, and orchestration tools that streamline operations and enhance reliability.
  • Promotes and drives automation initiatives to improve operational efficiency and reduce manual toil. Identify opportunities for automation, develop and implement automation tools, scripts, and frameworks, and advocate for the use of monitoring, deployment, and orchestration tools that streamline operations and enhance reliability.
  • Plays a key role in incident management and response. Lead the team in resolving critical incidents, coordinating with relevant stakeholders, and driving post-incident reviews to identify root causes and implement preventive measures.
  • Establish incident response processes and ensure adherence to incident management protocols. Assist in day-to-day production/non-production issues, act as escalation point to resolve urgent and/or complex issues and manage expectations.
  • Respond to customer requests, troubleshooting, reported problems pertaining to the applications performance and reliability.
  • Contribute to training materials and/or instructions for end-users. Should be able to apply the understanding of DevOps and software engineering best practices to influence design and implementation approaches and solutions wherever applicable.
  • Provides operational readiness through the engineering, planning, coordination, and execution of performance and tuning analysis, systems support, incident and problem resolution, software configuration and system/features upgrades.
  • Identify and mitigate risks related to web development, infrastructure, and security vulnerabilities.
  • Implement security measures and best practices to protect web applications, data, and infrastructure.
  • Ensure compliance with relevant security standards and regulations.
  • Understand how the organization's applications interact with different systems and business processes to ensure they operate smoothly. Research and remain informed of new technology and development tools.

You Have:

  • 7 years' experience in advanced (e.g. Tier 2 or Tier 3) software support and development.
  • Bachelor's Degree in Computer Science or Information Technology or related field.
  • Experience working on Zuora, CommerceTools, Okta, ShipStation, Zeplin, Zoominfo, etc.
  • Experience in project management methodologies, such as Agile or Scrum, can be beneficial. Familiarity with project management tools and practices, including resource allocation, timeline management, and risk mitigation, can enhance this role's ability to manage projects effectively.
  • Relevant industry certifications, such as Certified Site Reliability Engineer (CSRE) or certifications in cloud platforms (e.g., AWS Certified DevOps Engineer, Azure DevOps Engineer), can be advantageous.
  • Familiarity with specific vendor technologies and tools commonly used in the organization's tech stack can be preferred. This could include experience with specific cloud platforms, monitoring and observability tools, deployment and orchestration tools, and incident management platforms.
  • Experience working across multiple domains or industries can be beneficial. Exposure to different technology stacks, diverse operational environments, or various business domains can provide a broader perspective and enable this role to bring innovative ideas and best practices from different contexts.

How We Support You:

We provide flexibility to help you achieve a good work-life balance. You'll be part of a global, diverse team who foster an environment of inclusion and belonging where you are valued for who you are and where you come from.

We offer benefit options in and out of the workplace, including healthcare, retirement, paid time-off, parental leave, an employee assistance program. We provide resources that support your mental health, and evolve our offerings to meet your needs. We care about our employees' welfare and focus our benefits package on the benefits which support your wellbeing. We also recognize that everyone has different priorities, so in addition to our core benefits to support your health we offer flexible options for you to choose benefits that are right for you, your family and your lifestyle.

We believe in non-stop learning and are committed to investing in learning opportunities that help you reach your full potential and support your continued development.

About Us:

At The Association, a Great Place to Work-Certified company, we are transforming the accounting and finance profession. We are future-focused, empowering the world's most accomplished accountants to stay relevant, meet today's demands, and prepare for tomorrow's challenges through quality education, resources, and training.

Learn more about The Association on LinkedIn and our Career Site.

#LI-Remote #GreatPlacetoWork

We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender, gender identity or expression, or veteran status. We are proud to be an equal opportunity workplace.

Senior Site Reliability Engineer

aicpa
London, UK
Full-Time

Published on 21/06/2024

Share this job now