Skip to main content

Software Engineering Manager, SRE, Cloud Incident Response

This job is brought to you by Jobs/Redefined, the UK's leading over-50s age inclusive jobs board.

Minimum qualifications:

  • Bachelor's degree or equivalent practical experience.
  • 8 years of experience with software development in one or more programming languages (e.g., Python, C, C , Java, JavaScript).
  • 3 years of experience in a technical leadership role; overseeing projects, with 2 years of experience in a people management, supervision/team leadership role.
  • Experience with Cloud Services, Telemetry Systems and Incident Response.
  • Ability to work non-standard hours in on-call rotation.

Preferred qualifications:

  • Master's degree or PhD in Computer Science, or a related technical field.
  • Experience in a customer focused role or as a Cloud Customer.
  • Ability to work across organizational boundaries.

About the job

Like Google's own ambitions, the work of a Software Engineer goes beyond just Search. Software Engineering Managers have not only the technical expertise to take on and provide technical leadership to major projects, but also manage a team of Engineers. You not only optimize your own code but make sure Engineers are able to optimize theirs. As a Software Engineering Manager you manage your project goals, contribute to product strategy and help develop your team. Teams work all across the company, in areas such as information retrieval, artificial intelligence, natural language processing, distributed computing, large-scale system design, networking, security, data compression, user interface design; the list goes on and is growing every day. Operating with scale and speed, our exceptional software engineers are just getting started -- and as a manager, you guide the way.

With technical and leadership expertise, you manage engineers across multiple teams and locations, a large product budget and oversee the deployment of large-scale projects across multiple sites internationally.

The Cloud Incident Response team has a mission to create a dependable experience for GCP customers. The team responds to and helps coordinate, mitigate, or resolve major incidents across all of GCP. The core team builds the processes, systems, and tooling necessary to deliver customer focused mitigations to critical GCP incidents. This will involve working on tooling, systems, and processes.

Google Cloud accelerates every organization's ability to digitally transform its business and industry. We deliver enterprise-grade solutions that leverage Google's cutting-edge technology, and tools that help developers build more sustainably. Customers in more than 200 countries and territories turn to Google Cloud as their trusted partner to enable growth and solve their most critical business problems.

Responsibilities

  • Participate in on-call rotation supporting Critical Incident Response.
  • Focus on high-quality customer outcomes and continuous collaboration across GCP teams.
  • Create training and processes for incident management life-cycle and partner with Cloud SRE UTLs, and Cloud Support leadership team.
  • Build systems and tooling to support the team, improve visibility, detection of issues, communications to customers, stakeholders, and customer facing teams.
  • Define and escalate risks in Cloud, reduce incident probabilities with strategic and tactical/pragmatic approaches as needed.

Software Engineering Manager, SRE, Cloud Incident Response

Google
London, UK
Full-Time

Published on 10/05/2025

Share this job now