Senior Infrastructure Engineer
This job is brought to you by Jobs/Redefined, the UK's leading over-50s age inclusive jobs board.
Senior Infrastructure Engineer
Department: Engineering
Employment Type: Full Time
Location: Remote
Reporting To: Chief Technology Officer
Compensation: GBP 80,000 - GBP 90,000 / year
Description
Hi I'm Bill, CTO at Pinpoint.
We're a high-growth HR tech company building and selling software that helps in-house recruitment teams attract, hire, and onboard the right talent. Today, we have a strong foundation in place, with a mature product, rapid growth, strong product-market fit, and happy customers.
As we grow, expectations around stability, performance, and reliability continue to increase. Our platform supports mission-critical workflows for our customers, and maintaining trust is central to how we operate. As the business scales, we're investing further in infrastructure ownership to ensure we stay ahead of demand and continue to deliver a stable, predictable experience.
We're hiring a Senior Infrastructure Engineer to raise the reliability bar across our platform.
As the second member of our dedicated infrastructure team, you'll operate as a senior individual contributor with meaningful autonomy. You'll work across the full surface area of our environment and own problems end-to-end, partnering closely with our Lead Infrastructure Engineer, CTO, and senior developers to improve reliability across the organisation.
The fine print (but a bit more exciting):
- This is a remote-first role based in the UK or Poland, with occasional in-person team meetups. Our HQ is in Jersey (UK), and our ~90-person team is spread across the UK, US, and EU
- Our infrastructure team is intentionally small. You'll be expected to operate independently as a subject matter expert. There isn't a large platform team to lean on
- Our tech stack is pragmatic and maintainable rather than trend-driven. We run a monolithic Ruby on Rails application with a React frontend, deployed and managed via Cloud66. We use GitHub and GitHub Actions for CI, Terraform (OpenTofu) for infrastructure as code, and Datadog for monitoring and observability
- This is not a greenfield experimentation role. It's about making a mature, revenue-generating SaaS platform more stable, predictable, and scalable
- You'll participate in an on-call rotation (typically ~3 days per week out of hours). Incidents are infrequent, but availability during your rotation is required
- Our values actually matter here. We hire people who reflect them in how they work, collaborate, and make decisions
About the Role:
- Improve monitoring and alerting across infrastructure and application layers
- Diagnose and reduce production instability, including load spikes and database bottlenecks
- Strengthen our use of Datadog, particularly logging quality and alert signal-to-noise ratio
- Improve capacity planning through testing, monitoring, and forecasting
- Ensure new features ship with appropriate production metrics and reliability safeguards
- Make pragmatic, risk-aware infrastructure decisions that prioritise stability and customer impact
- Implement and maintain best practices across infrastructure security, compliance, and vulnerability management
- Participate in on-call rotations, incident response, and post-incident analysis
- Maintain clear and up-to-date infrastructure documentation
- Improve our CI/CD pipeline and overall infrastructure performance
What Success Looks Like:
- Fewer surprise production incidents
- Faster diagnosis and recovery when issues occur
- Clear, actionable dashboards and alerts
- More predictable performance under load
- Infrastructure changes that feel controlled and low-risk
- Increased engineering confidence when shipping new features
About You:
- 4+ years of hands-on experience in infrastructure, DevOps, platform, or site reliability roles
- Experience maintaining the reliability and scalability of a production SaaS application
- Experience participating in on-call rotations and incident response
- Strong working knowledge of infrastructure as code (e.g. Terraform)
- Experience with monitoring, metrics visualisation, and alerting platforms
- Demonstrated ability to balance speed and experimentation with stability and operational risk
- Strong problem-solving ability and sound technical judgment in ambiguous environments
- Clear communicator who documents decisions and contributes to thoughtful post-incident reviews
- Comfortable getting close to the codebase when needed
Huge plus if you have:
- Experience optimising MySQL, PostgreSQL, or Redis.
- Deep familiarity with AWS.
- Advanced Datadog experience.
- Experience working within SOC2 or ISO27001 environments.
- Familiarity with Ruby or Ruby on Rails applications.
What We Offer:
We want Pinpoint to be the best place you've ever worked-somewhere you feel valued, supported, and excited to grow. Here's what you'll get:
- Comprehensive healthcare - Excellent medical, dental, & vision coverage for you and your family
- Unlimited holidays - Take the time you need to rest and recharge
- Mental health support - Unlimited, immediate access to professional counseling via Spill
- Retirement contributions - 401k or pension contributions depending on your location
- Remote-first - Work where you're most productive, with flexibility and trust as the default
- Equity with real upside - Share in the long-term value you help create
- Fully paid parental leave - Up to 16 weeks of paid leave for new parents
- Learning budget - Annual funds for courses, books, or anything that supports your growth
A detailed overview of our benefits can be found here