Platform Engineer wanted for our multi-regional, scalable cloud platform

With Beamery in London - GB

More jobs from Beamery

Posted on June 11, 2019

About this job

Compensation: £70k - 125k | Equity
Job type: Full-time
Experience level: Senior, Lead, Manager
Role: System Administrator
Industry: CRM, HR Services, SaaS
Company size: 51-200 people
Company type: VC Funded

Technologies

google-cloud-platform, linux, go, kubernetes, docker

Job description

Senior Platform Engineer

Role Ownership and Key Objectives

  • Ownership of mission-critical production multi-tenanted SaaS operations: on-rota, monitoring, alerting, configuration and change management, incident management and disaster recovery
  • Ownership of service scalability, elasticity, fault-tolerance and disaster recovery
  • Continuously improving SLOs such as availability, performance and recoverability
  • Elimination of operational toil & engineering knowledge silos
  • Ownership of production incidents post-mortems advocating blameless culture
  • Key contributor to service release management

Key Success Metrics

  • Service availability, performance and recoverability SLOs
  • Service release failure rates
  • Time spent during on-rota on production issues

Must Haves Skills & Experience

  • Previous job experience as a Site Reliability Engineer or DevOps Engineer
  • Experience with building out multi-regional, scalable cloud platforms preferably on GCP
  • Excellent understanding of DNS, cloud networking and infrastructure
  • Excellent debugging and analytical skills: ability to isolate root cause across networking/infrastructure, application and database stacks
  • Excellent understanding of Linux, Docker and Kubernetes (or similar container orchestration tool)
  • Experience with configuration management tooling such as Consul, Zookeeper and etcd
  • Experience with Infrastructure-as-Code tooling (i.e. Terraform/Puppet/Chef)
  • Experience with languages such as Go, Node.js and Bash scripting

Bonus Skills & Experience

  • Operational management of message brokers at scale (i.e. Kafka, RabbitMQ etc)
  • Operational management of NoSQL/in-memory databases at scale
  • Running ETL type workloads  at scale
  • Experience with service mesh technologies (i.e. Istio, Consul, Linkerd)

Do you?

  • Have a systematic problem solving approach, coupled with a strong sense of ownership and drive?
  • Have a history in automating operations processes via services and tools?
  • Never settle for downtime and outages, don’t like to be woken up in the middle of the night
  • Feel comfortable balancing technical direction with the business value needed factoring into solutions multiple constraints such as scale, distribution, costs, users, patterns?
  • Advocate for iterative software delivery and world class engineering, demonstrating best practice?

Apply here