The Twilio platform enables companies to integrate communications directly into their applications via simple cloud API’s and with on-demand global reach. Twilio is therefore challenged with abstracting away a world of complexity so that our customers can go global without concern for managing a global communications network, carrier integrations and relationships throughout the world. About the job: Unlike traditional operations groups, Twilio’s Site Reliability Engineers are expected to be full-fledged programmers, and are considered peers of the Software Engineering team. Candidates should have finite patience for repetitive work, and have a natural inclination to automate it. Working under Twilio’s Messaging organization, you will be responsible for an SMS and MMS platform that touches millions of people every day. Together with our developers and product teams, you will find a way to keep production running smoothly while still allowing for big new features to be safely and quickly launched. Responsibilities:

  • Own, operate, and maintain your team’s services in a distributed production environment. Employ Agile methodologies to continuously deliver value to customers without jeopardizing reliability.
  • Write and adjust realistic SLAs and SLOs for availability, throughput, latency, and data integrity.
  • Coordinate monitoring, reporting, and alerting based on these numbers, and when things inevitably start to go wrong, take a lead role in our response and postmortem processes.
  • Collaborate with Software Engineers, Product Managers, Architects and Engineering Management.
  • Release, monitor, and maintain features in a responsible, reliable way as part of our integrated DevOps practices.
  • Write actionable runbooks and error messages, and insist that the rest of the team does, too.
  • Excel as an engineer and be a productive member of the team where leadership is a behavioral trait, not a title. Requirements:

  • 5+ years of software engineering or operational experience (ideally, a blend of the two) in a large-scale, highly-available, uptime-critical production environment.
  • Understanding of Unix systems internals.
  • Knowledge and understanding of network theory, such as different protocols (TCP/IP, UDP, ICMP, etc), MAC addresses, IP packets, DNS, OSI layers, and load balancing).
  • Have been an active participant in agile processes, continuous integration and test automation.
  • Systematic problem solving approach, coupled with a strong sense of ownership and drive. Telephony experience is not required; you will learn. We are seeking engineers who have solved complex problems at scale and have the aptitude to navigate uncharted waters. About us: Twilio makes communications easy and powerful. With Twilio’s platform, businesses can make communications relevant and contextual by embedding real-time communication and authentication capabilities directly into their software applications. Twilio gives businesses the ability to innovate, prototype, create, and connect with their customers at the right time and in the right way. Founded in 2008, Twilio is privately held and based in San Francisco, California with other offices around the world. Twilio is proud to be an equal opportunity workplace and is an affirmative action employer. We are committed to equal opportunity regardless of race, color, ancestry, religion, gender, gender identity, parental or pregnancy status, national origin, sexual orientation, age, citizenship, marital status, disability, or Veteran status and operate in compliance with the San Francisco Fair Chance Ordinance.

Desired Skills and Experience

See application page for details