In this role, you will work on the Site Reliability team as a manager and contributor, on a team that encompasses everything from datacenter operations all the way to developer tooling. We manage a fleet of hundreds of cloud and hardware servers that span multiple datacenters, multiple cloud providers and multiple geographic regions. Our small team builds the automation and tooling that our feature teams use everyday to ship great products for our customers.

Part of this will be supporting and developing other members of the team - conducting regular check-ins, providing career and professional support, coaching and advising, conducting performance reviews, etc.  While you should be excited about building the infrastructure that powers Sprout, we want you to be equally excited about the trajectory of your team and your part in forming them as stellar site reliability engineers.

We’re looking for smart, creative, driven people who want to help Sprout continue to run lean and mean, and at a bigger scale each day. Sprout has a global infrastructure footprint, serving customers from all over the world. Our platform is used by more than 17,000 brands across over 100 countries. Companies like Microsoft, Zipcar, Hyatt, Google, and Zendesk rely on Sprout to create stronger relationships with their customers through social media. We need you to help us spread Sprout operations to all corners of the globe.

Responsibilities

Desired Skills and Experience

  • Help our multi-region cloud footprint easily scale
  • Design and implement server and datacenter automation
  • Build Java and Python libraries for feature teams
  • Build and deploy services with Chef
  • Orchestrate cloud resource provisioning with Terraform
  • Keep our fleet of Cassandra clusters running with low latency and high throughput
  • Help scale up our MySQL database fleet
  • In addition to engineering responsibility, you will also have the managerial responsibility to support the careers of fellow site reliability engineers
  • Bring vision to the team in the form of new technology recommendations, ideas and approaches
  • Drive your team to build in Sprout’s best interests by being cognizant of business requirements, market need, customer pains, etc.
  • Support and enhance the Sprout Social culture
  • Minimum of 5 years of operational software development experience at scale
  • Minimum of 2 years of management experience in a software environment, preferably a high-growth, SaaS company
  • Demonstrable  leadership, project & logistics management, and people mentoring skills
  • Previous experience in fostering and growing a strong engineering culture in an agile and devops environment
  • Smart, a very quick learner and gets things done
  • Demonstrated experience working with large cloud deployments
  • Very clear written and verbal communicator
  • Experience with multiple cloud providers including Rackspace and AWS
  • Strong applied knowledge of concurrency, I/O and distributed systems
  • Heavy SQL and NoSQL database experience
  • Experience designing large-scale automation systems
  • Insurance and benefit options that are built for both individuals and families, including generous company contributions
  • Progressive benefit programs, like our parental leave program and free Divvy bike memberships
  • High-quality and well-maintained equipment - your computer will never prevent you from doing your best
  • Beautiful, convenient and state-of-the-art offices in Chicago’s Loop
  • Solid programs in some of the staples: transportation, disability and life insurance
  • Wellness initiatives to ensure both health and financial well-being of our team
  • Breakfast and lunch options onsite, and more healthy snack options and beverages than you can imagine
  • Growing corporate social responsibility program that is driven by the involvement and passion of our team members