Site Reliability/Infrastructure Engineer
Role Description Do you want to build, maintain, and automate reliable systems? Do you enjoy working with other smart people to identify and fix tough application problems? Does the thought of testing and tweaking until you are sure you have squeezed out every last drop of performance bring a smile to your face? Do you find yourself wishing you could script every repetitive task in life, not just systems-related tasks? TSheets provides real-time SaaS services to hundreds of thousands of users across multiple data centers, and our infrastructure team keeps us on-target for five nines availability. Whether it is improving our cross-data center replication, troubleshooting changes in system behavior, optimizing to get more throughput per IOPS, automating processes, or providing cross-platform expertise to support teams, an infrastructure engineer is usually helping, if not leading. We are looking for someone to hack on the back end pieces of our core user-facing applications and infrastructure. This person will also join our infrastructure on-call rotation after a training period. In addition to technology challenges, we are a tight-knit team that supports each other in every way we can. Joining our team means making meaningful contributions and quickly seeing those contributions come to fruition. Check out our careers page and our company page for a glimpse of our company culture! What You Will Do
- Improve site reliability by monitoring and responding to operational issues at all levels of the OSI stack, and identifying underlying patterns in these issues
- Build new automation systems and tools, with the goal of increased reliability, reduced human effort (and human error) and visibility of key performance metrics
- Work across functions (engineering, marketing, finance, and customer experience) to support internal business needs when they arise
- Provide support for troubleshooting complex issues affecting internal and/or external services
-
Work with architects, system engineers, and external vendors to design and provision new services, and maintain existing services Requirements
- Bachelor’s degree (or equivalent experience) in Computer Science, Engineering, or a related field
- 4+ years experience with at least two web application stacks, preferably in both WAN and LAN environments
- Deep understanding of Linux internals and scripting languages, along with higher-level tools for automation of server and systems-related tasks
- Strong familiarity with networking sockets and protocols; Knowledge of routing protocols a plus
- Experience with operations and troubleshooting in distributed production environments; Knowledge of configuration management systems a plus
Desired Skills and Experience
See application page for details