Senior Systems Reliability Engineer (SRE)
Our client’s systems are fast and reliable and we’re the team that makes that possible. We build middleware * the software infrastructure designed for creating large-scale, fault-tolerant applications that run on thousands of machines throughout the world. What we build is used by both engineers and clients. Our complex infrastructure uses a variety of programming paradigms such as RPC, publish/subscribe and message queues. Building this infrastructure requires top engineers. We are two dozen C++ programmers with experience in designing network protocols and large-scale software architecture.
What’s in it for you:
As a Systems Reliability Engineer (SRE) working on this critical infrastructure, your mission will be to take responsibility of deployments and ensure reliability. You will focus on automating everything from build and deployment to reaction and remediation to outages. You will be part of a larger SRE organization aimed at supporting the company’s API. You will work on all aspects of this end-to-end system. This will require a wide range of skills, some of which you can learn on the job, such as time series databases, statistical analysis, web-based UIs, RESTful services and new programming languages.
We’ll trust you to:
- Take responsibility for deployment after Beta for messaging and multicast services
- Ensure level 1 support for production issues
- Automate everything from reaction to outages to quality checks for new builds
-
Provide feedback to developers to make this infrastructure increasingly resilient You need to have:
- Proven experience as a software engineer or developer working on high availability, large-scale distributed applications
- Excellent programming skills (you don’t need to know C++ or Java, although they are a plus, but you do need to be a great programmer in other programming languages such as Python, Ruby, Perl, Scala or JavaScript)
- A strong understanding of the UNIX/Linux command line
- A passion for performance excellence and an engineering mindset
- Previous experience with data, statistics and latency numbers
-
A Bachelor’s degree in Computer Science or equivalent experience We’d love to see:
- Strong leadership skills
- Prior experience as a systems performance or site/systems reliability engineer
- Extensive experience working with fault-tolerant approaches in a large-scale distributed environment with high performance systems
- Expertise analyzing and troubleshooting large-scale distributed systems
- A deep understanding of Internet and networking protocols, including IP multicast (PGM)
- Knowledge of network analysis and performance and application issues using standard tools (Tcpdump or Wireshark)
- A strong understanding of the software development lifecycle, as well as tools such as Git, Cmake, Jenkins, RPM or DPKG, Chef or Puppet
- Experience with virtualization and Infrastructure as a Service models
- The ability to handle periodic on-call duties as well as out-of-band requests
Join our Client’s Team:
You will be working for the foremost provider of financial data, news and analysis in the world. Our client is the most trusted source of information for businesses and professionals. The company combines innovative technology with unmatched analytic, data, news, display and distribution capabilities, to deliver critical information via their proprietary service and multimedia platforms. Their media services cover the world with more than 2,200 news and multimedia professionals at 146 bureaus in 72 countries.
The culture is fast-paced, high energy and entrepreneurial. They have an open office environment that encourages interaction, communication, collaboration and teamwork. Their 24-hour food courts, giant tropical fish tanks and world-class art installations provide additional flourishes to a workplace that is as dynamic and exciting as the financial markets themselves.
Their technologists are among the best and the brightest, and come from many different industries, financial experience is not a requirement. Their technology environment primarily consists of C/C++ and UNIX/Linux.
Desired Skills and Experience
See application page for details