THE ROLE Users at customer sites from Washington, DC to Tokyo rely on Palantir’s high availability to pursue their missions. As a Site Reliability Engineer (SRE) you will work collaboratively with multiple partners to build tools that ensure this software remain on-line for all the people that rely on it. You come from a systems or development background (or are comfortable in both areas). You are self-directed and have the ability to track solutions from design through implementation, owning every step along the way. You won’t be alone in this endeavor * SREs go to customer sites and work side by side with Palantir’s implementation teams and customer IT departments to understand their unique problems and develop innovative solutions. We strive for a deep understanding of the core technologies that are being used while building redundancy and establishing best practices. SREs combine engineering experience, a desire to improve existing systems and processes, and creative problem solving to develop novel solutions to evolving challenges. Our team strives to automate processes whenever possible, using whatever tools are best for the job. We treat operations issues as if they are software problems.   Our approach is to deploy whatever technology is best to solve the problem at hand. This often means that we are learning from the cutting edge of the Open-Source community or developing our own tools.  

RESPONSIBILITIES

  • Design software focused on the reliability and scalability of the products we use to succeed in our mission
  • Develop a deep understanding of a core component of the Palantir software stack, think through all of the moving pieces and make recommendations on design and configuration
  • Work with the customer-facing engineers to mitigate risk and pay down technical debt on customers’ software stacks
  • Develop solutions to automate processes whenever possible

TECHNOLOGIES WE USE

  • Hadoop/HDFS/Yarn, Spark, Elasticsearch, Cassandra, Vertica for distributed data storage
  • Puppet for system management
  • AWS, VMWare ESX, KVM, Xen, Docker for virtualization  

REQUIREMENTS

  • BS/MS in Computer Science or a related technical field, or equivalent experience
  • Strong developer in any language, but code needs to be clean and effective
  • Experience with Linux systems from the kernel level up to the shell
  • Familiarity with TCP/IP networking
  • Ability to work independently and collaboratively with multiple partners

Desired Skills and Experience

See application page for details