You will set up, manage, maintain and troubleshoot Splunk’s SaaS customer facing systems, Splunk Cloud This position is an opportunity to join the team that is responsible for Cloud’s operational infrastructure and delivery.    Amazon’s EC2 and EBS is the platform and this candidate should have experience with managing deployments, right-sizing servers and storage for performance, and providing muti-site HA strategies at Amazon.  This is an incredible opportunity to use your existing cloud experience and drive the growth of our cloud offerings here at Splunk!   

Responsibilities: 

  • Manage Amazon server/storage deployments including release process, image management, backup/restore, HA/DR
  • Manage monitoring environment using Splunk, Zabix, Pingdom, and PagerDuty
  • Develop scripts and tools in Python/Shell or similar to monitor system stability and performance and ensure system availability, reliability, and usability
  • Perform end-to-end administration of virtual Amazon infrastructure, focusing on Linux based systems
  • Troubleshoot complex problems, resolve operational issues, and interact with vendors
  • Perform Amazon instance maintenance and system upgrades including service packs, patches, hot fixes, vulnerabilities, and security configuration
  • Work with Opscode Chef for automated deployment capability, along with Github source code management  Requirements:  

  • 5+ years as a Linux system administrator supporting enterprise computing platforms and systems
  • 3+ years with virtualization technologies
  • 2+ years experience running production systems at Amazon
  • Knowledge of Amazon EC2 including machine image management, storage
  • Understanding of Amazon EC2 regional center, availability zones, and HA strategies
  • Experience supporting customer facing multi-tenant infrastructure (SaaS) or similar cloud related services
  • Experience with Python or Shell, and open source based systems
  • BS or comparable work experience  

Desired Skills and Experience

See application page for details