Service Reliability Engineer
Proofpoint is always looking to hire exceptional people. We help some of the largest and most successful companies in the world defend, protect, and govern their most sensitive data; and we are currently building our next generation of cloud-based solutions that will literally change the way people work.
Duties/Responsibilities
- Collaboratively manage several 24x7 multi-site production environments powering the Proofpoint Enterprise Archive service, including deployment, maintenance, troubleshooting, performance tuning, and security
- Contribute to the evolving design and architecture of reliable and scalable infrastructure
- Ensure proper monitoring, alerting, capacity planning and reporting in the production environment
- Develop processes, tools, and documentation in support of production operations
- Participate in evaluating new software, hardware and infrastructure solutions
- Participate in an on-call rotation and be willing to jump on escalated issues as needed
- Train and mentor junior staff to improve their ability to support the environment
Required Skills And Experience
- Demonstrable skills and 7+ years’ experience managing, troubleshooting, and tuning Linux systems
- Experience automating management of systems and applications using puppet, Perl, Python, and Ruby
- Experience with industry-standard foundation technologies such as DNS, SMTP, NTP, LDAP, NFS
- Hands on experience with TCP/IP and Ethernet network technologies
- Experience managing a large distributed computing environment
- Experience with industry-standard operational practices such as change management, incident management, and working in colocation datacenters
- Excellent verbal and written communication skills
Desired Skills And Experience
- Experience operating production Internet-facing services for large enterprise customers
- Experience managing multi-tier web services using technologies such as Apache web server, Tomcat, Java applications, and MySQL
- DevOps experience working in a configuration management framework such as Puppet
- Experience troubleshooting and upgrading Puppet as well as developing new puppet classes
- Experience with monitoring and alerting systems such as Nagios, ZenOSS and Sensu
- Experience with F5 or HAProxy load-balancing technologies
- Experience with CentOS or Red Hat Enterprise Linux (RHEL) and creating RPM packages. RHCE is a plus
- Experience with VMware vSphere, ESX or ESXi, and vCenter
- Experience with OpenStack or KVM virtualization technologies
- Experience with public cloud providers such as Amazon EC2 or Rackspace Cloud
- Experience managing Hadoop or Cassandra services
Proofpoint, Inc. helps the largest and most successful companies in the world protect and govern their most sensitive data. Founded in 2002 by the former CTO of Netscape and headquartered in Sunnyvale, CA, Proofpoint was funded by top Silicon Valley investors, including Benchmark Capital and Mohr Davidow Ventures before going public.
Please note that Proofpoint does not accept unsolicited resumes from recruiters or employment agencies. In the absence of a signed Recruitment Services Agreement, Proofpoint will not consider or agree to payment of any referral compensation or recruiter fee. In the event a recruiter or agency submits a resume or candidate without a previously signed agreement, Proofpoint explicitly reserves the right to pursue and hire those candidate(s) without any financial obligation to the recruiter or agency.
Desired Skills and Experience
See application page for details