Senior Site Reliability Engineer
With SafeCorp Technology, Inc. in Sacramento CA USMore jobs from SafeCorp Technology, Inc.
Posted on August 09, 2019
About this job
Compensation: $100k - 150k | Equity
Job type: Full-time
Experience level: Senior
Industry: Financial Services, Financial Technology
Company size: 11–50 people
Company type: Private
amazon-web-services, cloud, security, kubernetes, automation, sysadmin
We are looking for a full-time Senior Site Reliability Engineer to develop our new cloud-based infrastructure. You will be part of a dedicated team responsible for the design, operation, availability, change management, performance, monitoring, security, and emergency response of our global infrastructure, as well as creating and maintaining our continuous integration/ continuous delivery pipelines. You will assist our application engineers and automation experts as they design and develop infrastructure to improve resiliency, security, and data availability. This role will be based in our Sacramento, CA office.
? BS in Computer Science or related field, or equivalent employment experience.
? Strong sense of ownership, customer service, and integrity.
? Experience managing large numbers of systems in cloud environments (AWS, Azure, GCP) utilizing infrastructure as code (CloudFormation, TerraForm, etc).
? Deep understanding of cloud and application security with a desire to enforce best practices.
? Understanding of cloud database solutions and experience administering large-scale deployments
? Fundamental understanding of distributed systems including: AWS Well-Architected, Microservices, and the Twelve Factor App.
? Passion for eliminating repetitive manual processes using automation.
? Architect, author and deliver software to improve the availability, scalability and security of the platform.
? Build and manage systems, infrastructure and applications through automation.
? Deploy, support and monitor new and existing services, platforms, and application stacks.
? Work closely with developers to debug application issues by evaluating application logs, stack traces, and system metrics.
? Participate in periodic on-call duties.