Desired Skills and Experience

  • Build and manage various components of the internal and production environments with a focus on configuration management, continuous integration, and platform automation
  • Implement disaster recovery and reliability improvement initiatives
  • Build and manage software delivery, systems integration, and developer support tools
  • Take Kubernetes and Docker to production for all our new microservices
  • Manage Kubernetes and Cassandra clusters
  • Take ownership of features that range from services provisioning on SaaS or On-Prem
  • Enhance developer CI/CD pipeline using Jenkins and Github
  • Automate our infrastructure and EC2 deployments (CloudFormation) as well as our build automation systems (Jenkins)
  • Conduct performance tuning, load testing, and optimization of information/data processing, maintenance, and support of the production environment  
  • Proficiency with configuration management tools like CloudFormation or Terraform (or at least Puppet, Chef, or SaltStack)
  • Solid experience in monitoring cloud services using tools like Sysdig, Datadog, Prometheus, Grafana, Graphite, Nagios, or Zabbix
  • Experience in managing AWS resources including EC2, RDS, Auto Scaling groups, ALB/NLB, IAM
  • Experience in diagnosing and troubleshooting customer facing production service outages
  • Aptitude for troubleshooting complex problems in high-throughput web applications and network services
  • Command of at least one of the following: Java, Python, Bash, and Golang
  • Solid understanding of Linux systems and networking
  • Working knowledge of Git
  • Worked with containers such as Docker or Rocket
  • Deployed Kubernetes or OpenStack clusters
  • Managed any of these clusters - Cassandra, HBase, HDFS, Elasticsearch
  • Set up Kafka or Redis clusters
  • Used log aggregation services like Elasticsearch or Splunk
  • Familiar with CI/CD pipelines using Jenkins, Bamboo or TeamCity
  • Knowledge of ITIL terminology for incident and problem management
  • Background in PCI/HIPAA compliant infrastructure in the cloud
  • We’re a well funded startup that already has a large enterprise customer base.
  • We have a pragmatic, approachable engineering culture, from the CEO down.
  • We have an organizational focus on delivering value to customers.
  • Our open source tools (https://www.sysdig.org) are widely used and loved by technologists & developers.
  • We have fun team and company events, beer outings, and lots of espresso (if you’re in to that).
  • Desk and tech setup of your choice (for wherever you work)
  • IRA with company matching up to 3% of salary
  • Unlimited vacation policy
  • Monthly self-improvement grant – spend on yourself however you see fit
  • Free weekly team lunches and delicious snacks every day of the week
  • Free monthly house cleaning service

Apply