Senior DevOps Engineer

With SparkCognition in Austin TX US

More jobs from SparkCognition

Posted on November 07, 2019

About this job

Job type: Full-time
Role: DevOps, System Administrator


cloud, automation, sysadmin

Job description

SparkCognition is an AI leader that offers business-critical solutions for customers in energy, oil and gas, manufacturing, finance, aerospace, defense, and security. A highly awarded company recognized for cutting-edge technology, SparkCognition develops AI-powered, cyber-physical software for the safety, security, reliability, and optimization of IT, OT, and the Industrial IoT.

SparkCognition is looking for a Senior DevOps Engineer who can help drive ourĀ DevOpsĀ initiatives. The ideal candidate has experience in running automated production infrastructure in the cloud, such as AWS, Azure, or GCP. The position offers opportunities for building and designing a modern, automated platform in the cloud, spanning multiple regions around the globe. This is a high visibility role where the candidate will work across multiple teams to shape a common infrastructure to run machine-learning solutions.


  • Continuously improve the infrastructure for cloud-based services and client interfaces
  • Collaborate with team leads and management across the company to define shared capabilities
  • Manage the day-to-day operations of our build, testing, and continuous integration environment
  • Support an effective developer workflow including build, test automation, and deployment
  • Knowledge of best practices and IT operations in an always-up, always-available service
  • Proactively communicate project & task status to project stakeholders
  • Well versed in systems administration with a background and understanding of software development
  • Provide occasional on-call support which may include irregular hours as needed


  • Must have strong experience designing and deploying scalable infrastructure using Kubernetes
  • Proven ability with several Google Cloud Platform services or similar cloud infrastructure
  • Hands on experience with cloud networking and traffic management (VPCs, load balancers, network segregation)
  • Previous experience with Helm
  • Familiar with monitoring, metrics collection, and reporting using open source tools such as Prometheus
  • Previously deployed and maintained Kafka or Pulsar
  • Proven skills with automation/configuration management tools like Terraform, Ansible
  • Experience with development operations of continuous integration, automated testing, and automation of the dev process
  • Experience building out continuous integration/continuous delivery pipelines and overall development operations using Jenkins
  • Previous containerization experience with Docker of similar technology
  • A solid background in Linux/Unix Administration
  • Proficient skills with at least one scripting languages Shell, Bash, Python
  • Proven experience managing multiple projects and competing priorities in a fast-paced work environment
  • Strong written and verbal communication skills
  • Proven ability to work across multiple product teams and deliver solutions on tight deadlines
  • Experience with Spinnaker is a plus

Apply here