TubeMogul has built the video advertising industry’s best, most trusted software for brand advertising. Our mission is to make video advertising as simple and accountable as paid search, and our efforts are paying off. In July 2014, we took the company public (TUBE) and established ourselves as one of the smartest, most innovative companies in the industry.

But the hard work doesn’t stop there. We are seeking a Site Reliability Engineer to help with the building and operation of our Real Time Analytics Platform. The operations team leverages some of the most cutting edge technology to simplify an otherwise complex environment.

Candidates should have a passion for building infrastructure for high-performance, “Big Data” systems. In this role, you will leverage open-source tools like Zookeeper, Hadoop, HBase, Hive, and Couchbase. Candidates will at least be familiar with the technologies we are using but may not have had the opportunity to acquire deep experience in previous job settings. However, you should have a true passion for systems engineering that is apparent in your past work.

Responsibilities:

  • Build tools to ease provisioning and scaling of TubeMogul Analytics infrastructure
  • Monitor and improve service performance and stability
  • Continuously extend and improve infrastructure components to handle growth
  • Investigate failures and offer suggestions for future improvement
  • Work closely with development teams to ensure that platforms are designed with “operability” in mind
  • Assist our software engineering team to ensure proper monitoring and metrics are being built into the applications before going to productionRequired Skills and Expertise:

  • Must have a solid understanding of information technology and information security
  • Desire to work in a fast paced environment
  • Experience troubleshooting and deploying applications on Linux
  • Experience in large scale monitoring and alerting tools such as Nagios, Ganglia, Graphite, Statsd, Skyline, Sensu
  • Fluent with Configuration Management Tools like Puppet, Chef or Ansible
  • At least one of : Perl, Python, Ruby
  • Knowledge of TCP/IP, HTTP, DNS, LDAP, SSL, SSH, OpenVPN, SQL, IDS, IPSBonus skills:

  • Java Programming Experience
  • Background in building and operating a Real Time Analytics infrastructure based on technology like Kafka, Storm, Hadoop, HBase, Amazon EMR, Couchbase, Aerospike, Vertica.
  • Experience with Amazon AWS (EC2, S3, EBS, EIP, VPC)
  • Server Virtualization using Eucalyptus, OpenStack or CloudStackCompensation and Benefits:

You’ll appreciate a competitive compensation package including an equity component and excellent benefits. You’ll love a challenging work environment, exceptional colleagues, strong business momentum, clear objectives and the ability to make a difference. Benefits include: medical, dental, vision, 401K matching, company events and an extraordinary culture.

Desired Skills and Experience

See application page for details