Desired Skills and Experience

  • Critical alert response, tuning, management
  • Operational tool creation, routine task automation
  • Operational runbook development, execution, improvements
  • Server administration, troubleshooting
  • Traffic engineering, DNS management, DDOS mitigation
  • IP transit partner maintenance coordination
  • Escalation support for internal teams
  • New POP deployment, configuration, activation
  • End-to-end ownership and accountability for all Edge Cloud activities and incidents
  • Comprehensive knowledge of how CDNs work: TCP/IP, BGP anycast, DNS, HTTP, TLS, reverse proxies, etc
  • Experience in data center routing and switching systems
  • Hands-on experience operating a global network and/or a globally distributed Linux-based system
  • Capable of troubleshooting and diagnosing failures of hardware in a data center environment. Everything from SSD’s to AOC cables are fair game for ECO
  • Development or administrative experience in a Linux-based environment, and associated open-source tools (Chef, Ansible, Git, Awk, Sed, cURL, etc)
  • Solid understanding of how the internet works - from client to server, and everything in-between. HAProxy, nginx, and Varnish are not foreign to you
  • Cultivation of various monitoring platforms, such as Datadog, Nagios, Ganglia, Icinga, Pingdom, Catchpoint, Cedexis and others
  • Exposure to cloud environments and systems like AWS, GCE, Azure, Softlayer
  • Data analysis using tools and systems like Deepfield, MySQL, Google BigTable
  • An innate curiosity and inquisitiveness
  • 3+ years of direct involvement with network and/or systems engineering in a web-scale production environment
  • 3+ years working with at least one of: Perl, Python, Go, Ruby
  • B.A. or B.S. in an engineering or computer-related field of study or equivalent on-the-job training