As an SRE, we know that you are passionate about seamless uptime. You delight in building tools to automate routine tasks and constantly seek new ways to improve system performance. If you also want to join a hyper-growth, $billion startup, work with exceptional people, and play with some really cool tech, then you found the right place.
CORE RESPONSIBILITIES:
We’re a CentOS shop in the data centres running on high end Intel servers. Our core technology is Java on Linux using open source technology throughout the stack. The Java engine runs and stores all data in RAM for super high performance while staying safe with transaction logs and auto recovery. The office is Macs with a few Windows holdouts. You decide which works best for you.
In order to manage these systems you will:
Desired Skills and Experience
- Utilise your skills in automation, replication and scaling to manage our worldwide data centres
- Write scripts in Ruby, Python, Perl, etc. to build custom tools for automation, replication and scaling
- Build tools to monitor and provide metrics on our systems
- Perform Linux system administration (DNS, NFS, RPM, Apache, Raid, etc.)
- Be able to take a bare-metal server/hardware to fully functional app servers
- Lead Release deployments and participate in revising software design to scale and prevent against failures
- Participate in on-call rotation
- Automation – Currently using Chef
- Java applications including JVM performance and tuning
- Linux administration – dns, nfs, rpm, apache, raid, etc.
- Programming in any of Ruby, Python, Perl, etc.
- Multi Data centre management, replication, scaling.
- Middleware software such as Nginx, HA Proxy, Consul, terraform or equivalent architectures
- Metrics and monitoring – writing custom tools and familiar with open source options.
- MySQL – replication, backups, some light querying
- Networking – Switches, routers, firewalls, vpn, etc
- Amazon EC2, EFS and related AWS technologies