Principal Site Reliability Engineer at RideCell (San Francisco, CA)
Principal Site Reliability Engineer:
Technical- Deep understanding of cloud infrastructures such as AWS and Google Compute- Designed and scaled highly available multi-tenant services- Works on projects with engineering team on performance, scalability and reliability- Experience deploying micro-services in Kubernetes and Docker containers- Strong understanding of networking fundamentals, security (SSL/TLS), HTTP- Fluent in SQL and No-SQL DBs- Experience deploying Database sharding, memcache/REDIS- Deployed Hadoop clusters- Load balancers and zero-downtime deployments- Conduct security reviews and overall site audits- Manage general maintenance such as periodic data backup
Non-Technical- Strong communication: written and verbal- Budget planning for Infrastructure- Manage contractors and vendors- Team player, Strong sense of ownership