Site Reliability Engineer
Description The Site Reliability Engineer will help to elevate Performance Engineering, Monitoring and Systems Engineering disciplines across the Cox Automotive Media Group. Site Reliability Engineering assists to keep our technology humming and is essential to our ongoing evolution and growth. These engineers use their knowledge of technology and operational best practices to deliver an experience that drives available, scalable and reliable customer experiences. These engineers are hybrid systems and software engineers who are responsible and take ownership for scaling, automation, and performance of their respected applications. This person will understand the challenges around rapidly creating, scaling and managing distributed applications and will be able to collaborate with talented engineers across multiple disciplines to address those challenges. We aim to continuously improve the reliability of our services with a focus on automation, process improvement and operational velocity toward our continuous delivery objectives. This position requires an individual with a programming background, analytical abilities, communication and multitasking skills to work in a collaborative environment towards resolving complex problems.Responsibilities• Analyze application processes to improve the availability, scalability, latency, and efficiency of Cox Auto Media services.• Evaluate production performance issues effectively; with the goal of automating to prevent problem recurrence.• Identify and analyze architecture gaps, standards and methods for large-scale distributed systems.• Engage in service capacity planning and demand forecasting, software performance analysis and system tuning.• Collaborate with other leads across IT to align principles and operating procedures for the respective disciplines• Improve the capabilities of the support/delivery teams by providing pattern analysis and recommendations on potential risks for planned change(s) on platforms.• Identify performance and resiliency recommendations to the architects/ project teams/ developers based on results and code identifications• Automate collection, aggregation and synthesis processes to increase productivity by collaborating with other engineering teams• Build repository of historical performance results for the purpose of plotting trends over time Qualifications Required:• Bachelor’s degree in Computer Science, Information Systems or other related degree or equivalent work experience.• 3+ years’ experience supporting web-based applications built on UNIX/LINUX systems• 3+ years’ experience automating tasks via scripting languages such as Python, Perl or similar.• 3+ years’ of software development experience (Java, .Net, C++, etc)• 3+ years’ of software performance or systems engineering experience in a large-scale environment• 3+ years’ experience optimizing website performance with strong knowledge in Networks, Browsers, Content Delivery, etc• Demonstrable knowledge of TCP/IP, HTTP, OSI layers, load balancing, web application security, and experience supporting multi- tier web application architectures• Experience working with Unix/Linux systems from kernel to shell and beyond, with experience working with system libraries, file systems, and client-server protocols.Preferred:• Ability to identify system bottlenecks, architectural risks, and issues is a plus• Experience with Gomez, Web Page Test, Splunk or other monitoring tools is a plus• Expertise in designing, analyzing and troubleshooting large-scale distributed systems.• Knowledge of functional automation testing and tools.• Experience with diagnostic tools such as Dynatrace, App Dynamics or Wily• Systematic problem solving approach, coupled with a strong sense of ownership and drive. Organization : Cox Automotive Primary Location : US-GA-Atlanta-3003 Summit Blvd Employee Status : Regular Job Level : Individual Contributor Shift : Day Job * Travel : No Schedule : Full-time Unposting Date : Ongoing
Desired Skills and Experience
See application page for details