Site Reliability Engineers are directly responsive for the availability of the NetSuite’s customer facing solutions. They monitor the applications, react to problems, proactively address issues before they become problems and build tools to constantly improve availability, performance, uptime and response time. Site Reliability Engineering is a global team ensuring NetSuite exceeds its Service Level Commitment 24x7x365.Responsibilities:Keep the customer facing site runningOwner of all alerts and escalations in customer facing production environmentAutomate manual tasksUse SRE toolset to identify, resolve or escalate issues in productionBuild effective monitoring that evolves with the productWork closely with development engineers who build the productInterface with Customer SupportBuild, test and run Disaster Recovery proceduresGain familiarity with NetSuite solutions and customer needsWork to constantly increase the number of issues resolved directly by SREMinimum Qualifications:Experience with Unix or LinuxExperience with networkingDatabase knowledge is desirable3-4 years experience working in a large scale production operations environment providing mission critical services to customersComputer Science DegreeShell scriptingGood troubleshooting skillsWork quickly and accurately under pressure in time critical situationsA self starter who takes pride in job ownership and is always thinking of innovative ways to improve efficiency and effectiveness
Desired Skills and Experience
See application page for details