Module Lead - Systems
With Mphasis in Hyderabad - INMore jobs from Mphasis
Posted on March 18, 2020
About this job
Job type: Full-time
java, spring, agile
Step into the world of Mphasis!
Welcome to a world of customer-centric transformation. Of digital, cognitive, and intelligent technologies that are always a few paces ahead. A world where you redefine and co-create paradigms of business success every day. Where you keep alive your inquisitive mind to deliver innovation and wow customers.
A leading applied technology services company, Mphasis has sprinted exciting laps of transformation to stand tall on the edge of NEXT and positive disruption. With our agile business processes and innovations, we anticipate the future of applied technology and predict tomorrow’s trends to keep our clients at the summit of an ever-changing marketplace. Our future-proof expertise brings faster, more innovative IT solutions to next-gen customers across countless industry segments and micro-verticals.
Role : SRE Lead
Location : Bangalore/Hyderabad
Who are we looking for?
SRE Lead will manage a critical Environment support cum proactive monitoring engagement for one of our biggest client in banking & capital markets domain. The Individual should be passionate about technology, experienced in developing and managing cutting edge environment monitoring solutions.
Change Management; Stewardship for SRE initiative
Oversee SRE activities, provide direction in terms of business priorities.
Enable SRE team interaction / integration with other stakeholders (Development, Infrastructure, Info Sec…)
Lead discussion and negotiate on Error Budgets with appropriate stakeholders
Ability to solution and deliver all of the Operations/SRE services and processes including managing L2 Environment Support
You will serve as a leader and coach of a team of SREs responsible for automated infrastructure deployment, ongoing operation and monitoring of our Cloud infrastructure, working closely with the development teams.
Analyse reliability challenges and develop automated solutions for incident resolution
Work with development teams to improve applications’ operational features for faster MTTD and MTTR and auto recovery
The job requires getting your hands dirty, troubleshooting infrastructure, and architecting data centers, using your existing knowledge and toolkits.
Continuously analyze the current Site Reliability capabilities and identify areas of improvements
Identify, define, and implement new tools and technologies for improving the quality and efficiency of distributed platform.
You will drive reliability and supportability aspects of Cloud service, including change management, triage of customer escalations, remediation plans, playbooks and automations.
Maintain services once they are live by measuring and monitoring availability, latency and overall system health.
Scale systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and velocity.
Engage in and improve the whole lifecycle of services from inception and design, through deployment, operation and refinement.
Ability to solution & deliver all of Operations/SRE services & processes including managing L2 Environment Support
9+ years of overall environment support experience with 5+ years of experience as SRE engineer
At least 1 year as SRE Lead
5+ years of experience in building Monitoring solutions using AppDynamics, Splunk, set up and test proactive monitoring s
Experience developing monitoring solutions across Core Java, Webapi, services and database layer
Have strong coding skills in at least one programming language, primarily in Core Java, Spring, and a desire to pick up more. You’ll run into several other languages on a regular basis as well. (If you’re good with web technologies that’s a plus!)
3 years of experience in Spring
Solid knowledge of Atlassian tools (Bitbucket, Bamboo etc.) and monitoring tools (5+ years)
Knowledge of cloud and virtualization technology a plus.
Experience with algorithms, data structures, complexity analysis and software design.
Big Data experience (Hadoop, Hive, Spark SQL, HDFS, YARN)
Experience in Agile SDLC practices with strong focus on continuous integration and continuous delivery pipeline automation and tooling, DevOps, distributed version control system, and agile methodologies.
Comfortable with large scale production systems and technologies, for example load balancing, monitoring, distributed systems, microservices, and configuration management.
Have experience from designing and executing large scale systems automation projects with strong autonomy
Systematic problem-solving approach, coupled with strong communication skills and a sense of ownership and drive
Interest in designing, analyzing and troubleshooting large-scale distributed systems
Strong Understanding of Core Java
Expertise in any motonring tools Appdynamics or Splunk
Solid knowledge of Atlassian tools (Bitbucket, Bamboo etc.)