hireejobs
Hyderabad Jobs
Banglore Jobs
Chennai Jobs
Delhi Jobs
Ahmedabad Jobs
Mumbai Jobs
Pune Jobs
Vijayawada Jobs
Gurgaon Jobs
Noida Jobs
Oil & Gas Jobs
Banking Jobs
Construction Jobs
Top Management Jobs
IT - Software Jobs
Medical Healthcare Jobs
Purchase / Logistics Jobs
Sales
Ajax Jobs
Designing Jobs
ASP .NET Jobs
Java Jobs
MySQL Jobs
Sap hr Jobs
Software Testing Jobs
Html Jobs
IT Jobs
Logistics Jobs
Customer Service Jobs
Airport Jobs
Banking Jobs
Driver Jobs
Part Time Jobs
Civil Engineering Jobs
Accountant Jobs
Safety Officer Jobs
Nursing Jobs
Civil Engineering Jobs
Hospitality Jobs
Part Time Jobs
Security Jobs
Finance Jobs
Marketing Jobs
Shipping Jobs
Real Estate Jobs
Telecom Jobs

SRE Engineer

4.00 to 7.00 Years   Bangalore   12 Nov, 2019
Job LocationBangalore
EducationNot Mentioned
SalaryNot Disclosed
IndustryBanking / Financial Services
Functional AreaGeneral / Other Software
EmploymentTypeFull-time

Job Description

J.P. Morgan is a leader in financial services, offering innovative and intelligent solutions to clients in more than 100 countries with one of the most comprehensive global product platforms available. We have been helping our clients to do business and manage their wealth for more than 200 years and we keep their interests foremost in our minds at all times. This combination of product strength, intellectual capital and character sets us apart as an industry leader. J.P. Morgan is part of J.P. Morgan Chase & Co. (NYSE: JPM), a global financial services firm with assets of $2.0 trillion.The Chief Technology Office (CTO) aims to deliver technology efficiently and effectively with the right capabilities and the best talent for the firm, while removing friction that slows delivery. The AI/Machine learning group within the Chief Technology Office owns the Strategy and Pattern for AI and Machine Learning for the firm. We enable advanced analytics for the Lines of Business through the use of AI and Machine Learning design patterns and common services.We are seeking an experienced software engineer in our global Site Reliability Engineering (SRE) team supporting our AI/ML platform. This individual will be expected to work with functional application development teams, partner with infrastructure engineers and production support analysts to determine requirements for designing and developing automation, SDLC and development environment testing & integration tools. The toolsets developed must pass the rigor of JPMC s cyber security standards.The SRE team runs, maintains and improves the AI/ML Platform against established Service Level Objectives by applying software engineering practices. It is responsible for the availability, performance, change management, monitoring, and capacity management of their services, with special emphasis being placed on the automation of the processes/workload in support of the above. The SRE team is also responsible for the operational support of the AI/ML infrastructure, with emphasis being placed on the ability to submit outage/issue/incident data into a design and SDLC feedback loop to ensure maximum automation and outage avoidance.Responsibilities

  • Ensure application / platform uptime and quality, providing operational and development expertise in making our systems have proactive monitoring, fail rarely and automatically fix when they do fail
  • Key contributor to SRE, core infrastructure and functional development teams throughout the life cycle to help support software for reliability and scale, ensuring minimal refactoring or changes
  • Own day-to-day health, uptime, monitoring, reliability of services & server infrastructure, performance improvements, change management and capacity management of the services supported
  • Identify and/or analyze patterns of incidents/problem, conduct flawless post-mortems, develop permanent remediation plans, implement automation to prevent future incidents from re-occurring again
  • Troubleshoots priority incidents, conducts blameless post-mortems and ensures permanent closure of the incidents
  • Apply company standard s fpr change management, incident management and problem management principles
  • Works with open source software and experienced in packaging / distribution techniques like anaconda and pip
  • Engages with development team throughout the life cycle to help develop software for reliability
  • Contributes to the definition of the strategic roadmap and its execution; inclusive of R&D of emerging industry trends
  • Work with Cyber team to ensure systems are safe and resolve / prioritize vulnerability fixes
  • Applies analytics on the past data like incidents and usage patterns for predicting issues and takes proactive actions
  • Defines and drives adoption of a best in class monitoring frameworks to accomplish end to end flow monitoring and noiseless alerting
  • Deploys the software and product upgrades
  • Manages the effort split between manual operational work and engineering work
  • Be part of the 24x7 support coverage as needed
Qualifications:
  • B S or MS degree in computer science
  • 4 + years of experience architecting integrated stack solutions (storage, network, compute) within an enterprise scale production environment
  • 4 + years of experience in performance engineering and monitoring using tools such as AppDynamics, Splunk, Apica, Jmete r, data dog etc.
  • 4 + years of incident management, change management and problem management experience in an large scale operations environment
  • Experience in Anaconda, Jupyter, open source framework.
  • Experience in conda packaging of python libraries and experience with python distribution.
  • Cloud computing: Amazon Web Service, Azure, Docker, Kubernetes.
  • Familiarity with Python programming language
  • Experience working in an Agile Development environment
  • Experience in setting CI/CD pipeline.
  • Proven ability to understand and troubleshoot complex problems under pressure
  • Familiarity with AWS ML/Sagemaker , EC2, EMR, S3, ASG , would be preferred.
  • Experience in big data technologies like Hadoop, map reduce, spark etc
  • Hands on experience of GIT, BitBucket, Jenkins, SONAR, SPLUNK, Maven, AIM and/ or Continuous Delivery tools
,

Keyskills :
esign patterns flow monitoring computer science support analysts big data machine learning change management open source software agile development

SRE Engineer Related Jobs

© 2019 Hireejobs All Rights Reserved