hireejobs
Hyderabad Jobs
Banglore Jobs
Chennai Jobs
Delhi Jobs
Ahmedabad Jobs
Mumbai Jobs
Pune Jobs
Vijayawada Jobs
Gurgaon Jobs
Noida Jobs
Oil & Gas Jobs
Banking Jobs
Construction Jobs
Top Management Jobs
IT - Software Jobs
Medical Healthcare Jobs
Purchase / Logistics Jobs
Sales
Ajax Jobs
Designing Jobs
ASP .NET Jobs
Java Jobs
MySQL Jobs
Sap hr Jobs
Software Testing Jobs
Html Jobs
IT Jobs
Logistics Jobs
Customer Service Jobs
Airport Jobs
Banking Jobs
Driver Jobs
Part Time Jobs
Civil Engineering Jobs
Accountant Jobs
Safety Officer Jobs
Nursing Jobs
Civil Engineering Jobs
Hospitality Jobs
Part Time Jobs
Security Jobs
Finance Jobs
Marketing Jobs
Shipping Jobs
Real Estate Jobs
Telecom Jobs

SaaS Site Reliability Engineer (SRE)

3.00 to 8.00 Years   Bangalore   29 Dec, 2021
Job LocationBangalore
EducationNot Mentioned
SalaryNot Disclosed
IndustryTelecom / ISP
Functional AreaNetwork / System Administration
EmploymentTypeFull-time

Job Description

Job DescriptionThe Nokia CNS SaaS SRE Operations department is looking for a strong and motivated Site Reliability Engineer (SRE). This role is designed around a lead position within the team that will help shape processes, tools, and capabilities and take the SaaS SRE team to world class quality and capability!This position will be a part of the CNS SaaS SRE center of excellence that will be responsible for:

  • 24x7x365 Service Assurance for SaaS applications (use cases) deployed across all public cloud hyperscaler providers that CNS SaaS will have
  • L1/L2 Site Reliability Engineering Operations (event & incident management, change management and execution, security and privacy compliance remediation and mitigation)
  • Auto-recovery DevOps for continuous service improvement
  • L2/L3/L3 Application support integration with BU and Product teams
  • Request fulfillment
  • Change management
  • MOP/SOP planning and execution
  • SLA/SLO commitment and attainment
  • Infra and Ops SREs
  • Disaster Recovery Plan execution and testing (meeting RPO/RTO targets)
Software Engineering:a.) Services can range from production code changes to alerting and monitoring adjustments.b.) Includes tasks like building proprietary tools from the scratch to mitigate weaknesses in incident management or software delivery.Troubleshooting Support Escalation:
  • Should fully know critical issues to route support escalation incidents to concerned teams.
  • On-Call Process Optimization
  • Add automation for improved collaborative response in real-time, besides updating documentation, runbook tools, and modules to ready teams for incidents.
Documenting Knowledge:
  • To ensure a seamless flow of information between teams, site reliability engineer job may require documenting the knowledge gained.
  • Optimizing SDLC (Software Development Life Cycle)
  • Based on post-incident reviews, site reliability engineers will need to optimize the Software Development Life Cycle (SDLC) to boost service reliability.
  • This SRE lead position will be a key role in the ongoing success of the SaaS business and protecting customer Annual Recurring Revenue by assuring service reliability and instilling absolute confidence in service quality and security.
Knowledge:
  • Middle Ware Administrator - Worked either on Weblogic, Jboss any Java Containers.
  • Web Server knowledge - Apache, Nginx.
  • Basic Unix/Linux and Networking skills.
  • Good hands on day-to-day Unix commands on Prod Middleware maintenance/administration work.
  • AWS + Containerized knowledge: Docker + Kubernetes.
  • Other Cloud Technologies - GCP and Azure is an add-on.
  • Application software upgrades and migrations
  • Application configuration and management
  • Monitoring applications using Prometheus, Grafana, etc., and other services.
  • Contribute to design enhancements of application infrastructure architecture to support business growth.
  • CI/CD deployment of new applications base installations.
  • First line of response to troubleshooting new and existing systems/applications.
  • Reporting the performance levels of the application, databases and Unix systems.
Job Responsibilities & Competencies
  • Help build and maintain SRE centers of excellence that are best-in-class at service assurance and service quality
  • Provide technical and operational leadership over Agile DevOps practices including documentation, iteration, planning, scheduling, coordinating and executing
  • Help devise and execute strategies for accomplishing service assurance improvements using creative and cost-effective means and methods
  • Encourage and foster SRE contribution and input/participation in continuous service improvements both technical and procedural
  • Collaborate with team members and peers/partner organizations to determine and define best practices that bring benefits to SRE Operations and the SaaS organization
  • Work with Product Managers and R&D teams of SaaS applications (use cases) to determine and support service-level agreements (SLAs), service-level indicators (SLIs) and service-level objectives (SLOs)
  • Understand that 100% reliability of component services is not expected, and failure is planned for and accepted/accounted for
  • Partner with R&D teams of SaaS applications (use cases) and SaaS Delivery & Operations Framework DevOps to improve quality, reliability and resilience of overall SaaS capabilities
  • Recover, restore, and build self-recovery capability for cloud-native services and components (AWS, GCP and Azure)
  • Create, organize, and adhere to documentation for processes, procedures and maintenance
  • Assure accuracy of and actively populate Knowledge Management contents and documentation
  • Participate in 24x7x365 on-call rotation schedule (taking shifts in a follow-the-sun support model)
Qualifications
  • 5 or more years of operations, support, SRE, DevOps or related experience
  • Strong communication skills, including ability to create presentations or dashboards that contain enough detail to be consumed without accompanying narrative, yet brief enough not to confound or confuse the audience either
  • Experience with Incident and Event Management (tools, processes, KPIs, regular status reporting, post mortems, RCAs)
  • Capable of working in a diverse, global environment
  • Self-directed, can interpret and extrapolate requirements and priorities from shared contents or communications
  • Experience or familiarity with DevOps technologies (examples: Azure DevOps, public cloud native services and components, GitHUB, Terraform/Terragrunt, etc)
  • Experience or familiarity with Kubernetes and related technologies (docker, helm, k8s API)
  • Experience or familiarity with public cloud native services and components (AWS, GCP, Azure)
  • Experience with ticketing systems like SF.com, Zendesk, Jira including process and even API integrations
  • Experience with documentation management using Confluence, SharePoint and MS Teams
Imagine creating technology that has the potential to change the world. Working with us, you will have a positive impact on people s lives and help to overcome some of the world s most pressing challenges. We act inclusively and respect the uniqueness of people. At Nokia, employment decisions are made regardless of race, color, national or ethnic origin, religion, gender, sexual orientation, gender identity or expression, age, marital status, disability, protected veteran status or other characteristics protected by law. Nokia culture welcomes people as their true selves. Come create technology that helps the world act together.Additional Information
  • For US job positions only: Vaccination Requirements: As a federal contractor & pursuant to Presidential Exec. Order 14042, Nokia mandates for all employees a COVID-19 vaccination or an approved religious or medical accommodation.
,

Keyskills :
javaacademicsacpalgorithmsandroidsoftware development life cyclecenter of excellencelife cyclecore networkservice qualityglobal servicesevent managementnetwork serviceschange managementservice assurance

SaaS Site Reliability Engineer (SRE) Related Jobs

© 2019 Hireejobs All Rights Reserved