Site Reliability Engineer -II

5.00 to 10.00 Years Hyderabad 30 Mar, 2021

Job Location	Hyderabad
Education	Not Mentioned
Salary	Not Disclosed
Industry	IT - Software
Functional Area	General / Other Software
EmploymentType	Full-time

Job Description

Job Role:

We are looking for a Site Reliability Engineer (SRE) , initially focused on production AppOps, who can build scalable systems, using best practices around automation, that improve reliability, velocity and enable monitoring of the operational health of stacks throughout their life-cycle including metrics collection, aggregation, and visualization.

As a member of the SRE team you will support NCR s Financial Services business unit, product and technology teams to improve the design and operation of systems, focusing on making them scalable, reliable, and efficient while ensuring production performance and high availability of products/services primarily residing in the cloud. You will influence the development and implementation of reliable production systems and services to address emerging business needs (such as Cloud-based SaaS). SRE s pride themselves on the resiliency and stability of production systems, yet at the same time is committed to innovation and operational improvement through the application of software engineering practices to operations.

The SRE will facilitate innovation and operational improvement through the application of software engineering practices to operations. You will make our products easier to adopt and use by making improvements to the product, tools, processes and documentation. You are someone who strives for six 9 s or better in availability/uptime!

You will be responsible for maintaining and scaling production services and servers for complex and high throughput cloud services.
You will bridge and own the union between development, quality, security and operations
You will improve scalability, service reliability, capacity, and performance.
You will write automation code for provisioning and operating infrastructure at massive scale.
You are not just an operator, you re an experienced software engineer focused on operations.
You will initiate and contribute to continuous improvement of our software delivery processes and practices in a multi-location, multidisciplinary team to empower and accelerate product development
You will use automation extensively to design, configure, manage, and monitor systems in support of our product development teams
You will participate in disaster recovery planning and execution
You will be responsible for maintaining / patching servers supporting SaaS products. This includes Windows Servers, Linux Servers running in in-house Datacenters and/or using cloud PaaS providers (GCP & Azure)

You ll work hand-in-hand with all teams to ship our code to production using Continuous Integration / Continuous Deployment (CI/CD) and AppSec tooling.

You will collaborate with development teams and use intuition, experience and understanding to create SLIs, SLOs, and SLAs
You will provide timely assistance and remediation solutions during critical situations and production incidents to help resolve service problems (You will be on call for periods of time)

You will develop monitoring architecture, implementing monitoring agents, build dashboards, manage escalations and alerts
You will participate in incident management and driving root cause analysis (RCA) and risk management processes
You will participate in a rotating on-call schedule during off-hours where you may periodically need to remote-in to systems if a production outage occurs.

IDEAL TECHNICAL AND PROFESSIONAL SKILLS:

BS degree in Computer Science or related technical field or 5 years prior relevant experience.
Extensive experience in a DevOps / SRE role with demonstrable experience in deploying and managing large scale production environments in GCP, AWS or Azure and Multi Datacenter environment.
Experience developing and debugging code (i.e. one or more of the following: Java, C, C++, .NET, Python, Ruby, Go, Shell, Perl, JavaScript)
2+ years deploying and supporting high traffic, scalable web applications/services
2+ years with cloud virtualization and PaaS
2+ years with AWS/GCP/Azure
2+ years with Docker, Kubernetes and early versions of OpenShift
Experience with Linux, Shell Scripting, PKI TLS/SSL, Network, firewalls, load balancers and backup
Experience in designing, analyzing and running large-scale distributed systems
Experience hosting and solving problems with public-facing services securely in Azure, AWS or GCP
Experience with orchestration, automation, and configuration management tools like git, Fabric and Ansible (or Puppet, Chef, Terraform, Helm or related technology)
Excellent analysis, debugging, root-cause identification, and troubleshooting skills
Experience with Kubernetes, system virtualization, on-prem and/or hybrid cloud computing, cloud Identity and security system, cloud monitoring and logging, and/or local/cloud storage.
Experience with one or more CI tools Jenkins, Artifactory, Harness, CloudBuild
Experience with application disaster recovery, migration, roll-back plans, expansion, routine deployments, and system upgrades
Experience with log management, including aggregation, alerting, and graphing (i.e Sensu / StackDriver / Prometheus / ELK / TICK stacks)
Bonus points for experience with Cassandra, Elasticsearch or Kafka
Extra bonus points for Cloud certifications and exposure to Harness

Keyskills :
javaacademicsacpalgorithmsandroidroot cause analysiseuropean works councilsdisaster recovery planningsoftware engineering practicesroot causedata centerhybrid cloudhigh trafficlog managementdebugging codecloud computingrisk managementshel

APPLY NOW

Site Reliability Engineer -II Related Jobs

We Hiring for E/M ,HOME HEALTH, Surgery, Ipdrg Coder, Trainer, QA

Axis Services

1.00 to 6.00 Years Hyderabad 02 May, 2024

Keyskills :
medical codingcpcccsbchhc

View & Apply
Senior Analyst

MACKENZIE MODERN IT SOLUTIONS PRIVATE LIMITED

6.00 to 9.00 Years Hyderabad 02 May, 2024

Keyskills :
siliconprocess documentationanalyticalconsultingpythonmachine learning

View & Apply
Program Manager

MACKENZIE MODERN IT SOLUTIONS PRIVATE LIMITED

4.00 to 9.00 Years Hyderabad 02 May, 2024

Keyskills :
advanced excellogistic regressioncommunication skillsqlikviewstatistical modelingpythonmachine learning

View & Apply
Product Designer

MACKENZIE MODERN IT SOLUTIONS PRIVATE LIMITED

5.00 to 8.00 Years Hyderabad 02 May, 2024

Keyskills :
software engineeringrtmsqldata mungingproduct developmentsoftware development life cyclesdlcproduct design

View & Apply
Dynamic Mechanical Engineer

MACKENZIE MODERN IT SOLUTIONS PRIVATE LIMITED

3.00 to 6.00 Years Hyderabad 02 May, 2024

Keyskills :
spibasiccadvalvessolid modelingengineering designproduct designsheet metal

View & Apply
Urgent Hiring AEM Developer MSRcosmos Group

MSR COSMOS IT LLP

4.00 to 9.00 Years Hyderabad 02 May, 2024

Keyskills :
damdigital asset managementadobe experience manager

View & Apply
Business Analyst

MACKENZIE MODERN IT SOLUTIONS PRIVATE LIMITED

4.00 to 9.00 Years Hyderabad 02 May, 2024

Keyskills :
safety trainingprogrammingmanagementhospitalityscheduling

View & Apply
Web Programmer

MACKENZIE MODERN IT SOLUTIONS PRIVATE LIMITED

4.00 to 8.00 Years Hyderabad 02 May, 2024

Keyskills :
numpyawsmatplotlibgitdjangopythonsciencemongodbnode

View & Apply
Program Manager

MACKENZIE MODERN IT SOLUTIONS PRIVATE LIMITED

3.00 to 8.00 Years Hyderabad 02 May, 2024

Keyskills :
machine learningcommunication skillslogistic regressionstatistical modelingqlikviewpythonadvanced excel

View & Apply
Technical Manager

SUNITA AGRI EXPORTS PRIVATE LIMITED

8.00 to 12.00 Years Hyderabad 02 May, 2024

Keyskills :
programminggoodmanagementevaluationtechnicalproblemcommunicationcontentengineeringcreationprojectsolvingsocialskills

View & Apply

Site Reliability Engineer -II

Job Description

Site Reliability Engineer -II Related Jobs

We Hiring for E/M ,HOME HEALTH, Surgery, Ipdrg Coder, Trainer, QA

Senior Analyst

Program Manager

Product Designer

Dynamic Mechanical Engineer

Urgent Hiring AEM Developer MSRcosmos Group

Business Analyst

Web Programmer

Program Manager

Technical Manager

Jobs By Category

Jobs By Skills

Jobs By Location

Main Menu

Jobseekers

Employers