hireejobs
Hyderabad Jobs
Banglore Jobs
Chennai Jobs
Delhi Jobs
Ahmedabad Jobs
Mumbai Jobs
Pune Jobs
Vijayawada Jobs
Gurgaon Jobs
Noida Jobs
Oil & Gas Jobs
Banking Jobs
Construction Jobs
Top Management Jobs
IT - Software Jobs
Medical Healthcare Jobs
Purchase / Logistics Jobs
Sales
Ajax Jobs
Designing Jobs
ASP .NET Jobs
Java Jobs
MySQL Jobs
Sap hr Jobs
Software Testing Jobs
Html Jobs
IT Jobs
Logistics Jobs
Customer Service Jobs
Airport Jobs
Banking Jobs
Driver Jobs
Part Time Jobs
Civil Engineering Jobs
Accountant Jobs
Safety Officer Jobs
Nursing Jobs
Civil Engineering Jobs
Hospitality Jobs
Part Time Jobs
Security Jobs
Finance Jobs
Marketing Jobs
Shipping Jobs
Real Estate Jobs
Telecom Jobs

Site Reliability Engineer

1.00 to 5.00 Years   Pune   13 Sep, 2022
Job LocationPune
EducationNot Mentioned
SalaryNot Disclosed
IndustryIT - Hardware / Networking
Functional AreaGeneral / Other SoftwareNetwork / System Administration
EmploymentTypeFull-time

Job Description

    Co-develop and participate in the full lifecycle development of cloud platform services from inception and design, deployment, operation and improvement by applying scientific principles.Increase the effectiveness, reliability and performance of cloud platform technologies by identifying and measuring key indicators, making changes to the production systems in an automated way and evaluating the results.Support cloud platform team before the technologies are pushed for production release through activities such as system design, capacity planning, automation of key deployments, engaging in building a strategy for production monitoring and alerting and participate in testing/verification process.Ensure that the cloud platform technologies are maintained properly by measuring and monitoring availability, latency, performance and system health.Advice the cloud platform team to improve the reliability of the systems in production and scale them based on need.Participate in the development process by supporting new features, services, releases and hold an ownership mindset for the cloud platform technologies. Develop tools and automate the process for achieving large scale provisioning and deployment of cloud platform technologies.Participate in on-call rotation for cloud platform technologies. At times of incidents, lead incident response and be part of writing detailed postmortem analysis reports which are brutally honest with no-blame.Propose improvements and drive efficiencies in systems and processes related to capacity planning, configuration management, scaling services, performance tuning, monitoring, alerting and root cause analysisRequirementsFresher or 1+ years of relevant experience in running distributed systems at scale in production.Expertise in one of the programming language: Java, Python or Go.Proficient in writing bash scriptsGood understanding of SQL and NoSQL systemsGood understanding of systems programming (network stack, file system, OS services)Understanding of network elements such as firewalls, load balancers, DNS, NAT, TLS/SSL, VLANs etcSkilled in identifying performance bottlenecks, identifying anomalous system behavior, and determining the root cause of incidents.Knowledge of JVM concepts like garbage collection, heap, stack, profiling, class loading, etc.Knowledge of best practices related to security, performance, high-availability, and disaster recovery.Demonstrate a proven record of handling production issues, planning escalation procedures, conducting post-mortems, impact analysis, risk assessments and other related procedures.Able to drive results and set priorities independentlyBS/MS degree in Computer Science, Applied Math or related fieldBonus Points if you have:Experience with managing large scale deployments of search engines like ElasticsearchExperience with managing large scale deployments of message-oriented middleware such as KafkaExperience with managing large scale deployments of RDBMS systems such as oracleExperience with managing large scale deployments of NoSQL databases such as CassandraExperience with managing large scale deployments of In-memory caching using Redis, Memcached, etc.Experience with container and orchestration technologies such as Docker, Kubernetes etcExperience with monitoring tools such as Graphite, Grafana and PrometheusExperience with Hashicorp technologies such as Consul, Vault, Terraform and VagrantExperience with configuration management tools such as Chef, Puppet or AnsibleIn-depth experience with continuous integration and continuous deployment pipelinesExposure to Maven, Ant or Gradle for buildsEEO Employer/Vet/Disabled,

Keyskills :
javaacademicsacpalgorithmsandroidlarge scale deploymentsroot causefile systemsystem designdrive resultssearch engines

Site Reliability Engineer Related Jobs

© 2019 Hireejobs All Rights Reserved