Hyderabad Jobs |
Banglore Jobs |
Chennai Jobs |
Delhi Jobs |
Ahmedabad Jobs |
Mumbai Jobs |
Pune Jobs |
Vijayawada Jobs |
Gurgaon Jobs |
Noida Jobs |
Hyderabad Jobs |
Banglore Jobs |
Chennai Jobs |
Delhi Jobs |
Ahmedabad Jobs |
Mumbai Jobs |
Pune Jobs |
Vijayawada Jobs |
Gurgaon Jobs |
Noida Jobs |
Oil & Gas Jobs |
Banking Jobs |
Construction Jobs |
Top Management Jobs |
IT - Software Jobs |
Medical Healthcare Jobs |
Purchase / Logistics Jobs |
Sales |
Ajax Jobs |
Designing Jobs |
ASP .NET Jobs |
Java Jobs |
MySQL Jobs |
Sap hr Jobs |
Software Testing Jobs |
Html Jobs |
Job Location | Bangalore |
Education | Not Mentioned |
Salary | Not Disclosed |
Industry | Recruitment Services |
Functional Area | General / Other Software |
EmploymentType | Full-time |
As a member of the HPC as a service team HPCaaS, you will be responsible for establishing and executing on strategic objectives focused on improving the effective utilization of the compute resources while meeting or exceeding customer service level agreements for job prioritization, job concurrency, and job throughput in our EDA compute clusters. This includes leading architectural innovation and path finding efforts to create and implement Western Digital s next generation Grid computing environment. As a member of the team you will be expected to not only deliver on technical requirements and solutions but also be able to present your solutions to senior management. Responsibilities include but are not limited to working as individual contributor, a team member and a technical team lead to explore, define and pilot new solutions with little supervision. Develop solutions, scripts and/or processes to automate management of services and tools as required. In this role, you will be collaborating closely with EDA and hardware design team stakeholders to define and deliver workload efficiency improvements in Western Digital s EDA HPC infrastructure globally.What you ll be doing:Support multi-site, high performance compute infrastructure and services for the global engineering product development organizationsDesign, create, deliver and support the deployment of Ansible automation within HPC and Unix environmentsIdentify and propose solutions and new services for the distributed ASIC and GPU computing clustersPerform troubleshooting and root cause analysis of HPC clusters and file system related issuesDevelop and maintain documentation for all aspects of the HPC infrastructureImprove root cause analysis and corrective action for problems large and small identify patterns and propose how we can automate repetitive tasksRecommend and implement solutions to improve performance of workloadsSupport diverse Engineering Design Automation environmentWhat we need to see:Bachelor s degree in computer science or equivalent experience7+ years Linux systems administration experience specifically in managing or supporting RedHat and/or Centos Linux in production environmentsExperience with configuration management tools: Ansible, Puppet, ChefAbility to technically lead a project through the lifecycleScripting skills: highly skilled in a at least two typical scripting languages (shell/bash, perl, python, ruby)Excellent problem solving, multitasking, troubleshooting skills and attention to details are required to work in this challenging and dynamic environmentVery strong interpersonal, customer service, result oriented and team building skillsWays To Stand Out From The Crowd:Experience with IBM Platform LSF, Grid Engine, NC or similar technologiesExperience with Quest QASIn depth knowledge of EDA design flows and supporting EDA toolsExperience with AWS infrastructures as code (IaC) using Terraform or CloudformationFamiliarity with Splunk and Check-MK for troubleshooting and monitoringExperience with source / version control systems,
Keyskills :
root cause analysishigh performance computingroot causefile systemplatform lsfpath findingteam buildingservice levelgrid computingproblem solving