Principal DevOps Engineer

Nvidia

Santa Clara, CA

Job posting number: #7276154 (Ref:JR1986486)

Posted: August 28, 2024

Job Description

NVIDIA is the platform upon which every new AI-powered application is built. We are looking for a Principal DevOps Engineer to join our mission to continue improving our CI/CD infrastructure. Our team builds and operates sophisticated infrastructure to enable business critical services. You will be working with a team of passionate and skilled engineers that are continuously working to provide better tools to build and manage this infrastructure. Ideal candidate is strong in software development, designing and creating reliable distributed systems, and has the ability to implement well thought out long term maintenance strategy.
 

What you'll be doing:

  • Define best in class automation workflows and methodologies to improve upon existing DevOps and SRE practices.

  • Drive technical engagements with different organizations including product demonstrations and proof-of-concept with solution proposal and hands-on support.

  • Compile and standardize test suites to accelerate acceptance and health check cycle times.

  • Drive successful adoption of automation platform and tools.

  • Work closely with engineering team to drive design features and product roadmap to address solution gaps.

  • Automation and Scripting: Develop and maintain scripts for automation of various tasks, including build, deployment, and monitoring.

  • Work closely with software engineers, QA, and solution architects to ensure smooth deployment and operation of software. Mentor junior team members and provide guidance on industry standards.

  • Resolve issues in development and production environments.

What we need to see:

  • Bachelor's or Master's degree in Computer Science, Engineering, or a related field or equivalent experience.

  • Minimum of 15 years of experience in a DevOps or SRE role.

  • Self driven, excellent in communication and partnership, stays on top of tasks, mission focused and outcome oriented.

  • Strong knowledge of bash/shell and Python for scripting and automation.

  • Deep knowledge of at least one distribution of Linux (Ubuntu, Centos, Amazon Linux, RHEL, Debian, etc.)

  • Experience with Bazel and Terraform for build and CI/CD processes.

  • Solid understanding of Git for version control and source code management.

  • Proficiency with Docker for containerization and in Kubernetes for orchestration and management of containerized applications.

Ways to stand out from the crowd:

  • Experience with implementing automation platforms such as Shoreline.

  • Familiarity with cloud platforms such as AWS, Azure, or GCP.

  • Familiarity with accelerated computing.

  • Experience with observability tools such as Prometheus or Grafana.

The base salary range is 248,000 USD - 391,000 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.





Apply Now

Please mention to the employer that you saw this ad on CollegeJobs.com

Job posting number:#7276154 (Ref:JR1986486)
Application Deadline:Open Until Filled
Employer Location:Nvidia
Santa Clara,California
United States
More jobs from this employer