Solutions Architect, AI Infrastructure
Job Description
NVIDIA is looking for an experienced systems and network infrastructure Solutions Architect Engineer. Do you want to be part of a team that brings new Artificial Intelligence (AI) hardware and software technologies to production in the field? We are looking for a compute and networking savvy Solution Architect to join the NVIDIA Solution Architecture Engineering (SA) team focused on supporting accelerated computing applications.
As part of the NVIDIA SA organization, you will be driving our end-to-end technology solutions integration with some of NVIDIA's most strategic technology customers, as well as offering recommendations to business and engineering teams on our product technology.
What you'll be doing:
Working with NVIDIA Consumer Internet and IT Services customers on data center GPU server and networking infrastructure deployments as solution architect. Guide customer discussions on network topologies, compute/storage and support bring up of server/network/cluster deployments. You will need to visit customer data center during bring up phase.
Identifying new project opportunities for NVIDIA products and technology solutions in data center and artificial intelligence applications. Work closely with the Systems/Network Engineering, Product management and Sales teams
Work as customer trusted advisor conducting regular technical customer meetings for product roadmap, cluster debug, feature discussions and introduction to new technology solutions
Building custom product demonstrations and POCs for solutions that address critical business needs of our customers
Analyzing and debugging compute/network performance issues
What we need to see:
BS/MS/PhD in Electrical/Computer Engineering, Computer Science, Physics, or other Engineering fields or equivalent experience.
This role is for an individual with the motivation and skills to drive the data center engineering process. Ideal candidate has 5+ years of Solution Engineering (or similar Engineering roles) experience
System level understanding of server architecture, NICs, Linux, system software and kernel drivers
Practical knowledge of Networking - switching & routing for Ethernet/Infiniband, and Data Center infrastructure (power/cooling)
Knowledge of DevOps/MLOps technologies such as Docker/containers, Kubernetes
Effective time management and capable of balancing multiple tasks
Ability to communicate your ideas/code clearly through documents, presentation etc
Ways to stand out from the crowd:
External customer facing skill-set and background
Experience with bringup and deployment of large clusters
Systems engineering, coding, and debugging skills including experience with C/C++, Linux kernel and drivers
Hands-on experience with NVIDIA systems/SDKs (e.g. CUDA), NVIDIA Networking technologies (e.g., DPU, RoCE, InfiniBand), and/or ARM CPU solutions
Familiarity with virtualization technology concepts
We make extensive use of conferencing tools, but occasional (20%) travel is required for on-site visit to customers and industry events. We are open to remote work location and look forward to have you join our team!
NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. If you're creative and autonomous, we want to hear from you!
The base salary range is 148,000 USD - 230,000 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.