- Meta (Bellevue, WA)
- … fabric, host networking, communication libraries, and scheduling infrastructure. **Required Skills:** AI / HPC Network Engineer Responsibilities: 1. ... software, leveraging software defined networking principles. 14. Understanding of AI technologies and associated network technologies (IB/RDMA/RoCE) **Preferred… more
- Meta (Menlo Park, CA)
- … fabric and host networking, comms lib and scheduling infrastructure. **Required Skills:** AI / HPC Network Engineer Responsibilities: 1. Design, develop, ... **Summary:** Meta's AI Training and Inference Infrastructure is growing exponentially...daily basis. We need to build and evolve our network infrastructure that connects myriads of GPUs together. In… more
- Meta (Washington, DC)
- …fabric and host networking, comms lib and scheduling infrastructure. **Required Skills:** AI / HPC Systems Performance Engineer Responsibilities: 1. Active ... **Summary:** Meta's AI Training and Inference Infrastructure is growing exponentially...daily basis. We need to build and evolve our network infrastructure that connects myriads of training accelerators like… more
- Cisco (Research Triangle Park, NC)
- …and communicate advanced technical concepts. A talented and passionate engineer comfortable working in high-pressure, large-scale enterprise environments. What You ... and managing the internal NVIDIA DGX and Cisco-UCS based AI platforms at Cisco. You will provide leadership in...* 7+ years of previous experience deploying and administrating HPC clusters * Familiar with GPU resource scheduling managers… more
- Samsung SDS America (Ridgefield Park, NJ)
- …Summary: We are seeking a highly skilled and experienced Data Center Storage Engineer with exposure to High Performance Computing ( HPC ) and GPU Infrastructure. ... Frost & Sullivan for our expertise in Managed Cloud Services, Cloud Security, and AI innovation. We're proud to play a pivotal role in the digital transformation… more
- NVIDIA (Durham, NC)
- We are seeking a motivated Senior HPC Support Engineer - Ethernet, passionate about data center and networking technologies, to provide comprehensive solutions ... (multi-distro), with the focus on NVIDIA Ethernet Switching technologies and our AI End-to-End Solutions. + Responding to customer product support inquiries via… more
- NVIDIA (Santa Clara, CA)
- NVIDIA is looking for an experienced GPU and network systems Solutions Architect & Engineer . Do you want to be part of a team that brings new Artificial ... Intelligence ( AI ) hardware and software technologies to production in customer...GPU server and networking system deployments as Solution Architect Engineer . Guide customer discussions on network design,… more
- NVIDIA (Santa Clara, CA)
- …how you can make a lasting impact on the world. As a Senior Technical Marketing Engineer for AI Infrastructure, you will join a dedicated team that is passionate ... equivalent experience. + 5+ years of experience. + Proficiency in Python and C++ for AI and HPC applications. + Experience using large scale multi node GPU… more
- Meta (Austin, TX)
- …Meta Silicon hyperscalar bring up and validation. **Required Skills:** Hardware Systems Engineer , NPI AI Responsibilities: 1. Lead the bring-up, validation, and ... ASIC productization in datacenter applications. 3. Utilize experience in accelerator and network ASIC architecture, AI workloads/ML models to design and… more
- Meta (Menlo Park, CA)
- …space of GenAI/LLM scaling reliability and performance. **Required Skills:** Software Engineer , SystemML - AI Networking Responsibilities: 1. Enabling reliable ... this role, you will be a member of the AI Networking Software team and part of the bigger...Library), which enables multi-GPU and multi-node data communication through HPC -style collectives. NCCL has been integrated into PyTorch and… more