• AI / HPC Network

    Meta (Menlo Park, CA)
    … fabric, host networking, communication libraries, and scheduling infrastructure. **Required Skills:** AI / HPC Network Engineer Responsibilities: 1. ... software, leveraging software defined networking principles. 14. Understanding of AI technologies and associated network technologies (IB/RDMA/RoCE) **Preferred… more
    Meta (12/03/24)
    - Related Jobs
  • Solutions Architect - HPC AI

    NVIDIA (CA)
    …can make a lasting impact on the world. NVIDIA Infrastructure Specialists team seeks an HPC / AI Infiniband Network Engineer to help customers realize ... to the world's largest and most sophisticated data centers and supercomputers. HPC Network Engineers deliver the technologies, solutions and services customers… more
    NVIDIA (01/09/25)
    - Related Jobs
  • Senior HPC AI Cluster…

    NVIDIA (Santa Clara, CA)
    …an experienced HPC Engineer to join the E2E software verification HPC / AI Infrastructure team. We are building supercomputers and HPC clusters based ... be doing: + Designing, implementing and maintaining large scale HPC / AI clusters with monitoring, logging and alerting...of resources + Deploying monitoring solutions for the servers, network and storage + Troubleshooting and fixing, bottom up… more
    NVIDIA (01/27/25)
    - Related Jobs
  • Senior Observability Engineer , AI

    NVIDIA (Santa Clara, CA)
    Engineer to help architect and implement our distributed observability systems for AI and HPC clusters. We serve and collaborate directly with NVIDIA's ... retrieval, and visualization to spectacularly improve efficiency, performance, and productivity of AI and HPC workloads. You will develop, deploy, and operate… more
    NVIDIA (01/31/25)
    - Related Jobs
  • AI / HPC Systems Performance…

    Meta (Menlo Park, CA)
    …fabric and host networking, comms lib and scheduling infrastructure. **Required Skills:** AI / HPC Systems Performance Engineer Responsibilities: 1. Lead ... 5. Work with cross functional teams and provide guidance on the AI network architecture including topologies, transport, congestion control techniques. **Minimum… more
    Meta (01/26/25)
    - Related Jobs
  • AI / HPC Systems Performance…

    Meta (Austin, TX)
    …fabric and host networking, comms lib and scheduling infrastructure. **Required Skills:** AI / HPC Systems Performance Engineer Responsibilities: 1. Active ... **Summary:** Meta's AI Training and Inference Infrastructure is growing exponentially...daily basis. We need to build and evolve our network infrastructure that connects myriads of training accelerators like… more
    Meta (01/30/25)
    - Related Jobs
  • AI Infrastructure Engineer

    Cisco (Research Triangle Park, NC)
    …and communicate advanced technical concepts. A talented and passionate engineer comfortable working in high-pressure, large-scale enterprise environments. What You ... and managing the internal NVIDIA DGX and Cisco-UCS based AI platforms at Cisco. You will provide leadership in...* 7+ years of previous experience deploying and administrating HPC clusters * Familiar with GPU resource scheduling managers… more
    Cisco (11/17/24)
    - Related Jobs
  • Senior Product Architect, HPC and AI

    NVIDIA (Santa Clara, CA)
    …harness your infrastructure expertise to create reference designs for the world's most powerful AI clusters. As an AI / HPC Product Architect at NVIDIA, you'll ... experience with benchmarking systems and analyzing performance bottlenecks in large-scale AI / HPC infrastructure + Exceptional communication skills, with the… more
    NVIDIA (01/23/25)
    - Related Jobs
  • Sr. Software Development Engineer

    Amazon (Cupertino, CA)
    …have extensive experience in low-latency networking and collective operations, such as HPC network fabric or machine learning accelerator cluster systems. Also ... Description We are seeking an experienced software engineer with low-level latency networking or interconnect expertise to optimize customer experience by designing… more
    Amazon (12/20/24)
    - Related Jobs
  • Senior HPC Technical Support…

    NVIDIA (Durham, NC)
    We are seeking a motivated Senior HPC Support Engineer - Ethernet, passionate about data center and networking technologies, to provide comprehensive solutions ... (multi-distro), with the focus on NVIDIA Ethernet Switching technologies and our AI End-to-End Solutions. + Responding to customer product support inquiries via… more
    NVIDIA (02/05/25)
    - Related Jobs