• AI/ HPC Systems Performance…

    Meta (Austin, TX)
    …fabric and host networking, comms lib and scheduling infrastructure. **Required Skills:** AI/ HPC Systems Performance Engineer Responsibilities: 1. Active ... of RDMA workloads that expects a loss-less fabric interconnect. To improve performance of these systems we constantly look for opportunities across stack: network… more
    Meta (03/22/25)
    - Related Jobs
  • HPC Systems Engineer

    US Tech Solutions (Houston, TX)
    …Description:** + A minimum of 5 years' experience working in a large HPC enterprise environment comprising thousands of servers, large storage solutions, tape and ... Proficient in the installation, configuration and management of Linux based operating systems , preferably using RHEL, CentOS, Rocky Linux. + Experience with IBM's… more
    US Tech Solutions (03/14/25)
    - Related Jobs
  • HPC Platform Engineer I

    The University Of Texas At Dallas (Dallas, TX)
    …Job Description: Reporting to the Director of HPC Operations. This is a systems engineer with a background in a High Performance Computing environment and ... HPC ) resources and related research services. The engineer will demonstrate a customer service mindset and interact...support efforts, products and technologies. + Current knowledge of HPC best practice and systems deployment and… more
    The University Of Texas At Dallas (02/15/25)
    - Related Jobs
  • HPC /AI - Kubernetes Engineer

    Deloitte (Austin, TX)
    HPC /AI Engineer (Federal) Job Summary: The HPC...AI and HPC workloads on NVIDIA GPUs. + HPC Systems Support: Implement and manage on-premise ... will be responsible for managing the day-to-day operations of the High-Performance Computing ( HPC ) and AI infrastructure, ensuring all systems meet or exceed… more
    Deloitte (04/25/25)
    - Related Jobs
  • HPC Engineer - Hybrid

    Caris Life Sciences (Irving, TX)
    …ensuring fair allocation of computing resources. + Implementing security measures to protect HPC systems and data from unauthorized access. + Diagnosing and ... **Position Summary** An HPC (High Performance Computing) Engineer is...responsible for implementing, and maintaining a High Performance Computing ( HPC ) systems primarily running on Linux operating… more
    Caris Life Sciences (03/25/25)
    - Related Jobs
  • High-Performance Computing Systems

    Texas A&M University System (Kingsville, TX)
    Job Title High-Performance Computing Systems Engineer Agency Texas A&M University - Kingsville Department I Tech Proposed Minimum Salary Commensurate Job ... manages the High-Performance Computing cluster administration, unit coordination, maintaining HPC systems , strategic planning for the University's HPC more
    Texas A&M University System (02/19/25)
    - Related Jobs
  • Advanced Technology Senior Software…

    Wells Fargo (Westlake, TX)
    **About this role:** We are seeking a High-Performance Computing ( HPC ) Engineer with experience in Machine Learning to optimize and scale AI/ML workloads. The ... this role, you will:** + Design, develop, and optimize HPC solutions for large-scale ML workloads. + Optimize data...domains + Assure quality, security and compliance for supported systems and applications + Serve as a technical resource… more
    Wells Fargo (04/23/25)
    - Related Jobs
  • Hardware Systems Engineer , NPI AI

    Meta (Austin, TX)
    **Summary:** Meta is seeking a Systems Engineer to join our Release to Production (RTP) team working on AI/ML initiatives supporting large scale AI Training and ... to Meta Silicon hyperscalar bring up and validation. **Required Skills:** Hardware Systems Engineer , NPI AI Responsibilities: 1. Lead the bring-up, validation,… more
    Meta (04/24/25)
    - Related Jobs
  • Hardware Systems Engineer , AI NPI

    Meta (Austin, TX)
    …approach to the new product introduction (NPI) phase. **Required Skills:** Hardware Systems Engineer , AI NPI Responsibilities: 1. Drive and execute end-to-end ... validation strategy (hardware and software), with a focus on various AI/ HPC hardware systems in datacenter applications. 2. Lead the bring-up, validation, and… more
    Meta (02/05/25)
    - Related Jobs
  • System Engineer - Interconnect

    Meta (Austin, TX)
    …Meta's custom AI hardware 2. Collect requirements and develop specifications for Rackscale AI/ HPC systems . 3. Develop and maintain code, to collect, analyze, and ... **Summary:** The Accelerator Reference Design Team is looking for a System Engineer to design, implement, and maintain hardware designs for custom AI hardware… more
    Meta (04/19/25)
    - Related Jobs