Ai Engineer
Company:

Diverse Lynx


Details of the offer

AI+HPC infra requirement

looking for someone with Architectural and design experience also along with experience in handling 1000+ nodes.

Technical/Functional Skills -
Proficiency in RoCEv2, K8s, KVM, Ubuntu, Python, Shell, Go, Rust, GPU drivers, and Cluster interconnect with 200G/400G networking.
Managing GPU clusters optimizing GPU-based services/tools/software

Roles & Responsibilities -

Develop, implement, and maintain GPU-based clusters of 10 to 1000 nodes, ensuring optimal performance and availability.
Administer Client/AI platforms - Distributed Client services, LLMs, Vector-DB and AI inferencing, by managing deployments, resource allocation, monitoring, and security.
Collaborate with cross-functional teams to address AI infrastructure requirements, support AI-related projects, and provide technical expertise.
Monitor and evaluate the performance of AI systems and clusters, ensuring that they adhere to industry best practices and meet company standards.
Compile reports, document procedures, and publish recommendations for improving AI infrastructure and solutions.
Use AI/Client to continuously improve internal processes and tools that are used in end-to-end delivery of your services in this team

Diverse Lynx LLC is an Equal Employment Opportunity employer. All qualified applicants will receive due consideration for employment without any discrimination. All applicants will be evaluated solely on the basis of their ability, competence and their proven capability to perform the functions outlined in the corresponding role. We promote and support a diverse workforce across all levels in the company.
#J-18808-Ljbffr


Source: Grabsjobs_Co

Job Function:

Requirements

Ai Engineer
Company:

Diverse Lynx


Sr. Cloudsec Engineer

Job Summary:The Cybersecurity Engineer position is a hands-on role that involves evaluating and enforcing cybersecurity and compliance controls. This positio...


From Iherb - California

Published 19 days ago

Senior Engineer - Frontend (Platform Team)

Who We Are At OKX, we believe our future is reshaped with technology. Founded in 2017, OKX is one of the world’s leading cryptocurrency spot and derivativ...


From OKX - California

Published 19 days ago

Mechanical Engineer — Associate Program (Fall 2024)

Mechanical Engineer — AssociateAssociate Engineer positions typically last for twelve weeks, and are salaried roles designed for students who have already re...


From Astranis - California

Published 21 days ago

Lead Architect, Identity And Access Management

What you'll work on: Gather, share and coach industry best-practices regarding implementation of customer identity and access in both frontend and backend im...


From Circle - California

Published 19 days ago

Built at: 2024-05-23T23:15:01.148Z