Senior Engineer, Kubernetes Infrastructure

Senior Engineer, Kubernetes Infrastructure
Company:

Coreweave


Details of the offer

About the role:An engineering practice is only as healthy as its foundational dependencies and CoreWeave’s Kubernetes Infrastructure Team supports the platform and tools that underpin nearly every part of the cloud. Responsible for our internal Kubernetes-on-metal clusters in each datacenter, engineers on this team have the mission to manage and scale Kubernetes in one of one of the fastest growing clouds in the world. The domain of bare-metal day-0+ reliability engineering offers unique and rewarding challenges in orchestration, fleet operations, testing, observability and automation and every team member will have opportunities to develop their skills with Kuberenetes in an environment unique to being a cloud-builder, not just a cloud-consumer.We are seeking a Senior Engineer to join the Kubernetes Infrastructure team and help us grow our orchestration platforms in scale, reliability, and featureset. This individual will join a team of 4-6 mixed-skill engineers and have the opportunity to work on the full gamut of rewarding challenges that come with the business of building a cloud in a communicative, supportive, and high-performing environment. As a member of the Kubernetes Infrastructure Team, you would have the opportunity to:Design and implement solutions to fascinating problems of scale for provisioning and managing (many) bare-metal Kubernetes clusters in a hands-free, growing environment.Develop a toolchain and program for testing and developing against a complex cloud environment at a scale that remains agile.Create custom Kubernetes interfaces, gateways, and orchestrators all managed using Gitops tools such as Argo CD and Helm.Improve the performance, security, and reliability of our internal Kubernetes platforms and participate in the Kubernetes Infrastructure on-call rotation.Build dashboards, alerts, and insights into the customer experience using Grafana and Prometheus ecosystem tools.Grow, change, invest in your teammates, be invested-in, share your ideas, listen to others, be curious, have fun, and, above all, be yourself.Wondering if you’re a good fit?We believe in investing in our people, and value candidates who can bring their own diversified experiences to our teams – even if you aren't a 100% skill or experience match.Here are some qualities we’ve found compatible with our team. If a portion of this resonates with you, we’d love to talk. You have four or more years of experience in a software or infrastructure engineering industryYou have experience operating services in production and at scale.You have some experience using Kubernetes with a conceptual understanding of its major components and/or have administered unmanaged (eg, not EKS/GKE) Kubernetes clusters with some form of automation such as KubeSpray.You’re comfortable with the idea of using Go as your primary programming language.You know your way around a Linux distro, shell scripting, and/or the Linux storage and networking stacks.You’re interested in reliability engineering concepts such as the different types of testing, progressive deployments, error budgets, the role observability, and fault-tolerant design.You can transform problems in elastic architectures, decompose them into achievable tasks, and socialize both to your teammates.You’re excited about being part of a team of diverse perspectives and backgrounds that believe in tackling challenges, growing hand in hand, and winning together.Our compensation reflects the cost of labor across several US geographic markets. The base pay for this position ranges from $165,000 to $200,000 annually. Pay is based on a number of factors including market location and may vary depending on job-related knowledge, skills, and experience.Hybrid WorkplaceSuccessful candidates will be expected to attend onboarding training at our NJ Headquarters within their first several weeks of employment, with subsequent quarterly travel requirements of 1 week duration.If you reside within a 30-mile radius of our New Jersey, New York, or Philadelphia offices, we're excited for you to join us at the office at least three times a week, recognizing the significance we place on fostering connections, collaboration, and creativity within our office culture. Our commitment to operating as a hybrid workplace underscores our dedication to enabling our employees to tailor their work-life balance to their individual preferences.


Source: Greenhouse

Requirements

Senior Engineer, Kubernetes Infrastructure
Company:

Coreweave


JIRA Developer

CoreWeave is looking for a JIRA Developer to join our growing team. In this role, you will be part of a team that is responsible for managing and optimizing ...


From Coreweave - New Jersey

Published 18 days ago

HPC Network Developer

About the Role:Our HPC Network teams have a maniacal focus on delivering world-class network infrastructure by way of top notch automation on top of modern a...


From CoreWeave - New Jersey

Published 14 days ago

Hardware Engineer, GPU Infrastructure

CoreWeave is seeking a highly skilled and motivated Infrastructure/Hardware Engineer, focusing on GPU and PCIe troubleshooting, to join our Hardware Engineer...


From CoreWeave - New Jersey

Published 12 days ago

Product Support Specialist I

Company Description At Xplor, we believe that helping people make the most of each day is the most rewarding way to spend ours.We give small and medium-sized...


From Xplor - Utah

Published 19 days ago

Built at: 2024-06-16T14:13:00.538Z