Senior Director Of Platform And Site Reliability Engineering

Senior Director Of Platform And Site Reliability Engineering






Senior Director Of Platform And Site Reliability Engineering

Details of the offer

Site reliability and Platform Engineering are key functions in IT and this highly visible senior leadership role will be responsible for Infrastructure platforms and application support strategy, roadmap, and technical implementation of the IT Transformation programs
Manage Compute Platform as a service with end:to:end responsibility for delivering and supporting the on:prem and cloud compute platforms ( GCP, AWS), VMWARE, Kubernetes, Terraform, Ansible, CI/CD, Artifactory etc for continuously deploying applications.
Own automation for delivery of Platform services using Infrastructure as Code. Build standard playbooks for Platform which can be consumed across multiple teams in the organization.
Lead delivery of Cloud Infrastructure strategies aligned with business objectives with a focus on mass Application movements into the Cloud involving design, implementation and Infrastructure automation.
Build a high performing team of Cloud Platform SMEs and platform leads while mentoring traditional platform SMEs on cloud computing best practices, technology, and adoption.
Build and manage an SRE function that owns application availability and performance and manage it through automation and proactive/predictive alerts by having a strong data analytical tool set to identify areas of improvement
Implement comprehensive service monitoring to ensure uptime and performance, including synthetic, real user, system, application performance, dashboards etc.
Define, measure, and meet key Service Level Objectives including availability, performance, incidents and chronic problems
Own end:to:end availability and performance of mission critical services and build automation to prevent problem recurrence; eventually automate response to all non:exceptional service conditions.
Partner with application and business stakeholders to ensure high quality product is developed and released into production. Establish and periodically update the Release Policy which governs the release process and details release categories, release activities, role and responsibilities, exception, etc.
Work closely with Enterprise Architecture and Information Security to specify and document solutions and practices.
Keep abreast with evolving threats/risks, industry trends and work to implement best practices in the organization.

BA/BS degree in Computer Science or related technical field, or equivalent practical experience.
10+ years of hands:on technical experience combined with strong management and communication skills.
Solid understanding of Windows, Linux, Networking, TCP:IP, Routing, Switching, Firewalls, Load balancers and other infrastructure components
Solid understanding of modern cloud technologies and developer family of products: GKE, Istio, Serverless, Cloud Build, Monitoring and Logging, as well as the Microservices, DevSecOps etc.
Experience running revenue generating applications in a public cloud and IaaS, including real world experience with at least one public cloud provider: AWS, Google Cloud or Microsoft Azure
Experience building, scaling, and running production operations for heterogeneous applications.
Strong troubleshooting experience and skillset to resolve incidents across multiple domains.
Ability to nurture and support a strong operations culture: customer/service focus excellent technology; high quality implementations; self:motivated innovation and problem:solving.
Demonstrated ability of establishing and maintaining metrics:based process improvement
Demonstrated ability to develop strong alliances with those outside of your immediate organization
Experience in building and managing strong technical teams
Excellent communications, organization, and time management skills

Source: Tiptopjob_Xml



Senior data engineer

Optum is a company that's on the rise. We're expanding in multiple directions, across borders and, most of all, in the way we think. Here, innovation isn't...

From Unitedhealth Group - New York

Published a month ago

Software engineer, android framework

Company DescriptionSquare builds common business tools in unconventional ways so more people can start, run, and grow their businesses. When Square started, it...

From Square - California

Published a month ago

Sr. software engineer

Company Description As the world's leader in digital payments technology, Visa's mission is to connect the world through the most creative, reliable and secure...

From Visa - Texas

Published a month ago

Senior java/full

At Accenture, we’re building something great – not just an innovative, next-generation threat intelligence platform, but just as importantly we’re building a...

From Accenture - Washington

Published a month ago