SRE Manager

4 weeks ago


Dublin, Ireland Apple Inc. Full time

Apple Services Engineering team is one of the most exciting examples of Apple’s long-held passion for combining art and technology. Join Apple Services Engineering Cloud Service Infrastructure team, as a Site Reliability Engineering Manager, to help support and scale cloud services for millions of Apple users. We are building and supporting new and existing critical infrastructural systems and frameworks which provide and support services like structured and unstructured storage, caching, queueing, searching, and much more at hyperscale. These form the platform upon which many iCloud and other backend systems at Apple are built. The team is responsible for the next generation platform that will power Apple’s infrastructural services. These services operate at extremely large scale and store exabytes of data. The platform will support a variety of services based on open-source software, such as Kubernetes, Cassandra, Zookeeper, Kafka, Redis, etc, alongside internally developed services. This is a hands-on role, to establish SRE practices for a private cloud service, to accelerate our ability to reliably and consistently deliver thousands of applications. You will lead a team of Site Reliability Engineers who thrive in a fast-paced workplace, where drive and collaboration are the keys to success

Description

The Apple Services Engineering Cloud Services SRE organization is looking for a strong, hands-on leader. The leader will lead a platform focused SRE team, and be responsible for the reliability of the platform. The platform serves workloads that provide our organisation and our customers with their favourite applications, services, and tools. We are domain experts in fleet management, systems, and software engineering. We build automations, instrument reliability tools, and respond to alerts and incidents which may pose a risk to the reliability of the platform. Team’s focus is on infrastructure capabilities and processes, improving the reliability and efficiency of the systems, at scale.

Responsibilities include:

1. Act as the Service Owner, designing and mapping key performance indicators to achieve the organization’s mission
2. Lead the definition of requirements, priorities and planning of engineering deliverables
3. Implement structured engineering and operations processes
4. Lead the team in daily agile SRE practices, ensuring proper team focus on priorities, achievements, and deliverables
5. Optimise velocity and efficiency of delivery, and drive continuous improvement

Success depends on strong understanding of SRE principles and practices, combined with a track record of resolving issues in a live production environment, and implementing strategies to minimize them while driving clear action plans for the team. The successful candidate will be highly self-motivated with a passion for excellence, quality, and detail. As a leader, they are responsible for coaching and mentoring their team members, helping them achieve service goals, and build career paths in alignment. It’s imperative for the leader to empower their team by providing appropriate context and timely feedback. The leader will not only own the service, but will also collaborate with other teams within Apple. They will build trust with stakeholders and partner through diplomacy, discussion, and follow-through. This is a broad cross-organisation role with high-visibility, collaborating with multiple teams. They are expected to invest in and build good relations with key partners. Their collaboration with internal customers, product engineering, and development groups is critical to success.

Minimum Qualifications

- Experience in critical, large scale distributed systems experience, combining Hardware, Operating Systems and Software
- Experience building and leading engineering teams; ideally SRE or Production Engineering
- Strong emphasis on SRE as an engineering subject area, with proficiency in at least in one of the following languages (Golang, Rust, Python, Swift)
- Understanding of SRE principals, including monitoring, alerting, error budgets, fault analysis, and other common reliability engineering concepts, with a keen eye for opportunities to eliminate toil by code and process improvements
- Superb interpersonal skills, capable of working with multi-functional technical and business teams and varying levels of management, influencing decision making
- Bachelors or Masters in Computer Science, Computer Engineering, or equivalent experience.

Preferred Qualifications

- Working with large bare-metal infrastructure and release management.
- Experience with large scale server provisioning, fleet management and maintenance
- Experience with development within Kubernetes ecosystem, including operator framework, controllers and CRDs
- Hardware bootstrap and associated security (PXE, BIOS, TPM, secure boot, trusted computing)
- Automating operations processes via services and tools
- Configuration management and fleet orchestration via Puppet, Chef, Ansible, or others

#J-18808-Ljbffr


  • SRE Manager

    6 days ago


    Dublin, Dublin City, Ireland TN Ireland Full time

    Abbott in Cherrywood (Dublin) is looking for a SRE (Site Reliability Engineer) Manager to join our Cardiac Rhythm Management (CRM) Business. This is a newly created role for an SRE professional who is passionate about joining a healthcare leader and shaping the future of Health-Tech (also known as Med-Tech).As an SRE Manager, you will lead the SRE Team and...

  • SRE Manager

    7 days ago


    Dublin, Dublin City, Ireland Apple Inc. Full time

    Apple Services Engineering team is one of the most exciting examples of Apple's long-held passion for combining art and technology. Join Apple Services Engineering Cloud Service Infrastructure team, as a Site Reliability Engineering Manager, to help support and scale cloud services for millions of Apple users. We are building and supporting new and existing...


  • Dublin, Dublin City, Ireland ENGINEERINGUK Full time

    Job Summary:We are seeking a highly skilled DevOps and SRE Team Manager to lead our team of engineers in delivering high-quality cloud infrastructure services to our customers.Responsibilities:Lead and manage a team of DevOps and SRE engineers to design, implement, and operate cloud infrastructure solutionsDevelop and maintain technical strategies and...

  • SRE Data Engineer

    4 weeks ago


    Dublin, Ireland IBM Computing Full time

    Introduction This role is responsible for designing, deploying, and maintaining our infrastructure, data processing application and CI/CD pipelines with large volumes of data. The position will be part of the SRE Data team and will primarily work with R&D, SRE and Data analysts. The work includes designing, building and deploying high availability, robust,...


  • Dublin, Dublin City, Ireland Google Full time

    Software Engineer III, SRE, Display Ads Targeting SRECompany: GoogleLocation: Dublin, IrelandMid-Level ExperienceExperience driving progress, solving problems, and mentoring more junior team members; deeper expertise and applied knowledge within relevant area.Minimum Qualifications:Bachelor's degree in Computer Science, a related field, or equivalent...


  • Dublin, Dublin City, Ireland MongoDB Full time

    **About Us**MongoDB empowers innovators to create, transform, and disrupt industries by unleashing the power of software and data.We enable organizations of all sizes to easily build, scale, and run modern applications by helping them modernize legacy workloads, embrace innovation, and unleash AI.Our MissionWe help organizations build and run modern...


  • Dublin, Ireland The Recruitment Company Full time

    Location: Dublin Job type: 12 Month Contract Salary: €450 per day The Purpose of This Role We seek a self-driven engineer to scale our client's public cloud presence. As a Cloud SRE, you'll work with platform teams to ensure reliable runtimes for business-critical workloads. Ideal candidates have a background in software or systems engineering...


  • Dublin, Dublin City, Ireland Apple Inc. Full time

    About This RoleAs a Site Reliability Engineering Manager, you will be responsible for leading a team of Site Reliability Engineers who thrive in a fast-paced workplace.You will be part of a team that builds and supports new and existing critical infrastructural systems and frameworks.This includes services like structured and unstructured storage, caching,...


  • Dublin, Dublin City, Ireland Avature Full time

    This role is responsible for designing, deploying, and maintaining our cloud-based infrastructure, data processing application, and CI/CD pipelines with large volumes of data. The position will be part of the SRE Data team and will primarily work with R&D, SRE, and Data analysts.The work includes designing, building, and deploying high availability, robust,...

  • SRE Engineer

    4 weeks ago


    Dublin, Dublin City, Ireland Cpl Full time

    ·Dublin – Hybrid ·Long Term Rolling Contract ·Negotiable Day Rates Global consultancy is looking for an experience SRE/DevOps Engineer to join their team. This is a hybrid role with 1 day on site in Dublin City Centre. ·AWS Architecture - Deep understanding of AWS and automation concepts, practices, and procedures. Deep understanding of...


  • Dublin, Dublin City, Ireland Avature Full time

    We are seeking an experienced SRE Data Engineer to join our Avature team. As a key member of our SRE Data team, you will design, deploy, and maintain our cloud-based infrastructure, data processing application, and CI/CD pipelines. You will work closely with R&D, SRE, and Data analysts to develop and implement high availability, robust, resilient, and...


  • Dublin, Dublin City, Ireland Google Full time

    Software Engineering Manager II, Cloud SQL SREcorporate_fare Google place Dublin, IrelandApplyMinimum Qualifications:Bachelor's degree in Computer Science, a related field, or equivalent practical experience.8 years of experience with data structures or algorithms.5 years of experience with software development in one or more programming languages.3 years of...


  • Dublin, Dublin City, Ireland MongoDB Full time

    MongoDB's mission is to empower innovators to create, transform, and disrupt industries by unleashing the power of software and data. We enable organizations of all sizes to easily build, scale, and run modern applications by helping them modernize legacy workloads, embrace innovation, and unleash AI. Our industry-leading developer data platform, MongoDB...


  • Dublin, Dublin City, Ireland Apple Inc. Full time

    About UsApple Inc. has a long history of pushing boundaries in the tech industry.Our Services Engineering team is one of the most exciting examples of this passion for innovation.In this role, you will be part of a team that builds and supports new and existing critical infrastructural systems and frameworks.This includes services like structured and...

  • SRE Engineer

    4 days ago


    Dublin, Ireland Cpl Full time

    ·Dublin – Hybrid ·Long Term Rolling Contract ·Negotiable Day Rates Global consultancy is looking for an experience SRE/DevOps Engineer to join their team. This is a hybrid role with 1 day on site in Dublin City Centre. ·AWS Architecture - Deep understanding of AWS and automation concepts, practices, and procedures. Deep understanding of...

  • Innovative SRE Leader

    16 hours ago


    Dublin, Dublin City, Ireland Adecco UK Limited Full time

    About the Role:We are looking for an experienced Principal or Senior Research Engineer AI Ops to join our team in Dublin. As a key member of our AIOps team, you will be responsible for designing and developing innovative analytics solutions for observability, incident response, and change management within Cloud. You will work closely with researchers,...


  • Dublin, Ireland Avature Full time

    This role is responsible for designing, deploying, and maintaining our infrastructure, data processing application and CI/CD pipelines with large volumes of data. The position will be part of the SRE Data team and will primarily work with R&D, SRE and Data analysts.The work includes designing, building and deploying high availability, robust, resilient and...


  • Dublin, Ireland Google Inc. Full time

    Software Engineer II, Site Reliability Engineering, Bandwidth SRE Corporate: Google Location: Dublin, Ireland Early Experience completing work as directed, and collaborating with teammates; developing knowledge of relevant concepts and processes. Minimum Qualifications: - Bachelor’s degree in Computer Science, a related field, or equivalent practical...


  • Dublin, Ireland Avature Full time

    This role is responsible for designing, deploying, and maintaining our infrastructure, data processing application and CI/CD pipelines with large volumes of data. The position will be part of the SRE Data team and will primarily work with R&D, SRE and Data analysts. The work includes designing, building and deploying high availability, robust, resilient and...

  • SRE DevOps

    4 weeks ago


    Dublin, Ireland Tiger Resourcing Group Full time

    REDevops( SRE + Automation +Devops) - 2 Positions Role: Permanent Location: Dublin, Ireland Job description: The Role Plan, manage, and oversee all aspects of a Production Environment  Define strategies for Application Performance Monitoring, Optimization in Prod environment Respond to Incidents and improvise platform based on...