Sr. Infrastructure Reliability Engineer, Infrastructure Reliability

1 month ago


Dublin, Ireland Amazon Data Services Ireland Limited Full time
As an Infrastructure Reliability Engineer you will be proactively driving the reliability risk identification, assessment and mitigation for datacenter infrastructure equipment (Example: Air Handling Units, LV Generator, MV Transformers, LV SWGR, Breakers, UPS, Chillers etc.). You will also be responsible for root cause analysis of critical equipment failures and drive the continuous improvements to improve datacenter availability for AWS customers. You will work closely with both internal and outside partners including suppliers to drive key aspects of product specification, risk identification plan and execution. You must be ownership minded, independent, action and results oriented to succeed in an open collaborative environment.

Our Snr Reliability Engineers have experience in using Physics-of-Failure based approach to develop and implement both analytical and empirical approaches for product quality/reliability risk identification and assessment during product design, manufacture as well as deployment stages. They drive AWS application-specific requirements in carrying out both lifecycle environmental and operational stress driven risk analysis, including thermal, electrical, chemical and mechanical stresses so to identify overstress and fatigue-related product weaknesses. Evaluate product design quality/reliability risks and assess electronics manufacture process related quality/reliability issues.

They drive critical component identification and the associated vendor selection and qualification requirements. Using their knowledge of process capability for electronic component production as well as system-level performance requirements to establish critical to quality and reliability metrics, they develop datacenter system level reliability model and related reliability quantification and risk analysis for datacenter configuration optimization.

During sustaining stage, you will be responsible for monitoring product performance in the field and will be responsible to drive root cause analysis of any critical failures and the associated corrective and preventive actions. You will drive effective vendor auditing and quarterly review process to drive the continuous improvements of datacenter availability.

As an SME in the reliability engineering field and product reliability leadership, as well as business negotiations and program management, you will conduct problem analysis and solve as well as communicate with vendors.

In this role, you will be required to travel within EMEA and internationally.

About the team
AWS Infrastructure Services owns the design, planning, delivery, and operation of all AWS global infrastructure. In other words, we’re the people who keep the cloud running. We support all AWS data centers and all of the servers, storage, networking, power, and cooling equipment that ensure our customers have continual access to the innovation they rely on. We work on the most challenging problems, with thousands of variables impacting the supply chain — and we’re looking for talented people who want to help.

You’ll join a diverse team of software, hardware, and network engineers, supply chain specialists, security experts, operations managers, and other vital roles. You’ll collaborate with people across AWS to help us deliver the highest standards for safety and security while providing seemingly infinite capacity at the lowest possible cost for our customers. And you’ll experience an inclusive culture that welcomes bold ideas and empowers you to own them to completion.

Diverse Experiences
AWS values diverse experiences. Even if you do not meet all of the qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying.

Why AWS?
Amazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that’s why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses.

Inclusive Team Culture
Here at AWS, it’s in our nature to learn and be curious. Our employee-led affinity groups foster a culture of inclusion that empower us to be proud of our differences. Ongoing events and learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon (gender diversity) conferences, inspire us to never stop embracing our uniqueness.

Mentorship & Career Growth
We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.

Work/Life Balance
We value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve in the cloud.

We are open to hiring candidates to work out of one of the following locations:

Dublin, D, IRL

BASIC QUALIFICATIONS

- Bachelor's or Master’s degree in Reliability Engineering, Physics, Electrical, Mechanical or Materials Engineering or related field
- 8+ years of Reliability Engineering work experience in high reliability industry
- 5+ years of experience with failure analysis activities and root cause analysis
- 5+ years of experience with accelerated life testing, stress analysis and finite element analysis

PREFERRED QUALIFICATIONS

- Ph.D. in Reliability Engineering, Physics, Electrical, Mechanical or Materials Engineering or a related field.
- 10+ years of work experience in reliability risk identification and assessment from component to system level applying analytical, experimental and statistical approaches to evaluate product design and manufacture quality/reliability levels
- Experience with proactive and effective reliability approaches in a cost-effective manner throughout product design, manufacture and deployment stages
- Proven experience in working with external design and manufacturing supply chain partners.
- Familiarity with major data center infrastructure equipment reliability performance
- Ability in managing multiple qualification activities and development schedules



  • Dublin, Ireland Google Inc. Full time

    Site Reliability Manager, Technical Infrastructure corporate_fare Google place Dublin, Ireland Apply Bachelor’s degree in Computer Science, a related field, or equivalent practical experience. Experience with data structures or algorithms. Experience with software development in one or more programming languages. Experience managing people or...


  • Dublin, Dublin City, Ireland Circle Full time

    Circle is a financial technology company at the epicenter of the emerging internet of money, where value can finally travel like other digital data — globally, nearly instantly and less expensively than legacy settlement systems. This ground-breaking new internet layer opens up previously unimaginable possibilities for payments, commerce and markets that...


  • Dublin, Ireland CIRCLE Full time

    Circle is a financial technology company at the epicenter of the emerging internet of money, where value can finally travel like other digital data — globally, nearly instantly and less expensively than legacy settlement systems. This ground-breaking new internet layer opens up previously unimaginable possibilities for payments, commerce and markets that...


  • Dublin, Ireland Distilled Full time

    What’s the Opportunity You’ll be part of an experienced Site Reliability Team where you'll collaborate closely with software and quality engineers. We value everyone's input and lean on our team's collective experience to continuously enhance both our processes and platforms. We're on the lookout for a Site Reliability Engineer. In...


  • Dublin, Ireland Workato Inc Full time

    Responsibilities As a Senior Infrastructure Engineer, you will be responsible for deploying, scaling, and maintenance of services at the core of the Workato Platform. You will closely work with Data Engineers, and Developers as a part of a small, flexible team and will have a direct impact on the process of modernization and maturation of the platform...


  • Dublin, Ireland Daft Media Limited Full time

    What’s the OpportunityYou’ll be part of an experienced Site Reliability Team where you'll collaborate closely with software and quality engineers. We value everyone's input and lean on our team's collective experience to continuously enhance both our processes and platforms.We're on the lookout for a Site Reliability Engineer. In this role, you'll thrive...

  • Reliability Engineer

    3 hours ago


    Dublin, Dublin City, Ireland Cpl Full time

    SK biotek Ireland is actively looking for a Reliability Engineer to join their Maintenance Department in Swords, Co. Dublin on a 12-month temporary basis facilitated by CPL.Key Responsibilities:Act as a key technical person responsible for resolving repetitive failures and long-term issuesDefine the reliability strategy within the area of expertiseIdentify...


  • Dublin, Ireland Workato Full time

    About Workato Workato is the only integration and automation platform that is as simple as it is powerful and because its built to power the largest enterprises, it is quite powerful.  Simultaneously, its a low-code/no-code platform. This empowers any user (dev/non-dev) to painlessly across any apps and databases. Were proud to be named a leader...


  • Dublin, Dublin City, Ireland Adobe Full time

    JOB LEVELP40EMPLOYEE ROLEIndividual ContributorThe ChallengeAdobe Stock team is looking for an exceptional Site Reliability Engineer (SRE) to support the innovation we are bringing to the market through microservice and continuous integration/continuous deployment processes. We provide designers and businesses access to hundreds of millions high-quality,...


  • Dublin, Ireland Adobe Full time

    Our Company Changing the world through digital experiences is what Adobe’s all about. We give everyone—from emerging artists to global brands—everything they need to design and deliver exceptional digital experiences! We’re passionate about empowering people to create beautiful and powerful images, videos, and apps, and transform how companies...


  • Dublin, Ireland Google Inc. Full time

    Staff Software Engineer, Site Reliability Engineering corporate_fare Google place Dublin, Ireland Apply Bachelor’s degree in Computer Science, a related field, or equivalent practical experience. Candidates will typically have 5 years of experience with software development in one or more programming languages. Typically 8 years of experience with...


  • Dublin, Ireland Whatnot Full time

    Whatnot is a livestream shopping platform and marketplace backed by Andreessen Horowitz, Y Combinator, and CapitalG. We’re building the future of ecommerce, bringing together community, shopping and entertainment. We are committed to our values , and as a remote-first team, we operate out of hubs within the US, Canada, UK, Ireland, and Germany today. ...


  • Dublin, Dublin City, Ireland Cpl Full time

    SK biotek Ireland are seeking to recruit a Maintenance & Reliability Engineer to join the Maintenance Department based in Swords, Co. Dublin on a temporary 12 month basis through CPL. Key responsibilitiesThe Reliability Engineer act as a key technical person responsible for resolution of repetitive failures and long term issues and to define the reliability...


  • Dublin, Ireland Whatnot Full time

    ???? Whatnot Whatnot is a livestream shopping platform and marketplace backed by Andreessen Horowitz, Y Combinator, and CapitalG. We’re building the future of ecommerce, bringing together community, shopping and entertainment. We are committed to our values , and as a remote-first team, we operate out of hubs within the US, Canada, UK, and Germany...


  • Dublin, Ireland Google Inc. Full time

    Software Engineering Manager II, Site Reliability Engineering corporate_fare Google place Dublin, Ireland Apply Bachelor’s degree in Computer Science, a related field, or equivalent practical experience. Candidates will typically have 8 years of experience with data structures or algorithms. Typically 5 years of experience with software...


  • Dublin, Ireland Whatnot Full time

    Whatnot Whatnot is a livestream shopping platform and marketplace backed by Andreessen Horowitz, Y Combinator, and CapitalG. We’re building the future of ecommerce, bringing together community, shopping and entertainment. We are committed to our values, and as a remote-first team, we operate out of hubs within the US, Canada, UK, Ireland, and...


  • Dublin, Ireland Google Inc. Full time

    Principal Engineer, AI, Trust, Security, Site Reliability Engineering link Copy link corporate_fare Google place Dublin, Ireland bar_chart Director+ Apply link Copy link Bachelor's degree in Computer Science, similar technical field, or equivalent practical experience. Experience in technical leadership and setting technical direction for...


  • Dublin, Dublin City, Ireland Cpl Full time

    Infrastructure Operations Manager will oversee and optimize infrastructure operations services across multiple data centers and sites in EMEA. This role involves direct interaction with managed service providers, ensuring the strategic direction and efficient management of operational activities. The incumbent will serve as a key liaison between my client...


  • Dublin, Ireland Adobe Full time

    JOB LEVELP40EMPLOYEE ROLEIndividual ContributorThe ChallengeAdobe Stock team is looking for an exceptional Site Reliability Engineer (SRE) to support the innovation we are bringing to the market through microservice and continuous integration/continuous deployment processes. We provide designers and businesses access to hundreds of millions high-quality,...


  • Dublin, Ireland Workday Limited Full time

    About the Role Are you a creative SRE looking for more opportunities to improve reliability and enjoy building solutions to reduce toil and manual effort? As a Senior Associate Site Reliability Engineer at Workday Inc, you will have the chance to work on a diverse range of projects and tools, and contribute to the flawless operation of our world-class...