Sr. Infrastructure Reliability Engineer, Infrastructure Reliability

3 weeks ago


Dublin, Dublin City, Ireland Amazon Full time
Sr. Infrastructure Reliability Engineer, Infrastructure Reliability & Quality

Job ID: | Amazon Data Services Ireland Limited

As an Infrastructure Reliability Engineer you will be proactively driving the reliability risk identification, assessment and mitigation for datacenter infrastructure equipment (Example: Air Handling Units, LV Generator, MV Transformers, LV SWGR, Breakers, UPS, Chillers etc.). You will also be responsible for root cause analysis of critical equipment failures and drive the continuous improvements to improve datacenter availability for AWS customers. You will work closely with both internal and outside partners including suppliers to drive key aspects of product specification, risk identification plan and execution. You must be ownership minded, independent, action and results oriented to succeed in an open collaborative environment.

Our Snr Reliability Engineers have experience in using Physics-of-Failure based approach to develop and implement both analytical and empirical approaches for product quality/reliability risk identification and assessment during product design, manufacture as well as deployment stages. They drive AWS application-specific requirements in carrying out both lifecycle environmental and operational stress driven risk analysis, including thermal, electrical, chemical and mechanical stresses so to identify overstress and fatigue-related product weaknesses. Evaluate product design quality/reliability risks and assess electronics manufacture process related quality/reliability issues.

They drive critical component identification and the associated vendor selection and qualification requirements. Using their knowledge of process capability for electronic component production as well as system-level performance requirements to establish critical to quality and reliability metrics, they develop datacenter system level reliability model and related reliability quantification and risk analysis for datacenter configuration optimization.

During sustaining stage, you will be responsible for monitoring product performance in the field and will be responsible to drive root cause analysis of any critical failures and the associated corrective and preventive actions. You will drive effective vendor auditing and quarterly review process to drive the continuous improvements of datacenter availability.

As an SME in the reliability engineering field and product reliability leadership, as well as business negotiations and program management, you will conduct problem analysis and solve as well as communicate with vendors.

In this role, you will be required to travel within EMEA and internationally.

About the team
AWS Infrastructure Services owns the design, planning, delivery, and operation of all AWS global infrastructure. In other words, we're the people who keep the cloud running. We support all AWS data centers and all of the servers, storage, networking, power, and cooling equipment that ensure our customers have continual access to the innovation they rely on. We work on the most challenging problems, with thousands of variables impacting the supply chain — and we're looking for talented people who want to help.

You'll join a diverse team of software, hardware, and network engineers, supply chain specialists, security experts, operations managers, and other vital roles. You'll collaborate with people across AWS to help us deliver the highest standards for safety and security while providing seemingly infinite capacity at the lowest possible cost for our customers. And you'll experience an inclusive culture that welcomes bold ideas and empowers you to own them to completion.

Diverse Experiences
AWS values diverse experiences. Even if you do not meet all of the qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn't followed a traditional path, or includes alternative experiences, don't let it stop you from applying.

Why AWS?
Amazon Web Services (AWS) is the world's most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that's why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses.

Inclusive Team Culture
Here at AWS, it's in our nature to learn and be curious. Our employee-led affinity groups foster a culture of inclusion that empower us to be proud of our differences. Ongoing events and learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon (gender diversity) conferences, inspire us to never stop embracing our uniqueness.

Mentorship & Career Growth
We're continuously raising our performance bar as we strive to become Earth's Best Employer. That's why you'll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.

Work/Life Balance
We value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture. When we feel supported in the workplace and at home, there's nothing we can't achieve in the cloud.

We are open to hiring candidates to work out of one of the following locations:

Dublin, D, IRL

BASIC QUALIFICATIONS

Bachelor's or Master's degree in Reliability Engineering, Physics, Electrical, Mechanical or Materials Engineering or related field
- 8+ years of Reliability Engineering work experience in high reliability industry
- 5+ years of experience with failure analysis activities and root cause analysis
- 5+ years of experience with accelerated life testing, stress analysis and finite element analysis

PREFERRED QUALIFICATIONS

Ph.D. in Reliability Engineering, Physics, Electrical, Mechanical or Materials Engineering or a related field.
- 10+ years of work experience in reliability risk identification and assessment from component to system level applying analytical, experimental and statistical approaches to evaluate product design and manufacture quality/reliability levels
- Experience with proactive and effective reliability approaches in a cost-effective manner throughout product design, manufacture and deployment stages
- Proven experience in working with external design and manufacturing supply chain partners.
- Familiarity with major data center infrastructure equipment reliability performance
- Ability in managing multiple qualification activities and development schedules

Amazon is an equal opportunities employer. We believe passionately that employing a diverse workforce is central to our success. We make recruiting decisions based on your experience and skills. We value your passion to discover, invent, simplify and build. Protecting your privacy and the security of your data is a longstanding top priority for Amazon. Please consult our Privacy Notice ) to know more about how we collect, use and transfer the personal data of our candidates.

Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need an adjustment during the application and hiring process, including support for the interview or onboarding process, please contact the Applicant-Candidate Accommodation Team (ACAT), Monday through Friday from 7:00 am GMT - 4:00 pm GMT. If calling directly from the United Kingdom, please dial tel: If calling from Ireland, please dial tel:

Posted: January 13, 2024 (Updated about 4 hours ago)

Posted: April 14, 2024 (Updated about 4 hours ago)

Posted: March 21, 2024 (Updated 1 day ago)

Posted: March 19, 2024 (Updated 1 day ago)

Posted: February 29, 2024 (Updated 1 day ago)

Amazon is committed to a diverse and inclusive workplace. Amazon is an equal opportunity employer and does not discriminate on the basis of race, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or other legally protected status.

#J-18808-Ljbffr

  • Dublin, Dublin City, Ireland Token, Inc. Full time

    We're looking for an experienced Site Reliability Engineer (SRE) to help drive forward the platform at Our SRE team work closely with client facing teams and internal Engineering to make the service highly reliable and scalable. Here's what you get to do Design, develop, implement and own products and solutions to improve the security, reliability, and...


  • Dublin, Dublin City, Ireland Token, Inc. Full time

    We're looking for an experienced Site Reliability Engineer (SRE) to help drive forward the platform at Our SRE team work closely with client facing teams and internal Engineering to make the service highly reliable and scalable. Here's what you get to do Design, develop, implement and own products and solutions to improve the security, reliability, and...


  • Dublin, Dublin City, Ireland Daft Media Limited Full time

    What's the OpportunityYou'll be part of an experienced Site Reliability Team where you'll collaborate closely with software and quality engineers. We value everyone's input and lean on our team's collective experience to continuously enhance both our processes and platforms.We're on the lookout for a Site Reliability Engineer. In this role, you'll thrive in...


  • Dublin, Dublin City, Ireland Reddit Inc Full time

    Reddit SRE is rapidly innovating and leading the company on a mission to meet Redditor's user-experience expectations. Our teams are working to meet the needs of infrastructure and development teams as they evolve our product faster than ever before. This is a unique opportunity to leave your mark on one of the most influential and trafficked corners of the...


  • Dublin, Dublin City, Ireland TikTok Full time

    About TikTokTikTok is the leading destination for short-form mobile video. At TikTok, our mission is to inspire creativity and bring joy. TikTok's global headquarters are in Los Angeles and Singapore, and its offices include New York, London, Dublin, Paris, Berlin, Dubai, Jakarta, Seoul, and Tokyo.Why Join UsCreation is the core of TikTok's purpose. Our...


  • Dublin, Dublin City, Ireland Reperio Human Capital Full time

    Site Reliability Engineer 96880Desired skills: AWS, Containerisation, Terraform, Ansible, DevOpsThis is a hybrid position based in Dublin, Ireland.Requirements:Highly experienced across AWS technologies & servicesHands-on experience with containerisation technologies (Docker, Kubernetes)Scripting skills (Java, Python, Go)Exposure to Infrastructure as Code...


  • Dublin, Dublin City, Ireland MongoDB Full time

    Infrastructure Engineering TeamThe Infrastructure Engineering team is responsible for building and maintaining a self-service internal development platform that enables MongoDB engineering teams to reliably deploy and operate their own production services and products. We work with numerous engineering teams across the company to understand their...


  • Dublin, Dublin City, Ireland MongoDB Full time

    The worldwide data management software market is massive (According to IDC, the worldwide database software market, which it refers to as the database management systems software market, was forecasted to be approximately $82 billion in 2023 growing to approximately $137 billion in 2027. This represents a 14% compound annual growth rate). At MongoDB we are...

  • Reliability Engineer

    3 weeks ago


    Dublin, Dublin City, Ireland Cpl Healthcare Full time

    SK biotek Ireland are seeking to recruit a Reliability Engineer to join the Maintenance Department based in Swords, Co. Dublin on a temporary 12 month basis through CPL.Key responsibilitiesThe Reliability Engineer act as a key technical person responsible for resolution of repetitive failures and long term issues and to define the reliability strategy...


  • Dublin, Dublin City, Ireland dbt Labs Full time

    Since 2016, dbt Labs has been on a mission to help analysts create and disseminate organizational knowledge. dbt Labs pioneered the practice of analytics engineering, built the primary tool in the analytics engineering toolbox, and has been fortunate enough to see a fantastic community coalesce to help push the boundaries of the analytics engineering...


  • Dublin, Dublin City, Ireland Adobe Full time

    Site Reliability Engineer, Adobe Stock page is loaded Site Reliability Engineer, Adobe Stock Apply locations Dublin Remote Northern Ireland Remote Denmark Remote Ireland Maidenhead time type Full time posted on Posted Yesterday job requisition id R145266 Our Company Changing the world through digital experiences is what Adobe's all about. We give...

  • Reliability Engineer

    3 weeks ago


    Dublin, Dublin City, Ireland Cpl Full time

    SK biotek Ireland - Reliability EngineerSK biotek Ireland is looking for a Reliability Engineer to join the Maintenance Department in Swords, Co. Dublin on a temporary 12-month basis through CPL.Key Responsibilities:Resolve repetitive failures and long-term issues to define a reliability strategy.Identify engineering solutions to enhance operational...


  • Dublin, Dublin City, Ireland Adobe Full time

    JOB LEVELP40EMPLOYEE ROLEIndividual ContributorThe ChallengeAdobe Stock team is looking for an exceptional Site Reliability Engineer (SRE) to support the innovation we are bringing to the market through microservice and continuous integration/continuous deployment processes. We provide designers and businesses access to hundreds of millions high-quality,...


  • Dublin, Dublin City, Ireland Adobe Full time

    JOB LEVELP40EMPLOYEE ROLEIndividual ContributorThe ChallengeAdobe Stock team is looking for an exceptional Site Reliability Engineer (SRE) to support the innovation we are bringing to the market through microservice and continuous integration/continuous deployment processes. We provide designers and businesses access to hundreds of millions high-quality,...


  • Dublin, Dublin City, Ireland Whatnot Full time

    About UsWhatnot Whatnot is a livestream shopping platform and marketplace backed by Andreessen Horowitz, Y Combinator, and CapitalG. We're building the future of ecommerce, bringing together community, shopping, and entertainment. We are committed to our values, and as a remote-first team, we operate out of hubs within the US, Canada, UK, Ireland, and...


  • Dublin, Dublin City, Ireland Millennium Management Full time

    Infrastructure Software Developer We are looking for an Infrastructure Software Developer to join our WorldQuant Aligned Infrastructure team. The successful candidate will be responsible for delivering high-quality, reliable, and scalable infrastructure solutions and self-service tools for our rapidly growing internal users. The team is comprised of...


  • Dublin, Dublin City, Ireland Google Inc. Full time

    Staff Software Engineer, Site Reliability Engineering corporate_fare Google place Dublin, Ireland Apply Bachelor's degree in Computer Science, a related field, or equivalent practical experience.Candidates will typically have 5 years of experience with software development in one or more programming languages.Typically 8 years of experience with data...


  • Dublin, Dublin City, Ireland Whatnot Full time

    Whatnot is a livestream shopping platform and marketplace backed by Andreessen Horowitz, Y Combinator, and CapitalG. We're building the future of ecommerce, bringing together community, shopping and entertainment. We are committed to our values , and as a remote-first team, we operate out of hubs within the US, Canada, UK, Ireland, and Germany today. We're...


  • Dublin, Dublin City, Ireland Millennium Management LLC Full time

    Infrastructure Software DeveloperWe are looking for an Infrastructure Software Developer to join our WorldQuant Aligned Infrastructure team.The successful candidate will be responsible for delivering high-quality, reliable, and scalable infrastructure solutions and self-service tools for our rapidly growing internal users.The team is comprised of...


  • Dublin, Dublin City, Ireland Reperio Human Capital Full time

    IT Infrastructure Engineer 100736Desired skills: IT Infrastructure Engineer, Dublin, Permanent, Infrastructure, Cloud, ITIT Infrastructure EngineerDublin, IrelandSalary: 60K+Permanent, Full-timeWe are currently looking to speak with experienced Infrastructure Engineers for our client based in Dublin. This role would be a leadership role meaning we would be...