Senior Research Scientist, AWS Incident Tooling

2 weeks ago


Dublin, Dublin City, Ireland Amazon Full time
Senior Research Scientist, AWS Incident Tooling & Response

AWS Resilience owns service to prevent and respond to availability and security issues for all AWS Services. In other words, we're the people who keep the cloud running. We work on the most challenging problems, with constant new services and possible failure modes to prevent — and we're looking for talented people who want to help.

You'll join a diverse team of software, security experts, operations managers, and other vital roles. You'll collaborate with people across AWS to help us deliver the highest standards for safety, security, and availability. You'll experience an inclusive culture that welcomes bold ideas and empowers you to own them to completion.

AWS Incident Response is at the heart of the high availability of Amazon Web Services. We make customer impacting events shorter and less frequent by driving large scale event and incident response. Our automated tooling quickly identifies the cause of an issue and helps mitigate its impact, and much of our engineer time is spent on projects to improve the tooling and automation. We also provide manual incident management for AWS and other Amazon groups, directing the resolution of an issue with service teams, and diving deep into those events to drive improvements to the tooling. It's an exciting time to join our team as we are growing and expanding our offerings.

Key job responsibilities:

1. You will own the organization strategy relative to the usage of ML, GenAI and propose the best technology to advance our ability to better detect, faster root cause, and correlate to prior incidents to shorten customer facing AWS incidents.
2. Your work will enable us to identify gaps in our current strategy, learnings from past incidents.
3. You will contribute to shortening incident response through deep analysis and introduction of new technology.

Minimum Qualifications:

1. Masters degree (or European advanced degree equivalent) or PhD in Computer Science, or related technical, math, economics, or scientific field.
2. Several years of relevant experience in developing large scale machine learning or deep learning models and/or systems in a production environment.
3. Experience in using Python, R or Matlab or other statistical/machine learning software language.
4. Several years experience specifically with deep learning (e.g., CNN, RNN, LSTM, etc.).
5. Experience hiring or mentoring more junior colleagues.

Preferred Qualifications:

1. PhD degree in computer science, engineering, mathematics, economics, or related technical/scientific field.
2. Hands on experience building models with deep learning frameworks like PyTorch, or similar.
3. Experience with machine learning, time series, NLP and CV solutions.
4. Proven communication skills, presentation skills, and attention to detail.
5. Comfortable working in a fast paced, highly collaborative, dynamic work environment.
6. Scientific thinking and the ability to invent, a track record of thought leadership and contributions that have advanced the field.

Amazon is an equal opportunities employer. We believe passionately that employing a diverse workforce is central to our success. We make recruiting decisions based on your experience and skills. We value your passion to discover, invent, simplify and build.

Protecting your privacy and the security of your data is a longstanding top priority for Amazon. Please consult our Privacy Notice (https://www.amazon.jobs/en/privacy_page) to know more about how we collect, use and transfer the personal data of our candidates.

Amazon is committed to a diverse and inclusive workplace. Amazon is an equal opportunity employer and does not discriminate on the basis of race, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or other legally protected status. For individuals with disabilities who would like to request an accommodation, please visit https://www.amazon.jobs/content/en/how-we-hire/accommodations.

#J-18808-Ljbffr

  • Dublin, Dublin City, Ireland ENGINEERINGUK Full time

    Senior Research Scientist, AWS Incident Tooling & ResponseDESCRIPTIONAWS Resilience owns service to prevent and respond to availability and security issues for all AWS Services. In other words, we're the people who keep the cloud running. We work on the most challenging problems, with constant new services and possible failure modes to prevent - and we're...


  • Dublin, Dublin City, Ireland Amazon Full time

    Senior Research Scientist, AWS Incident Tooling & ResponseJob ID: | Amazon Development Centre Ireland LimitedAWS Resilience owns service to prevent and respond to availability and security issues for all AWS Services.In other words, we're the people who keep the cloud running.We work on the most challenging problems, with constant new services and possible...


  • Dublin, Dublin City, Ireland Amazon Full time

    Senior Research Scientist, AWS Incident Tooling & ResponseAWS Resilience owns service to prevent and respond to availability and security issues for all AWS Services. In other words, we're the people who keep the cloud running. We work on the most challenging problems, with constant new services and possible failure modes to prevent — and we're looking for...


  • Dublin, Dublin City, Ireland Amazon Development Centre Ireland Limited Full time

    AWS Resilience owns service to prevent and response to availability and security issues for all AWS Services. In other words, we're the people who keep the cloud running. We work on the most challenging problems, with constant new services and possible failure modes to prevent — and we're looking for talented people who want to help.You'll join a diverse...


  • Dublin, Dublin City, Ireland Amazon Full time

    Software Development Manager, AWS Incident Tooling & ResponseJob ID: | Amazon Development Centre Ireland LimitedAWS Resilience owns services that prevent and respond to availability and security issues for all AWS Services.In other words, we're the people who keep the cloud running.We work on the most challenging problems, with constant new services and...


  • Dublin, Dublin City, Ireland ENGINEERINGUK Full time

    Software Development Manager, AWS Incident Tooling & ResponseDESCRIPTIONAWS Resilience owns services that prevent and respond to availability and security issues for all AWS Services. In other words, we're the people who keep the cloud running. We work on the most challenging problems, with constant new services and possible failure modes to prevent - and...


  • Dublin, Dublin City, Ireland Amazon Full time

    Software Development Manager, AWS Incident Tooling & ResponseJob ID: 2830638 | Amazon Development Centre Ireland LimitedAWS Resilience owns services that prevent and respond to availability and security issues for all AWS Services. In other words, we're the people who keep the cloud running. We work on the most challenging problems, with constant new...


  • Dublin, Dublin City, Ireland Amazon Full time

    Software Development Manager, AWS Incident Tooling & ResponseJob ID: 2830638 | Amazon Development Centre Ireland LimitedAWS Resilience owns services that prevent and respond to availability and security issues for all AWS Services. In other words, we're the people who keep the cloud running. We work on the most challenging problems, with constant new...


  • Dublin, Dublin City, Ireland Amazon Full time

    Software Development Manager, AWS Incident Tooling & ResponseJob ID: 2830638 | Amazon Development Centre Ireland LimitedAWS Resilience owns services that prevent and respond to availability and security issues for all AWS Services. In other words, we're the people who keep the cloud running. We work on the most challenging problems, with constant new...


  • Dublin, Dublin City, Ireland Amazon Full time

    Software Development Manager, AWS Incident Tooling & ResponseJob ID: 2830638 | Amazon Development Centre Ireland LimitedAWS Resilience owns services that prevent and respond to availability and security issues for all AWS Services. In other words, we're the people who keep the cloud running. We work on the most challenging problems, with constant new...


  • Dublin, Dublin City, Ireland ENGINEERINGUK Full time

    Software Development Engineer, AWS Incident Tooling & ResponseDESCRIPTIONAmazon Web Services is the largest consumer cloud offering in the world, powering cutting edge science, rapidly growing start-ups and industry-leading companies.The AWS Incident Response Systems team is building systems to ensure these AWS customers can rely on the highest-availability,...


  • Dublin, Dublin City, Ireland TN Ireland Full time

    Social network you want to login/join with:Software Development Engineer, AWS Incident Tooling & Response, DublinClient:Amazon Data Services Ireland LimitedLocation:Dublin, IrelandJob Category:OtherEU work permit required:YesJob Reference:ff1326e23bd5Job Views:3Posted:22.03.2025Expiry Date:06.05.2025Job Description:Amazon Web Services is the largest consumer...


  • Dublin, Dublin City, Ireland Amazon Full time

    Software Development Engineer, AWS Incident Tooling & ResponseJob ID: 2795181 | Amazon Data Services Ireland LimitedAmazon Web Services is the largest consumer cloud offering in the world, powering cutting edge science, rapidly growing start-ups and industry-leading companies.The AWS Incident Response Systems team is building systems to ensure these AWS...


  • Dublin, Dublin City, Ireland Amazon Full time

    Software Development Engineer, AWS Incident Tooling & ResponseJob ID: 2795181 | Amazon Data Services Ireland LimitedAmazon Web Services is the largest consumer cloud offering in the world, powering cutting edge science, rapidly growing start-ups and industry-leading companies.The AWS Incident Response Systems team is building systems to ensure these AWS...


  • Dublin, Dublin City, Ireland Amazon Full time

    Amazon never asks for fees or deposits in any form during recruitment process.Please click here to learn more and safeguard yourself from potential frauds.Software Development Engineer, AWS Incident Tooling & ResponseJob ID: | Amazon Data Services Ireland LimitedAmazon Web Services is the largest consumer cloud offering in the world, powering cutting edge...


  • Dublin, Dublin City, Ireland Amazon Development Centre Ireland Limited Full time

    The AWS Incident Response team is responsible for ensuring the high availability of Amazon Web Services. As a Support Engineer, you will play a key role in providing large-scale event and incident management.You will lead projects to improve the tooling and automation, and provide manual incident management for AWS and other Amazon groups. This includes...


  • Dublin, Dublin City, Ireland Amazon Data Services Ireland Limited Full time

    Amazon Web Services is the largest consumer cloud offering in the world, powering cutting edge science, rapidly growing start-ups and industry-leading companies.The AWS Incident Response Systems team is building systems to ensure these AWS customers can rely on the highest-availability, lowest-latency cloud platform on the planet. We work closely with the...


  • Dublin, Dublin City, Ireland Amazon Full time

    AWS Incident Tooling Overview">AWS Incident Tooling is at the heart of the high availability of Amazon Web Services. We make customer impacting events shorter and less frequent by detecting early large-scale events and providing the tooling to enable fast mitigation.

  • Graphics Researcher

    20 hours ago


    Dublin, Dublin City, Ireland Huawei Ireland Research Center Full time

    Huawei Ireland Research Center is seeking a highly skilled Graphics Researcher & Scientist to join our team in Dublin. As a key member of our Game Rendering Lab, you will be responsible for researching and developing new graphics algorithms for mobile games.The ideal candidate will have a strong passion for Computer Graphics and a drive to deliver innovative...


  • Dublin, Dublin City, Ireland Huawei Ireland Research Center Full time

    About Huawei Ireland Research CentreHuawei Ireland Research Centre is a leading hub for research and innovation in AI, Cloud reliability, and Infrastructure efficiency. Our goal is to provide cutting-edge solutions to support various businesses at a global scale.We strive to deliver exceptional user experiences through our intelligent solutions.About the...