Senior Research Scientist, AWS Incident Tooling
7 days ago
You'll join a diverse team of software, security experts, operations managers, and other vital roles. You'll collaborate with people across AWS to help us deliver the highest standards for safety and security and availability. You'll experience an inclusive culture that welcomes bold ideas and empowers you to own them to completion.
AWS Incident Response is at the heart of the high availability of Amazon Web Services. We make customer impacting events shorter and less frequent by driving large scale event and incident response. Our automated tooling quickly identifies the cause of an issue and helps mitigate its impact, and much of our engineer time is spent on projects to improve the tooling and automation. We also provide manual incident management for AWS and other Amazon groups, directing the resolution of an issue with service teams, and diving deep into those events to drive improvements to the tooling. It's an exciting time to join our team as we are growing and expanding our offerings.
Key job responsibilities
You will own the organisation strategy relative to the usage of ML, GenAI and propose the best technology to advance our ability to better detect, faster root cause ,and correlate to prior incidents to shorten customer facing AWS incidents. Your work will enable us to identify gaps in our current strategy, learnings from past incidents. You will contribute to shortening incident response through deep analysis and introduction of new technology.
About the team
AWS Infrastructure Services (AIS)
AWS Infrastructure Services owns the design, planning, delivery, and operation of all AWS global infrastructure. In other words, we're the people who keep the cloud running. We support all AWS data centers and all of the servers, storage, networking, power, and cooling equipment that ensure our customers have continual access to the innovation they rely on. We work on the most challenging problems, with thousands of variables impacting the supply chain — and we're looking for talented people who want to help.
You'll join a diverse team of software, hardware, and network engineers, supply chain specialists, security experts, operations managers, and other vital roles. You'll collaborate with people across AWS to help us deliver the highest standards for safety and security while providing seemingly infinite capacity at the lowest possible cost for our customers. And you'll experience an inclusive culture that welcomes bold ideas and empowers you to own them to completion.
About AWS
Diverse Experiences
AWS values diverse experiences. Even if you do not meet all of the preferred qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn't followed a traditional path, or includes alternative experiences, don't let it stop you from applying.
Why AWS?
Amazon Web Services (AWS) is the world's most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that's why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses.
Inclusive Team Culture
Here at AWS, it's in our nature to learn and be curious. Our employee-led affinity groups foster a culture of inclusion that empower us to be proud of our differences. Ongoing events and learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon (gender diversity) conferences, inspire us to never stop embracing our uniqueness.
Mentorship & Career Growth
We're continuously raising our performance bar as we strive to become Earth's Best Employer. That's why you'll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.
Work/Life Balance
We value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture. When we feel supported in the workplace and at home, there's nothing we can't achieve.
BASIC QUALIFICATIONS
- Masters degree (or European advanced degree equivalent) or PhD in Computer Science, or related technical, math, economics, or scientific field- Several years of relevant experience in developing large scale machine learning or deep learning models and/or systems in a production environment
- Experience in using Python, R or Matlab or other statistical/machine learning software language
- Several year experience specifically with deep learning (e.g., CNN, RNN, LSTM, etc.)
- Experience hiring or mentoring more junior colleagues
PREFERRED QUALIFICATIONS
- PhD degree in computer science, engineering, mathematics, economics, or related technical/scientific field- Hands on experience building models with deep learning frameworks like PyTorch, or similar
- Experience with machine learning, time series, NLP and CV solutions
- Proven communication skills, presentation skills, and attention to detail
- Comfortable working in a fast paced, highly collaborative, dynamic work environment
- Scientific thinking and the ability to invent, a track record of thought leadership and contributions that have advanced the field.
-
Senior Research Scientist, AWS Incident Tooling
4 weeks ago
Dublin, Dublin City, Ireland ENGINEERINGUK Full timeSenior Research Scientist, AWS Incident Tooling & ResponseDESCRIPTIONAWS Resilience owns service to prevent and respond to availability and security issues for all AWS Services. In other words, we're the people who keep the cloud running. We work on the most challenging problems, with constant new services and possible failure modes to prevent - and we're...
-
Senior Research Scientist, Aws Incident Tooling
4 weeks ago
Dublin, Dublin City, Ireland Amazon Full timeSenior Research Scientist, AWS Incident Tooling & ResponseJob ID: | Amazon Development Centre Ireland LimitedAWS Resilience owns service to prevent and respond to availability and security issues for all AWS Services.In other words, we're the people who keep the cloud running.We work on the most challenging problems, with constant new services and possible...
-
Senior Research Scientist, AWS Incident Tooling
2 weeks ago
Dublin, Dublin City, Ireland Amazon Full timeSenior Research Scientist, AWS Incident Tooling & ResponseAWS Resilience owns service to prevent and respond to availability and security issues for all AWS Services. In other words, we're the people who keep the cloud running. We work on the most challenging problems, with constant new services and possible failure modes to prevent — and we're looking for...
-
Dublin, Dublin City, Ireland Amazon Full timeSenior Research Scientist, AWS Incident Tooling & ResponseAWS Resilience owns service to prevent and respond to availability and security issues for all AWS Services. In other words, we're the people who keep the cloud running. We work on the most challenging problems, with constant new services and possible failure modes to prevent — and we're looking for...
-
Dublin, Dublin City, Ireland Amazon Full timeSoftware Development Manager, AWS Incident Tooling & ResponseJob ID: 2830638 | Amazon Development Centre Ireland LimitedAWS Resilience owns services that prevent and respond to availability and security issues for all AWS Services. In other words, we're the people who keep the cloud running. We work on the most challenging problems, with constant new...
-
Dublin, Dublin City, Ireland ENGINEERINGUK Full timeSoftware Development Manager, AWS Incident Tooling & ResponseDESCRIPTIONAWS Resilience owns services that prevent and respond to availability and security issues for all AWS Services. In other words, we're the people who keep the cloud running. We work on the most challenging problems, with constant new services and possible failure modes to prevent - and...
-
Dublin, Dublin City, Ireland Amazon Full timeSoftware Development Manager, AWS Incident Tooling & ResponseJob ID: 2830638 | Amazon Development Centre Ireland LimitedAWS Resilience owns services that prevent and respond to availability and security issues for all AWS Services. In other words, we're the people who keep the cloud running. We work on the most challenging problems, with constant new...
-
Dublin, Dublin City, Ireland Amazon Full timeSoftware Development Manager, AWS Incident Tooling & ResponseJob ID: 2830638 | Amazon Development Centre Ireland LimitedAWS Resilience owns services that prevent and respond to availability and security issues for all AWS Services. In other words, we're the people who keep the cloud running. We work on the most challenging problems, with constant new...
-
Dublin, Dublin City, Ireland Amazon Full timeSoftware Development Manager, AWS Incident Tooling & ResponseJob ID: | Amazon Development Centre Ireland LimitedAWS Resilience owns services that prevent and respond to availability and security issues for all AWS Services.In other words, we're the people who keep the cloud running.We work on the most challenging problems, with constant new services and...
-
Dublin, Dublin City, Ireland Amazon Full time**Job Description**As a Senior Research Scientist on our team, you will own the organization's strategy for using Machine Learning (ML) and General Artificial Intelligence (GenAI). You will propose the best technologies to advance our ability to detect incidents, identify root causes, and correlate them to prior events. Your work will enable us to shorten...
-
Dublin, Dublin City, Ireland Amazon Full timeJoin a diverse team of software, security experts, operations managers, and other vital roles to help deliver the highest standards for safety, security, and availability across AWS Services. Collaborate with people across AWS to work on the most challenging problems, with constant new services and possible failure modes to prevent.Key...
-
Dublin, Dublin City, Ireland ENGINEERINGUK Full timeSoftware Development Engineer, AWS Incident Tooling & ResponseDESCRIPTIONAmazon Web Services is the largest consumer cloud offering in the world, powering cutting edge science, rapidly growing start-ups and industry-leading companies.The AWS Incident Response Systems team is building systems to ensure these AWS customers can rely on the highest-availability,...
-
Dublin, Dublin City, Ireland Amazon Full timeSoftware Development Engineer, AWS Incident Tooling & ResponseJob ID: 2795181 | Amazon Data Services Ireland LimitedAmazon Web Services is the largest consumer cloud offering in the world, powering cutting edge science, rapidly growing start-ups and industry-leading companies.The AWS Incident Response Systems team is building systems to ensure these AWS...
-
Dublin, Dublin City, Ireland Amazon Full timeSoftware Development Engineer, AWS Incident Tooling & ResponseJob ID: 2795181 | Amazon Data Services Ireland LimitedAmazon Web Services is the largest consumer cloud offering in the world, powering cutting edge science, rapidly growing start-ups and industry-leading companies.The AWS Incident Response Systems team is building systems to ensure these AWS...
-
Dublin, Dublin City, Ireland Amazon Full timeAmazon never asks for fees or deposits in any form during recruitment process.Please click here to learn more and safeguard yourself from potential frauds.Software Development Engineer, AWS Incident Tooling & ResponseJob ID: | Amazon Data Services Ireland LimitedAmazon Web Services is the largest consumer cloud offering in the world, powering cutting edge...
-
Dublin, Dublin City, Ireland Amazon Data Services Ireland Limited Full timeAmazon Web Services is the largest consumer cloud offering in the world, powering cutting edge science, rapidly growing start-ups and industry-leading companies.The AWS Incident Response Systems team is building systems to ensure these AWS customers can rely on the highest-availability, lowest-latency cloud platform on the planet. We work closely with the...
-
AWS Incident Response Lead
4 days ago
Dublin, Dublin City, Ireland Amazon Full timeAbout AWS Incident ToolingAWS Incident Tooling plays a critical role in ensuring the high availability of Amazon Web Services (AWS). Our team is responsible for detecting and resolving issues within AWS infrastructure, leveraging automated tooling to minimize downtime and optimize recovery times.As a Software Development Manager on our team, you will lead...
-
Research Scientist for AI Innovation
16 hours ago
Dublin, Dublin City, Ireland Huawei Ireland Research Center Full timeAbout Huawei Ireland Research CentreHuawei Ireland Research Centre is a leading hub for research and innovation in AI, Cloud reliability, and Infrastructure efficiency. Our goal is to provide cutting-edge solutions to support various businesses at a global scale.We strive to deliver exceptional user experiences through our intelligent solutions.About the...
-
Senior Scientist for Pharmaceutical Research
24 hours ago
Dublin, Dublin City, Ireland Fastnet Full timeJob Description:Fastnet seeks a talented Senior Process Development Scientist to drive innovation in pharmaceutical manufacturing and biotechnology research.Your Key Responsibilities:Develop and execute research strategies, ensuring alignment with technical workflows.Lead teams of scientists, engineers, and analysts, fostering a culture of collaboration and...
-
AWS Incident Resolution Specialist
5 days ago
Dublin, Dublin City, Ireland Amazon Full timeA day in the life of an AWS Support Engineer is a dynamic and challenging experience.As part of our team, you will be at the forefront of delivering best-in-class incident management to Amazon businesses worldwide.Your primary responsibility will be to identify and resolve complex technical systems issues, utilizing your analytical skills and scripting...