Software Development Engineer, AWS Incident Tooling

9 hours ago


Dublin, Ireland ENGINEERINGUK Full time

Software Development Engineer, AWS Incident Tooling & Response

DESCRIPTION

Amazon Web Services is the largest consumer cloud offering in the world, powering cutting edge science, rapidly growing start-ups and industry-leading companies.

The AWS Incident Response Systems team is building systems to ensure these AWS customers can rely on the highest-availability, lowest-latency cloud platform on the planet. We work closely with the teams who own the largest AWS products, building systems to detect and mitigate operational issues before they impact customers. We are looking for a knowledgeable and experienced software development engineer to help us succeed in this mission.

As a Software Development Engineer at AWS Incident Response Systems, you will join the team in the design and implementation of systems which automate fault containment, problem diagnosis, and issue resolution across multiple hugely-distributed, always-on architectures. These systems will take metric and dependency data from multiple sources and analyse them, correlating them with customer impact to determine root cause of an issue without human intervention. They will create engagements, facilitate communication and coordination of the response and mitigation. As the scale and complexity of AWS grows, this is the best way that we can offer our customers a stable and reliable cloud computing platform. We succeed once these systems detect, diagnose, and repair operational defects without customer impact or human intervention.

You will work with teams across AWS to drive adoption of the software that has been built by the team, and influence systems development practices for new and existing products. You will define availability goals for service teams across AWS, and strategies to make these goals attainable with minimal effort. Your goal will be to remove human-error from the day-to-day operations of the massive, always-on, distributed systems which make up AWS.

Within your first year on the AWS Incident Response Systems team, you will have met with senior technical leaders from across AWS, designed and implemented at least one new system, and you will have dived deep into the causes of at least one historic external customer impacting event, and determined how to prevent a similar event from ever happening again. As your career continues to develop, you will influence the growth and direction not only of the Incident Response Systems team, but of the AWS group as a whole.
If this sounds like the right challenge for you, then please apply today

Key job responsibilities
- Write well-tested, maintainable code.
- Design, contribute to, and maintain systems which solve customer problems.
- Work with team-mates to improve code quality, system architecture and team processes.
- Learn about the incident management processes supported by the team's system to identify improvement opportunities.

BASIC QUALIFICATIONS
- Experience (non-internship) in professional software development.
- Experience designing or architecting (design patterns, reliability and scaling) of new and existing systems.
- Experience programming with at least one software programming language.

PREFERRED QUALIFICATIONS
- Bachelor's degree in computer science or equivalent.
- Experience with full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations.

Amazon is an equal opportunities employer. We believe passionately that employing a diverse workforce is central to our success. We make recruiting decisions based on your experience and skills. We value your passion to discover, invent, simplify and build. Protecting your privacy and the security of your data is a longstanding top priority for Amazon. Please consult our Privacy Notice to know more about how we collect, use and transfer the personal data of our candidates.

#J-18808-Ljbffr



  • Dublin, Ireland Amazon Full time

    AWS Resilience owns service to prevent and response to availability and security issues for all AWS Services. In other words, we’re the people who keep the cloud running. We work on the most challenging problems, with constant new services and possible failure modes to prevent — and we’re looking for talented people who want to help. You’ll join a...


  • Dublin, Ireland Amazon Full time

    AWS Incident Response is at the heart of high availability of Amazon Web Services. We make customer impacting events shorter and less frequent by providing large scale event and incident management. Our automated tooling quickly identifies the cause of an issue and helps mitigate its impact, and much of our engineer time is spent on projects to improve the...


  • Dublin, Ireland Amazon Development Centre Ireland Limited Full time

    - Experience (non-internship) in professional software development - Experience designing or architecting (design patterns, reliability and scaling) of new and existing systems - Experience programming with at least one software programming language Amazon Web Services is seeking a talented and passionate Software Development Engineer to join our Load...


  • Dublin, Ireland Amazon Development Centre Ireland Limited Full time

    - 3+ years of non-internship professional software development experience - 2+ years of non-internship experience designing or architecting new and existing systems (including design patterns, reliability and scaling) - Experience programming with at least one software programming language - Successful applicants must have the legal right to work in Ireland....


  • Dublin, Ireland ENGINEERINGUK Full time

    Sr. Technical Program Manager Automation and Tooling, AWS Trust & Safety DESCRIPTION Amazon Web Services (AWS) is built on the core principles of security, privacy, compliance, and transparency, which form the foundation of our trusted cloud infrastructure. We are seeking individuals who are dedicated to upholding these principles daily. Are you interested...


  • Dublin, Ireland ENGINEERINGUK Full time

    You will need to login before you can apply for a job. Software Developer Engineer, Platform - AWS Load Balancing Sector: Engineering, Technology Role: Professional Contract Type: Permanent Hours: Full Time DESCRIPTION Developers all over the world rely on AWS Load Balancing services to ensure their applications and services are highly available. The...


  • Dublin, Ireland Amazon Full time

    Software Development Engineer, AWS Security Job ID: 2816061 | Amazon Data Services Ireland Limited Join our team and help build innovative security services that actively detect and mitigate cyber threats across Amazon's cloud infrastructure and customer services in real-time. You'll work alongside data scientists, security engineers, and software...


  • Dublin, Ireland Amazon Full time

    Join us to drive high-impact innovation that secures our cloud by building solutions that enable an ecosystem of services to protect against sophisticated threats. The Informatics team owns the security telemetry mission in AWS. We collect, enrich and vend massive volumes of security related data from millions of hosts across globally distributed...


  • Dublin, Ireland Amazon Full time

    Software Development Engineer, Lambda Runtimes, AWS Job ID: 2823870 | Amazon Development Centre Ireland Limited AWS Lambda’s goal is nothing less than to simplify and improve the experience of computing in the cloud for every developer on the planet, from start-ups to the largest Fortune 100 companies. Serverless computing is rapidly changing how every...


  • Dublin, Ireland Amazon Full time

    Software Development Engineer, Lambda Runtimes, AWS Job ID: 2823870 | Amazon Development Centre Ireland Limited AWS Lambda’s goal is nothing less than to simplify and improve the experience of computing in the cloud for every developer on the planet, from start-ups to the largest Fortune 100 companies. Serverless computing is rapidly changing how every...


  • Dublin, Ireland ENGINEERINGUK Full time

    DESCRIPTION Are you ready to create systems to power one of the largest e-commerce companies in the world? Amazon.com has over 70 million customers, and developers all over the world rely on our storage, compute, and virtualized services via Amazon Web Services. We support systems at massive, and ever-growing, scale. We rapidly integrate new technologies to...


  • Dublin, Ireland Amazon Full time

    AWS Systems Security Engineer , AWS Trust & Safety Job ID: 2809224 | Amazon Data Services Ireland Limited AWS Trust and Safety (T&S) Risk & Response (R&R) is seeking a motivated Security Engineer with a strong background in incident response, threat investigation, and developing solutions to security issues. As a Security Engineer in R&R, you will employ...


  • Dublin, Ireland Amazon Full time

    AWS Fault Injection Service is a fully managed service for running fault injection experiments on AWS. FIS makes it easier to improve an application’s performance, observability, and resiliency. Fault injection experiments are used in chaos engineering, which is the practice of stressing an application in testing or production environments by creating...


  • Dublin, Ireland ENGINEERINGUK Full time

    DESCRIPTION This is a 12-week internship role starting in June/July 2025. Amazon Web Services (AWS), the largest consumer cloud offering in the world, is looking for Network Development Engineer Interns to join, learn and grow with our Dublin engineering team! AWS is the world leader in providing a highly reliable, scalable, low-cost infrastructure...


  • Dublin, Ireland Amazon Full time

    **This is a 12-week internship role starting in June/July 2025** Amazon Web Services (AWS), the largest consumer cloud offering in the world, is looking for Network Development Engineer Interns to join, learn and grow with our Dublin engineering team! AWS is the world leader in providing a highly reliable, scalable, low-cost infrastructure platform in the...


  • Dublin, Ireland Amazon Full time

    Join us in leading a team that builds innovative services protecting AWS from security threats! As a Software Engineering Manager in AWS Security, you’ll lead a team in building and managing innovative services that detect and automate the mitigation of cyber threats across all of Amazon’s infrastructure. You’ll manage software development engineers,...


  • Dublin, Ireland Amazon Full time

    In Enterprise Engineering, we build the software, services, and infrastructure that enable Amazon services across the world to build and deliver for customers. We represent the intersection of technology and the need for traditional IT solutions at Amazon’s scale. We’re an AWS-focused, builder-centric organization, focused on using AWS technology to...


  • Dublin, Ireland Amazon Full time

    AWS customers build their businesses on top of our network and they expect it to be indistinguishable from perfect. We empower Amazon’s Network Services team to automate millions of daily operations for the most powerful network in the world. A safe, efficient, automated operations platform is essential for managing Amazon’s next-generation networks. We...


  • Dublin, Ireland Amazon Full time

    Job ID: 2840409 | Amazon Development Centre Ireland Limited Are you ready to create systems to power one of the largest e-commerce companies in the world? Amazon.com has over 70 million customers, and developers all over the world rely on our storage, compute, and virtualised services via Amazon Web Services. We support systems at massive, and ever-growing,...


  • Dublin, Ireland ENGINEERINGUK Full time

    DESCRIPTION The Network Alerts team in AWS is looking for System Development Engineers to help build systems that monitor the AWS network, one of the world's largest and most complex networks. Tens of millions of customers rely on this network for using our retail websites, accessing content on their Kindles, and building applications and businesses on top...