Software Development Manager, AWS Incident Tooling

2 days ago


Dublin, Ireland Amazon Full time

Software Development Manager, AWS Incident Tooling & Response

Job ID: 2830638 | Amazon Development Centre Ireland Limited

AWS Resilience owns services that prevent and respond to availability and security issues for all AWS Services. In other words, we’re the people who keep the cloud running. We work on the most challenging problems, with constant new services and possible failure modes to prevent — and we’re looking for talented people who want to help.
You’ll join a diverse team of software, security experts, operations managers, and other vital roles. You’ll collaborate with people across AWS to help us deliver the highest standards for safety, security, and availability. You’ll experience an inclusive culture that welcomes bold ideas and empowers you to own them to completion.
AWS Incident Tooling is at the heart of the high availability of Amazon Web Services. We make customer-impacting events shorter and less frequent by detecting early large-scale events and providing the tooling to enable fast mitigation. Our automated tooling quickly identifies the cause of an issue and helps mitigate its impact. Our engineer time is spent on projects to improve the tooling and automation. We also provide our solutions for other AWS groups to manage their own events. It's an exciting time to join our team as we are growing and expanding our offerings.

As a Software Development Manager on the team, you will manage automated tooling roadmaps and delivery for the detection and resolution of issues within AWS infrastructure. You will work closely with the team managing the incident response and with leadership to gather new requirements. Based on learning from past incidents, you will drive further improvements into our automation, tooling, and processes so that the next event is shorter or avoided entirely. You will coordinate across project teams to expand the use of our tooling to additional areas across Amazon. If you're looking for a team with great growth potential and an opportunity to make a huge impact, this is the team to join.

Key job responsibilities

1. Define and Deliver Business Priorities: You will be a key contributor and owner of the direction of the AWS Incident Management team. You will define, plan, track, and deliver on strategic goals for the team while ensuring that the team remains unblocked and focused.
2. Cross-Site, Cross-Team Coordination: You will be responsible for coordinating with your counterparts and sister teams to ensure that a clear communication channel exists between AWS Incident tooling and Response teams. You will also work closely with the alarming systems to create and maintain a proper end-to-end experience from detecting, alarming to mitigating incidents.
3. Performance Management/Team Health: You will own all facets of performance and career management for the team. You will ensure the operational load of your team remains manageable and as minimal as possible.

BASIC QUALIFICATIONS

- Knowledge of engineering practices and patterns for the full software/hardware/networks development life cycle, including coding standards, code reviews, source control management, build processes, testing, certification, and livesite operations.
- Experience in engineering team management.
- Experience in engineering.
- Experience in leading the definition and development of multi-tier web services.
- Experience partnering with product and program management teams.

PREFERRED QUALIFICATIONS

- Experience in communicating with users, other technical teams, and senior leadership to collect requirements, describe software product features, technical designs, and product strategy.
- Experience in recruiting, hiring, mentoring/coaching, and managing teams of Software Engineers to improve their skills and make them more effective, product software engineers.
- Experience managing a team of high-caliber Software Engineers developing complex, world-class, scalable software systems that have been successfully delivered to customers.

Posted: November 13, 2024 (Updated about 14 hours ago)

Posted: October 4, 2024 (Updated about 17 hours ago)

Posted: October 4, 2024 (Updated about 17 hours ago)

Posted: September 27, 2024 (Updated about 22 hours ago)

Posted: September 27, 2024 (Updated about 22 hours ago)

#J-18808-Ljbffr



  • Dublin, Ireland ENGINEERINGUK Full time

    Software Development Manager, AWS Incident Tooling & Response DESCRIPTION AWS Resilience owns services that prevent and respond to availability and security issues for all AWS Services. In other words, we're the people who keep the cloud running. We work on the most challenging problems, with constant new services and possible failure modes to prevent - and...


  • Dublin, Ireland ENGINEERINGUK Full time

    Software Development Engineer, AWS Incident Tooling & Response DESCRIPTION Amazon Web Services is the largest consumer cloud offering in the world, powering cutting edge science, rapidly growing start-ups and industry-leading companies. The AWS Incident Response Systems team is building systems to ensure these AWS customers can rely on the...


  • Dublin, Ireland Amazon Full time

    Software Development Engineer, AWS Incident Tooling & Response Amazon Web Services is the largest consumer cloud offering in the world, powering cutting edge science, rapidly growing start-ups and industry-leading companies. The AWS Incident Response Systems team is building systems to ensure these AWS customers can rely on the highest-availability,...


  • Dublin, Ireland Amazon Full time

    Software Development Engineer, AWS Incident Tooling & Response Job ID: 2795181 | Amazon Data Services Ireland Limited Amazon Web Services is the largest consumer cloud offering in the world, powering cutting edge science, rapidly growing start-ups and industry-leading companies. The AWS Incident Response Systems team is building systems to ensure these...


  • Dublin, Ireland ENGINEERINGUK Full time

    Senior Research Scientist, AWS Incident Tooling & Response DESCRIPTION AWS Resilience owns service to prevent and respond to availability and security issues for all AWS Services. In other words, we're the people who keep the cloud running. We work on the most challenging problems, with constant new services and possible failure modes to prevent - and...


  • Dublin, Ireland Amazon Full time

    AWS Resilience owns service to prevent and response to availability and security issues for all AWS Services. In other words, we’re the people who keep the cloud running. We work on the most challenging problems, with constant new services and possible failure modes to prevent — and we’re looking for talented people who want to help. You’ll join a...


  • Dublin, Ireland Amazon Full time

    AWS Incident Response is at the heart of high availability of Amazon Web Services. We make customer impacting events shorter and less frequent by providing large scale event and incident management. Our automated tooling quickly identifies the cause of an issue and helps mitigate its impact, and much of our engineer time is spent on projects to improve the...


  • Dublin, Ireland Amazon Full time

    Incident Management Engineer, AWS Incident Detection and Response Job ID: 2882806 | Amazon Web Services EMEA SARL (Irish Branch) ABOUT US Amazon has built a reputation for excellence with a mission to be the earth’s most customer-centric company. Amazon Web Services (AWS) is carrying on that tradition while leading the world in cloud technologies. The...


  • Dublin, Ireland Amazon Full time

    ABOUT US Amazon has built a reputation for excellence with a mission to be the earth’s most customer-centric company, a company that customers from all over the globe will recognize, value, and trust for both our products and our service. Amazon Web Services (AWS) is carrying on that tradition while leading the world in cloud technologies. The AWS...


  • Dublin, Ireland ENGINEERINGUK Full time

    Sr. Technical Program Manager Automation and Tooling, AWS Trust & Safety DESCRIPTION Amazon Web Services (AWS) is built on the core principles of security, privacy, compliance, and transparency, which form the foundation of our trusted cloud infrastructure. We are seeking individuals who are dedicated to upholding these principles daily. Are you interested...


  • Dublin, Ireland Amazon Full time

    Sr. Technical Program Manager Automation and Tooling, AWS Trust & Safety Job ID: 2814847 | Amazon Web Services EMEA SARL (Irish Branch) - G50 Amazon Web Services (AWS) is built on the core principles of security, privacy, compliance, and transparency, which form the foundation of our trusted cloud infrastructure. We are seeking individuals who are...


  • Dublin, Ireland Amazon Development Centre Ireland Limited Full time

    - Experience (non-internship) in professional software development - Experience designing or architecting (design patterns, reliability and scaling) of new and existing systems - Experience programming with at least one software programming language Amazon Web Services is seeking a talented and passionate Software Development Engineer to join our Load...


  • Dublin, Ireland Amazon Development Centre Ireland Limited Full time

    - 3+ years of non-internship professional software development experience - 2+ years of non-internship experience designing or architecting new and existing systems (including design patterns, reliability and scaling) - Experience programming with at least one software programming language - Successful applicants must have the legal right to work in Ireland....


  • Dublin, Ireland Amazon Full time

    Software Development Engineer, AWS Alameda Job ID: 2855471 | Amazon Development Center U.S., Inc. AWS Alameda is shaping the future of how Control Planes for AWS Services will be offered. The Alameda team builds innovative and secure technologies on a massive scale that manage the control planes for AWS services and keep them secure and scalable for their...


  • Dublin, Ireland Amazon Full time

    Software Development Engineer, AWS Demand Planning Job ID: 2794149 | Amazon Development Center U.S., Inc. The Demand Planning team builds software to forecast the amount of hardware needed to keep EC2 running as the world's favorite elastic cloud. Our work ensures that AWS customers never run out of cloud computing capacity! We are looking for a Software...


  • Dublin, Ireland Amazon Full time

    Software Development Engineer II, AWS Aurora Looking to be part of a team building hyper-scale database services in the cloud? Do you want to revolutionize the way people manage vast volumes of data in the cloud where you have direct and immediate impact on hundreds of thousands of users who use AWS database services? Aurora is a distributed,...


  • Dublin, Ireland Amazon Full time

    Are you ready to create systems to power one of the largest e-commerce companies in the world? Amazon.com has over 70 million customers, and developers all over the world rely on our storage, compute, and virtualized services via Amazon Web Services. We support systems at massive, and ever-growing, scale. We rapidly integrate new technologies to expand our...


  • Dublin, Ireland ENGINEERINGUK Full time

    Software Development Engineer, AWS Security DESCRIPTION Join us to drive high-impact innovation that secures our cloud by building solutions that enable an ecosystem of services to protect against sophisticated threats. The Informatics team owns the security telemetry mission in AWS. We collect, enrich and vend massive volumes of security related data from...


  • Dublin, Ireland Amazon Full time

    Job ID: 2877396 | Amazon Data Services Ireland Limited Are you ready to create systems to power one of the largest e-commerce companies in the world? Amazon.com has over 70 million customers, and developers all over the world rely on our storage, compute, and virtualized services via Amazon Web Services. We support systems at massive, and ever-growing,...


  • Dublin, Ireland ENGINEERINGUK Full time

    You will need to login before you can apply for a job. Software Developer Engineer, Platform - AWS Load Balancing Sector: Engineering, Technology Role: Professional Contract Type: Permanent Hours: Full Time DESCRIPTION Developers all over the world rely on AWS Load Balancing services to ensure their applications and services are highly available. The...