Software Development Manager, AWS Incident Tooling
6 days ago
Job ID: 2830638 | Amazon Development Centre Ireland Limited
AWS Resilience owns services that prevent and respond to availability and security issues for all AWS Services. In other words, we're the people who keep the cloud running. We work on the most challenging problems, with constant new services and possible failure modes to prevent — and we're looking for talented people who want to help.
You'll join a diverse team of software, security experts, operations managers, and other vital roles. You'll collaborate with people across AWS to help us deliver the highest standards for safety, security, and availability. You'll experience an inclusive culture that welcomes bold ideas and empowers you to own them to completion.
AWS Incident Tooling is at the heart of the high availability of Amazon Web Services. We make customer impacting events shorter and less frequent by detecting early large-scale events and providing the tooling to enable fast mitigation. Our automated tooling quickly identifies the cause of an issue and helps mitigate its impact. Our engineer time is spent on projects to improve the tooling and automation. We also provide our solutions for other AWS groups to manage their own events. It's an exciting time to join our team as we are growing and expanding our offerings.
As a Software Development Manager on the team, you will manage automated tooling roadmaps and delivery for the detection and resolution of issues within AWS infrastructure. You will work closely with the team managing the incident response and with leadership to gather new requirements. Based on learning from past incidents, you will drive further improvements into our automation, tooling, and processes so that the next event is shorter or avoided entirely. You will coordinate across project teams to expand the use of our tooling to additional areas across Amazon. If you're looking for a team with great growth potential and an opportunity to make a huge impact, this is the team to join.
Key job responsibilities
- Define and Deliver Business Priorities: You will be a key contributor and owner of the direction of the AWS Incident Management team. You will define, plan, track, and deliver on strategic goals for the team, while ensuring that the team remains unblocked and focused.
- Cross-Site, Cross-Team Coordination: You will be responsible for coordinating with your counterparts and sister teams to ensure that a clear communication channel exists between AWS Incident tooling and Response teams. You will also work closely with the alarming systems to create and maintain a proper end-to-end experience from detecting, alarming to mitigating incidents.
- Performance Management/Team Health: You will own all facets of performance and career management for the team. You will ensure the operational load of your team remains manageable and as minimal as possible.
- Knowledge of engineering practices and patterns for the full software/hardware/networks development life cycle, including coding standards, code reviews, source control management, build processes, testing, certification, and livesite operations.
- Experience in engineering team management.
- Experience in engineering.
- Experience in leading the definition and development of multi-tier web services.
- Experience partnering with product and program management teams.
- Experience in communicating with users, other technical teams, and senior leadership to collect requirements, describe software product features, technical designs, and product strategy.
- Experience in recruiting, hiring, mentoring/coaching, and managing teams of Software Engineers to improve their skills and make them more effective product software engineers.
- Experience managing a team of high-caliber Software Engineers developing complex, world-class, scalable software systems that have been successfully delivered to customers.
Posted: November 13, 2024 (Updated about 14 hours ago)
Amazon is committed to a diverse and inclusive workplace. Amazon is an equal opportunity employer and does not discriminate on the basis of race, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or other legally protected status.
#J-18808-Ljbffr-
Software Development Manager, AWS Incident Tooling
12 hours ago
Dublin, Dublin City, Ireland Amazon Full timeSoftware Development Manager, AWS Incident Tooling & ResponseJob ID: 2830638 | Amazon Development Centre Ireland LimitedAWS Resilience owns services that prevent and respond to availability and security issues for all AWS Services. In other words, we're the people who keep the cloud running. We work on the most challenging problems, with constant new...
-
Dublin, Dublin City, Ireland Amazon Full timeSoftware Development Manager, AWS Incident Tooling & ResponseJob ID: 2830638 | Amazon Development Centre Ireland LimitedAWS Resilience owns services that prevent and respond to availability and security issues for all AWS Services. In other words, we're the people who keep the cloud running. We work on the most challenging problems, with constant new...
-
Dublin, Dublin City, Ireland Amazon Full timeSoftware Development Manager, AWS Incident Tooling & ResponseJob ID: 2830638 | Amazon Development Centre Ireland LimitedAWS Resilience owns services that prevent and respond to availability and security issues for all AWS Services. In other words, we're the people who keep the cloud running. We work on the most challenging problems, with constant new...
-
Dublin, Dublin City, Ireland ENGINEERINGUK Full timeSoftware Development Manager, AWS Incident Tooling & ResponseDESCRIPTIONAWS Resilience owns services that prevent and respond to availability and security issues for all AWS Services. In other words, we're the people who keep the cloud running. We work on the most challenging problems, with constant new services and possible failure modes to prevent - and...
-
Dublin, Dublin City, Ireland Amazon Full timeSoftware Development Manager, AWS Incident Tooling & ResponseJob ID: 2830638 | Amazon Development Centre Ireland LimitedAWS Resilience owns services that prevent and respond to availability and security issues for all AWS Services. In other words, we're the people who keep the cloud running. We work on the most challenging problems, with constant new...
-
Dublin, Dublin City, Ireland ENGINEERINGUK Full timeSoftware Development Engineer, AWS Incident Tooling & ResponseDESCRIPTIONAmazon Web Services is the largest consumer cloud offering in the world, powering cutting edge science, rapidly growing start-ups and industry-leading companies.The AWS Incident Response Systems team is building systems to ensure these AWS customers can rely on the highest-availability,...
-
Dublin, Dublin City, Ireland TN Ireland Full timeSocial network you want to login/join with:Software Development Engineer, AWS Incident Tooling & Response, DublinClient:Amazon Data Services Ireland LimitedLocation:Dublin, IrelandJob Category:OtherEU work permit required:YesJob Reference:ff1326e23bd5Job Views:3Posted:22.03.2025Expiry Date:06.05.2025Job Description:Amazon Web Services is the largest consumer...
-
Dublin, Dublin City, Ireland Amazon Full timeSoftware Development Engineer, AWS Incident Tooling & ResponseJob ID: 2795181 | Amazon Data Services Ireland LimitedAmazon Web Services is the largest consumer cloud offering in the world, powering cutting edge science, rapidly growing start-ups and industry-leading companies.The AWS Incident Response Systems team is building systems to ensure these AWS...
-
Dublin, Dublin City, Ireland Amazon Full timeSoftware Development Engineer, AWS Incident Tooling & ResponseJob ID: | Amazon Data Services Ireland LimitedAmazon Web Services is the largest consumer cloud offering in the world, powering cutting edge science, rapidly growing start-ups and industry-leading companies.The AWS Incident Response Systems team is building systems to ensure these AWS customers...
-
AWS Incident Tooling Expert
2 days ago
Dublin, Dublin City, Ireland ENGINEERINGUK Full timeAbout the RoleThis is an exciting opportunity to join our AWS Incident Tooling team as a Software Development Manager. You will be responsible for managing automated tooling roadmaps and delivery for the detection and resolution of issues within AWS infrastructure.Key AccountabilitiesAutomated Tooling DeliveryYou will ensure timely and high-quality delivery...
-
Senior Research Scientist, AWS Incident Tooling
2 weeks ago
Dublin, Dublin City, Ireland Amazon Development Centre Ireland Limited Full timeAWS Resilience owns service to prevent and response to availability and security issues for all AWS Services. In other words, we're the people who keep the cloud running. We work on the most challenging problems, with constant new services and possible failure modes to prevent — and we're looking for talented people who want to help.You'll join a diverse...
-
Dublin, Dublin City, Ireland Amazon Data Services Ireland Limited Full timeAmazon Web Services is the largest consumer cloud offering in the world, powering cutting edge science, rapidly growing start-ups and industry-leading companies.The AWS Incident Response Systems team is building systems to ensure these AWS customers can rely on the highest-availability, lowest-latency cloud platform on the planet. We work closely with the...
-
Senior Research Scientist, Aws Incident Tooling
7 hours ago
Dublin, Dublin City, Ireland Amazon Full timeSenior Research Scientist, AWS Incident Tooling & ResponseJob ID: | Amazon Development Centre Ireland LimitedAWS Resilience owns service to prevent and respond to availability and security issues for all AWS Services.In other words, we're the people who keep the cloud running.We work on the most challenging problems, with constant new services and possible...
-
AWS Incident Response Engineer
5 days ago
Dublin, Dublin City, Ireland Amazon Development Centre Ireland Limited Full timeThe AWS Incident Response team is responsible for ensuring the high availability of Amazon Web Services. As a Support Engineer, you will play a key role in providing large-scale event and incident management.You will lead projects to improve the tooling and automation, and provide manual incident management for AWS and other Amazon groups. This includes...
-
Dublin, Dublin City, Ireland Amazon Full timeSenior Research Scientist, AWS Incident Tooling & ResponseJob ID: 2834651 | Amazon Development Centre Ireland LimitedAWS Resilience owns service to prevent and respond to availability and security issues for all AWS Services. In other words, we're the people who keep the cloud running. We work on the most challenging problems, with constant new services and...
-
Senior Research Scientist, AWS Incident Tooling
3 weeks ago
Dublin, Dublin City, Ireland Amazon Full timeSenior Research Scientist, AWS Incident Tooling & ResponseAWS Resilience owns service to prevent and respond to availability and security issues for all AWS Services. In other words, we're the people who keep the cloud running. We work on the most challenging problems, with constant new services and possible failure modes to prevent — and we're looking for...
-
Senior Research Scientist, AWS Incident Tooling
2 weeks ago
Dublin, Dublin City, Ireland Amazon Full timeSenior Research Scientist, AWS Incident Tooling & ResponseAWS Resilience owns service to prevent and respond to availability and security issues for all AWS Services. In other words, we're the people who keep the cloud running. We work on the most challenging problems, with constant new services and possible failure modes to prevent — and we're looking for...
-
Dublin, Dublin City, Ireland TN Ireland Full timeSocial network you want to login/join with:Senior Research Scientist, AWS Incident Tooling & Response, DublinClient: Amazon Development Centre Ireland LimitedLocation: Dublin, IrelandJob Category: OtherEU work permit required: YesJob Reference: 28fea5a7e581Job Views: 1Posted: 26.03.2025Expiry Date: 10.05.2025Job Description:AWS Resilience owns service to...
-
Dublin, Dublin City, Ireland Amazon Development Centre Ireland Limited Full timeAbout UsAt Amazon Web Services (AWS), we are the pioneers in cloud computing. Our mission is to provide a comprehensive and broadly adopted cloud platform that powers businesses globally. We strive for excellence in innovation, customer satisfaction, and employee growth.Job DescriptionWe are seeking a highly skilled Software Development Manager to join our...
-
Dublin, Dublin City, Ireland Amazon Development Centre Ireland Limited Full timeAbout This RoleThis is an exciting opportunity to join our AWS Incident Tooling team as a Software Development Manager. As a key contributor to our team, you will manage automated tooling roadmaps and delivery for issue detection and resolution within AWS infrastructure. Your expertise will help us drive further improvements into our automation, tooling, and...