Software Development Manager, Aws Incident Tooling
2 days ago
AWS Resilience owns service that prevent and respond to availability and security issues for all AWS Services.
In other words, we're the people who keep the cloud running.
We work on the most challenging problems, with constant new services and possible failure modes to prevent — and we're looking for talented people who want to help.You'll join a diverse team of software, security experts, operations managers, and other vital roles.
You'll collaborate with people across AWS to help us deliver the highest standards for safety and security and availability.
You'll experience an inclusive culture that welcomes bold ideas and empowers you to own them to completion.AWS Incident Tooling is at the heart of the high availability of Amazon Web Services.
We make customer impacting events shorter and less frequent by detecting early large-scale events and providing the tooling to enable fast mitigation.
Our automated tooling quickly identifies the cause of an issue and helps mitigate its impact.
Our engineer time is spent on projects to improve the tooling and automation.
We also provide our solutions for other AWS groups to manage their own events.
It's an exciting time to join our team as we are growing and expanding our offerings.As a Software Development Manager on the team, you will manage automated tooling roadmaps and delivery for the detection and resolution of issues within AWS infrastructure.
You will work closely with the team managing the incident response and with leadership to gather new requirements.
Based on learning from past incidents you will drive further improvements into our automation, tooling, and processes so that the next event is shorter or avoided entirely.
You will coordinate across project teams to expand use of our tooling to additional areas across Amazon.
If you're looking for a team with great growth potential and an opportunity to make a huge impact, this is the team to join.Key job responsibilitiesDefine and Deliver Business PrioritiesYou will be a key contributor and owner of the direction of the AWS Incident Management team.
You will define, plan, track and deliver on strategic goals for the team, while ensuring that the team remains unblocked and focused.Cross-Site, Cross-Team CoordinationYou will be responsible for coordinating with your counterparts and sister teams to ensure that a clear communication channel exists between AWS Incident tooling and Response teams.
You will also work closely with the alarming systems to create and maintain a proper end to end experience from detecting, alarming to mitigating incidents.Performance Management/Team HealthYou will own all facets of performance and career management for the team.
You will ensure the operational load of your team remains manageable and as minimal as possible.
About the teamAWS Infrastructure Services (AIS)AWS Infrastructure Services owns the design, planning, delivery, and operation of all AWS global infrastructure.
In other words, we're the people who keep the cloud running.
We support all AWS data centers and all of the servers, storage, networking, power, and cooling equipment that ensure our customers have continual access to the innovation they rely on.
We work on the most challenging problems, with thousands of variables impacting the supply chain — and we're looking for talented people who want to help.
You'll join a diverse team of software, hardware, and network engineers, supply chain specialists, security experts, operations managers, and other vital roles.
You'll collaborate with people across AWS to help us deliver the highest standards for safety and security while providing seemingly infinite capacity at the lowest possible cost for our customers.
And you'll experience an inclusive culture that welcomes bold ideas and empowers you to own them to completion.About AWSDiverse ExperiencesAWS values diverse experiences.
Even if you do not meet all of the preferred qualifications and skills listed in the job description, we encourage candidates to apply.
If your career is just starting, hasn't followed a traditional path, or includes alternative experiences, don't let it stop you from applying.
Why AWS?Amazon Web Services (AWS) is the world's most comprehensive and broadly adopted cloud platform.
We pioneered cloud computing and never stopped innovating — that's why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses.Inclusive Team CultureHere at AWS, it's in our nature to learn and be curious.
Our employee-led affinity groups foster a culture of inclusion that empower us to be proud of our differences.
Ongoing events and learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon (gender diversity) conferences, inspire us to never stop embracing our uniqueness.Mentorship & Career GrowthWe're continuously raising our performance bar as we strive to become Earth's Best Employer.
That's why you'll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.
Work/Life BalanceWe value work-life harmony.
Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture.
When we feel supported in the workplace and at home, there's nothing we can't achieve.
BASIC QUALIFICATIONS- Knowledge of engineering practices and patterns for the full software/hardware/networks development life cycle, including coding standards, code reviews, source control management, build processes, testing, certification, and livesite operations- Experience in engineering team management- Experience in engineering- Experience in leading the definition and development of multi tier web services- Experience partnering with product and program management teamsPREFERRED QUALIFICATIONS- Experience in communicating with users, other technical teams, and senior leadership to collect requirements, describe software product features, technical designs, and product strategy- Experience in recruiting, hiring, mentoring/coaching and managing teams of Software Engineers to improve their skills, and make them more effective, product software engineers- Experience managing a team of high calibre Software Engineers developing complex, world class, scalable software systems that have been successfully delivered to customers
-
Dublin, Ireland Amazon Development Centre Ireland Limited Full timeAWS Resilience owns service that prevent and respond to availability and security issues for all AWS Services. In other words, we’re the people who keep the cloud running. We work on the most challenging problems, with constant new services and possible failure modes to prevent — and we’re looking for talented people who want to help.You’ll join a...
-
Dublin, Dublin City, Ireland Amazon Full timeSoftware Development Manager, AWS Incident Tooling & ResponseJob ID: 2830638 | Amazon Development Centre Ireland LimitedAWS Resilience owns services that prevent and respond to availability and security issues for all AWS Services. In other words, we're the people who keep the cloud running. We work on the most challenging problems, with constant new...
-
Dublin, Dublin City, Ireland Amazon Full timeSoftware Development Manager, AWS Incident Tooling & ResponseJob ID: 2830638 | Amazon Development Centre Ireland LimitedAWS Resilience owns services that prevent and respond to availability and security issues for all AWS Services. In other words, we're the people who keep the cloud running. We work on the most challenging problems, with constant new...
-
Dublin, Dublin City, Ireland ENGINEERINGUK Full timeSoftware Development Manager, AWS Incident Tooling & ResponseDESCRIPTIONAWS Resilience owns services that prevent and respond to availability and security issues for all AWS Services. In other words, we're the people who keep the cloud running. We work on the most challenging problems, with constant new services and possible failure modes to prevent - and...
-
Dublin, Dublin City, Ireland Amazon Full timeSoftware Development Manager, AWS Incident Tooling & ResponseJob ID: 2830638 | Amazon Development Centre Ireland LimitedAWS Resilience owns services that prevent and respond to availability and security issues for all AWS Services. In other words, we're the people who keep the cloud running. We work on the most challenging problems, with constant new...
-
Dublin, Dublin City, Ireland Amazon Full timeJoin a diverse team of software, security experts, operations managers, and other vital roles to help deliver the highest standards for safety, security, and availability across AWS Services. Collaborate with people across AWS to work on the most challenging problems, with constant new services and possible failure modes to prevent.Key...
-
Dublin, Ireland Amazon Full timeSoftware Development Manager, AWS Incident Tooling & Response Job ID: 2830638 | Amazon Development Centre Ireland Limited AWS Resilience owns services that prevent and respond to availability and security issues for all AWS Services. In other words, we’re the people who keep the cloud running. We work on the most challenging problems, with constant new...
-
Dublin, Ireland Amazon Full timeSoftware Development Manager, AWS Incident Tooling & ResponseJob ID: 2830638 | Amazon Development Centre Ireland LimitedAWS Resilience owns services that prevent and respond to availability and security issues for all AWS Services.In other words, we're the people who keep the cloud running.We work on the most challenging problems, with constant new services...
-
Dublin, Ireland Engineeringuk Full timeSoftware Development Manager, AWS Incident Tooling & ResponseDESCRIPTIONAWS Resilience owns services that prevent and respond to availability and security issues for all AWS Services.In other words, we're the people who keep the cloud running.We work on the most challenging problems, with constant new services and possible failure modes to prevent - and...
-
Dublin, Ireland Amazon Full timeSoftware Development Manager, AWS Incident Tooling & ResponseJob ID: 2830638 | Amazon Development Centre Ireland LimitedAWS Resilience owns services that prevent and respond to availability and security issues for all AWS Services.In other words, we're the people who keep the cloud running.We work on the most challenging problems, with constant new services...
-
Dublin, Dublin City, Ireland ENGINEERINGUK Full timeSoftware Development Engineer, AWS Incident Tooling & ResponseDESCRIPTIONAmazon Web Services is the largest consumer cloud offering in the world, powering cutting edge science, rapidly growing start-ups and industry-leading companies.The AWS Incident Response Systems team is building systems to ensure these AWS customers can rely on the highest-availability,...
-
Dublin, Dublin City, Ireland Amazon Full timeSoftware Development Engineer, AWS Incident Tooling & ResponseJob ID: 2795181 | Amazon Data Services Ireland LimitedAmazon Web Services is the largest consumer cloud offering in the world, powering cutting edge science, rapidly growing start-ups and industry-leading companies.The AWS Incident Response Systems team is building systems to ensure these AWS...
-
Dublin, Dublin City, Ireland Amazon Full timeSoftware Development Engineer, AWS Incident Tooling & ResponseJob ID: 2795181 | Amazon Data Services Ireland LimitedAmazon Web Services is the largest consumer cloud offering in the world, powering cutting edge science, rapidly growing start-ups and industry-leading companies.The AWS Incident Response Systems team is building systems to ensure these AWS...
-
Dublin, Ireland ENGINEERINGUK Full timeSoftware Development Engineer, AWS Incident Tooling & ResponseDESCRIPTIONAmazon Web Services is the largest consumer cloud offering in the world, powering cutting edge science, rapidly growing start-ups and industry-leading companies.The AWS Incident Response Systems team is building systems to ensure these AWS customers can rely on the highest-availability,...
-
Dublin, Ireland Engineeringuk Full timeSoftware Development Engineer, AWS Incident Tooling & ResponseDESCRIPTIONAmazon Web Services is the largest consumer cloud offering in the world, powering cutting edge science, rapidly growing start-ups and industry-leading companies.The AWS Incident Response Systems team is building systems to ensure these AWS customers can rely on the highest-availability,...
-
Dublin, Ireland Amazon Full timeSoftware Development Engineer, AWS Incident Tooling & ResponseJob ID: 2795181 | Amazon Data Services Ireland LimitedAmazon Web Services is the largest consumer cloud offering in the world, powering cutting edge science, rapidly growing start-ups and industry-leading companies.The AWS Incident Response Systems team is building systems to ensure these AWS...
-
Senior Research Scientist, AWS Incident Tooling
2 weeks ago
Dublin, Dublin City, Ireland ENGINEERINGUK Full timeSenior Research Scientist, AWS Incident Tooling & ResponseDESCRIPTIONAWS Resilience owns service to prevent and respond to availability and security issues for all AWS Services. In other words, we're the people who keep the cloud running. We work on the most challenging problems, with constant new services and possible failure modes to prevent - and we're...
-
Senior Research Scientist, Aws Incident Tooling
2 weeks ago
Dublin, Dublin City, Ireland Amazon Full timeSenior Research Scientist, AWS Incident Tooling & ResponseJob ID: | Amazon Development Centre Ireland LimitedAWS Resilience owns service to prevent and respond to availability and security issues for all AWS Services.In other words, we're the people who keep the cloud running.We work on the most challenging problems, with constant new services and possible...
-
Support Engineer
2 days ago
Dublin, Ireland Amazon Development Centre Ireland Limited Full timeAWS Incident Response is at the heart of high availability of Amazon Web Services. We make customer impacting events shorter and less frequent by providing large scale event and incident management. Our automated tooling quickly identifies the cause of an issue and helps mitigate its impact, and much of our engineer time is spent on projects to improve the...
-
Dublin, Dublin City, Ireland Amazon Full timeSenior Research Scientist, AWS Incident Tooling & ResponseAWS Resilience owns service to prevent and respond to availability and security issues for all AWS Services. In other words, we're the people who keep the cloud running. We work on the most challenging problems, with constant new services and possible failure modes to prevent — and we're looking for...