15h Left: Software Development Engineer, AWS Incident Tooling
1 day ago
Software Development Engineer, AWS Incident Tooling & Response
Job ID: 2795181 | Amazon Data Services Ireland Limited
Amazon Web Services is the largest consumer cloud offering in the world, powering cutting edge science, rapidly growing start-ups and industry-leading companies.
The AWS Incident Response Systems team is building systems to ensure these AWS customers can rely on the highest-availability, lowest-latency cloud platform on the planet. We work closely with the teams who own the largest AWS products, building systems to detect and mitigate operational issues before they impact customers. We are looking for a knowledgeable and experienced software development engineer to help us succeed in this mission.
As a Software Development Engineer at AWS Incident Response Systems, you will join the team in the design and implementation of systems which automate fault containment, problem diagnosis, and issue resolution across multiple hugely-distributed, always-on architectures. These systems will take metric and dependency data from multiple sources and analyse them, correlating them with customer impact to determine root cause of an issue without human intervention. They will create engagements, facilitate communication and coordination of the response and mitigation. As the scale and complexity of AWS grows, this is the best way that we can offer our customers a stable and reliable cloud computing platform. We succeed once these systems detect, diagnose, and repair operational defects without customer impact or human intervention.
You will work with teams across AWS to drive adoption of the software that has been built by the team, and influence systems development practices for new and existing products. You will define availability goals for service teams across AWS, and strategies to make these goals attainable with minimal effort. Your goal will be to remove human-error from the day-to-day operations of the massive, always-on, distributed systems which make up AWS.
Within your first year on the AWS Incident Response Systems team, you will have met with senior technical leaders from across AWS, designed and implemented at least one new system, and you will have dived deep into the causes of at least one historic external customer impacting event, and determined how to prevent a similar event from ever happening again. As your career continues to develop, you will influence the growth and direction not only of the Incident Response Systems team, but of the AWS group as a whole.
If this sounds like the right challenge for you, then please apply today
AWS Infrastructure Services owns the design, planning, delivery, and operation of all AWS global infrastructure. In other words, we’re the people who keep the cloud running. We support all AWS data centers and all of the servers, storage, networking, power, and cooling equipment that ensure our customers have continual access to the innovation they rely on. We work on the most challenging problems, with thousands of variables impacting the supply chain — and we’re looking for talented people who want to help.
You’ll join a diverse team of software, hardware, and network engineers, supply chain specialists, security experts, operations managers, and other vital roles. You’ll collaborate with people across AWS to help us deliver the highest standards for safety and security while providing seemingly infinite capacity at the lowest possible cost for our customers. And you’ll experience an inclusive culture that welcomes bold ideas and empowers you to own them to completion.
Key job responsibilities
1. Write well-tested, maintainable code.
2. Design, contribute to, and maintain systems which solve customer problems.
3. Work with team-mates to improve code quality, system architecture and team processes.
4. Learn about the incident management processes supported by the team’s system to identify improvement opportunities.
A day in the life
As a Software Development Engineer on the AWS Incident Response Systems team, you will spend time each day writing code, reviewing code, creating documentation and responding to operational issues in the team’s systems. You will have conversations with technical leaders which will help you grow your career. You will get to know your customers and figure out better ways to solve their problems. You will contribute to the long term direction for your team and for the Incident Response Systems organisation.
Diverse Experiences
AWS values diverse experiences. Even if you do not meet all of the preferred qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying.
Why AWS?
Amazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that’s why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses.
Inclusive Team Culture
Here at AWS, it’s in our nature to learn and be curious. Our employee-led affinity groups foster a culture of inclusion that empower us to be proud of our differences. Ongoing events and learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon (gender diversity) conferences, inspire us to never stop embracing our uniqueness.
Mentorship & Career Growth
We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.
Work/Life Balance
We value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve in the cloud.
About the team
Our team has a wide and diverse set of backgrounds and experiences - we have engineers and managers who have been involved in Incident Response at Amazon for many years, people with a more traditional software engineering background, and a range in between. This breadth of experience makes for a vibrant and creative team, and we collaborate to build high-quality software systems which solve our customers’ problems.
BASIC QUALIFICATIONS
- Experience (non-internship) in professional software development
- Experience designing or architecting (design patterns, reliability and scaling) of new and existing systems
- Experience programming with at least one software programming language
PREFERRED QUALIFICATIONS
- Bachelor's degree in computer science or equivalent
- Experience with full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations
Amazon is an equal opportunities employer. We believe passionately that employing a diverse workforce is central to our success. We make recruiting decisions based on your experience and skills. We value your passion to discover, invent, simplify and build. Protecting your privacy and the security of your data is a longstanding top priority for Amazon. Please consult our Privacy Notice (https://www.amazon.jobs/en/privacy_page) to know more about how we collect, use and transfer the personal data of our candidates.
Posted: January 10, 2025 (Updated 2 days ago)
Posted: December 12, 2024 (Updated 3 days ago)
Posted: December 11, 2024 (Updated 3 days ago)
Posted: January 10, 2025 (Updated 3 days ago)
Posted: October 8, 2024 (Updated 3 days ago)
#J-18808-Ljbffr
-
Dublin, Ireland ENGINEERINGUK Full timeSoftware Development Manager, AWS Incident Tooling & Response DESCRIPTION AWS Resilience owns services that prevent and respond to availability and security issues for all AWS Services. In other words, we're the people who keep the cloud running. We work on the most challenging problems, with constant new services and possible failure modes to prevent - and...
-
Dublin, Ireland ENGINEERINGUK Full timeSoftware Development Engineer, AWS Incident Tooling & Response DESCRIPTION Amazon Web Services is the largest consumer cloud offering in the world, powering cutting edge science, rapidly growing start-ups and industry-leading companies. The AWS Incident Response Systems team is building systems to ensure these AWS customers can rely on the...
-
Dublin, Ireland Amazon Full timeSoftware Development Engineer, AWS Incident Tooling & Response Amazon Web Services is the largest consumer cloud offering in the world, powering cutting edge science, rapidly growing start-ups and industry-leading companies. The AWS Incident Response Systems team is building systems to ensure these AWS customers can rely on the highest-availability,...
-
Dublin, Ireland Amazon Full timeAWS Resilience owns service to prevent and response to availability and security issues for all AWS Services. In other words, we’re the people who keep the cloud running. We work on the most challenging problems, with constant new services and possible failure modes to prevent — and we’re looking for talented people who want to help. You’ll join a...
-
Dublin, Ireland Amazon Full timeSoftware Development Engineer, Lambda Runtimes, AWS Job ID: 2823870 | Amazon Development Centre Ireland Limited AWS Lambda’s goal is nothing less than to simplify and improve the experience of computing in the cloud for every developer on the planet, from start-ups to the largest Fortune 100 companies. Serverless computing is rapidly changing how every...
-
Dublin, Ireland Amazon Full timeAWS Incident Response is at the heart of high availability of Amazon Web Services. We make customer impacting events shorter and less frequent by providing large scale event and incident management. Our automated tooling quickly identifies the cause of an issue and helps mitigate its impact, and much of our engineer time is spent on projects to improve the...
-
Dublin, Ireland ENGINEERINGUK Full timeSoftware Development Engineer, Lambda - Experience DESCRIPTION AWS Lambda's goal is nothing less than to simplify and improve the experience of computing in the cloud for every developer on the planet, from startups to the largest Fortune 100 companies. Serverless computing is rapidly changing how every company thinks about building and delivering software...
-
Dublin, Ireland ENGINEERINGUK Full timeSoftware Development Engineer, AWS Fault Injection Service DESCRIPTION AWS Fault Injection Service is a fully managed service for running fault injection experiments on AWS. FIS makes it easier to improve an application's performance, observability, and resiliency. Fault injection experiments are used in chaos engineering, which is the practice of stressing...
-
Dublin, Ireland Amazon Full timeSoftware Development Engineer, Lambda Runtimes, AWS Job ID: 2823870 | Amazon Development Centre Ireland Limited AWS Lambda’s goal is nothing less than to simplify and improve the experience of computing in the cloud for every developer on the planet, from start-ups to the largest Fortune 100 companies. Serverless computing is rapidly changing how every...
-
Dublin, Ireland Amazon Development Centre Ireland Limited Full time- Experience (non-internship) in professional software development - Experience designing or architecting (design patterns, reliability and scaling) of new and existing systems - Experience programming with at least one software programming language Amazon Web Services is seeking a talented and passionate Software Development Engineer to join our Load...
-
▷ (15h Left) Research Engineer, AWS AI
4 days ago
Dublin, Ireland Amazon Full timeJob ID: 2783177 | Amazon Development Center (Tel Aviv) AWS Utility Computing (UC) provides product innovations — from foundational services such as Amazon’s Simple Storage Service (S3) and Amazon Elastic Compute Cloud (EC2), to consistently released new product innovations that continue to set AWS’s services and features apart in the industry. As a...
-
Dublin, Ireland Amazon Full timeAWS Fault Injection Service is a fully managed service for running fault injection experiments on AWS. FIS makes it easier to improve an application’s performance, observability, and resiliency. Fault injection experiments are used in chaos engineering, which is the practice of stressing an application in testing or production environments by creating...
-
Dublin, Ireland Amazon Development Centre Ireland Limited Full time- 3+ years of non-internship professional software development experience - 2+ years of non-internship experience designing or architecting new and existing systems (including design patterns, reliability and scaling) - Experience programming with at least one software programming language - Successful applicants must have the legal right to work in Ireland....
-
Dublin, Ireland ENGINEERINGUK Full timeSr. Technical Program Manager Automation and Tooling, AWS Trust & Safety DESCRIPTION Amazon Web Services (AWS) is built on the core principles of security, privacy, compliance, and transparency, which form the foundation of our trusted cloud infrastructure. We are seeking individuals who are dedicated to upholding these principles daily. Are you interested...
-
Dublin, Ireland Amazon Full timeSoftware Development Engineer, AWS Demand Planning Job ID: 2794149 | Amazon Development Center U.S., Inc. The Demand Planning team builds software to forecast the amount of hardware needed to keep EC2 running as the world's favorite elastic cloud. Our work ensures that AWS customers never run out of cloud computing capacity! We are looking for a Software...
-
Dublin, Ireland Amazon Full timeSoftware Development Engineer, GCNA Network Availability Engineer AWS operates one of the world’s largest and most highly available networks which continues to grow rapidly in both size and complexity in response to customer demand. Many AWS customers run mission critical workloads that depend on our networks to be always on. Network Availability...
-
▷ 15h Left: Software Development Engineer
5 days ago
Dublin, Ireland Amazon Full timeSoftware Development Engineer - Enterprise Networking, Enterprise Networking Software & Automation Job ID: 2867767 | Amazon Support Services Pty Ltd AWS Infrastructure Services owns the design, planning, delivery, and operation of all AWS global infrastructure. In other words, we’re the people who keep the cloud running. We support all AWS data centers...
-
Dublin, Ireland ENGINEERINGUK Full timeDESCRIPTION Are you ready to create systems to power one of the largest e-commerce companies in the world? Amazon.com has over 70 million customers, and developers all over the world rely on our storage, compute, and virtualized services via Amazon Web Services. We support systems at massive, and ever-growing, scale. We rapidly integrate new technologies to...
-
Software Development Engineer, AWS DNS
3 days ago
Dublin, Ireland Amazon Full timeAre you ready to create systems to power one of the largest e-commerce companies in the world? Amazon.com has over 70 million customers, and developers all over the world rely on our storage, compute, and virtualized services via Amazon Web Services. We support systems at massive, and ever-growing, scale. We rapidly integrate new technologies to expand our...
-
Sr. Software Engineer
4 days ago
Dublin, Ireland Allegion Canada Inc. Full timeCreating Peace of Mind by Pioneering Safety and Security At Allegion, we help keep the people you know and love safe and secure where they live, work and visit. With more than 30 brands, 12,000+ employees globally and products sold in 130 countries, we specialize in security around the doorway and beyond. Additionally, in 2024 we were awarded the Gallup...