Lead Site Reliability Engineer

3 weeks ago


Dublin Pike, Ireland JPMorganChase Full time

As a Lead Site Reliability Engineer at JPMorgan Chase in the Commercial & Investment Bank's Digital & Platform Services division, you hold a leadership role in your team, demonstrate strong knowledge across multiple technical domains, and advise others on the technical and business issues facing them. Take lead and conduct resiliency design reviews, break up complex problems into digestible work for other engineers, act as a technical lead for medium to large-sized products, and provide advice and mentoring to other engineers. This role will involve designing, managing and maintaining tools to automate operational processes on AWS. You will also collaborate with team members to identify comprehensive service level indicators and work with stakeholders to establish reasonable service level objectives and error budgets with customers.

Job responsibilities

Manage incident response to swiftly mitigate business impacts by coordinating cross-functional teams.

Serve as the primary point of contact during major incidents, demonstrating the ability to quickly identify and resolve issues to prevent financial losses.

Oversee, track, and validate all changes to the Production and Disaster Recovery environments.

Automate security controls, governance processes, and compliance validation on AWS.

Lead initiatives to enhance the reliability and stability of team applications and platforms, utilizing data-driven analytics to improve service levels.

Document and share knowledge within the organization through internal forums and communities of practice.

Provide ongoing guidance, tools, and solutions to support the firm's growth.

Champion and demonstrate site reliability culture and practices, exerting technical influence throughout the team.

Exhibit a high level of technical expertise in one or more domains, proactively identifying and resolving technology-related bottlenecks.

Strive to become an expert on the applications and platforms under your purview, understanding their interdependencies and limitations.

Required qualifications, capabilities, and skills

Formal training or certification on software engineering concepts and proficient advanced experience.

Deep proficiency in reliability, scalability, performance, security, enterprise system architecture, toil reduction, and other site reliability best practices with the ability to implement these practices within an application or platform

Fluency in at least one programming language such as (e.g., Python, Java Spring Boot, Go, Shell Script, etc.)

Deep knowledge of software applications and technical processes with emerging depth in one or more technical disciplines

Proficiency and experience in observability such as white and black box monitoring, SLO alerting, and telemetry collection using tools such as Grafana, Dynatrace, Prometheus, Datadog, Splunk, etc.

Proficiency and experience in Cloud Platform (AWS) infrastructure and setting up monitoring / observability for application migrated to cloud platforms.

Proficiency in continuous integration and continuous delivery tools (e.g., Jenkins, GitLab, Terraform, etc.)

Experience with container and container orchestration (e.g., ECS, Kubernetes, Docker, etc.)

Experience with troubleshooting common networking technologies and issues

Ability to identify and solve problems related to complex data structures, algorithms and new technologies and if needed self-educate on new technology

Ability to expand and collaborate across different levels and stakeholder groups

Preferred qualifications, capabilities, and skills

Ability to identify new technologies and relevant solutions to ensure design constraints are met by the software team

Ability to initiate and implement ideas to solve business problem

Experience building dashboards with products such as Grafana

Prior experience in both Systems Engineering and Software Development

AWS certification as an Architect, DevOps is preferred

About Us
J.P. Morgan is a global leader in financial services, providing strategic advice and products to the world's most prominent corporations, governments, wealthy individuals and institutional investors. Our first-class business in a first-class way approach to serving clients drives everything we do. We strive to build trusted, long-term partnerships to help our clients achieve their business objectives.

We recognize that our people are our strength and the diverse talents they bring to our global workforce are directly linked to our success. We are an equal opportunity employer and place a high value on diversity and inclusion at our company. We do not discriminate on the basis of any protected attribute, including race, religion, color, national origin, gender, sexual orientation, gender identity, gender expression, age, marital or veteran status, pregnancy or disability, or any other basis protected under applicable law. We also make reasonable accommodations for applicants/' and employees/' religious practices and beliefs, as well as mental health or physical disability needs. FAQS for more information about requesting an accommodation.

About the Team
J.P. Morgan's Commercial & Investment Bank is a global leader across banking, markets, securities services and payments. Corporations, governments and institutions throughout the world entrust us with their business in more than 100 countries. The Commercial & Investment Bank provides strategic advice, raises capital, manages risk and extends liquidity in markets around the world.

#J-18808-Ljbffr



  • Dublin, Ireland Jpmorgan Chase & Co. Full time

    As aLead Site Reliability Engineerat JPMorgan Chase in theCommercial & Investment Bank's Digital & Platform Servicesdivision, you hold a leadership role in your team, demonstrate strong knowledge across multiple technical domains, and advise others on the technical and business issues facing them.Take lead and conduct resiliency design reviews, break up...


  • Dublin, Ireland JP Morgan Full time

    Job Description Assume a critical role in defining the future of a globally recognized firm and have a direct and significant effect in a realm tailored for top achievers in site reliability. As a Lead Site Reliability Engineer at JPMorgan Chase within the International Consumer Bank, you hold a leadership role in your team, demonstrate strong knowledge...


  • Dublin, Ireland Reperio Human Capital Full time

    Site Reliability Engineer 190995 Desired skills: SRE, Azure, SaaS, Dublin Site Reliability Engineer - Azure | SaaS | Dublin (Hybrid - 2 Days Onsite)A leading software company is looking for a Site Reliability Engineer (SRE) to join their growing team supporting a complex SaaS platform hosted in Microsoft Azure. This is a traditional SRE role focused on...


  • Dublin Pike, Ireland Black Nova Group Full time

    About us: At Protex AI, we are at the forefront of AI-driven computer vision, building a safer, smarter industrial workplace with an intelligent operating system that redefines how facilities operate. Backed by top-tier global investors, we recently secured a $36 million Series B to accelerate our mission. Industry leaders like DHL, Amazon, and Tesla trust...


  • Dublin Pike, Ireland ESW Full time

    Join to apply for the Manager, Site Reliability Engineering role at ESW The Opportunity ESW is seeking a Site Reliability Engineering Manager to lead a high-performing SRE team focused on building resilient, scalable, and secure systems in Azure. In this role, you’ll collaborate with senior engineering and product leaders, champion SRE best practices,...


  • Dublin, Ireland G Treasury Ss, Llc Full time

    Site Reliability Engineer (Dublin, Hybrid)DevOps - Dublin 2 (Hybrid) The mission, should you choose to accept it, is to pioneer and scale GTreasurys system and application observability efforts, and reduce toil amongst our operational workstreams. You will work across a global set of hard-driving engineering, support, and technical operations teams that care...


  • Dublin, Ireland Elwood Roberts Full time

    Job: site Reliability Engineer Location: North Dublin Rate: 450-500 per day Type: Contract (12 months+) Working arrangement: Hybrid (1-2 days onsite per week) We have an excellent role for a combined Site Reliability Engineering (SRE) and Observability Engineering role to oversee and ensuring that complex software systems are reliable, scalable, making sure...


  • Dublin Pike, Ireland Reddit, Inc. Full time

    Reddit is a community of communities. It’s built on shared interests, passion, and trust and is home to the most open and authentic conversations on the internet. Every day, Reddit users submit, vote, and comment on the topics they care most about. Reddit is one of the internet’s largest sources of information. Reddit SRE is rapidly innovating and our...


  • Dublin Pike, Ireland Susquehanna International Group Full time

    Overview As a Site Reliability Engineer at Susquehanna, you’ll be working alongside experienced engineers to solve real problems, being responsible for designing, supporting, maintaining and improving infrastructure across virtual and physical environments, applying a DevOps approach. This role is aimed at graduates and early career professionals...


  • Dublin Pike, Ireland Crone Corkill Full time

    Crone Corkill have partnered with a technology consultancy who are searching for a Site Reliability Engineer to join a client in their Dublin office on a permanent basis. Expertise with Apache Kafka within a production environment is absolutely key here, with strong knowledge and experience across Kafka architecture, security, clusters, stream processing and...