Senior Staff Site Reliability Engineer, Monitoring

6 days ago


Dublin, Dublin City, Ireland Socotra, Inc. Full time

About usAt Udemy, we're on a mission to transform lives through learning.
Through our intelligent skills platform and a global community of instructors, we've helped over 70 million learners and 16,000 organizations achieve their goals.
Come join us in ensuring everyone, everywhere has access to the skills they need to unlock their potential and create possibilities for themselves and others.

Hybrid workUdemy is headquartered in San Francisco with global offices in Australia, India, Ireland, Türkiye, and other US locations.
Our robust hybrid work model spans San Francisco, Denver, Ankara, Dublin, and Melbourne.
This hybrid position requires two days per week in the office at the nearest hub.
About youYou are a motivated, meticulous Engineer with a team-oriented approach and exceptional problem-solving skills.
You are organized and proactive and take the initiative to prioritize your own work and projects effectively.
You thrive in a collaborative environment and are eager to work with and learn alongside the best in Product, Design, and Engineering.
As a Monitoring & Observability Engineer, you'll be a key player in building and evolving our systems.
You know that complex systems are hard to measure and monitor, but you're driven to tackle these challenges head-on.
You have deep expertise in microservices and are passionate about optimizing the way we monitor, measure, and instrument them.
User experience is at the heart of your work, and you're always thinking about how our metrics impact the way people interact with our systems.
Linux is your natural environment, and you aren't afraid to dive deep into troubleshooting application, system, and network issues.
You've worked with industry-leading monitoring tools like Datadog, New Relic, and Honeycomb, and you're always eager to refine your skills and learn new ones.
Above all, you're a strong communicator in English and excel at collaborating with engineers and teams across the organization.
We care less about your formal education or mathematical expertise and more about your hands-on experience and your passion for monitoring.
If you're obsessed with building observability systems, automating repetitive tasks, and driving improvements across the board, we want you on our team.
Here's what you will be doing:Leading the evolution of our monitoring and observability strategy, making it a core pillar of how we workPartnering with engineering teams to enhance the visibility and reliability of our systems, ensuring that we build for long-term successDriving the standardization of SLIs + SLOs across all engineering teams, aligning on best practicesOwning and optimizing our current monitoring systems, including Datadog, Sentry, and other key toolsCollaborating with teams to proactively improve site availability, ensuring a seamless user experienceLeading incident analysis while fostering a Blameless Culture, ensuring that we learn from challenges and improvePromoting best practices for on-call and incident management, ensuring teams are always prepared and resilientContinuously improving developer happiness and productivity by automating manual tasks and creating processes that prevent surprisesAbout your skills:3+ years experience managing complex monitoring systems like Datadog, Honeycomb, or New RelicProficiency in programming languages such as Go (preferred), Python, Bash, or JavaExperience with incident management tools and processes, with at least 3 years on-call experienceHands-on experience with paging tools and incident response frameworksSolid understanding of Terraform, Kubernetes (K8s), and AWS for deployment and managementA knack for problem-solving, with the ability to think creatively and work collaboratively with peersExcellent communication skills and a desire to continuously learn and grow within a fast-paced environmentWe understand that not everyone will match each of the above qualifications.
However, we also realize that everyone has unique experiences that can add value to our company.
Even if you think your background might not perfectly align, we'd love to hear from you
Our Benefits Start with UOur benefits start with you and were built to provide you and your family with the protection and care you need, making it easy to access the right coverage when you need it most.
Benefits vary by region, and we encourage applicants to review our US Benefits and Ireland Benefits pages to get an understanding of some of the benefits we offer.
For details on region-specific benefits, please refer to the information provided during the hiring process.
Information regarding data privacy is available within the Udemy Careers Privacy Notice.

#J-18808-Ljbffr



  • Dublin, Dublin City, Ireland Socotra, Inc. Full time

    At Socotra, Inc., we're dedicated to revolutionizing the way organizations approach monitoring and observability.About UsWe're a team of passionate engineers driven by a shared vision of building scalable and reliable systems that empower our users.The RoleWe're seeking an exceptional Senior Staff Site Reliability Engineer, Monitoring to join our ranks. As a...


  • Dublin, Dublin City, Ireland Scopely Full time

    Description Scopely is looking for a Senior Site Reliability Engineer to join our new unannounced project in either Ireland, Spain, Portugal or the UK on a hybrid/remote basis. We can support with visa sponsorship and relocation assistance.At Scopely, we care deeply about what we do and want to inspire play, every day - whether in our work...


  • Dublin, Dublin City, Ireland Sojern Full time

    Position summary:Sojern is looking for a Staff Site Reliability Engineer in Dublin to collaborate with Software Engineering teams located primarily in our Dublin office. An ideal candidate would have extensive experience building cloud infrastructure on Google Cloud with Terraform, and have strong experience running and securing workloads at scale on...


  • Dublin, Dublin City, Ireland Reperio Human Capital Full time

    Senior Site Reliability Engineer 98301 Desired skills: Senior Site Reliability Engineer, SRE, Senior, Ireland, Remote, AWS, Cloud, Infrastructure Senior Site Reliability Engineer Ireland Salary: 75K+ Full-time, Permanent We are currently looking for a Senior Site Reliability Engineer for our client in Dublin. The role would be...


  • Dublin, Dublin City, Ireland Prove Full time

    Title: Senior Site Reliability EngineerDepartment: Internal OperationsReports To: Senior Manager, Site ReliabilityFLSA Status: N/ALocation: IrelandJob Summary:The Senior Site Reliability Engineer is responsible for bringing a software engineering approach to Prove operations. Using software as a tool to manage systems, solve problems, and automate operations...


  • Dublin, Dublin City, Ireland realTime Recruitment Full time

    Job Opening Lead Site Reliability Engineer - SRE Permanent Dublin 09-08-2023 RealTime are looking for a Lead Site Reliability Engineer to lead a site reliability function to design, implement, & lead a team responsible for delivering on growth and industry-changing strategic objectives. A key responsibility will be...


  • Dublin, Dublin City, Ireland Google Inc. Full time

    Staff Software Engineer, Site Reliability Engineeringcorporate_fare Google place Dublin, IrelandApplyMinimum Qualifications:Bachelor's degree in Computer Science, a related field, or equivalent practical experience.Candidates will typically have 5 years of experience with software development in one or more programming languages.Typically 8 years of...


  • Dublin, Dublin City, Ireland Fruition Group Ireland Full time

    My client based in Co Louth is currently recruiting for a Site Reliability Engineer to join a growing team. The role is hybrid working. As a Site Reliability Engineer (SRE), you will play a critical role in ensuring the reliability, scalability, and security of cloud-based ERP solutions. You will work closely with software engineers, DevOps teams, and...


  • Dublin, Dublin City, Ireland Google Full time

    About the JobAs a Staff Site Reliability Engineer at Google, you will play a critical role in ensuring the reliability and uptime of our externally-visible systems.You will engage in all aspects of service lifecycle, from inception and design through deployment, operation, and refinement.This involves collaborating with cross-functional teams to drive system...


  • Dublin, Dublin City, Ireland realTime Recruitment Full time

    Job Opening Site Reliability Engineer - SRE Permanent Dublin 10-08-2023 RealTime are looking for a Site Reliability Engineer to help with the development and deployment of tooling, monitoring, control, self-service reporting, and analysis approach. You will be design & build, with a focus on monitor & traceability and remediation of security,...


  • Dublin, Dublin City, Ireland Pontoon Solutions Full time

    Get AI-powered advice on this job and more exclusive features.Direct message the job poster from Pontoon SolutionsPrincipal Recruiter (Tech) | Pontoon Solutions (Adecco)Job Title: Site Reliability EngineerContract Type: TemporaryLocation: Dublin (5 days a week onsite)Are you ready to elevate your career in the financial services industry? Our client, a...


  • Dublin, Dublin City, Ireland Griffin Gaming Partners Full time

    Scopely is looking for a Senior Site Reliability Engineer to join our new unannounced project in either Ireland, Spain, Portugal or the UK on a hybrid/remote basis. We can support with visa sponsorship and relocation assistance.At Scopely, we care deeply about what we do and want to inspire play, every day - whether in our work environments alongside our...


  • Dublin, Dublin City, Ireland Tbwa ChiatDay Inc Full time

    As the world moves to a mobile-first economy, businesses need to modernize how they acquire, engage with and enable consumers. Prove's phone-centric identity tokenization and passive cryptographic authentication solutions reduce friction, enhance security and privacy across all digital channels, and accelerate revenues while reducing operating expenses and...


  • Dublin, Dublin City, Ireland Reperio Human Capital Full time

    Site Reliability Engineer 97977 Desired skills: Terraform, Coding, Cypress, Datadog, Go/Golang Fully Remote - Ireland The Site Reliability Engineer will be responsible for building observability frameworks. This role naturally suits a candidate who is a true tech lover and is self-taught in several areas. The client requires the successful...


  • Dublin, Dublin City, Ireland Google Full time

    Staff Software Engineer, Turnup Site Reliability Engineeringcorporate_fare Google place Dublin, IrelandApplyMinimum Qualifications:Bachelor's degree in Computer Science, a related field, or equivalent practical experience.8 years of experience with data structures or algorithms.5 years of experience with software development in one or more programming...


  • Dublin, Dublin City, Ireland Recruiters Full time

    This range is provided by Recruiters.ie. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more.Base pay rangeDirect message the job poster from Recruiters.iePrincipal IT Recruiter | Expert in Sourcing Top DAILY RATE CONTRACTORSWho is Our Client? Our client's technology division powers a secure, digital...


  • Dublin, Dublin City, Ireland Apple Inc. Full time

    Site Reliability Engineering Manager Job SummaryWe are seeking an experienced Site Reliability Engineering Manager to join our team at Apple Inc.As a Site Reliability Engineering Manager, you will be responsible for the reliability of the platform serving workloads that provide our organisation and our customers with their favourite applications, services,...


  • Dublin, Dublin City, Ireland Google Full time

    Staff Software Engineer, Site Reliability Engineering corporate_fare Google place Dublin, Ireland Advanced Experience owning outcomes and decision making, solving ambiguous problems and influencing stakeholders; deep expertise in domain.Minimum Qualifications: Bachelor's degree in Computer Science, a related field, or equivalent practical...


  • Dublin, Dublin City, Ireland Recruiters Full time

    We are seeking an experienced Site Reliability Engineering Specialist to join our team in IT Services and IT Consulting, Financial Services, and Banking.About the RoleThe Site Reliability Engineering Specialist will be responsible for ensuring platform stability and health, focusing on automation, monitoring, and capacity planning.Key Responsibilities:Ensure...


  • Dublin, Dublin City, Ireland Tn Ireland Full time

    Job Description: We are seeking a seasoned Site Reliability Engineer to lead our site reliability function. As a key member of our team, you will design, implement, and lead a team responsible for delivering strategic objectives.You will play a critical role in monitoring and remediating systems, security, and network issues.A key responsibility will be...