AI Infrastructure Reliability Expert

2 weeks ago


Dublin, Dublin City, Ireland beBeeEngineer Full time €60,000 - €90,000
Senior Site Reliability Engineer

We are building the world's leading AI-first cloud infrastructure company.

Our vertically integrated, purpose-built AI infrastructure solutions are trusted by Fortune 500 companies to power their most advanced AI applications.

We are redefining AI cloud infrastructure with a mission to align computing with the future of the climate.

Our AI platform is recognized as the gold standard for reliability and performance.

Data centers optimized for AI workloads are powered by clean renewable energy.

About This Role:

This role plays a pivotal part in ensuring the reliability and performance of our infrastructure.

SREs at our organization detect, analyze, and prevent issues to maintain high Service Level Agreement through Service Level Indicators (SLIs) and Service Level Objectives (SLOs).

We use automation and proactive remediation to not only resolve common errors automatically but also advise various engineering teams in building resilient code.

What You'll Be Working On:

You will begin each day with reviewing overnight alerts and system performance metrics to ensure everything is running smoothly.

Collaborate with your team in a morning stand-up meeting to discuss ongoing projects, recent incidents, and priorities for the day.

Tasks might include automating routine processes, analyzing system logs, developing tools to enhance monitoring capabilities.

Engage in incident response drills, post-mortems, and root cause analysis sessions to learn from past issues and prevent future ones.

  • Drive meaningful innovation and make a tangible impact on sustainable technology.

Requirements:

To succeed in this role you must have:

  • 5+ years of professional SRE experience.
  • 3+ years of experience contributing to architecture and design of new and current systems.
  • Bachelor's Degree in Computer Science or related field, or 8+ years relevant work experience.
  • Solid understanding of infrastructure design, including operational trade-offs of various designs.
  • Experience writing high quality code with at least one programming language.
  • Experience with Unix/Linux environments.
  • Experience with TCP/IP and network programming.
  • Experience with information security best practices.
  • Excellent communication skills.

Responsibilities Include:

As a Senior Site Reliability Engineer you will:

  • Closely collaborate with software engineers to advise on best practices for resilient code and review changes before deployment.
  • Maintain high SLIs and SLOs, ensuring that our infrastructure remains robust and reliable for customers.

Benefits:

We offer a competitive benefits package designed to support financial security, health, and overall well-being.

Compensation will be paid as salary or hourly.



  • Dublin, Dublin City, Ireland Anthropic Full time

    Engineering Manager, AI Reliability EngineeringDublin, IEAbout AnthropicAnthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working...


  • Dublin, Dublin City, Ireland Anthropic Full time

    Engineering Manager, AI Reliability EngineeringDublin, IEAbout AnthropicAnthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working...


  • Dublin, Dublin City, Ireland beBeeDevops Full time €100,000 - €150,000

    Staff AI DevOps Engineer Job SummaryWe are seeking a highly skilled Staff AI DevOps Engineer to join our dynamic team. As a key member of our engineering team, you will be responsible for building the automated backbone of our AI platform, driving continuous integration and delivery (CI/CD), infrastructure as code (IaC), and observability to enable secure,...


  • Dublin, Dublin City, Ireland Anthropic Full time

    Staff Software Engineer, AI Reliability EngineeringJoin to apply for the Staff Software Engineer, AI Reliability Engineering role at AnthropicStaff Software Engineer, AI Reliability EngineeringJoin to apply for the Staff Software Engineer, AI Reliability Engineering role at AnthropicGet AI-powered advice on this job and more exclusive features.About...


  • Dublin, Dublin City, Ireland beBeeEngineer Full time €235,000 - €355,000

    Reliable AI Systems EngineerAbout AnthropicAnthropic aims to create safe and beneficial artificial intelligence systems. Our mission is to advance the capabilities of AI while ensuring reliability, interpretability, and controllability.About the RoleWe are seeking talented engineers with experience in reliability to join our team. Your primary responsibility...


  • Dublin, Dublin City, Ireland beBeeReliability Full time €150,000 - €208,920

    Job TitleWe are seeking an experienced engineering leader to manage our Reliability Engineering team.This team includes Software Engineers and Systems Engineers focused on defining and achieving reliability metrics for all of our internal and external products and services.Responsibilities:Lead and grow a team of reliability engineers responsible for large...


  • Dublin, Dublin City, Ireland beBeeDevops Full time €100,000 - €110,000

    Staff AI DevOps Engineer Job OpportunityWe are seeking a highly skilled Staff AI DevOps Engineer to join our team and contribute to the development and maintenance of our AI platform's automated backbone.In this critical role, you will lead the implementation of cutting-edge DevOps, DevSecOps, and MLOps practices that support continuous delivery, resilient...


  • Dublin, Dublin City, Ireland Menlo Ventures Full time

    About AnthropicAnthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.About the roleAnthropic...


  • Dublin, Dublin City, Ireland beBeeInfrastructure Full time €120,000 - €175,000

    Crusoe is pioneering the development of vertically integrated, purpose-built AI infrastructure solutions trusted by Fortune 500 companies to power their most advanced AI applications.">Our AI platform is recognized as the gold standard for reliability and performance. Our data centers are optimized for AI workloads and are powered by clean, renewable...


  • Dublin, Dublin City, Ireland Naptha AI Full time

    OverviewAI Agent Developer Evangelist | We are seeking an exceptional AI Agent Developer Evangelist to build and nurture relationships with frontier AI developers and shape the future of AI agent development. This is a rare opportunity to influence the future of AI agent infrastructure at a massively ambitious scale, backed by industry veterans and technical...