Reliable Systems Engineer

3 days ago


Dublin, Dublin City, Ireland beBeeSRE Full time €80,000 - €100,000
Job Description

We are seeking a highly skilled and experienced SRE to join our team. The successful candidate will be responsible for ensuring the reliability and performance of our infrastructure, detecting and analyzing issues, and developing tools to enhance monitoring capabilities.

The ideal candidate will have a solid understanding of infrastructure design, including operational trade-offs of various designs, and experience writing high-quality code with at least one programming language (Python, Go, or similar). They will also have experience with Unix/Linux environments, TCP/IP, and network programming, as well as excellent communication skills.

Key Responsibilities:
  • Review overnight alerts and system performance metrics to ensure smooth operations.
  • Collaborate with your team in a morning stand-up meeting to discuss ongoing projects, recent incidents, and priorities.
  • Automate routine processes, analyze system logs, and develop tools to enhance monitoring capabilities.
  • Work closely with software engineers, advising on best practices for resilient code and reviewing changes before deployment.
Requirements

To be considered for this role, you will need to meet the following requirements:

Skills and Qualifications:
  • 5+ years of professional SRE experience.
  • 3+ years of experience contributing to architecture and design of new and current systems.
  • Bachelor's Degree in Computer Science or related field, or 8+ years relevant work experience.
  • Solid understanding of infrastructure design, including operational trade-offs of various designs.
  • Experience writing high-quality code with at least one programming language (Python, Go, or similar).
  • Experience with Unix/Linux environments, TCP/IP, and network programming.
  • Excellent communication skills.
Desirable Skills:
  • Experience working with cutting-edge AI training & inference hardware and networks.
  • Experience running large, mission-critical storage systems/NVMe over Fabric.
  • Experience building with modern infrastructure tools such as Docker, Kubernetes, Ansible, Cloud Formation, Terraform.
  • Experience building with modern CI/CD practices and build systems, such as GitLab CI/CD, CircleCI, GitHub Actions.
  • Experience with logging, monitoring, and alerting systems and tools.


  • Dublin, Dublin City, Ireland beBeeReliability Full time €80,000 - €120,000

    Reliability Engineering SpecialistOur team focuses on developing and maintaining large-scale, fault-tolerant systems. We aim to ensure high reliability, uptime suitable for customer needs, and a rapid pace of improvement.Key ResponsibilitiesParticipate in the entire lifecycle of services from inception to deployment, operation, and refinement.Provide support...


  • Dublin, Dublin City, Ireland Crusoe Energy Systems LLC Full time

    Crusoe is building the World's Favorite AI-first Cloud infrastructure company. We're pioneering vertically integrated, purpose-built AI infrastructure solutions trusted by Fortune 500 companies to power their most advanced AI applications. Crusoe is redefining AI cloud infrastructure, with a mission to align the future of computing with the future of the...


  • Dublin, Dublin City, Ireland Crusoe Energy Systems LLC Full time

    Crusoe is building the World's Favorite AI-first Cloud infrastructure company. We're pioneering vertically integrated, purpose-built AI infrastructure solutions trusted by Fortune 500 companies to power their most advanced AI applications. Crusoe is redefining AI cloud infrastructure, with a mission to align the future of computing with the future of the...


  • Dublin, Dublin City, Ireland beBeeReliability Full time €60,000 - €100,000

    ">Reliability Engineer Role OverviewThis is an exciting opportunity to join a team as a Reliability Engineer. The position involves working with aircraft manufacturers to improve reliability in our operations.">Analyze maintenance data for components and systems, identify and resolve reliability alerts, and find solutions for repetitive delay causes.Conduct...


  • Dublin, Dublin City, Ireland beBeeEngineer Full time €90,000 - €105,000

    We are seeking a Senior Site Reliability Engineer to ensure the reliability, security, and scalability of our SaaS platform hosted on AWS.Key ResponsibilitiesCloud Infrastructure & Operations:Manage and monitor AWS services (EC2, ECS/EKS, RDS, Lambda, S3, CloudFront, VPC, IAM, etc.).Ensure high availability, performance, and cost efficiency of cloud...


  • Dublin, Dublin City, Ireland beBeeReliabilityEngineer Full time €80,000 - €100,000

    Job Overview:As a reliability engineer, you will pioneer and scale system observability efforts. You will work across engineering teams to ensure excellent customer experiences.Key Responsibilities include leading system observability efforts using tools like New Relic, developing site reliability practices, implementing new tools and standards,...


  • Dublin, Dublin City, Ireland beBeeEngineer Full time €75,000 - €105,000

    Job DescriptionWe are seeking a skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a key role in ensuring the reliability and efficiency of our systems.You will be responsible for supporting production systems, performing troubleshooting tasks, and providing relief and sustainable resolution to issues within our...


  • Dublin, Dublin City, Ireland beBeeSystemDeveloper Full time €90,000 - €120,000

    Job OverviewAs a reliability engineer in our team, you will play a key role in building and running large-scale systems. Your primary responsibility will be to ensure the reliability, uptime, and performance of our services.In this position, you will write product or system development code using your expertise in coding, algorithms, complexity analysis, and...


  • Dublin, Dublin City, Ireland beBeeInfrastructure Full time €88,302 - €121,956

    Security Site Reliability EngineerWe are seeking highly skilled and motivated individuals to join our dynamic teams across Europe and the US.You will design, engineer, and run systems and infrastructure that support millions of customers.You will work closely with software developers to provide systems and infrastructure that fuel scalable services.The role...

  • Reliability Engineer

    4 weeks ago


    Dublin, Dublin City, Ireland Egis Group Full time

    Social network you want to login/join with:Work Location:Dublin Tunnel Control Building, D03NH33 or Jack Lynch Tunnel, CorkThe Reliability Engineer is a critical member of the Asset Management team responsible for maximising the operational availability, performance, and lifecycle value of assets across the Dublin Tunnel (DT), Jack Lynch Tunnel (JLT), and...