
Reliable Systems Engineer
3 days ago
We are seeking a highly skilled and experienced SRE to join our team. The successful candidate will be responsible for ensuring the reliability and performance of our infrastructure, detecting and analyzing issues, and developing tools to enhance monitoring capabilities.
The ideal candidate will have a solid understanding of infrastructure design, including operational trade-offs of various designs, and experience writing high-quality code with at least one programming language (Python, Go, or similar). They will also have experience with Unix/Linux environments, TCP/IP, and network programming, as well as excellent communication skills.
Key Responsibilities:- Review overnight alerts and system performance metrics to ensure smooth operations.
- Collaborate with your team in a morning stand-up meeting to discuss ongoing projects, recent incidents, and priorities.
- Automate routine processes, analyze system logs, and develop tools to enhance monitoring capabilities.
- Work closely with software engineers, advising on best practices for resilient code and reviewing changes before deployment.
To be considered for this role, you will need to meet the following requirements:
Skills and Qualifications:- 5+ years of professional SRE experience.
- 3+ years of experience contributing to architecture and design of new and current systems.
- Bachelor's Degree in Computer Science or related field, or 8+ years relevant work experience.
- Solid understanding of infrastructure design, including operational trade-offs of various designs.
- Experience writing high-quality code with at least one programming language (Python, Go, or similar).
- Experience with Unix/Linux environments, TCP/IP, and network programming.
- Excellent communication skills.
- Experience working with cutting-edge AI training & inference hardware and networks.
- Experience running large, mission-critical storage systems/NVMe over Fabric.
- Experience building with modern infrastructure tools such as Docker, Kubernetes, Ansible, Cloud Formation, Terraform.
- Experience building with modern CI/CD practices and build systems, such as GitLab CI/CD, CircleCI, GitHub Actions.
- Experience with logging, monitoring, and alerting systems and tools.
-
Systems Reliability Engineer
2 days ago
Dublin, Dublin City, Ireland beBeeReliability Full time €80,000 - €120,000Reliability Engineering SpecialistOur team focuses on developing and maintaining large-scale, fault-tolerant systems. We aim to ensure high reliability, uptime suitable for customer needs, and a rapid pace of improvement.Key ResponsibilitiesParticipate in the entire lifecycle of services from inception to deployment, operation, and refinement.Provide support...
-
Senior Site Reliability Engineer
2 weeks ago
Dublin, Dublin City, Ireland Crusoe Energy Systems LLC Full timeCrusoe is building the World's Favorite AI-first Cloud infrastructure company. We're pioneering vertically integrated, purpose-built AI infrastructure solutions trusted by Fortune 500 companies to power their most advanced AI applications. Crusoe is redefining AI cloud infrastructure, with a mission to align the future of computing with the future of the...
-
Senior Site Reliability Engineer
1 week ago
Dublin, Dublin City, Ireland Crusoe Energy Systems LLC Full timeCrusoe is building the World's Favorite AI-first Cloud infrastructure company. We're pioneering vertically integrated, purpose-built AI infrastructure solutions trusted by Fortune 500 companies to power their most advanced AI applications. Crusoe is redefining AI cloud infrastructure, with a mission to align the future of computing with the future of the...
-
System Reliability Specialist
5 days ago
Dublin, Dublin City, Ireland beBeeReliability Full time €60,000 - €100,000">Reliability Engineer Role OverviewThis is an exciting opportunity to join a team as a Reliability Engineer. The position involves working with aircraft manufacturers to improve reliability in our operations.">Analyze maintenance data for components and systems, identify and resolve reliability alerts, and find solutions for repetitive delay causes.Conduct...
-
Reliable Systems Engineer
2 weeks ago
Dublin, Dublin City, Ireland beBeeEngineer Full time €90,000 - €105,000We are seeking a Senior Site Reliability Engineer to ensure the reliability, security, and scalability of our SaaS platform hosted on AWS.Key ResponsibilitiesCloud Infrastructure & Operations:Manage and monitor AWS services (EC2, ECS/EKS, RDS, Lambda, S3, CloudFront, VPC, IAM, etc.).Ensure high availability, performance, and cost efficiency of cloud...
-
Cloud System Architect
2 weeks ago
Dublin, Dublin City, Ireland beBeeReliabilityEngineer Full time €80,000 - €100,000Job Overview:As a reliability engineer, you will pioneer and scale system observability efforts. You will work across engineering teams to ensure excellent customer experiences.Key Responsibilities include leading system observability efforts using tools like New Relic, developing site reliability practices, implementing new tools and standards,...
-
Reliable Systems Specialist
2 weeks ago
Dublin, Dublin City, Ireland beBeeEngineer Full time €75,000 - €105,000Job DescriptionWe are seeking a skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a key role in ensuring the reliability and efficiency of our systems.You will be responsible for supporting production systems, performing troubleshooting tasks, and providing relief and sustainable resolution to issues within our...
-
Reliability Engineer
1 week ago
Dublin, Dublin City, Ireland beBeeSystemDeveloper Full time €90,000 - €120,000Job OverviewAs a reliability engineer in our team, you will play a key role in building and running large-scale systems. Your primary responsibility will be to ensure the reliability, uptime, and performance of our services.In this position, you will write product or system development code using your expertise in coding, algorithms, complexity analysis, and...
-
Reliable Systems Architect
3 days ago
Dublin, Dublin City, Ireland beBeeInfrastructure Full time €88,302 - €121,956Security Site Reliability EngineerWe are seeking highly skilled and motivated individuals to join our dynamic teams across Europe and the US.You will design, engineer, and run systems and infrastructure that support millions of customers.You will work closely with software developers to provide systems and infrastructure that fuel scalable services.The role...
-
Reliability Engineer
4 weeks ago
Dublin, Dublin City, Ireland Egis Group Full timeSocial network you want to login/join with:Work Location:Dublin Tunnel Control Building, D03NH33 or Jack Lynch Tunnel, CorkThe Reliability Engineer is a critical member of the Asset Management team responsible for maximising the operational availability, performance, and lifecycle value of assets across the Dublin Tunnel (DT), Jack Lynch Tunnel (JLT), and...