Senior Site Reliability Engineer

1 week ago


Dublin, Dublin City, Ireland DOCOsoft Full time

Get AI-powered advice on this job and more exclusive features.

As an Senior Azure Site Reliability Engineer (SRE), you will play a critical role in ensuring the reliability, availability, and performance of our Vew SaaS platform hosted on Microsoft Azure. You will collaborate closely with development, operations, and infrastructure teams to design, implement, and maintain highly scalable and resilient systems. Your primary focus will be on automation, monitoring, incident response, and continuous improvement to enhance the overall reliability of our services.

Responsibilities
  • System Reliability: Implement and maintain highly available, scalable, and fault-tolerant systems on Azure.
  • Monitor system health and performance metrics to ensure reliability and proactively address issues.
  • Maintain a set of metrics and reporting to demonstrate the operational performance of the Incident & Problem Management processes.
    • Automation: Develop and maintain automation scripts and tools for provisioning, deployment, monitoring, and scaling of services.
    • Implement Infrastructure as Code (IaC) using tools like Azure Resource Manager templates to ensure consistent and reproducible environments.
    • Leverage AI-based automation to predict and prevent incidents before they impact customers.
      • Monitoring And Alerting: Configure and maintain monitoring solutions to provide real-time visibility into system health and performance.
      • Define and implement alerting strategies to detect and respond to incidents in a timely manner.
        • Incident Response: Respond to and resolve incidents, including root cause analysis, mitigation, and communication with stakeholders.
        • Develop and maintain incident response playbooks to streamline response processes.
        • Continue to develop robust Incident management processes that will enable effective management of our customers from an Incident & Problem perspective.
        • Ensure support issues are resolved within the contractual SLA's.
        • Conduct post-incident reviews and implement recommendations to prevent recurrence.
          • Security And Compliance: Ensure systems and infrastructure adhere to security best practices and compliance requirements.
          • Implement and maintain security controls, encryption, and access management mechanisms.
            • Continuous Improvement: Identify areas for optimization and implement solutions to improve system reliability, performance, and efficiency.
            • Participate in regular reviews and retrospectives to drive continuous improvement in processes and systems.
            • Drive continuous service improvement to work towards achieving operational excellence.
            • Maintain up-to-date knowledge of the latest technologies and best practices in application support.
            • Engaging with Development and Quality Assurance Teams on Support issues.
            Key Skills / Qualifications
            • Bachelor's degree in Computer Science, Engineering, or related field.
            • Proven experience as a Site Reliability Engineer or similar role, preferably in a SaaS environment.
            • Strong proficiency in Microsoft Azure services, including compute, networking, storage, and monitoring.
            • Experience with automation tools and scripting languages such as PowerShell
            • Solid understanding of containerization technologies (e.g., Docker, Kubernetes) and orchestration tools.
            • Work with DevOps team to Improve CI/CD pipeline for reliability and deployment and efficiency.
            • Experience with Bicep/Terraform and ARM templates for Infrastructure as Code (IaC).
            • Hands-on experience with monitoring and logging tools such as Azure Monitor, Grafana, Prometheus, or Datadog
            • Knowledge of security best practices, compliance standards (e.g., ISO27001, SOC 2, GDPR), and relevant regulations.
            • Excellent problem-solving skills and the ability to troubleshoot complex technical issues.
            • Strong communication and collaboration skills, with the ability to work effectively in a cross-functional team environment.
            Preferred Qualifications
            • Azure certifications such as Azure Administrator Associate or Azure Solutions Architect Expert.
            • Familiarity with agile methodologies and agile development practices.
            • Knowledge of cloud-native architectures and microservices-based applications.
            • Experience with database technologies such as Azure SQL Database, Cosmos DB, or PostgreSQL.
            • Comfortable in a Client Facing Role as role, with the ability to join regular Service Desk Review meeting with Clients.
            • London Insurance Market Experience an advantage.
            Who We Are

DOCOsoft is a leading software and services provider to Lloyd's of London and the broader London insurance market. It was founded in 2008 and has since grown to become one of the leading insurance software specialists in the London Insurance Market. We are a growing team of approximately 95 with offices in London, Dublin, Tokyo, Portugal and Poland.

DOCOsoft aspires to be a market leader in the technology sector, and we are always looking for new ways to approach projects or improve existing content. We look to hire people that will help us achieve this with hard work, enthusiasm and an and expression of their own ideas.

What We Offer
  • The opportunity to impact our growing business - make your own stamp on the role/ company.
  • Exciting challenges to grow.
  • Exciting challenges to grow – we motivate and mentor junior members of the team, many of whom joined as interns and have progressed through the organisation.
  • A competitive salary.
  • Company pension.
  • Health Insurance.
  • Remote and flexible working.
  • 25 days annual leave.
Equal Opportunity Employer

DOCOsoft is committed to building an inclusive and diverse team that represents a variety of backgrounds, experiences and perspectives. We welcome applications from all suitably qualified candidates, and do not discriminate on the grounds of race, religion, gender, marital or family status, age, disability, sexual orientation, membership of the travelling community or any other basis as protected by applicable law. Should you require reasonable accommodations during any stage of the recruitment process, please let us know.

Other
  • Seniority level: Mid-Senior level
  • Employment type: Full-time
  • Job function: Engineering and Information Technology
  • Industries: Software Development

We're unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.


#J-18808-Ljbffr

  • Dublin, Dublin City, Ireland Fivetran Full time

    Join to apply for the Senior Site Reliability Engineer role at FivetranJoin to apply for the Senior Site Reliability Engineer role at FivetranGet AI-powered advice on this job and more exclusive features.From Fivetran's founding until now, our mission has remained the same: to make access to data as simple and reliable as electricity. With Fivetran, customer...


  • Dublin, Dublin City, Ireland Fis Management Services Llc Full time

    Senior Site Reliability Engineer page is loadedSenior Site Reliability EngineerApply locations IRL DUBL 11-12 time type Full time posted on Posted 13 Days Ago job requisition id JR0300101Are you curious, motivated and forward-thinking?At FIS you'll have the opportunity to work on some of the most challenging and relevant issues in financial services and...


  • Dublin, Dublin City, Ireland Microsoft Full time

    Join to apply for theSite Reliability Engineerrole atMicrosoftJoin to apply for theSite Reliability Engineerrole atMicrosoftGet AI-powered advice on this job and more exclusive features.Microsoft has been a leading company in computing for decades.We are a global service, relied on by governments, utilities, schools, and co-operatives to deliver the things...


  • Dublin, Dublin City, Ireland Crusoe Full time

    OverviewSenior Site Reliability Engineer at Crusoe in Dublin, Ireland. Crusoe's mission is to accelerate the abundance of energy and intelligence. We're crafting the engine that powers a world where people can create ambitiously with AI — without sacrificing scale, speed, or sustainability. Be part of the AI revolution with sustainable technology at...


  • Dublin, Dublin City, Ireland Canonical Full time

    Senior Site Reliability / Gitops EngineerJoin to apply for the Senior Site Reliability / Gitops Engineer role at CanonicalSenior Site Reliability / Gitops Engineer1 day ago Be among the first 25 applicantsJoin to apply for the Senior Site Reliability / Gitops Engineer role at CanonicalCanonical is a leading provider of open source software and operating...


  • Dublin, Dublin City, Ireland Mason Alexander Full time

    OverviewWe're partnering with a leading SaaS platform company who are looking for a Senior Site Reliability Engineer to ensure the reliability, security, and performance of their AWS-hosted infrastructure.Base pay rangeDirect message the job poster from Mason AlexanderKey ResponsibilitiesManage and monitor AWS services (EC2, EKS, Lambda, RDS, etc.)Implement...


  • Dublin, Dublin City, Ireland eBay Full time

    Join to apply for the Site Reliability Engineer role at eBayJoin to apply for the Site Reliability Engineer role at eBayGet AI-powered advice on this job and more exclusive features.At eBay, we're more than a global ecommerce leader — we're changing the way the world shops and sells. Our platform empowers millions of buyers and sellers in more than 190...


  • Dublin, Dublin City, Ireland Fivetran, Inc. Full time

    From Fivetran's founding until now, our mission has remained the same: to make access to data as simple and reliable as electricity. With Fivetran, customer data arrives in their warehouses, canonical and ready to query, with no engineering or maintenance required. We're proud that more organizations continue to leverage our technology every day to become...


  • Dublin, Dublin City, Ireland GTreasury Full time

    Site Reliability Engineer (Dublin, Hybrid)Join to apply for the Site Reliability Engineer (Dublin, Hybrid) role at GTreasury.The mission, should you choose to accept it, is to pioneer and scale GTreasury's system and application observability efforts, and reduce toil amongst our operational workstreams. You will work across a global set of hard-driving...


  • Dublin, Dublin City, Ireland Google Full time

    Senior Software Engineer, Site Reliability EngineeringJoin to apply for the Senior Software Engineer, Site Reliability Engineering role at GoogleSenior Software Engineer, Site Reliability Engineering1 day ago Be among the first 25 applicantsJoin to apply for the Senior Software Engineer, Site Reliability Engineering role at GoogleGet AI-powered advice on...