Cloud Architect for Observability and Reliability

4 days ago


Dublin, Dublin City, Ireland beBeeObservability Full time €96,000 - €124,000
Job Description

We are seeking a highly experienced technical architect to lead the design, strategy, and implementation of observability and SRE frameworks for enterprise-scale, microservices-based applications.

The ideal candidate will bring deep technical knowledge of both Splunk Observability Stack and Open Source tools like OpenTelemetry, Prometheus, Grafana, Jaeger, and be capable of defining and executing architecture strategies for complex distributed systems.

This role requires hands-on ability to create architecture blueprints, lead technical teams, and work directly with stakeholders and platform owners to embed observability and reliability practices across the SDLC.

Responsibilities
  • Architecture & Blueprinting
    • Design and deliver end-to-end observability architecture (Metrics, Logs, Traces, Events) for cloud-native and hybrid environments.
    • Create technical architecture diagrams, data flow maps, and integration blueprints using tools like Lucidchart or Visio.
    • Lead the definition of SLIs, SLOs, and Error Budgets aligned with business KPIs and DORA metrics.
  • Toolchain Strategy & Implementation
    • Architect telemetry pipelines using OpenTelemetry Collector and Splunk Observability Cloud (SignalFx, APM, RUM, Log Observer).
    • Define tool adoption strategy and integration roadmap for OSS tools (Prometheus, Loki, Grafana, Jaeger) and Splunk-based stacks.
    • Guide teams on instrumentation approaches (auto/manual) across languages like Java, Go, Python, .NET, etc.
  • Reliability Engineering Enablement
    • Lead adoption of SRE principles including incident management frameworks, resiliency testing, and runbook automation.
    • Collaborate with DevOps to integrate observability into CI/CD pipelines (e.g., Jenkins, ArgoCD, GitHub Actions).
    • Define health checks, golden signals, and SPoG (Single Pane of Glass) dashboards.
    • Exposure to AIOps, ML-based anomaly detection, or business observability.
  • Stakeholder Management & Governance
    • Serve as a technical liaison between leadership, SREs, developers, and infrastructure teams.
    • Run workshops, assessments, and evangelize observability-first culture across teams.
    • Provide guidance on data retention, access control, cost optimization, and compliance (especially with Splunk ingestion policies).
  • Performance & Optimization
    • Continuously monitor and fine-tune observability data flows to prevent alert fatigue and ensure actionability.
    • Implement root cause analysis practices using telemetry correlation across metrics, logs, and traces.
    • Lead efforts to build self-healing systems using automated playbooks and AIOps integrations (where applicable).
Required Skills & Qualifications
  • 15+ years in IT, with 5 years in Observability/SRE architecture roles.
  • Proven experience designing architecture for microservices, containers (Docker, Kubernetes), and distributed systems.
  • Strong hands-on expertise with:
    • Splunk Observability Cloud (SignalFx, Log Observer, APM).
    • OpenTelemetry (SDKs + Collector).
    • Prometheus + Grafana.
    • Jacger / Zipkin for distributed tracing.
    • CI/CD tools: Jenkins, GitHub Actions, ArgoCD.
  • Ability to build and present clear architecture diagrams and solution roadmaps.
  • Working knowledge of cloud environments (AWS, Azure, GCP) and container orchestration (K8s/OpenShift).
  • Familiarity with SRE and DevOps best practices (error budgets, release engineering, chaos testing).
Benefits

This role offers a unique opportunity to shape the future of observability and SRE at our organization. As a key member of our team, you will have the chance to:

  • Unify legacy and modern telemetry stacks.
  • Drive reliability-first mindset and tooling.
  • Establish a scalable blueprint for production excellence.
About Us

We are a trusted global innovator of business and technology services, serving 75% of the Fortune Global 100. Our diverse experts in over 50 countries and robust partner ecosystem enable us to deliver innovative solutions that drive long-term success. We value collaboration, adaptability, and innovation in everything we do.



  • Dublin, Dublin City, Ireland beBeeObservability Full time €90,000 - €120,000

    Job Title:Senior Cloud Architect – Observability and Site Reliability EngineeringThis role is critical in unifying legacy and modern telemetry stacks, driving a reliability-first mindset and tooling, and establishing a scalable blueprint for production excellence.ResponsibilitiesArchitecture & BlueprintingDesign and deliver end-to-end observability...


  • Dublin, Dublin City, Ireland beBeeObservability Full time €90,000 - €120,000

    Job Description:We are seeking a highly experienced Technical Architect to lead the design and implementation of observability and Site Reliability Engineering (SRE) frameworks for enterprise-scale, microservices-based applications.Design end-to-end observability architecture for cloud-native and hybrid environments.Create technical architecture diagrams and...


  • Dublin, Dublin City, Ireland Ntt America, Inc. Full time

    OverviewJob Title:Technical Architect – Observability & SRE FrameworksPosition Title:Technical Architect – Observability & Site Reliability Engineering (SRE)Location:Dublin, IrelandExperience:15+ years (including 5+ years in observability/SRE architecture)Employment Type:Full-timeResponsibilitiesArchitecture & BlueprintingDesign and deliver end-to-end...


  • Dublin, Dublin City, Ireland NTT America, Inc. Full time

    OverviewJob Title: Technical Architect – Observability & SRE FrameworksPosition Title: Technical Architect – Observability & Site Reliability Engineering (SRE)Location: Dublin, IrelandExperience: 15+ years (including 5+ years in observability/SRE architecture)Employment Type: Full-timeResponsibilitiesArchitecture & BlueprintingDesign and deliver...


  • Dublin, Dublin City, Ireland NTT DATA, Inc. Full time

    Press Tab to Move to Skip to Content LinkSelect how often (in days) to receive an alert: Create AlertNTT DATA strives to hire exceptional, innovative and passionate individuals who want to grow with us. If you want to be part of an inclusive, adaptable, and forward-thinking organization, apply now.Job Description: Technical Architect – Observability & SRE...


  • Dublin, Dublin City, Ireland Ntt Data, Inc. Full time

    Press Tab to Move to Skip to Content LinkSelect how often (in days) to receive an alert: Create AlertNTT DATA strives to hire exceptional, innovative and passionate individuals who want to grow with us.If you want to be part of an inclusive, adaptable, and forward-thinking organization, apply now.Job Description: Technical Architect – Observability & SRE...


  • Dublin, Dublin City, Ireland Ntt Data, Inc. Full time

    Press Tab to Move to Skip to Content LinkSelect how often (in days) to receive an alert: Create AlertNTT DATA strives to hire exceptional, innovative and passionate individuals who want to grow with us.If you want to be part of an inclusive, adaptable, and forward-thinking organization, apply now.Job Description: Technical Architect – Observability & SRE...


  • Dublin, Dublin City, Ireland beBeeExpertise Full time €150,000 - €175,000

    Observability ExpertWe are seeking a highly experienced technical professional to lead the design, strategy, and implementation of enterprise-scale Observability frameworks for cloud-native and hybrid applications.The ideal candidate will bring deep knowledge of Observability tools (like OpenTelemetry, Prometheus, Grafana, Jaeger) and be capable of defining...


  • Dublin, Dublin City, Ireland NTT DATA North America Full time

    OverviewTechnical Architect – Observability & Site Reliability Engineering (SRE) role at NTT DATA North America based in Dublin, Ireland. This position leads the design, strategy, and implementation of Observability and SRE frameworks for enterprise-scale, microservices-based applications. The ideal candidate brings deep technical knowledge of the Splunk...


  • Dublin, Dublin City, Ireland NTT DATA North America Full time

    NTT DATA strives to hire exceptional, innovative and passionate individuals who want to grow with us. If you want to be part of an inclusive, adaptable, and forward-thinking organization, apply now.Job Description: Technical Architect – Observability & SRE FrameworksPosition Title: Technical Architect – Observability & Site Reliability Engineering...