Sre & Observability Architect

4 weeks ago


Dublin, Dublin City, Ireland Ntt America, Inc. Full time

Overview
Job Title:
Technical Architect – Observability & SRE Frameworks
Position Title:
Technical Architect – Observability & Site Reliability Engineering (SRE)
Location:
Dublin, Ireland
Experience:
15+ years (including 5+ years in observability/SRE architecture)
Employment Type:
Full-time
Responsibilities
Architecture & Blueprinting
Design and deliver end-to-end observability architecture (Metrics, Logs, Traces, Events) for cloud-native and hybrid environments.
Create technical architecture diagrams, data flow maps, and integration blueprints using Lucidchart, Draw.io, or Visio.
Lead the definition of SLIs, SLOs, and Error Budgets aligned with business KPIs and DORA metrics.
Toolchain Strategy & Implementation
Architect telemetry pipelines using OpenTelemetry Collector and Splunk Observability Cloud (SignalFx, APM, RUM, Log Observer).
Define tool adoption strategy and integration roadmap for OSS tools (Prometheus, Loki, Grafana, Jaeger) and Splunk-based stacks.
Guide teams on instrumentation approaches (auto/manual) across languages like Java, Go, Python, .
NET, etc.
Reliability Engineering Enablement
Lead adoption of SRE principles including incident management frameworks, resiliency testing, and runbook automation.
Collaborate with DevOps to integrate observability into CI/CD pipelines (e.g., Jenkins, ArgoCD, GitHub Actions).
Define health checks, golden signals, and SPoG (Single Pane of Glass) dashboards.
Exposure to AIOps, ML-based anomaly detection, or business observability.
Stakeholder Management & Governance
Serve as a technical liaison between client leadership, SREs, developers, and infrastructure teams.
Run workshops, assessments, and evangelize observability-first culture across teams.
Provide guidance on data retention, access control, cost optimization, and compliance (especially with Splunk ingestion policies).
Performance & Optimization
Continuously monitor and fine-tune observability data flows to prevent alert fatigue and ensure actionability.
Implement root cause analysis practices using telemetry correlation across metrics, logs, and traces.
Lead efforts to build self-healing systems using automated playbooks and AIOps integrations (where applicable).
Required Skills & Qualifications
15+ years in IT, with 5 years in Observability/SRE architecture roles
Proven experience designing architecture for microservices, containers (Docker, Kubernetes), and distributed systems
Strong hands-on expertise with:
Splunk Observability Cloud (SignalFx, Log Observer, APM)
OpenTelemetry (SDKs + Collector)
Prometheus + Grafana
Jaeger / Zipkin
CI/CD tools : Jenkins, GitHub Actions, ArgoCD
Ability to build and present clear architecture diagrams and solution roadmaps
Working knowledge of cloud environments (AWS, Azure, GCP) and container orchestration (K8s/OpenShift)
Familiarity with SRE and DevOps best practices (error budgets, release engineering, chaos testing)
Nice to Have
Splunk certifications: Core Consultant, Observability Specialist, Admin
Knowledge of ITIL and modern incident management frameworks (PagerDuty, OpsGenie)
Experience in banking or regulated enterprise environments
Soft Skills
Strong leadership and cross-functional collaboration
Ability to work in ambiguous, fast-paced environments
Excellent documentation and communication skills
Passion for mentoring teams and building best practices at scale
Why This Role Matters
The client is on a journey to mature its Observability and SRE ecosystem, and this role will be critical in:
Unifying legacy and modern telemetry stacks
Driving reliability-first mindset and tooling
Establishing a scalable blueprint for production excellence
About NTT DATA
NTT DATA is a $30 billion trusted global innovator of business and technology services.
We serve 75% of the Fortune Global 100 and are committed to helping clients innovate, optimize and transform for long term success.
As a Global Top Employer, we have diverse experts in more than 50 countries and a robust partner ecosystem of established and start-up companies.
Our services include business and technology consulting, data and artificial intelligence, industry solutions, as well as the development, implementation and management of applications, infrastructure and connectivity.
NTT DATA is a part of NTT Group, which invests over $3.6 billion each year in R&D to help organizations and society move confidently and sustainably into the digital future.
Visit us at us.nttdata.com
Application & EEO
NTT DATA is an equal opportunity employer.
Qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability or protected veteran status.
#J-18808-Ljbffr



  • Dublin, Dublin City, Ireland Ntt Data North America Full time

    NTT DATA strives to hire exceptional, innovative and passionate individuals who want to grow with us.If you want to be part of an inclusive, adaptable, and forward-thinking organization, apply now.Job Description: Technical Architect – Observability & SRE FrameworksPosition Title: Technical Architect – Observability & Site Reliability Engineering...

  • Cloud Architect

    4 weeks ago


    Dublin, Dublin City, Ireland Permanent TSB Full time

    PTSB is one of Ireland's leading retail and SME banks, with an innovative range of products and services powered through an evolving digital landscape, our focus is centred on ensuring we deliver what our customers, colleagues and communities need to be successful.To strengthen and support our Digital transformation journey we have an exciting opportunity...

  • Cloud Architect

    4 weeks ago


    Dublin, Dublin City, Ireland Permanent TSB plc Full time

    OverviewJoin to apply for the Cloud Architect role at PTSB.PTSB is one of Ireland's leading retail and SME banks, with an innovative range of products and services powered through an evolving digital landscape. Our focus is centred on delivering what our customers, colleagues and communities need to be successful.To strengthen and support our Digital...


  • Dublin, Dublin City, Ireland TechForce Talent Full time

    OverviewWe are seeking a highly skilled Platform Automation Engineer with a strong software engineering background to join our Site Reliability Engineering (SRE) team. This role is coding-heavy, focused on developing automation, building resilient services, and ensuring observability and reliability at scale.Location: Dublin (Hybrid)Key...

  • Solutions Architect

    4 weeks ago


    Dublin, Dublin City, Ireland Compass Informatics Full time

    Join to apply for the Solutions Architect role at Compass Informatics6 days ago Be among the first 25 applicantsJoin to apply for the Solutions Architect role at Compass InformaticsGet AI-powered advice on this job and more exclusive features.Solutions Architect/Technical Lead for Cloud ApplicationsHybrid/Remote/OnsiteOur talented people are creating the...


  • Dublin, Dublin City, Ireland Deloitte Ireland LLP Full time

    Deloitte is the biggest professional services Firm in the world and making an impact is more than just what we do: it's why we're here. We're driven to create positive progress for our clients, community, people, and the planet. This sense of purpose inspires us to work to the highest standards, to tackle the challenges that matter. Joining us means becoming...


  • Dublin, Dublin City, Ireland Apple Inc. Full time

    Dublin, County Dublin, Ireland Software and ServicesDescriptionApple Services Engineering (ASE) Infrastructure is the foundation upon which Apple services run.Working collaboratively with key stakeholders in storage, compute, traffic, and observability, the Systems Infrastructure team's focus is system provisioning, configuration management, deployment, name...


  • Dublin, Dublin City, Ireland Twilio Full time

    OverviewAt Twilio, we're shaping the future of communications with a remote-first culture. We deliver innovative solutions to hundreds of thousands of businesses and empower millions of developers to craft personalized customer experiences. This role is for a Software Engineer on Twilio's Platform Observability team, focused on rebuilding and unifying our...


  • Dublin, Dublin City, Ireland Twilio Full time

    OverviewAt Twilio, we're shaping the future of communications with a remote-first culture. We deliver innovative solutions to hundreds of thousands of businesses and empower millions of developers to craft personalized customer experiences. This role is for a Software Engineer on Twilio's Platform Observability team, focused on rebuilding and unifying our...


  • Dublin, Dublin City, Ireland Mastercard Full time

    Principal Software Development ArchitectJoin to apply for thePrincipal Software Development Architectrole atMastercardOverviewAs a Principal Software Architect for Mastercard Commercial Solutions, you will play a critical role in shaping the architecture of a global B2B payments platform designed for scale, security, and performance.This platform supports...