Assistant Lead Engineer - Observability Dynatrace (Ops Response)

Date: 24 Jun 2026

Location: SG

Company: Synapxe

Position Overview

We are looking for a skilled Dynatrace / Cloud Observability & SRE Engineer to support, maintain, and enhance our observability, monitoring, and site reliability practices. The ideal candidate should have strong hands-on experience with Dynatrace, be familiar with AWS cloud environments, understand SRE principles, and have some exposure to AI Engineering concepts to support modern cloud-native and AI-enabled applications

Role & Responsibilities

  • Maintain, configure, and optimize Dynatrace monitoring across applications, infrastructure, cloud services, containers, APIs, and integrations
  • Ensure proper observability coverage for application performance, infrastructure health, user experience, logs, metrics, traces, service dependencies, and alerts.
  • Set up and manage Dynatrace dashboards, alerts, synthetic monitoring, service-level objectives, anomaly detection, and root-cause analysis capabilities
  • Define, monitor, and report on SRE metrics such as Service Level Indicators, Service Level Objectives, availability, latency, error rates, throughput, and error budgets
  • Work with application, infrastructure, DevOps, security, and cloud teams to improve system reliability, scalability, performance, and resilience
  • Participate in on-call or production support activities where required, including incident response and service restoration
  • Improve alert quality by reducing noise, refining thresholds, implementing intelligent alerting, and ensuring alerts are actionable
  • Support reliability engineering practices such as capacity planning, resilience testing, performance tuning, failover validation, and disaster recovery readiness
  • Identify recurring operational issues and drive automation, self-healing, runbook improvements, and preventive engineering solutions
  • Monitor AWS workloads and provide observability and reliability support for services such as EC2, ECS, EKS, Lambda, RDS, CloudWatch, S3, API Gateway, Load Balancers, and related cloud services
  • Support Dynatrace integrations with AWS, CI/CD pipelines, ITSM platforms, incident management tools, and other enterprise monitoring solutions
  • Collaborate with DevOps teams to embed monitoring, reliability checks, and performance validation into deployment pipelines
  • Maintain documentation for monitoring standards, dashboards, alert rules, SLO definitions, runbooks, incident procedures, and operational best practices
  • Explore opportunities to apply AI-assisted observability, automation, predictive monitoring, and intelligent incident respons

Requirements

  • Hands-on experience maintaining and administering Dynatrace in an enterprise or cloud environment
  • Good understanding of application performance monitoring, distributed tracing, logs, metrics, dashboards, and alerting
  • Familiarity with AWS cloud services and cloud infrastructure monitoring
  • Basic understanding of DevOps practices, CI/CD pipelines, containers, Kubernetes, and microservices architecture
  • Experience with AWS CloudWatch, OpenTelemetry, Kubernetes, Docker, Terraform, or other infrastructure-as-code tools
  • Knowledge of AI Engineering concepts such as AI/ML model deployment, inference APIs, vector databases, prompt engineering, or MLOps
  • Familiarity with automation using Python, shell scripting, or API-based integrations
  • Experience with ITSM tools such as ServiceNow or Jira
  • Dynatrace certification or AWS certification would be an advantage

Apply Now

NOTE: It only takes a few minutes to apply for a meaningful career in HealthTech - GO FOR IT!!

#LI-SYNX40