Assistant Lead Engineer - Observability Dynatrace (Ops Response)

Apply now »

Date: 24 Jun 2026

Location: SG

Company: Synapxe

Position Overview

We are looking for a skilled Dynatrace / Cloud Observability & SRE Engineer to support, maintain, and enhance our observability, monitoring, and site reliability practices. The ideal candidate should have strong hands-on experience with Dynatrace, be familiar with AWS cloud environments, understand SRE principles, and have some exposure to AI Engineering concepts to support modern cloud-native and AI-enabled applications

Role & Responsibilities

Maintain, configure, and optimize Dynatrace monitoring across applications, infrastructure, cloud services, containers, APIs, and integrations
Ensure proper observability coverage for application performance, infrastructure health, user experience, logs, metrics, traces, service dependencies, and alerts.
Set up and manage Dynatrace dashboards, alerts, synthetic monitoring, service-level objectives, anomaly detection, and root-cause analysis capabilities
Define, monitor, and report on SRE metrics such as Service Level Indicators, Service Level Objectives, availability, latency, error rates, throughput, and error budgets
Work with application, infrastructure, DevOps, security, and cloud teams to improve system reliability, scalability, performance, and resilience
Participate in on-call or production support activities where required, including incident response and service restoration
Improve alert quality by reducing noise, refining thresholds, implementing intelligent alerting, and ensuring alerts are actionable
Support reliability engineering practices such as capacity planning, resilience testing, performance tuning, failover validation, and disaster recovery readiness
Identify recurring operational issues and drive automation, self-healing, runbook improvements, and preventive engineering solutions
Monitor AWS workloads and provide observability and reliability support for services such as EC2, ECS, EKS, Lambda, RDS, CloudWatch, S3, API Gateway, Load Balancers, and related cloud services
Support Dynatrace integrations with AWS, CI/CD pipelines, ITSM platforms, incident management tools, and other enterprise monitoring solutions
Collaborate with DevOps teams to embed monitoring, reliability checks, and performance validation into deployment pipelines
Maintain documentation for monitoring standards, dashboards, alert rules, SLO definitions, runbooks, incident procedures, and operational best practices
Explore opportunities to apply AI-assisted observability, automation, predictive monitoring, and intelligent incident respons

Requirements

Hands-on experience maintaining and administering Dynatrace in an enterprise or cloud environment
Good understanding of application performance monitoring, distributed tracing, logs, metrics, dashboards, and alerting
Familiarity with AWS cloud services and cloud infrastructure monitoring
Basic understanding of DevOps practices, CI/CD pipelines, containers, Kubernetes, and microservices architecture
Experience with AWS CloudWatch, OpenTelemetry, Kubernetes, Docker, Terraform, or other infrastructure-as-code tools
Knowledge of AI Engineering concepts such as AI/ML model deployment, inference APIs, vector databases, prompt engineering, or MLOps
Familiarity with automation using Python, shell scripting, or API-based integrations
Experience with ITSM tools such as ServiceNow or Jira
Dynatrace certification or AWS certification would be an advantage

Apply Now

NOTE: It only takes a few minutes to apply for a meaningful career in HealthTech - GO FOR IT!!

#LI-SYNX40

Apply now »