Assistant Lead Engineer - Observability Dynatrace (Ops Response)
Date: 24 Jun 2026
Location: SG
Company: Synapxe
Position Overview
We are looking for a skilled Dynatrace / Cloud Observability & SRE Engineer to support, maintain, and enhance our observability, monitoring, and site reliability practices. The ideal candidate should have strong hands-on experience with Dynatrace, be familiar with AWS cloud environments, understand SRE principles, and have some exposure to AI Engineering concepts to support modern cloud-native and AI-enabled applications
Role & Responsibilities
- Maintain, configure, and optimize Dynatrace monitoring across applications, infrastructure, cloud services, containers, APIs, and integrations
- Ensure proper observability coverage for application performance, infrastructure health, user experience, logs, metrics, traces, service dependencies, and alerts.
- Set up and manage Dynatrace dashboards, alerts, synthetic monitoring, service-level objectives, anomaly detection, and root-cause analysis capabilities
- Define, monitor, and report on SRE metrics such as Service Level Indicators, Service Level Objectives, availability, latency, error rates, throughput, and error budgets
- Work with application, infrastructure, DevOps, security, and cloud teams to improve system reliability, scalability, performance, and resilience
- Participate in on-call or production support activities where required, including incident response and service restoration
- Improve alert quality by reducing noise, refining thresholds, implementing intelligent alerting, and ensuring alerts are actionable
- Support reliability engineering practices such as capacity planning, resilience testing, performance tuning, failover validation, and disaster recovery readiness
- Identify recurring operational issues and drive automation, self-healing, runbook improvements, and preventive engineering solutions
- Monitor AWS workloads and provide observability and reliability support for services such as EC2, ECS, EKS, Lambda, RDS, CloudWatch, S3, API Gateway, Load Balancers, and related cloud services
- Support Dynatrace integrations with AWS, CI/CD pipelines, ITSM platforms, incident management tools, and other enterprise monitoring solutions
- Collaborate with DevOps teams to embed monitoring, reliability checks, and performance validation into deployment pipelines
- Maintain documentation for monitoring standards, dashboards, alert rules, SLO definitions, runbooks, incident procedures, and operational best practices
- Explore opportunities to apply AI-assisted observability, automation, predictive monitoring, and intelligent incident respons
Requirements
- Hands-on experience maintaining and administering Dynatrace in an enterprise or cloud environment
- Good understanding of application performance monitoring, distributed tracing, logs, metrics, dashboards, and alerting
- Familiarity with AWS cloud services and cloud infrastructure monitoring
- Basic understanding of DevOps practices, CI/CD pipelines, containers, Kubernetes, and microservices architecture
- Experience with AWS CloudWatch, OpenTelemetry, Kubernetes, Docker, Terraform, or other infrastructure-as-code tools
- Knowledge of AI Engineering concepts such as AI/ML model deployment, inference APIs, vector databases, prompt engineering, or MLOps
- Familiarity with automation using Python, shell scripting, or API-based integrations
- Experience with ITSM tools such as ServiceNow or Jira
- Dynatrace certification or AWS certification would be an advantage
Apply Now
NOTE: It only takes a few minutes to apply for a meaningful career in HealthTech - GO FOR IT!!
#LI-SYNX40