Site Reliability Engineer (Dynatrace Specialist)
Are you passionate about ensuring the reliability and performance of large-scale distributed systems? Apply Now!
Working with one of our top financial clients, this role calls for a Site Reliability Engineer (Dynatrace Specialist), who will play a crucial part in maintaining and enhancing the observability, stability, and efficiency of critical business applications. This position involves working with advanced monitoring tools, developing proactive solutions, and collaborating across teams to uphold high standards of system reliability. The role offers the opportunity to lead initiatives in innovative monitoring practices within a dynamic financial environment, fostering professional growth and expertise in SRE practices.
Responsibilities
- Monitor application transaction flows to detect anomalies and swiftly resolve errors.
- Leverage Dynatrace to enhance reporting on critical transactions and system performance.
- Ensure the accuracy and reliability of key business dashboards and service-level metrics.
- Configure and maintain Dynatrace components including OneAgents, ActiveGate, dashboards, synthetic monitoring, RUM, and distributed tracing.
- Develop custom alerting rules, anomaly detection, and service-level dashboards aligned with SLOs and SLIs.
- Build comprehensive end-to-end observability solutions utilizing Dynatrace’s full-stack capabilities.
- Create performance baselines, analyze trends, and identify opportunities for system optimization.
- Integrate Dynatrace with CI/CD pipelines, incident management systems, and ticketing tools such as ServiceNow and JIRA.
- Conduct root cause analysis to diagnose issues and drive strategic remediation efforts.
- Define and establish service-level objectives and reliability standards for critical systems.
- Develop best practices and internal standards for monitoring, tracing, and performance engineering.
Desired Skill-Set
- 3-5 years of hands-on experience with Dynatrace.
- Expertise across all Dynatrace phases: Infrastructure monitoring, Synthetic Monitoring, Real User Monitoring, OS, DB, and Incident Management.
- Strong familiarity with Dynatrace integration with ServiceNow and JIRA.
- Extensive experience utilizing Dynatrace Davis AI for anomaly detection and automated insights.
- Proficiency in Dynatrace SRG, workflows, defining guardians, objectives, and integration within deployment pipelines for go/no-go decisions.
- Deep understanding of Dynatrace WCCS framework (Gen-3 Dashboards) from a business-centric perspective to enable full E2E observability.
- Knowledge of SRE golden signals (application, web, database tiers) and setting alerts as per SLOs/SLIs/SLAs.
- Skilled in writing and utilizing Dynatrace DQL for advanced data queries.
Nice to Have
- Experience working within financial institutions.
- Relevant technical degree.
- Certification in Dynatrace (Certified Associate or Professional).
BeachHead is an equal opportunity agency and employer. We advocate for our candidates and welcome applicants regardless of race, color, religion, national origin, sex, age, or physical or mental disability. BeachHead or our clients may use technology-enabled tools, including automation and artificial intelligence (AI), to support parts of the recruitment process such as resume screening, application management, and candidate matching. These tools assist our recruiters and our clients, and do not replace human decision-making. This job posting represents a current or anticipated vacancy. The position may be filled at any time, and the posting may be removed without notice once the role has been filled.