About the Company
At Sourcing Trust, we are committed to delivering innovative, reliable, and tailored technology solutions that empower businesses to succeed in a rapidly evolving digital landscape. With a focus on excellence, integrity, and collaboration, we build lasting partnerships by understanding our clients' unique needs and providing them with expert support across. Our team is dedicated to fostering a positive and inclusive work environment where every employee's contribution is valued, encouraging continuous growth, learning, and shared success. Join us and be part of a passionate organization driven by innovation and excellence.
About the Role
We are looking for an Observability Specialist to administer and evolve Dynatrace (instrumentation, alerting, dashboards, tagging), implement alert governance to reduce noise and false positives, manage Grafana dashboards, operate Zabbix integrations, and build operational/executive dashboards with KPIs. The role involves collaboration with platform/automation teams for event correlation and continuous improvement in observability practices.
Requirements
Main Responsibilities
Dynatrace administration and evolution: instrumentation, alerting, problem notifications, dashboards, tagging.
Implement/maintain alert governance: alert matrix (incident triggers, actions, owners), thresholds/windows/severity, noise reduction.
Grafana management/consolidation: dashboard governance, naming, owners, versioning.
Zabbix operation/evolution and signal integration with operational channels.
Integrations with external tools (e.g., OnePoint/ITSM/communication channels).
Build operational/executive dashboards with technical/service-health KPIs (business-aware when applicable).
Collaborate with Platform/Automation for event→action correlation and continuous improvement.
Deliverables
Dashboards by domain (critical apps, core infra, capacity, availability).
Alert matrix and observability governance policy.
Response runbooks by alert type (with ownership).
Effectiveness reporting: noise, coverage, incident quality, diagnosis times.
Requirements
Mandatory Professional Experience
5+ years in monitoring/observability/SRE/ops analytics.
2+ years hands-on with Dynatrace in production (admin/config/alerting/dashboards).
Proven experience reducing noise and improving incident quality.
Preferred Experience
Integrations with ITSM (ServiceNow) and event management.
Advanced Grafana/Zabbix and governance practices.
Dynatrace/SRE certifications.
Technical Skills
End-to-end observability, event correlation, basic infra/app troubleshooting.
Metrics modeling, SLIs/SLOs valued.
Soft Skills
Signal quality mindset, pragmatism, action focus.
Ability to work cross-functionally and communicate clearly.
Language Requirements
Fluent Portuguese and English.
