Vacancy: Platform Systems Engineer – Global Services Tools (Observability & Analysis)
Location: Centurion Eco Park, Gauteng
Job Type: Permanent, Hybrid
Introduction:
We are seeking a skilled and results-driven Systems Engineer with a strong background in Linux environments and experience in monitoring, observability, and automation tools. This role forms part of the Global Services (GS) Tools team within Business IT, focusing on the implementation, configuration, and lifecycle management of global IT operations tools.
You will play a pivotal role in standardizing and enabling observability platforms across global regions, integrating solutions into the Global Service Desk, and supporting the business in driving performance, visibility, and operational excellence.
Key Responsibilities:
- Implementation & Operations
- Implement and configure monitoring and observability tools across new and existing projects.
- Productize platforms into cloud-native formats (containerization, automation, orchestration).
- Integrate tools into the Global Service Desk and ensure alignment with architectural standards.
- Support global enablement through coaching, documentation, and training of regional teams.
Monitoring, Observability & Analysis
- Lead observability and monitoring tool deployments (e.g., Prometheus, Grafana, Icinga2, Nagios).
- Implement system performance monitoring and analysis mechanisms.
- Enable proactive detection and resolution of operational issues through proper capacity planning and reporting.
- Ensure adherence to monitoring protocols, policies, and blueprints.
Reporting & Documentation
- Produce regular and ad-hoc reports for internal stakeholders and management.
- Ensure performance metrics, SLAs, and operational objectives are met through reliable data analysis.
- Identify and address process exceptions, deviations, and risks via continual improvement initiatives.
- Develop and maintain SOPs, implementation guides, and troubleshooting documentation.
Collaboration & Support
- Work closely with cross-functional teams including service desk, DevOps, infrastructure, and project teams.
- Contribute to project planning, coordination, and execution of monitoring requirements.
- Support global initiatives like Global Solution Enablement ART, ensuring alignment with tool strategies.
Technical Requirements:
- Proficiency in Linux (required) and Windows (advantageous).
- Hands-on experience with monitoring tools such as: Icinga2, Nagios, Prometheus, Grafana
- Scripting skills in Bash and Python for task automation.
- Familiarity with Docker, Kubernetes, and microservice architecture.
- Knowledge of network protocols (e.g., TCP/IP, SNMP).
- Understanding of the Software Development Lifecycle (SDLC) and IT operations best practices.
- Ability to manage monitoring agents, tune performance, and automate recurring processes.
Minimum Requirements:
- 3–5 years of experience in an IT monitoring or observability role.
- Proven experience in Linux-based environments.
- Experience supporting or deploying observability tools across enterprise-scale systems.
- ITIL Foundation Certification (preferred).
Nice to Have:
- Exposure to cloud-native monitoring tools (Azure Monitor, AWS CloudWatch).
- Experience in global IT environments or shared service models.
- Knowledge of ITSM platforms and integration with service desks.
- Previous involvement in training or enablement roles.
Desired Skills:
- Monitoring & Observability
- Linux & Scripting Automation
- Prometheus