Responsibilities:

  • Monitoring the performance and reliability of the company’s global online platforms
  • Monitoring the availability management, latency, efficiency, and change management
  • Monitoring the emergency response and capacity planning
  • Troubleshooting issues via proactive and reactive monitoring and alerts using observability tooling and logging service requests
  • Enhancing the tech stack/configurations of existing services to improve site performance and reduce issues
  • Recording data and managing issues with a view to participation in reviews and blameless post-mortems
  • Exploring and delivering on opportunities to implement automation and scripting of services, environments and toolsets
  • Collaborating closely with the technology teams, stakeholders and wider teams to achieve their ambitious goals
  • Explaining complex technical details to non-technical stakeholders
  • Gaining exposure to their technical teams, working closely with software development, QA, Support and IT operations

Essential Requirements:

  • Experience with AWS supporting a production environment
  • Indepth, hands-on experience with Linux??
  • Strong experience with Docker
  • Nginx?experience
  • Experience building and maintaining CI/CD pipelines (preferably Jenkins)
  • Experience of infrastructure automation tooling (e.g., Terraform/Puppet)
  • Monitoring platforms?(ELK, Grafana or similar)

Desirable Requirements:

  • MySQL experience
  • Experience with API gateway products
  • Experience with message broker/event streaming platforms RabbitMQ/Kafka
  • Observability Platforms and Application Performance Management (preferably New Relic)?
  • Knowledge of Bash scripting
  • Working within an Agile environment using both Kanban and Scrum (preferably using Jira)

Responsibilities:

  • Monitoring the performance and reliability of the company’s global online platforms
  • Monitoring the availability management, latency, efficiency, and change management
  • Monitoring the emergency response and capacity planning
  • Troubleshooting issues via proactive and reactive monitoring and alerts using observability tooling and logging service requests
  • Enhancing the tech stack/configurations of existing services to improve site performance and reduce issues
  • Recording data and managing issues with a view to participation in reviews and blameless post-mortems
  • Exploring and delivering on opportunities to implement automation and scripting of services, environments and toolsets
  • Collaborating closely with the technology teams, stakeholders and wider teams to achieve their ambitious goals
  • Explaining complex technical details to non-technical stakeholders
  • Gaining exposure to their technical teams, working closely with software development, QA, Support and IT operations

Essential Requirements:

  • Experience with AWS supporting a production environment
  • Indepth, hands-on experience with Linux??
  • Strong experience with Docker
  • Nginx?experience
  • Experience building and maintaining CI/CD pipelines (preferably Jenkins)
  • Experience of infrastructure automation tooling (e.g., Terraform/Puppet)
  • Monitoring platforms?(ELK, Grafana or similar)

Desirable Requirements:

  • MySQL experience
  • Experience with API gateway products
  • Experience with message broker/event streaming platforms RabbitMQ/Kafka
  • Observability Platforms and Application Performance Management (preferably New Relic)?
  • Knowledge of Bash scripting
  • Working within an Agile environment using both Kanban and Scrum (preferably using Jira)

Desired Skills:

  • AWS supporting a production
  • Linux??
  • Docker
  • Nginx?experience
  • CI/CD pipelines
  • infrastructure automation tooling

Learn more/Apply for this position