Data Engineer Job Responsibilities:

Core Responsibilities

  • Develops and maintains scalable data pipelines
  • Builds out new API integrations to support continuing increases in data volume and complexity
  • Collaborates with analytics and business teams to improve understanding and documentation of source data models
  • Implements processes and systems to monitor data quality and pipeline performance
  • Writes unit/integration tests, contributes to engineering wiki, and documents work
  • Performs data analysis required to troubleshoot data-related issues and assist in the resolution of data issues
  • Performs data investigations following leads, leaks and gremlins
  • Defines company data assets (data models) and jobs to populate data models
  • Manages code repositories and provides a shining example of best practices (and figures out how to sort things out when we’re falling short)
  • Is a pleasure to work with

Ancillary Responsibilities

  • Support and contribute to the data modelling process
  • Designs and evaluates open source and vendor tools for data lineage.
  • Manage Linux Virtual machines and improve performance and package management
  • Self-learning and research on the contextual public health topics we’re working in
  • Is patient and can train up junior staff or senior stakeholders on difficult technical concepts

We work remotely and want to set you up to be as productive as you can be, but also have a clear separation between work time and not work time. Our approach and hours are flexible to accommodate most personal commitments. We wish you to start 1st of May.

Data Engineer Requirements:

  • Outstanding teamwork and attitude
  • Experience with or knowledge of Agile Software Development methodologies
  • Excellent problem solving and troubleshooting skills
  • Process-oriented with great documentation skills
  • Professional written communication skills in English with a keen sense of customer service
  • Fluent in English (but 1st language not required), other languages a bonus
  • A desire to support and learn more about data analytics and machine learning is a huge advantage
  • A passion for public health
  • Can legally work in South Africa

Ideal Experience:

  • A degree in Computer Science or engineering is a bonus or relevant diplomas with additional experience are also welcome to apply
  • 2+ years of any sort of data experience, with demonstrated projects is an absolute must
  • Familiarity with data exploration, engineering and ETL libraries and workflows are highly valued
  • Python development experience
  • Pandas or Data Science experience a bonus
  • SQL experience (No-SQL experience is a plus)
  • Git version control experience
  • If you don’t have the above experience but are convinced you have the talent and interest in our mission, please still apply
  • We strongly encourage folks with non-traditional backgrounds to apply

This application will require:

  • Your CV,
  • Cover letter describing your interest in data science, and include
  • A short paragraph which data processing tool you think is undervalued and should be considered more often

If you are looking for a challenge and to grow your experience on hard problems, we’re excited to chat. If you are looking for an easy job doing the same thing daily this is probably not for you. Get after it.

Desired Skills:

  • data engineering
  • Software Engineering
  • Software Development
  • Big data
  • Python
  • data science
  • Data engineering
  • Big Data Analytics
  • Data wrangling
  • Data Modeling
  • ELT
  • Big Data Development

Desired Work Experience:

  • 2 to 5 years

Desired Qualification Level:

  • Diploma

About The Employer:

Palindrome Data uses contemporary machine learning to revolutionise traditional public health approaches. We currently have a focus on HIV and Maternal and Child Health programmes, and our work thus far has been in South Africa and the surrounding region.

We’re growing the team and need a sharp Data Engineer to bolster our preparation, management and understanding of large, difficult datasets. We’re still a start-up and expect this position will gain a huge amount of experience in both data management and data science as the demand for our services grows. [URL Removed]

Learn more/Apply for this position