Data Engineer - Python at Palindrome Data

Data Engineer Job Responsibilities:

Core Responsibilities

Develops and maintains scalable data pipelines
Builds out new API integrations to support continuing increases in data volume and complexity
Collaborates with analytics and business teams to improve understanding and documentation of source data models
Implements processes and systems to monitor data quality and pipeline performance
Writes unit/integration tests, contributes to engineering wiki, and documents work
Performs data analysis required to troubleshoot data-related issues and assist in the resolution of data issues
Performs data investigations following leads, leaks and gremlins
Defines company data assets (data models) and jobs to populate data models
Manages code repositories and provides a shining example of best practices (and figures out how to sort things out when we’re falling short)
Is a pleasure to work with

Ancillary Responsibilities

Support and contribute to the data modelling process
Designs and evaluates open source and vendor tools for data lineage.
Manage Linux Virtual machines and improve performance and package management
Self-learning and research on the contextual public health topics we’re working in
Is patient and can train up junior staff or senior stakeholders on difficult technical concepts

We work remotely and want to set you up to be as productive as you can be, but also have a clear separation between work time and not work time. Our approach and hours are flexible to accommodate most personal commitments. We wish you to start 1st of May.

Data Engineer Requirements:

Outstanding teamwork and attitude
Experience with or knowledge of Agile Software Development methodologies
Excellent problem solving and troubleshooting skills
Process-oriented with great documentation skills
Professional written communication skills in English with a keen sense of customer service
Fluent in English (but 1st language not required), other languages a bonus
A desire to support and learn more about data analytics and machine learning is a huge advantage
A passion for public health
Can legally work in South Africa

Ideal Experience:

A degree in Computer Science or engineering is a bonus or relevant diplomas with additional experience are also welcome to apply
2+ years of any sort of data experience, with demonstrated projects is an absolute must
Familiarity with data exploration, engineering and ETL libraries and workflows are highly valued
Python development experience
Pandas or Data Science experience a bonus
SQL experience (No-SQL experience is a plus)
Git version control experience
If you don’t have the above experience but are convinced you have the talent and interest in our mission, please still apply
We strongly encourage folks with non-traditional backgrounds to apply

This application will require:

Your CV,
Cover letter describing your interest in data science, and include
A short paragraph which data processing tool you think is undervalued and should be considered more often

If you are looking for a challenge and to grow your experience on hard problems, we’re excited to chat. If you are looking for an easy job doing the same thing daily this is probably not for you. Get after it.

Desired Skills:

data engineering
Software Engineering
Software Development
Big data
Python
data science
Data engineering
Big Data Analytics
Data wrangling
Data Modeling
ELT
Big Data Development

Desired Work Experience:

2 to 5 years

Desired Qualification Level:

Diploma

About The Employer:

Palindrome Data uses contemporary machine learning to revolutionise traditional public health approaches. We currently have a focus on HIV and Maternal and Child Health programmes, and our work thus far has been in South Africa and the surrounding region.

We’re growing the team and need a sharp Data Engineer to bolster our preparation, management and understanding of large, difficult datasets. We’re still a start-up and expect this position will gain a huge amount of experience in both data management and data science as the demand for our services grows. [URL Removed]

Learn more/Apply for this position

Data Engineer – Python at Palindrome Data