Data Engineer

Here's what we'd like you to do:

  • Excellent understanding of database and data warehouse technologies, including standard row RDBMS, columnar RDBMS and some form of NoSQL (e.g. key-value, document, graph stores)

  • Strong experience developing ETL scripts in Apache Airflow or similar open-source ETL platform

  • Strong experience of Python 3 with experience of writing large systems

  • Experience using Google BigQuery or similar technology, including schema design

  • Experience using MySQL, including schema design

  • Some experience mentoring or training junior team members

This is what you'll have done already:

 

  • Excellent understanding of database and data warehouse technologies, including standard row RDBMS, columnar RDBMS and some form of NoSQL (e.g. key-value, document, graph stores)

  • Strong experience developing ETL scripts in Apache Airflow or similar open-source ETL platform

  • Strong experience of Python 3 with experience of writing large systems

  • Experience using Google BigQuery or similar technology, including schema design

  • Experience using MySQL, including schema design

  • Some experience mentoring or training junior team members

  • Good understanding of Google Cloud Platform or AWS including setting up compute and storage instances from scratch

  • Good understanding of Linux-based platforms

  • Experience of at least one commercial BI platform

  • Experience writing code which consume 3rd party APIs at scale

  • Some experience working directly with data on statistical problems

  • Ideally some familiarity with machine learning techniques such as logistic regression or random forests

  • Computer Science or Maths degree