Senior Data Engineer
Background
Re-Hub, a DLG company, helps global luxury and premium brands in China to measure their digital competence and generate growth opportunities using AI and data.
We have been growing our China business rapidly over the last 12 months, working with leading global brands like Gucci, Saint-Laurent or Coach. We developed an Insights-as-a-Service data intelligence platform, which uses data from a variety of channels in the Chinese digital ecosystem (social, e-commerce, retail) to deliver actionable insights to functional teams to generate competitive advantages.
Role
We are looking for a Senior Data Engineer to make significant contribution to our core SaaS data platform. Key focus is on ownership of key data processing pipelines, including maintenance, enhancement and optimization. We will also expect key involvement in new development initiatives, requiring database architecture, design, development.
The role is reporting directly to our Chief Product Officer, based in Shanghai.
Responsibilities
- Design and development of data processing, data cleansing, data transformation and QA scripts using Python and SQL
- Operations, maintenance and enhancements of existing mission-critical data pipelines and ETLs including
- Monitoring and regular maintenance of data pipelines, ensuring data reliability and quality standards
- Enhancements to meet evolving business requirements
- Optimization of ETLs for performance
- Migration of legacy data pipelines into more a scalable architecture
- Typical data engineering responsibilities including
- SQL database design, maintenance and optimization
- Fetching datasets from external data sources: REST API, JSON or CSV
- Performing data migrations and batch data updates
- Exploratory projects including
- Analysis of various datasets to support business requirements
- Developing Proof-of-concept of new data pipelines
- Working with other data team members to complete common deliverables and provide technical expertise
Skills & Experience
Mandatory:
- Graduate in Computer Science, Information Systems, Statistics, Mathematics, or related fields
- At least 5 years relevant experience in data engineering, database or ETL development
- Excellent skills in Python (pandas, numpy)
- Excellent skills in SQL and database design/development
- Experience with ETL development and maintenance
- Experience with Apache Airflow
Preferred:
- Experience with PostgreSQL
- Experience with AliCloud, AWS or similar cloud platforms
- Experience with dbt is a plus
- Experience with Big Data processing technologies is a plus (BigQuery, Databricks, Spark, Redis, Kafka)
General:
- Ability to work independently and with team members from different backgrounds
- Proficient in English (written) and Chinese
- Passionate about new technologies and innovation
What we offer
- Work with leading global brands in the highly disruptive Chinese digital ecosystem
- Join a highly passionate and energetic team in a fast-growing company
- Fun, vibrant, multi-cultural team
- Competitive package, performance-based bonus