Wildberries

Data Engineer

6.0/10

Wildberries

Not specified
Remote
mid
7 days ago
analyticsdevtechAirflowHadoopSparkYarnHdfsGreenplumTrino

AI Summary

The vacancy is detailed in responsibilities and tech stack but lacks compensation clarity and broader company context.

Check Match โ€” Just drop your CV

See your fit for Data Engineer in seconds.

Description

Responsibilities

  • โ€ขSupport pipelines on Greenplum;
  • โ€ขMaintain and optimize existing ETL/ELT processes: monitoring, diagnosing degradations, partitioning, working with the catalog;
  • โ€ขIntegrate new sources;
  • โ€ขConnect new product teams and external sources: technical research, design integration schemes, data contracts.
  • โ€ขInteract with source owners on technical requirements;
  • โ€ขParticipate in migration: redesign layers for Iceberg (partitioning, schema evolution, snapshot management), understand MPP vs object storage trade-offs.

Requirements

Requirements

  • โ€ขExperience with Airflow as an orchestrator;
  • โ€ขExperience with Hadoop (Spark/Yarn/Hdfs);
  • โ€ขExperience with Greenplum or other MPP systems;
  • โ€ขWorked with Trino as a query engine.
Loading similar jobs...