Data engineer at abra
Join abra R&D as a Data Engineer!
We're seeking a skilled Data Engineer to support cutting-edge AI initiatives by building and optimizing data infrastructure.
In this role, you'll process massive datasets, conduct in-depth analysis, and work alongside Data Scientists to create robust data solutions. Your expertise will be crucial in developing high-performance data pipelines across cloud and on-premise systems.
- 5+ years professional experience in Data Engineering – required
- 5 years working with OOP languages – required
- 5 years Python development experience – required
- Proven Spark expertise for big data processing – required
- 2+ years AWS experience (Athena, Glue, Step Functions, EMR, Redshift, RDS) – significant plus
- Strong background in designing, building and tuning large-scale data solutions
- Knowledge of performance optimization and experience with data partitioning and formats like Parquet, Avro, HDF5, Delta Lake
- Proficiency with Docker, Linux, CI/CD pipelines, and Kubernetes
- Experience using data orchestration tools such as Airflow or Kubeflow
- Bachelor's degree in Computer Science, Engineering, Math, or Statistics – required
- Understanding of ML principles and workflows
- Knowledge of GenAI technologies or prompt engineering – beneficial