Opportunity Description
Data Pipeline Development & Management: Design, implement, and maintain scalable and reliable data pipelines to ingest, transform and load structured, unstructured, and real‑time data feeds from diverse sources.
Manage data pipelines for analytics and operational use ensuring data integrity, timeliness, and accuracy across systems.
Implement data quality tools and validation frameworks within transformation pipelines.
Data Processing & OptimizationBuild efficient high‑performance systems by leveraging techniques such as data denormalization, partitioning, caching and parallel processing.
Develop stream‑processing applications using Apache Kafka and optimize performance for large‑scale datasets.
Enable data enrichment and correlation across primary, secondary and tertiary sources.
Cloud Infrastructure and Platform EngineeringDevelop and deploy data workflows on AWS or GCP using services such as S3, Redshi...