Data Engineer Job at VDart Inc, Remote

MlUvZDg3TUhVMVZXRXkrOUVPZHlBL1pPYmc9PQ==
  • VDart Inc
  • Remote

Job Description

Title: Data Engineer

Location: Remote

Duration: 6 Months

Work Description:

We are in the process of migrating off CRMA Data Manager by rewriting queries and implementing the required data transformations in AWS. This platform modernization effort includes working through a backlog of datasets that must be migrated to AWS and transformed to meet current and future reporting needs.

Business Knowledge:

Limited business knowledge is needed.

Technical Skills:

Must-Have Technical Skills:

  • AWS Data Services (Hands-on)
  • S3: Data lake design, partitioning strategies, lifecycle management
  • IAM: Roles & policies, least-privilege access, cross-account access
  • Glue / EMR: Crawlers, Data Catalog, ETL job development
  • Athena: Querying data lakes with performance and cost optimization
  • Lake Formation: Basic governance and permission management

Compute & Processing

  • Apache Spark (PySpark): Batch processing, performance tuning, joins, partitioning
  • Python: Production-grade coding (packaging, testing, logging, type hints)
  • SQL: Advanced querying (window functions, query optimization, data modeling support)

Orchestration & Scheduling

  • Airflow / MWAA / AWS Step Functions
  • DAG design
  • Retry mechanisms
  • SLA management
  • Backfills
  • Data Warehousing & Modeling
  • Redshift / Snowflake (on AWS): Fundamentals and performance considerations
  • Dimensional Modeling: Star/Snowflake schema design

ETL/ELT Patterns:

  • CDC (Change Data Capture)
  • SCD (Slowly Changing Dimensions)
  • Idempotent data pipelines
  • Data Reliability & Observability
  • Data quality frameworks: Great Expectations / Deequ (or equivalent)
  • Data reconciliation & validation
  • Monitoring & observability: CloudWatch logs, metrics, alerts

DevOps & Delivery

  • Version Control: Git, branching strategies, code reviews
  • CI/CD: Data pipeline automation (e.g., GitLab CI/CD)
  • Infrastructure-as-Code: OpenTofu / CloudFormation for AWS resource deployment

Security & Compliance

  • Encryption: At rest & in transit (KMS)
  • Secrets management: AWS Secrets Manager / SSM
  • Networking fundamentals: VPC, private subnets, endpoints (data access control)

Role Expectations (Hands-on Experience Required):

  • Designed, developed, and maintained production-grade ETL pipelines using AWS Glue (PySpark)
  • Built scalable data ingestion pipelines from S3, databases, and streaming sources into S3 data lakes
  • Implemented complex transformations and joins in PySpark, optimizing performance (partitioning, broadcast joins, caching)
  • Developed incremental and idempotent pipelines, including handling CDC and SCD
  • Automated schema discovery using Glue Crawlers and Data Catalog
  • Tuned Glue Spark jobs for performance, concurrency, and cost efficiency
  • Integrated pipelines with orchestration tools like Airflow (MWAA) or Step Functions
  • Collaborated with data teams to load curated data into Redshift / Snowflake / Iceberg for analytics
  • Implemented data quality checks using built-in validations or tools like Great Expectations / Deequ
  • Applied AWS security best practices (IAM roles, KMS encryption, secure data access)
  • Contributed to CI/CD pipelines for Glue job deployment using Git and IaC tools
  • Monitored pipelines using CloudWatch, ensuring reliability and quick incident resolution
  • Worked closely with stakeholders to define data contracts, SLAs, and business expectations

Key Skills: Data Engineer, AWS Glue, IAM, ETL, Athena, PySpark

Job Tags

Full time

Similar Jobs

Aptyx

Director of Operations Job at Aptyx

Director of Operations Department: Operations Location: Charlotte, NC Reports To: General Manager FLSA: Exempt Position Summary The Director of Operations is a strategic operational leader and key member of the Charlotte site Senior Leadership Team (SLT), collaborating... 

AZZ Inc.

Plant Manager Job at AZZ Inc.

 ...create superior value while advancing a culture where people can thrive. Job Description AZZ has an opportunity for a Plant Manager at our Mobile facility. Reporting to the Regional Director of Operations, this role is responsible for, but not limited to,... 

Valley Honda

ENTRY LEVEL AUTOMOTIVE TECHNICIAN MONROEVILLE Job at Valley Honda

 ...Motivated and dependable entry level Automotive Technician needed. No prior experience needed, we will train. We are a 4 generation family owned dealership since 1918. We are looking for a person that desires a career as an Automotive Technician. Save money on student... 

Elite Technical

Service Delivery Manager Job at Elite Technical

Service Delivery ManagerWe are seeking a Service Delivery Manager to lead our clients field support technical team in Ronkonkoma (Long Island). This is a FTE position with our customer, and is 100% onsite. You will be the bridge between technical operations and... 

Blueground

Cleaner - New York City, NY Job at Blueground

Overview We are seeking detail-oriented 1099 independent contractors to join our cleaning team. In this role, you will be responsible for maintaining high cleaning standards in multiple apartments located throughout the New York metropolitan area, ensuring each unit meets...