Lead Data Engineer Job at WorkHQ, Los Angeles, CA

RXNRZ21nZW5neW5hUG51N0Z3LzhZMVFGUUE9PQ==
  • WorkHQ
  • Los Angeles, CA

Job Description

Company Context

Series A, well-funded US startup in HRTech developing WorkHQ.com and an AI Recruiter product.

This is a US-only, Remote role (Mainland).

Role Overview

Lead data infrastructure architect managing billions of data points across 250M+ professional profiles.

Hire data engineers to aid you in that journey.

Core Responsibilities

  • Design scalable data pipelines processing massive record volumes

  • Architect ETL processes using PySpark on Amazon EMR (Open to shifting to other solutions like Data Bricks / Snowflake)

  • Distribute enriched data through medallion architecture across Postgres, Athena, OpenSearch

  • Integrate new data sources into the main pipeline

  • Implement advanced data matching using Splink

Technical Requirements

  • 5-8 years professional data engineering experience

  • Good proficiency in:

    • PySpark and distributed computing

    • AWS data services (EMR, Glue, Athena)

    • Docker

    • Pandas and DataFrame manipulation

    • Complex data format handling (JSONL, Parquet)

  • Strong background in:

    • Big data processing architectures

    • Data warehouse design

    • Performance optimization

  • Advanced Python, SQL skills

Nice to Have

  • Probabilistic record linking expertise

  • OpenSearch/elasticsearch technologies

  • Machine learning data pipeline design

  • Recruitment tech ecosystem knowledge

Technical Stack

  • Big Data: PySpark, EMR

  • Databases: Postgres, OpenSearch

  • Cloud: AWS

  • Containerization: Docker

  • Data Formats: JSONL, Parquet

  • Analytics: Metabase, Athena, Glue

  • Data Processing: Pandas, Splink

Other Considerations

While this role has specific requirements - if you lack a few technical skills, but motivated to learn and lead the platform, please apply for consideration.

If you are coming from Director/Head of/VP levels that is relevant to this job, you can apply as well.

You will need to apply directly on our platform.

Thank you for your time.

Job Tags

Permanent employment, Remote work, Shift work,

Similar Jobs

Eligo Energy, LLC

Customer Service Tier 2 Agent Job at Eligo Energy, LLC

 ...to make an impact as part of a dynamic call center team. We offer competitive hourly compensation...  ...in a call center environment, you will work to support commercial and residential...  ...: The ability to work from home in a virtual work environment High speed internet... 

Evolent

Senior Analyst, Client Analytics Job at Evolent

 ...health plans and providers to achieve better outcomes for people with most complex and costly health conditions. Working across...  ...Stay for the culture. What You'll Be Doing: Senior Analyst, Healthcare Analytics Consultant The Client Analytics team provides a unique... 

ERSG Ltd

Wind Turbine Technician Job at ERSG Ltd

 ...Job Description This is a site-based Wind Technician opening for a wind farm site near Harlingen, Texas. This is a local position...  ...will be offered to non-local candidates. Although previous wind turbine maintenance experience is preferred, this opportunity is also available... 

Greysteel

Real Estate Agent / Associate (Capital Markets) Job at Greysteel

 ...private, middle market, and institutional real estate investors. Our collaborative...  ...market for each engagement, spanning all commercial property investment activities, from...  ...a commission-only role.Seniority level Seniority level Entry levelEmployment type Employment... 

Red Oak Technologies

DevOps Engineer Job at Red Oak Technologies

 ...onsite 3 days a week: Tuesday, Wednesday, and Thursday).Position Overview Our client, a technology giant, is looking for a Senior DevOps Engineer to join their growing team.Location Culver City, CAResponsibilities: 8+ years of experience in a senior DevOps or lead role...