Back to all positions
Data & AI • REF: TH-DAI-002

Data Engineer - Health Systems Pipeline

Austin, TXHybridFull-time
Apply for this position
Location
Austin, TX
Work Mode
Hybrid
Department
Data & AI
Employment Type
Full-time
Reference ID
TH-DAI-002
Date Posted
February 3, 2026

About This Role

Tile Health processes millions of clinical events daily from dozens of health system partners, each with unique data formats, delivery mechanisms, and quality characteristics. The Data Engineer for Health Systems Pipelines will own the ingestion layer-designing scalable, fault-tolerant pipelines that normalize heterogeneous health data into our unified clinical data model. This is a foundational infrastructure role with direct impact on every downstream analytics and AI capability.

What You'll Do

  • Design and implement data ingestion pipelines for HL7v2, FHIR, X12 claims, and custom flat-file feeds from health system partners
  • Build data quality validation layers that detect schema drift, missing fields, and out-of-range clinical values in real time
  • Optimize pipeline throughput and latency for high-volume ADT and lab result feeds using Apache Kafka and Apache Spark
  • Develop and maintain a clinical data model that normalizes disparate source schemas into a consistent analytical layer
  • Collaborate with the interoperability team to onboard new data sources, documenting mapping specifications and transformation logic
  • Implement monitoring and alerting systems for pipeline health, data freshness, and processing errors

What We're Looking For

  • 4+ years of data engineering experience with production-scale streaming and batch pipelines
  • Strong proficiency in Python or Scala with hands-on experience in Apache Spark, Kafka, or Apache Flink
  • Experience with cloud data platforms (AWS Glue, Redshift, Snowflake, or Databricks)
  • Familiarity with healthcare data formats including HL7v2, FHIR, CCDA, and X12 837/835
  • Solid understanding of data modeling principles for analytical workloads (star schema, slowly changing dimensions)
  • Experience operating in a HIPAA-compliant environment with proper PHI handling controls

Nice to Have

  • Experience with dbt for data transformation and testing
  • Background in healthcare revenue cycle or clinical data warehousing
  • Familiarity with Terraform or Pulumi for infrastructure-as-code
Data Engineer - Health Systems Pipeline - Tile Health Careers | Tile Health