FHIR Data Pipes

View source on GitHub

FHIR Data Pipes is a set of ETL pipelines for transforming FHIR data into an SQL-capable format for building analytics services. FHIR Data Pipes is designed with horizontal scalability in mind and offers multiple deployment options.

The features include:

  • Apache Beam ETL pipelines to transform data from a FHIR source into an SQL-on-FHIR schema data warehouse, based on Apache Parquet files that can be queried using SQL (see Parquet-on-FHIR schema)

  • Controller Module for managing the FHIR Data Pipes Pipelines, integrating "full", "incremental", and "merger" pipelines together. Using the controller module you can schedule periodic incremental updates or use the Web Control Panel to start the pipeline manually.

  • Implementation of the SQL-on-FHIR-v2 specification making it possible to apply ViewDefinition resources to generate flat views within the pipelines (see View layer)

Go to developer documentation