An efficient storage and compute engine for both on-prem and cloud-native data analytics.
-
Updated
Sep 8, 2025 - Java
An efficient storage and compute engine for both on-prem and cloud-native data analytics.
A powerful open source data warehouse system
ElasticFlow(伊塔)是一个开源弹性流数据交换系统,支持在任意类型数据端之间通过简单配置就可以建立可计算的弹性流管道,并进行定时、定量、高并发、多类型的交换数据服务。系统可应用于数据交换、通用搜索引擎、数据发布服务、数据仓库等项目。
BioDWH2 is an easy-to-use, automated, graph-based data warehouse and mapping tool for bioinformatics and medical informatics.
LogUnify is a schema-centric service that provides structured application event logging and seamless integration with data warehouses such as BigQuery for easy storage and analysis of event data.
Platform Extension Framework (PXF) for Apache Cloudberry (Incubating)
Purpose-built data connectors for Google CDAP data pipelines
This warehouse is made for storing and studying insurance claims data for vehicles serviced at designated branches.
🏥 Public Health Data Warehouse using FHIR and Kibana
A powerful open source data warehouse system
The METRO DW prototype uses Mesh Join & Star Schema for sales, customer & inventory data analysis. Implemented in SQL & Java for fast, accurate, & consistent data retrieval. Offers valuable insights & can be queried with standard BI tools.
A complete end-to-end project for building a Data Warehouse using IMDb data with Talend for ETL and Power BI for insightful visualizations. Includes a star schema, optimized database, and interactive dashboards.
distributed system built in Java that will run on two Google Cloud Platform Linux virtual
This repository comprises the design, implementation, and analysis of a near real-time data warehouse prototype for an electronics business chain, utilising a multi-threaded Extract, Transform, Load (ETL) pipeline leveraging the efficient HYBRIDJOIN algorithm implemented with Java and MySQL on customer sales data.
A robust near real-time retail data warehouse system leveraging Java, MySQL, and MeshJoin for efficient ETL, star schema design, and actionable OLAP insights.
This is the code base for MedicMine, a data warehouse system based on InterMine and used in MTGD, the Medicago truncatula Genome Database.
Official Java client for SlicingDice, Data Warehouse and Analytics Database as a Service.
Pentaho Data Integration ( ETL ) a.k.a Kettle
Add a description, image, and links to the data-warehouse topic page so that developers can more easily learn about it.
To associate your repository with the data-warehouse topic, visit your repo's landing page and select "manage topics."