Formalizing ETLT and ELTL Design Patterns and Proposing Enhanced Variants: A Systematic Framework for Modern Data Engineering
Chiara Rucco, Motaz Saad, Antonella Longo

TL;DR
This paper formalizes ETLT and ELTL data pipeline design patterns, introduces enhanced variants with governance and observability features, and provides a systematic framework for building scalable, trustworthy data engineering solutions.
Contribution
It formalizes ETLT and ELTL as reusable design patterns and proposes enhanced variants with explicit contracts, versioning, and monitoring for improved governance and quality.
Findings
Formalization of ETLT and ELTL as design patterns
Introduction of ETLT++ and ELTL++ with enhanced features
Framework enabling auditable, scalable data pipelines
Abstract
Traditional ETL and ELT design patterns struggle to meet modern requirements of scalability, governance, and real-time data processing. Hybrid approaches such as ETLT (Extract-Transform-Load-Transform) and ELTL (Extract-Load-Transform-Load) are already used in practice, but the literature lacks best practices and formal recognition of these approaches as design patterns. This paper formalizes ETLT and ELTL as reusable design patterns by codifying implicit best practices and introduces enhanced variants, ETLT++ and ELTL++, to address persistent gaps in governance, quality assurance, and observability. We define ETLT and ELTL patterns systematically within a design pattern framework, outlining their structure, trade-offs, and use cases. Building on this foundation, we extend them into ETLT++ and ELTL++ by embedding explicit contracts, versioning, semantic curation, and continuous…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScientific Computing and Data Management · Advanced Database Systems and Queries · Data Quality and Management
