Dagster iconDagster

Dagster: Orchestrate development, production, and observation of data assets.

Open Source Alternative to:

Repository activity:

Stars11,501

Forks1,450

Open Issues2,243

Last commit1 day ago

License:

Apache-2.0

Languages:

Python
TypeScript
LookML
Dagster screenshot

Dagster is a cloud-native orchestration platform designed to manage the entire lifecycle of data assets. From development to production and observation, Dagster simplifies complex data workflows, enabling data engineers to efficiently build, monitor, and maintain their data pipelines. With a declarative programming model and best-in-class testability, Dagster integrates seamlessly with modern data stacks to provide a unified view of your data platform.

  • Software-Defined Assets: Manage your data assets with code, ensuring consistency and reproducibility.
  • First-Class Testing: Incorporate robust testing mechanisms to ensure the reliability and performance of your data pipelines.
  • Integrated Lineage and Observability: Track data lineage and monitor the health of your data assets with built-in observability tools.
  • Declarative Programming Model: Simplify the orchestration of complex workflows with a clear and concise programming model.
  • Run Timeline View: Monitor all your jobs in one place, providing a comprehensive overview of your data operations.
  • Run Details View: Pinpoint issues with surgical precision by zooming into individual runs.
  • Asset Context and Updates: View and update each asset's context, including materializations, lineage, owner, schema, schedule, and partitions, all in one place.
  • Consolidated Asset View: Access detailed information on each asset, such as freshness, status, schema, metadata, and dependencies, in a single, consolidated view.
  • Enterprise Features: Enjoy fully serverless or hybrid deployments, operational observability, data cataloging, and out-of-the-box CI/CD with Dagster+.

Dagster empowers data engineering teams by bringing software engineering best practices to data workflows. Its asset-oriented approach allows for scalability and flexibility, making it an essential tool for data platforms in innovative organizations worldwide.

Share: