Skip to content
/ ccflow Public

ccflow is a collection of tools for workflow configuration, orchestration, and dependency injection

License

Notifications You must be signed in to change notification settings

Point72/ccflow

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

29 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ccflow logo, 'ccflow' with letters in color

Build Status GitHub issues PyPI Version License

ccflow is a collection of tools for workflow configuration, orchestration, and dependency injection. It is intended to be flexible enough to handle diverse use cases, including data retrieval, validation, transformation, and loading (i.e. ETL workflows), model training, microservice configuration, and automated report generation.

The framework provides:

  • a way to manage hierarchical, strongly typed configurations and the relationships between them through composition
  • a way to associate user-defined functions with configurations, and in doing so, to define and name configurable workflow graphs
  • a way to manage dependency injection and inversion of control for objects in these graphs
  • flexibility in how to interact with configurations and workflows, including files/command line, native python/Jupyter notebook, Airflow/job scheduler, REST API, etc (in progress)

It heavily leverages pydantic, and users are expected to implement their own configuration and workflow building blocks by implementing pydantic models.

It also integrates closely with hydra for file-based configuration and command line interaction, but can also be used natively from Python without it.

This library was partially inspired by this blog post by Suneeta Mall (@suneeta-mall). We have taken these ideas a step further by introducing the concept of the ModelRegistry, which allows for the configs to be managed without hydra, and also allows us to implement dependency injection.

We aim to provide additional (and optional) tools for workflow orchestration on top of the configuration framework.

More information is available in our wiki

Installation

ccflow can be installed via pip or conda, the two primary package managers for the Python ecosystem.

To install ccflow via pip, run this command in your terminal:

pip install ccflow

To install ccflow via conda, run this command in your terminal:

conda install ccflow -c conda-forge

Community

  • Contribute to ccflow and help improve the project

License

This software is licensed under the Apache 2.0 license. See the LICENSE file for details.