Home

Awesome

<!-- markdownlint-disable MD033 MD041--> <p align="center"> <h1 style="font-size:80px; font-weight: 800;" align="center">L U N G - S A R G</h1> <p align="center">The Open Data Platform for Sustainable, Accessible Lung Radiogenomics</a> </p> </p> <div align="center"> <img alt="GitHub" src="https://img.shields.io/github/license/open-radiogenomics/lung-sarg?style=flat-square"> <img alt="GitHub Workflow Status" src="https://img.shields.io/github/actions/workflow/status/open-radiogenomics/lung-sarg/ci.yml?style=flat-square"> <img alt="GitHub Repo stars" src="https://img.shields.io/github/stars/open-radiogenomics/lung-sarg?style=flat-square"> </div> <br>

Lung-SARG is a fully open-source and local-first platform that improves how communities collaborate on open data to diagnose lung cancer and perform epidemiology on local populations in low and middle income countries.

[!TIP] Datasets generated by this project are ready to explore and consume at HuggingFace.

Check them out!

💡 Principles

Overview

Lung SARG dataflow

Lung SARG dataflow.

⚙️ Setup and execution

🐍 Pixi

You can install all the dependencies inside a reproducible software environment via pixi. To do that, install pixi, clone the repository, and run the following command from the root folder.

pixi install -a

To see all tasks available:

pixi task list

Start and access the Dagster UI locally.

pixi run dev

🧬 Run on sample data

In the Dagster UI, click

Overview -> Jobs -> stage_idc_nsclc_radiogenomic_samples -> Materialize all

Materialize staging of samples

Observe what happens in the Overview, Runs, and Assets pages of the Dagster UI, and the content in the lung-sarg/data directory.

🎯 Motivation

This project started after thinking about what an Open Data Protocol could look like!

👏 Acknowledgements