DistCompute Tracker Server

LAION-5B Workflow Diagram

Tracker server library previously powering LAION's distributed compute network for filtering commoncrawl with CLIP to produce the LAION-400M and LAION-5B datasets. The previous code has now repurposed as a general-use multi-layer distributed compute tracker and job manager, with added support for a frontend web server dashboard, user leaderboards and up to 5 sequential stages of workers for each job.

Client Repo: TheoCoombes/distcompute-client
LAION-5B Paper: https://arxiv.org/abs/2210.08402
LAION-5B Implementation (Client): TheoCoombes/crawlingathome
LAION-5B Implementation (Server): TheoCoombes/crawlingathome-server

Installation

1. Install requirements

git clone https://github.com/TheoCoombes/Distributed-Compute-Tracker.git
cd Distributed-Compute-Tracker
pip install -r requirements.txt

2. Setup Redis

Redis Guide - follow steps 1-2.
You may need to configure your Redis connection url in config.py if you have changed any port bindings.

3. Setup SQL database

PostGreSQL Guide - follow steps 1-4, noting down the name you give your database.
Install the required python library for the database you are using. (see link above)
Configure your SQL connection in config.py, adding your database name to the SQL_CONN_URL.

4. Add Jobs

Create a JSON file containing either a list of strings or a list of dicts, each with job data (e.g. urls / filenames to process etc.) and run init.py --json <file> to setup the database.
Alternatively, you can also create a brace expansion for your initial job data, e.g. init.py --brace "./data/file_{00..99}.tar".
For more info, run init.py --help.
WARNING: running init.py will reset your database, so ensure you make a backup of any previous data before running the script!

5. Setup Project

Open config.py, and rename PROJECT_NAME to a more suitable name for your project.
Edit STAGE_<N> to add the names of each stage of your workflow. If the next stage is set to None, the job is marked as complete. Otherwise, workers operating at the next stage will recieve the output of the current stage as an input.
If you would like a linear input -[worker]-> output workflow, only enable STAGE_A.
The default setting is the workflow used previously for the production of the LAION-5B dataset. CPU workers at stage A would download and store images+alt text from CommonCrawl in tar files. GPU workers at stage B would then be inputted with these tar files, and then filter these images using CLIP to create the final dataset. (see paper)

6. Start ASGI server

You can either use gunicorn or uvicorn. Previously, the LAION-5B production server used uvicorn with 12 worker processes.
e.g. uvicorn main:app --host 0.0.0.0 --port 80 --workers 12

Usage

As stated in step 5 of installation, you need to run the server directly using a ASGI server library of your choice:

uvicorn main:app --host 0.0.0.0 --port 80 --workers 12

Runs the server through Uvicorn, using 12 processes.

Name		Name	Last commit message	Last commit date
Latest commit History 56 Commits
cdn		cdn
templates		templates
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
cache.py		cache.py
config.py		config.py
database.py		database.py
init.py		init.py
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DistCompute Tracker Server

Installation

1. Install requirements

2. Setup Redis

3. Setup SQL database

4. Add Jobs

5. Setup Project

6. Start ASGI server

Usage

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

DistCompute Tracker Server

Installation

1. Install requirements

2. Setup Redis

3. Setup SQL database

4. Add Jobs

5. Setup Project

6. Start ASGI server

Usage

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages