PCFNet

This repository is the official implementation of PCFNet in the paper "Mining for Protoclusters at $z\sim4$ from Photometric Datasets with Deep Learning" (Takeda et al. 2024).

Demostraion of PCFNet with mock data

Now, you can find the demonstration of PCFNet with mock data in the notebook example/mock_demonstration.ipynb.

Usage

Clone this repository and follow the instructions below.

cd your_workdir
git clone git@github.com:YoshihiroTakeda/PCFNet

1. Setup

We recommend using a virtual environment to run the code.

1.1 VSCode & Docker

The easiest way to setup the environment is to use VSCode with Docker.

Install VSCode & Docker
Install Remote - Containers extension in VSCode
Open the project folder (your_workdir/pcfnet) in VSCode
Set up for the docker environment
- To make the user id and group id consistent between the host and the container, excute the make_env4docker.sh script in your terminal. Then, .env file will be created.
```
bash make_env4docker.sh
```
Click on the button in the bottom left corner of VScode window and select "Reopen in Container"
- Automatically builds the Docker image and starts the container. This may take a while the first time.

1.2 Pip

You can also set up PCFNet as library in your environment with pip. Required packages are automatically installed.

python -m pip install -e ".[all]"

2. Data Preparation

Prepare the data for training and prediction
- Simulation data (PCcone): Please contact the authors of Araya-Araya et al. (2021). You may also need halo information from Millennium Simulation to label as protocluster.
- Observation data: Please obtain the obserbational data and random points data from HSC-SSP data release.
- Depth information : Please prepare the $5\sigma$ depth information in the PCcone as yaml format.
※ When you want to use original data, please prepare the data in the same format as the simulation data and observation data and write codes like preprocess.py to convert the format for PCFNet. You have to carefully check the consistency between the simulation and observation.
Place the data in the data directory. For exmaple,
- data/sim/ for simulation data
- data/obs/ for observation data
- data/depth/limit_mag5.yaml for depth information
Modify the configuration file config/PCFNet_configure.yaml to match the data paths. You may also need additional modifications for your computational environment (batchsize, iteration times, etc.). If you have WandB account, you can also set up the WandB integration (see 3.1).

3. Data Initialization & Train & Predict

You make sure that the cofiguration file config/PCFNet_configure.yaml is correctly set up. Then, run the following commands in the terminal.

python src/preprocess_obs.py --config config/PCFNet_configure.yaml
python src/preprocess.py --config config/PCFNet_configure.yaml
python src/train.py --config config/PCFNet_configure.yaml
python src/predict_obs.py --config config/PCFNet_configure.yaml

### or...
## modified src/run_all.sh and run the following command
src/run_all.sh

3.1 [Optional] WandB integration

You can use WandB to monitor the training process. Turn on the wandb flag in the configuration file and login to WandB.

wandb login

To illustrate the PR curve per epoch, you can use the custom chart, wandb_custom/pr_curve.json. Select custom chart in the WandB app and register the JSON file as a new Vega-Lite JSON. The table in query should be set to HistoryTable, and Other Setting -> Show step selector should be turned on. Then, you can easily see the PR curve per epoch by using the slider shown after clicking the gear button.

3.2 [Optional] Tensorboard

You can also use Tensorboard to monitor the training process. The log files are saved in the directory specified in the configuration file.

3.3 [Optional] GPU

If you have a GPU, you can use it by setting the --device in the command line. The default is cuda:0.

python src/train.py --config config/PCFNet_configure.yaml --device cuda:0

※ When pretraining the MDN model, the GPU setting is controlled by the predevice option in the configuration file.

4. Results

You can quickly view the results in the jupyter notebook example/check_result.ipynb.

References

Araya-Araya et al. 2021, MNRAS, 504, 5054.
Aihara et al. 2022, PASJ, 74, 247.

Citation

If you use this code for your research, please cite our paper.

@article{Takeda2024,
  title={Mining for Protoclusters at $z\sim4$ from photometric datasets with Deep Learning},
  author = {{Takeda}, Yoshihiro and {Kashikawa}, Nobunari and {Ito}, Kei and {Toshikawa}, Jun and {Momose}, Rieko and {Fujiwara}, Kent and {Liang}, Yongming and {Ishimoto}, Rikako and {Yoshioka}, Takehiro and {Arita}, Junya and {Kubo}, Mariko and {Uchiyama}, Hisakazu},
  year={2024},
  journal={ApJ},
  volume = {977},
  number = {1},
  pages = {81},
  url={http://doi.org/10.3847/1538-4357/ad8a67},
  adsurl = {https://ui.adsabs.harvard.edu/abs/2024arXiv241111956T}
}

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
.devcontainer		.devcontainer
config		config
example		example
pcfnet		pcfnet
src		src
test		test
wandb_custom		wandb_custom
Dockerfile		Dockerfile
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
devel-requirements.txt		devel-requirements.txt
docker-compose.yml		docker-compose.yml
make_env4docker.sh		make_env4docker.sh
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

PCFNet

Demostraion of PCFNet with mock data

Usage

1. Setup

1.1 VSCode & Docker

1.2 Pip

2. Data Preparation

3. Data Initialization & Train & Predict

3.1 [Optional] WandB integration

3.2 [Optional] Tensorboard

3.3 [Optional] GPU

4. Results

References

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

YoshihiroTakeda/PCFNet

Folders and files

Latest commit

History

Repository files navigation

PCFNet

Demostraion of PCFNet with mock data

Usage

1. Setup

1.1 VSCode & Docker

1.2 Pip

2. Data Preparation

3. Data Initialization & Train & Predict

3.1 [Optional] WandB integration

3.2 [Optional] Tensorboard

3.3 [Optional] GPU

4. Results

References

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages