Add PTM dataset support (addresses #81)#156
Conversation
|
@etherealsunshine thanks for implementing this! could you push something to the branch again? we just updated the GitHub actions to trigger the CI on pull requests coming from external forks |
|
hey @ncfrey thanks for the update! Just a heads up - one of the test failures (the AttributeError: 'LobsterCRMPLM' object has no attribute 'concept_names') is due to a dependency on another PR that still needs to be merged. I believe i have fixed the other issues :) |
great! i merged #155 so you can rebase and run the checks |
885cd7a to
f42fe0e
Compare
|
Hi @ncfrey, I think the CI is failing due to a GitHub Actions permissions issue with OIDC tokens for forks. I think its from the changes to trigger CI on external forks. Could you help resolve this workflow permission issue? Thanks! |
|
should resolve the failing test on main as well as add support for the dataset now. |
#162 should resolve this! |
|
@etherealsunshine this looks good to merge! |
* added wrapper for concept names * Add unit test for concept_names property * added support for PTM Dataset * Add unit test for concept_names property * added tests and integration for ptm datasets * Fix pooch.retrieve() call - add known_hash=None * updated column names * fixed column names to include token * added code to download the PTM Dataset * Fix code formatting issues (ruff) * Move PTM test to correct datasets directory and fix column names * Clean up commit history * added fixes for code formatting * Remove duplicate concept_names code * Fix typo in concept_names property (quick fix for failing tests on main) * Update test to match concepts_name property * Format code with ruff
* added wrapper for concept names * Add unit test for concept_names property * added support for PTM Dataset * Add unit test for concept_names property * added tests and integration for ptm datasets * Fix pooch.retrieve() call - add known_hash=None * updated column names * fixed column names to include token * added code to download the PTM Dataset * Fix code formatting issues (ruff) * Move PTM test to correct datasets directory and fix column names * Clean up commit history * added fixes for code formatting * Remove duplicate concept_names code * Fix typo in concept_names property (quick fix for failing tests on main) * Update test to match concepts_name property * Format code with ruff
Description
Adds support for Post-Translational Modification (PTM) datasets from the PTM-mamba paper, addressing issue #81.
What's added
PTMDatasetclass that auto-downloads PTM data from ZenodoFeatures
Usage
Type of Change
Testing
Checklist