You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Thanks @jjgao. NM_018263.4 has 1407aa while NM_018263.5 has 1435aa. Since DMP is using the older one, that's what we are sticking with in isoform_overrides_at_mskcc. Lemme know what led you to this, and we can find an alternative solution.
(Update 8/7/2017 - The info above on aa lengths is incorrect.)
I'm gonna close this request. It exposed a larger problem that versioning of RefSeq isoforms needs to be implemented across the CMO, before we do another sync up.
(Update 8/7/2017 - This comment is also incorrect. See below for details on what really went wrong.)
Per my work notes matching ENST IDs to Refseq IDs used by MSKCC's clinical bioinformatics, an ENST ID for NM_018263.4 couldn't be automatically extracted via NCBI's CCDS. So I had manually looked it up in Ensembl's release 75 archives (the latest for GRCh37 loci, viewable at feb2014.archive.ensembl.org), where the xref_refseq mapping was incorrect, as JJ pointed out in the first comment. I went back and reviewed the 8 other genes for which I had manually looked up matching ENST isoforms, and found similar mistakes in 3 other genes. Updated those too. Will merge shortly.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Looks like the mapping from Ensembl is incorrect. The RefSeq sequence has 1435 amino acids instead of 1407.