Vv ensembl dev susmi #615

Peter-J-Freeman · 2024-05-07T09:17:42Z

No description provided.

Update to vvta

commits that tackle what I saw for issue #387. It seems the reuested …

Remove bad return statement for issue https://github.com/openvar/vari…

Vv ensembl develop

Vv ensembl dev s working

Vv ensembl

Hide more recent version and not part of genome build warnings for irrelevant transcripts

The underlying seq fetching code is 0 based but much of the overlying code in the expanded repeat handling acts as if it is 1 based, fix this.

This test set is conceptually the same as the existing C set, but twice as long and done with transcripts where there is a pair of n and c versions with the same within transcript sequence, and the same from transcript start coordinates, for the section in question. This allows us to test the offset c based coordinates against the more raw 1 based non offset n data. This test set also includes more n+offset type coordinates as opposed to the mainly n-offset in the original set along with more mutli-base repeats, which adds to the test coverage.

Both a few basic tests and a reverse genome -> transcript set, which is equivalent to a genomic version of those C tests with successful genomic mapping.

We used to need to check the transcript type, but we now handle all ref type within the expanded repeat code, so this feature is no longer used.

Although the variant position is still messed with we no longer convert the variant into a pseudo-g type and back when we get an intronic coordinate. This patch also cleans up the use of offset, without specifiers, in variable and function names, which is ambiguous. In the process we end up fixing a n type to c type coordinate conversion bug and preparing for addition of 3' UTR handling in the process.

intronic_or_utr would sometimes store the UTR status, but this was actually unused, instead it was mostly used to store the transcript when an intron was detected, as part of the handling for the now fixed abuse of the reference variable. The regular expression fixed missing bad characters in a repeat if one good repeat character was included.

Also tidy up some logic leftover from previous c<->n mapping methods now that we centralise it into separate functions. We also slightly adjust function naming and input logic from get_range_from_single to get_range_from_single_or_start to match actual usage.

Also adjust test for get_range_from_single_pos to match the new name of get_range_from_single_or_start_pos.

We do not use this function any more in the current usage pattern.

Also some use of startswith instead of re.match in ref type checking.

Updtes to the expanded repeat code for simple repeats only so far

Fix outstanding bugs

…iantValidator into vv_ensembl_dev_susmi

…e and intrins in r. descriptions as referred to in #545

…es2transcripts functionality

…680.6:c.153G>T in issue #651

…ing and HGNC genes with no transcript info openvar/rest_variantValidator#186 and also handle the longer deletions in #651

…nate alignments in patches vs the primary assembly. Issue #657

…3 prime UTRs in uncertain positions for the LOVD paper

…lons

…D team

Peter J. Freeman and others added 30 commits July 7, 2022 09:34

Merge pull request #389 from openvar/update_to_vvta

9e0313e

Update to vvta

Merge pull request #390 from openvar/vv_ensembl_develop_pete

caec3fc

commits that tackle what I saw for issue #387. It seems the reuested …

Merge pull request #391 from openvar/vv_ensembl_develop_pete

7b77b02

Remove bad return statement for issue https://github.com/openvar/vari…

Remove ensembl not supported msg

9e4525e

Add transcript set when searching options

2602240

Add alt_aln_method to all t_to_g methods

56ceeec

Add alt_aln_method when searching

af3eb12

Tidy up Mixin code

5ad1528

Merge pull request #394 from openvar/vv_ensembl_develop

76daf7b

Vv ensembl develop

Optimise and fix search through options

2df0d59

Switch key order for postgres in config test

8d9a28a

Clean up ensembl url code

b4c9eeb

Sort grch37 and grch38 ensembl urls

2cdc808

Fix bug in ensembl test variant 2

d5b8cf0

Add ensembl urls to ensembl input test

b4cb8b1

Sort incorrect values in ensembl tests 4 and 5

8444080

Add alt_aln_method when getting hgvs_stash_t

f4fb451

Update wrong genome build error

692a5ac

Add ensembl test for wrong genome build

f5bf878

Pass in alt_aln_method to g_to_t methods

65ffe0c

Add specific wrong build msg for each build

d4713ca

Tidy up wrong build warning code

4849921

Include genome build in new version msg

1f2d93a

Merge pull request #400 from openvar/vv_ensembl_dev_s_working

40c6b49

Vv ensembl dev s working

Delete MANUAL.md

334ecb8

Rename MANUAL_UPDATED.md to MANUAL.md

05cc65c

Tidy up config

1fa9306

Merge pull request #401 from openvar/vv_ensembl

d310056

Vv ensembl

Hide warn for irrelevant transcripts

f11fe6e

Hide more recent version and not part of genome build warnings for irrelevant transcripts

Change build to grch38 in ensembl tests

af1ffcc

John-F-Wagstaff and others added 30 commits September 15, 2024 21:24

Fix underlying 1>0 based issue in expanded repeats

1ad174c

The underlying seq fetching code is 0 based but much of the overlying code in the expanded repeat handling acts as if it is 1 based, fix this.

Add genomic tests expanded repeat tests

e017b63

Both a few basic tests and a reverse genome -> transcript set, which is equivalent to a genomic version of those C tests with successful genomic mapping.

Add RefSeqGenomic expanded repeat tests

20a8604

Add LRG type tests for expanded repeat syntax

6a7bca3

Remove now unneeded check_transcript_type function

7ec09c3

We used to need to check the transcript type, but we now handle all ref type within the expanded repeat code, so this feature is no longer used.

Fix outstanding bugs

2f0e466

Add tests for 3' utr and over 5' end handling

b07d610

Also adjust test for get_range_from_single_pos to match the new name of get_range_from_single_or_start_pos.

Remove now unused function for variant splitting

5ca5926

We do not use this function any more in the current usage pattern.

Clean up input function, reduce regex usage

9410c37

Also some use of startswith instead of re.match in ref type checking.

Merge pull request #642 from openvar/expanded_repeat_syntax

9b0014d

Updtes to the expanded repeat code for simple repeats only so far

Merge pull request #649 from openvar/issue_645

4f361e5

Fix outstanding bugs

Merge branch 'vv_ensembl_dev_susmi' of https://github.com/openvar/var…

206402e

…iantValidator into vv_ensembl_dev_susmi

update dockerfiles to latest vvta and sr

a0eb76a

code changes that refer to issue #651 variants with N in the referenc…

6340024

…e and intrins in r. descriptions as referred to in #545

Fixes that overcome NR transcripts with LOC based gene symbols in gen…

4e2e82e

…es2transcripts functionality

Fixes the mapping of NC_000009.12:g.92474742delinsATCA back to NM_017…

dbfb7b1

…680.6:c.153G>T in issue #651

Add tweaks to genes2transcripts to handle gene symbols that are updat…

033d948

…ing and HGNC genes with no transcript info openvar/rest_variantValidator#186 and also handle the longer deletions in #651

Update the code to accept intronic variants in transcripts with alter…

45f5d07

…nate alignments in patches vs the primary assembly. Issue #657

code that deals with protein references with nucleotide variant types

2570119

Update position added for Ter=

cf2cfe1

Update vdb version and test issue 87

36d4237

Changes to the code base to correct some unhandled descriptions e.g. …

b3692f8

…3 prime UTRs in uncertain positions for the LOVD paper

Expand code to handle expanded repeat syntax in allele descriptions

cc05e3c

add in code to deal with common HGVS early stage typos like double co…

9a7e4b2

…lons

Additional code changes to handle failed variants reported by the LOV…

95d6591

…D team

Final commit before merge with parse bypass code

64b5132

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Vv ensembl dev susmi #615

Vv ensembl dev susmi #615

Peter-J-Freeman commented May 7, 2024

Vv ensembl dev susmi #615

Are you sure you want to change the base?

Vv ensembl dev susmi #615

Conversation

Peter-J-Freeman commented May 7, 2024