Conversion bruker data #148

ypriverol · 2023-09-04T10:12:58Z

added the option extension_convert reflecting discussion in [Pitch] Partial keep raw in OpenMS #145 and PR in quantms Feature/bruker data quantms#275.
version increased to 0.23.
minor changes to the tests.

jpfeuffer · 2023-09-04T14:51:54Z

Can you add a "--version" command so we dont have to hardcode the version in workflows everytime and everywhere?
Would be awesome

ypriverol · 2023-09-04T15:06:08Z

parse_sdrf.py --version give the version of the tool.

…o conversion_bruker_data

jpfeuffer · 2023-09-05T09:52:14Z

How can you specify that both raw and d are converted to mzml?

ypriverol · 2023-09-05T09:57:51Z

You mean having a conversion where you have more than one file type? Right now you will only have one file type to another, this is allowed: d:mzML or raw:mzML, d:d, etc.

jpfeuffer · 2023-09-05T10:39:34Z

what if you have multiple file types in the initial design?

ypriverol · 2023-09-05T10:44:37Z

We don't allow that in quantms or even OpenMS. I guess is a really future use case.

jpfeuffer · 2023-09-05T10:57:32Z

It is allowed in quantms.
I ran pipelines with raw and mzml before.

jpfeuffer · 2023-09-05T10:59:16Z

If you only allow one type of raw file, why a mapping at all?
Then you just need an option "out_type".

ypriverol · 2023-09-05T10:59:57Z

Ok, what will be the expected behaviour, something like:

raw:mzml
d:d

Which mean convert the raw to mzml and keep the d as d?

jpfeuffer · 2023-09-05T11:00:16Z

Yes exactly.

ypriverol · 2023-09-05T11:04:27Z

What should happen if the user gives contradictory changes, like d:d and d:raw ? Error?

ypriverol · 2023-09-05T11:11:05Z

I will put the following behavior: --extension_convert 'raw:mzml, d:mzml' if multiple conversion options are provided for the same filetype: raw:mzml, raw:raw the tool will through an error.

ypriverol · 2023-09-05T11:47:33Z

@jpfeuffer the following commit allows multiple file conversions: 9c304a7

fabianegli

Good work. Most comments can be safely ignored, but the tests should all use asserts.

(and please excuse my post merge review...)

fabianegli · 2023-09-08T15:38:03Z

sdrf_pipelines/tests/test_sdrfchecker.py

@@ -12,17 +12,28 @@ def test_validate_srdf():
    runner = CliRunner()
    result = runner.invoke(cli, ["validate-sdrf", "--sdrf_file", "testdata/PXD000288.sdrf.tsv", "--check_ms"])

-    print(result.output)
-    assert "ERROR" not in result.output
+    print("validate sdrf " + result.output)


Any particular reason to not use an assert statement here instead of the print? Just reading this test, I don't know what would count as a test failure. Or success. There might be an

assert "ERROR" not in result.output

missing?

fabianegli · 2023-09-08T15:41:18Z

sdrf_pipelines/tests/test_unimod.py

+def test_search_mods_by_accession():
+    unimod = UnimodDatabase()
+    ptm = unimod.get_by_accession("UNIMOD:21")
+    print(ptm.get_name())
+
+
+def test_search_mods_by_keyword():
+    unimod = UnimodDatabase()
+    ptms = unimod.search_mods_by_keyword("Phospho")
+    for ptm in ptms:
+        print(ptm.to_str())


Again, tests should have an assert. Without they still require human interpretation and the idea is to have the test automated and as little as possible human interpretation/interaction.

fabianegli · 2023-09-08T16:30:43Z

sdrf_pipelines/sdrf_merge/add_data_analysis_param.py

@@ -37,7 +37,7 @@ def verify_content(pname, pvalue, ptype):
    #            exit("ERROR: " + pname + " needs to be a numeric value!!")
    elif ptype == "class":
        not_matching = [x for x in pvalue.split(",") if x not in p["value"]]
-        if not_matching != []:
+        if len(not_matching) != 0:


Sidenote: This could be just if not_matching:

fabianegli · 2023-09-08T16:30:50Z

sdrf_pipelines/sdrf_merge/add_data_analysis_param.py

@@ -98,7 +98,7 @@ def add_ptms(mods, pname, mod_columns):
        modname = tmod[0]
        modpos = tmod[1]
        found = [x for x in unimod.modifications if modname == x.get_name()]
-        if found == []:
+        if len(found) == 0:


Sidenote: I would probably have gone with if not found.

fabianegli · 2023-09-08T17:10:45Z

sdrf_pipelines/openms/unimod.py

@@ -35,6 +41,15 @@ def get_name(self):
    def get_accession(self):
        return self._ontology_term.get_accession()

+    def get_delta_mono_mass(self):


Another option would be to make them properties with the "@Property" decorator. That way the "get_" prefix could be left out and the attributes would still have some protection against overwriting.

ypriverol added 2 commits September 4, 2023 11:09

option --extension_convert added

164555a

increase version

099a598

ypriverol requested review from fabianegli and jpfeuffer September 4, 2023 10:13

ypriverol added 10 commits September 4, 2023 11:16

run black in openms.py

906bbf8

run isort in openms.py

3386cae

run isort/black in parse_sdrf.py

43efe9d

run isort/black in test_sdrfchecker.py

7729d9a

run isort recursive

9f780be

run isort recursive

5b83d0b

black fixed

a91995c

black fixed

975a2a7

remove keep raw option.

29947dd

remove keep raw option.

39cc725

ypriverol linked an issue Sep 4, 2023 that may be closed by this pull request

[Pitch] Partial keep raw in OpenMS #145

Closed

ypriverol added 6 commits September 4, 2023 14:52

fix test

37259e6

fix test

0f7d2cd

change test conversion

37fb48a

change test conversion

e3fd298

change test conversion

eda8e3f

added functions for unimod.py

47d1af6

ypriverol mentioned this pull request Sep 4, 2023

Why are these elements private? #141

Closed

ypriverol linked an issue Sep 4, 2023 that may be closed by this pull request

Why are these elements private? #141

Closed

ypriverol mentioned this pull request Sep 4, 2023

[SECURITY] Move from xml to defusedxml for XML parsing #119

Closed

ypriverol linked an issue Sep 4, 2023 that may be closed by this pull request

[SECURITY] Move from xml to defusedxml for XML parsing #119

Closed

changes in unimod parser

f5c91d6

black updated

6264521

ypriverol mentioned this pull request Sep 4, 2023

[BUG] openms/unimod.py #130

Closed

ypriverol linked an issue Sep 4, 2023 that may be closed by this pull request

[BUG] openms/unimod.py #130

Closed

ypriverol added 9 commits September 4, 2023 16:12

minor changes in test

921b6ea

minor changes in test

107efbd

minor changes in test

59f7924

minor changes in test

ad25b75

Merge branch 'master' of https://github.com/bigbio/sdrf-pipelines int…

0033bec

…o conversion_bruker_data

minor changes in test

2e04772

minor changes in test

515ed72

isort fixed

0782834

isort fixed

ddfce63

ypriverol requested a review from WangHong007 September 5, 2023 08:19

WangHong007 approved these changes Sep 5, 2023

View reviewed changes

ypriverol added 2 commits September 5, 2023 12:44

allow multiple options for --extension_convert 'raw:mzml, d:mzml'

2edb41c

allow multiple options for --extension_convert raw:mzml,d:mzml

9c304a7

ypriverol removed the request for review from fabianegli September 5, 2023 14:03

jpfeuffer approved these changes Sep 5, 2023

View reviewed changes

ypriverol merged commit 03d8e8b into bigbio:master Sep 5, 2023

fabianegli reviewed Sep 8, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Conversion bruker data #148

Conversion bruker data #148

ypriverol commented Sep 4, 2023 •

edited

Loading

jpfeuffer commented Sep 4, 2023

ypriverol commented Sep 4, 2023

jpfeuffer commented Sep 5, 2023

ypriverol commented Sep 5, 2023

jpfeuffer commented Sep 5, 2023

ypriverol commented Sep 5, 2023

jpfeuffer commented Sep 5, 2023

jpfeuffer commented Sep 5, 2023

ypriverol commented Sep 5, 2023

jpfeuffer commented Sep 5, 2023

ypriverol commented Sep 5, 2023

ypriverol commented Sep 5, 2023

ypriverol commented Sep 5, 2023

fabianegli left a comment

fabianegli Sep 8, 2023

fabianegli Sep 8, 2023

fabianegli Sep 8, 2023

fabianegli Sep 8, 2023

fabianegli Sep 8, 2023

Conversion bruker data #148

Conversion bruker data #148

Conversation

ypriverol commented Sep 4, 2023 • edited Loading

jpfeuffer commented Sep 4, 2023

ypriverol commented Sep 4, 2023

jpfeuffer commented Sep 5, 2023

ypriverol commented Sep 5, 2023

jpfeuffer commented Sep 5, 2023

ypriverol commented Sep 5, 2023

jpfeuffer commented Sep 5, 2023

jpfeuffer commented Sep 5, 2023

ypriverol commented Sep 5, 2023

jpfeuffer commented Sep 5, 2023

ypriverol commented Sep 5, 2023

ypriverol commented Sep 5, 2023

ypriverol commented Sep 5, 2023

fabianegli left a comment

Choose a reason for hiding this comment

fabianegli Sep 8, 2023

Choose a reason for hiding this comment

fabianegli Sep 8, 2023

Choose a reason for hiding this comment

fabianegli Sep 8, 2023

Choose a reason for hiding this comment

fabianegli Sep 8, 2023

Choose a reason for hiding this comment

fabianegli Sep 8, 2023

Choose a reason for hiding this comment

ypriverol commented Sep 4, 2023 •

edited

Loading