Add MeSH data #507

suhana13 · 2021-09-17T14:58:54Z

No description provided.

spiekos · 2022-06-06T18:34:31Z

This is the documentation for the Medical Subject Headings (MeSH) import.

scripts/biomedical/mesh/README.md

scripts/biomedical/mesh/format_mesh.py

pradh · 2022-06-06T22:47:39Z

scripts/biomedical/mesh/format_mesh.py

+from xml.etree.ElementTree import parse
+
+
+def format_mesh_xml(mesh_xml):


I've not validated the logic of the Python code. I'm assuming @spiekos will do that :)

spiekos

The text values are not properly denoted by including them in quotes. Anytime there is a potential to be a comma in a text value please include the additional brackets of /"<my_text_value>/" around your value so that the commas in the value are not inappropriately parsed. The Descriptor output file contains a lot of duplicate info: please simplify and remove.

Updated mesh_record.tmcf and mesh_pubchem.tmcf to have correct mappings. There is no need to have descriptor_ID information in this file as we are only mapping to ChemicalCompounds to MeSHRecords.

spiekos

Tests missing for mesh_record.csv and mesh_pubchem.csv. Please add.

scripts/biomedical/mesh/mesh_test.py

scripts/biomedical/mesh/README.md

spiekos · 2022-08-24T19:59:32Z

scripts/biomedical/mesh/README.md

+
+In order to run the script [`format_mesh.py`](format_mesh.py), the user requires the `mesh.xml` file, which spits out four different 
+csv files, each relating to descriptor, concept, qualifier and term.
+In order to run the script [`format_mesh_record.py`](format_mesh_record.py), the user requires the `mesh_record.xml` file and the 


Please expand a little more about how you use the pubchem file to establish the mapping between the MeSHRecord and it's corresponding ChemicalCompound. A couple of sentences explicitly stating the goal and how it was accomplished is sufficient.

spiekos

This is close. Just. a few more comments about the tests that need to be resolved.

spiekos · 2022-09-20T04:27:56Z

scripts/biomedical/mesh/unit-tests/mesh_concept_test.csv

@@ -0,0 +1,18 @@
+DescriptorID,ConceptID,ConceptName,ScopeNote,DateCreated,DateRevised,DateEstablished,Concept_dcid,Descriptor_dcid
+D000001,M0000001,"""Calcimycin""","""An ionophorous, polyether antibiotic from Streptomyces chartreusensis. It binds and transports CALCIUM and other divalent cations across membranes and uncouples oxidative phosphorylation while inhibiting ATPase of rat liver mitochondria. The substance is used mostly as a biochemical tool to study the role of divalent cations in various biological systems.""",1974-11-19,2016-05-27,1984-01-01,bio/M0000001,bio/D000001


None of these test.csv files are formatted correctly - all of the quotes around the text values is incorrect. Please update your test files so they actually reflect the final output files. This needs to be done for all .csv test files.

spiekos · 2022-09-20T04:35:39Z

scripts/biomedical/mesh/mesh_record_test.py

+    def test_main(self):
+        """Test in the main function"""
+        # Read in the expected output files into pandas dataframes
+        df1_expected = pd.read_csv('unit-tests/mesh_record_test.csv')


Is there a reason why these test files are so big? Can you please limit it to ~10 entities for each? They currently are not viewable on GitHub.

spiekos

Updated mesh_pharma_concept.tmcf so that it's schema is mapping to the correct type of node. The node type that this data is referring to is a MeSH Supplementary Record.

Scripts and tests missing for the newly added 5 tmcf + csv pairs. Shell scripts and the README.md also need to be updated to reflect the addition of these data.

spiekos

Updated mesh_pharma_concept.tmcf so that it's schema is mapping to the correct type of node. The node type that this data is referring to is a MeSH Supplementary Record.

Scripts and tests missing for the newly added 5 tmcf + csv pairs. Shell scripts and the README.md also need to be updated to reflect the addition of these data.

spiekos · 2024-05-15T22:44:09Z

This is now taken care of as part of #1000

suhana13 added 4 commits September 13, 2021 11:04

feat: add format_mesh.py

1ccd2da

feat: add mesh tmcfs

6a2bf2c

style: run linter

fcadfbe

feat: add readme

413c141

blunderbuss-gcf bot assigned Spaceenter Sep 17, 2021

google-cla bot added the cla: yes label Sep 17, 2021

suhana13 and others added 6 commits September 20, 2021 09:01

feat: add comments and fix dcids

4e2dd22

feat: add property

7e02eb1

Update README.md

78ae772

Add info about tMCFs

6ed4421

Update README.md

f8aa970

Update README.md

1d06659

spiekos assigned pradh and shifucun and unassigned Spaceenter Jun 6, 2022

spiekos requested review from pradh and shifucun June 6, 2022 18:30

spiekos assigned spiekos and suhana13 and unassigned pradh and shifucun Jun 6, 2022

spiekos requested a review from chejennifer June 6, 2022 18:31

Merge branch 'master' into add_mesh_data

2ffc455

pradh reviewed Jun 6, 2022

View reviewed changes

spiekos added 5 commits June 6, 2022 16:11

Create download.sh

2a988e0

Update README.md

63a1a16

Update README.md

9d39b9d

Update output file names

ed2b5d8

Update README.md

593139e

Suhana Bedi and others added 4 commits August 3, 2022 17:37

add mapping py script

db2d138

update Readme

60b22cb

add property to MeSHRecord

7af7fde

Update mesh_pubchem.tmcf

3377334

spiekos reviewed Aug 23, 2022

View reviewed changes

update mesh py script

3a265cc

spiekos reviewed Aug 24, 2022

View reviewed changes

Suhana Bedi and others added 8 commits August 29, 2022 10:07

feat: add test data for mesh record and pubchem mapping

4a3d1a8

update test data for mesh

a34ce67

feat: add test file for mesh record

7de6c17

update readme

7f5996c

Update mesh_record.tmcf

d35b536

Update README.md

c799ce0

Update mesh_pubchem.tmcf

0753ac2

Update README.md

0a5c610

spiekos reviewed Sep 20, 2022

View reviewed changes

Suhana Bedi and others added 4 commits September 26, 2022 12:03

feat: add pharmacological class script

c1518df

feat:add mesh qualifier and pharma scripts

ba70fe2

feat: add tmcfs for qualifier and pharma class

a8b9c06

Update tmcf

33415e5

spiekos requested changes Nov 16, 2022

View reviewed changes

spiekos added 2 commits December 2, 2022 14:20

fix typo

49b8353

update format of tmcf

01c25af

spiekos reviewed Dec 2, 2022

View reviewed changes

Suhana Bedi and others added 2 commits August 14, 2023 14:48

feat: add illegal char check

092876a

Merge branch 'master' into add_mesh_data

3b30247

google-cla bot added cla: no and removed cla: yes labels Mar 5, 2024

spiekos closed this May 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add MeSH data #507

Add MeSH data #507

suhana13 commented Sep 17, 2021

spiekos commented Jun 6, 2022

pradh Jun 6, 2022

spiekos left a comment

spiekos left a comment

spiekos Aug 24, 2022

suhana13 Aug 29, 2022

spiekos left a comment

spiekos Sep 20, 2022

spiekos Sep 20, 2022

spiekos left a comment

spiekos left a comment

spiekos commented May 15, 2024

		from xml.etree.ElementTree import parse


		def format_mesh_xml(mesh_xml):

		@@ -0,0 +1,18 @@
		DescriptorID,ConceptID,ConceptName,ScopeNote,DateCreated,DateRevised,DateEstablished,Concept_dcid,Descriptor_dcid
		D000001,M0000001,"""Calcimycin""","""An ionophorous, polyether antibiotic from Streptomyces chartreusensis. It binds and transports CALCIUM and other divalent cations across membranes and uncouples oxidative phosphorylation while inhibiting ATPase of rat liver mitochondria. The substance is used mostly as a biochemical tool to study the role of divalent cations in various biological systems.""",1974-11-19,2016-05-27,1984-01-01,bio/M0000001,bio/D000001

Add MeSH data #507

Add MeSH data #507

Conversation

suhana13 commented Sep 17, 2021

spiekos commented Jun 6, 2022

pradh Jun 6, 2022

Choose a reason for hiding this comment

spiekos left a comment

Choose a reason for hiding this comment

spiekos left a comment

Choose a reason for hiding this comment

spiekos Aug 24, 2022

Choose a reason for hiding this comment

suhana13 Aug 29, 2022

Choose a reason for hiding this comment

spiekos left a comment

Choose a reason for hiding this comment

spiekos Sep 20, 2022

Choose a reason for hiding this comment

spiekos Sep 20, 2022

Choose a reason for hiding this comment

spiekos left a comment

Choose a reason for hiding this comment

spiekos left a comment

Choose a reason for hiding this comment

spiekos commented May 15, 2024