-
Notifications
You must be signed in to change notification settings - Fork 19
Dictionary Plant Secondary Metabolites
Plant Secondary Metabolites DICTIONARY:
-
I collected 3330 plant secondary metabolites and removed duplicates.
TOTAL Secondary metabolites -3330
WIKIDATA ID -1417
NO WIKIDATA ID -1913 (I will be getting)
Please, find dictionary. Presently, dictionary contain terms with 1419 WIKIDATA IDs
-
I used wikipedia page (https://en.wikipedia.org/wiki/Secondary_metabolite#Plant_secondary_metabolites) to find out only names of plant secondary metabolites. I also used following SPARQL queries
SELECT ?instance_of ?instance_ofLabel WHERE {
SERVICE wikibase:label { bd:serviceParam wikibase:language "[AUTO_LANGUAGE],en". }
?instance_of wdt:P31 wd:Q11173.
?instance_of wdt:P31 **wd:Q331818**.}
(replacing wd:Q331818 with wd:Q70702, wd:Q32528, wd:Q80218, wd:Q3234924 and wd:Q222174)
Most of the Wikidata IDs were searched manually.
-
To get names of plant secondary metabolites, I also searched papers (supplementary data) of genome wide association studies of Arabidopsis and Rice as well as Tomato (Introgression Lines) (Alseekh et al., 2015; Matsuda et al., 2014; Wu et al., 2018).
(https://academic.oup.com/plcell/article/27/3/485/6118131,https://www.pnas.org/content/110/50/20320,
https://onlinelibrary.wiley.com/doi/10.1111/tpj.12681,
https://doi.org/10.1016/j.molp.2017.08.012)
-
Here, we are focusing on only PLANT SECONDARY METABOLITES.
-
There are natural product databases available (Mangal et al., 2013; Tung 2014). I used database developed by Mangal et al., 2014. Some of the databases contain microbial natural products also and some require subscription and are not open access.
I got database table from Tung 2014 paper
No. | Database | Reference |
---|---|---|
1. | http://crdd.osdd.net/raghava/npact/ | Mangal et al., 2013 |
2. | http://alkamid.ugent.be/ | Boonen et al., 2012 |
3. | http://www.3dmet.dna.affrc.go.jp/ | Maeda et al., 2013 |
4. | http://kanaya.naist.jp/KNApSAcK_Family/ | Takahashi et al., 2011 |
5. | http://bioinformatics.charite.de/supernatural | Dunkel et al., 2006 |
6. | http://www.ars-grin.gov/duke/ | Duke and Bogenschutz |
7. | http://cwtung.kmu.edu.tw/tipdb | Lin et al., 2013 |
8. | http://nubbe.iq.unesp.br/nubbeDB.html | Valli et al., 2013 |
9. | http://www.megabionet.org/tcmid/ | Xue et al., 2013 |
10. | http://bif.uohyd.ac.in/nemedplant/ | Meetei et al., 2012 |
11. | http://lifecenter.sgst.cn/hit/ | Ye et al., 2011 |
12. | http://tcm.cmu.edu.tw/ | Chen et al., 2011 |
13. | http://www.hccbif.info/ | Daisy et al., 2011 |
14. | http://biolinfo.org/cmkb/ | Gaikwad et al., 2008 |
15. | http://tcm.cz3.nus.edu.sg/group/tcm-id/tcmid.asp | Ji et al., 2006 |
-
I have tested this dictionary on minicorpus of 10 papers. I used following commands.
getpapers -q "plant secondary metabolites" -o secompoundMAP -x -p -k 10 -f corpora1/log.txt
ami -p "secompoundMAP" section
ami -p "secompoundMAP" search --dictionary secondary.xml
It gave full data tables as