Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

GO namespace is missing for some of the imported GO #30

Open
leungmanhin opened this issue May 15, 2020 · 1 comment
Open

GO namespace is missing for some of the imported GO #30

leungmanhin opened this issue May 15, 2020 · 1 comment

Comments

@leungmanhin
Copy link
Contributor

Some GOs are not in any of the three namespaces (Molecular Function, Cellular Component, Biological Process)

To print out the list of GOs with their namespace missing in Guile:

(use-modules (opencog) (opencog exec) (opencog bioscience))

;  change the below paths as needed
(primitive-load "GO_2020-04-01.scm")
(primitive-load "GO_annotation_gene-level_2020-04-01.scm")
(primitive-load "Go-Plus-GO_2020-05-04.scm")

(for-each
  (lambda (go)
    (if (null? (cog-outgoing-set
          (cog-execute!
            (Get
              (TypedVariable (Variable "$x") (Type "ConceptNode"))
              (Evaluation (Predicate "GO_namespace") (List go (Variable "$x")))))))
      (format #t "~a\n" (cog-name go))))
  (filter
    (lambda (c)
      (string-prefix? "GO:" (cog-name c)))
    (cog-get-atoms 'ConceptNode)))
@leungmanhin
Copy link
Contributor Author

leungmanhin commented May 15, 2020

I see that there are 44519 GOs in GO_2020-04-01.scm, and they all come with namespaces.

After loading GO_annotation_gene-level_2020-04-01.scm, 10 extra GOs are added:

GO:0005072
GO:0042623
GO:0030702
GO:0030617
GO:0030616
GO:0006343
GO:0070869
GO:0030618
GO:0004584
GO:0098740

After loading Go-Plus-GO_2020-05-04.scm, 30 more are added:

GO:0140453
GO:0062235
GO:0140458
GO:0140451
GO:0140449
GO:0062239
GO:0062237
GO:0106258
GO:0062240
GO:0140457
GO:0062236
GO:0140454
GO:0062241
GO:0140447
GO:0062242
GO:0140450
GO:0140455
GO:0062238
GO:0106257
GO:0062243
GO:0140460
GO:0062244
GO:0140459
GO:0062246
GO:0140456
GO:0106259
GO:0140448
GO:0062245
GO:0062247
GO:0140446

It's these extra ones that come from files other than the GO-<date>.scm has no namespace associated.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant