-
Notifications
You must be signed in to change notification settings - Fork 22
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Update on the metadata episode, first draft of introduction & setup #52
Conversation
- metadata: Added the "movie" exercise - metadata: Added bio.tool figure and info - metadata: Added more definitions - metadata: Added explanation of controlled vocabulary vs ontology - metadata: General fixes, typos and timing - introduction: first draft - setup: create accounts on GitHub and bio.tools - guide: added only metadata points
_episodes/01-introduction.md
Outdated
objectives: | ||
- "First objective." | ||
- "What are the basics of Open Science in research software" | ||
- "What are the FAIR principles" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Change objectives from questions to statements.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Comment addressed.
_episodes/01-introduction.md
Outdated
exercises: 0 | ||
questions: | ||
- "Key question" | ||
- "Why are best practices necessary in research software" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
add "How Open Source can help with better quality of software?"
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Comment addressed.
_episodes/03-use-registry.md
Outdated
- System documentation and | ||
- User documentation | ||
|
||
_System documentation_ represents documents that describe the system itself and its parts. It includes requirements documents, design decisions, architecture descriptions, program source code, and help guides. On the other hand, _User documentation_ covers manuals that are mainly prepared for end-users of the product and system administrators. User documentation includes tutorials, user guides, troubleshooting manuals, installation, and reference manuals. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
for me this documentation section is very formal
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Comment addressed. Hopefully, the new text captures the same content but in a more simple format.
_extras/guide.md
Outdated
|
||
- Notes from the metadata episode | ||
- [Local Installation of Zenodo](https://github.com/zenodo/zenodo/blob/master/INSTALL.rst) | ||
It may be interesting to have a local installation of zenodo to play around. The instructions using Docker are available on the link above. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
under the link
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Comment addressed
_extras/guide.md
Outdated
It may be interesting to have a local installation of zenodo to play around. The instructions using Docker are available on the link above. | ||
|
||
- [Bio-Linux](http://environmentalomics.org/bio-linux-software-list/) | ||
It is a final OS containing tools that have been already published, connected metadata, etc |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
what does "final OS" mean?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Comment addressed.
setup.md
Outdated
@@ -3,4 +3,15 @@ layout: page | |||
title: Setup | |||
root: . | |||
--- | |||
FIXME | |||
|
|||
In order to be prepared for the lesson, you need to have accounts to the following (free) services: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
accounts on
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Comment addressed.
|
||
2. BioTools | ||
|
||
[bio.tools](https://bio.tools/) is a portal to bioinformatics resources worldwide, aimed to help bioinformaticians and scientists, find, understand, compare and select resources as well as use and connect them in workflows. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
they don't need bio.tools account only dev.bio.tools
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We are actually explicitly requesting for participants to create an account on the dev instance in the next sentence. However, I thought it made sense to include an intro to the actual platform. If that is redundant, we can delete it.
Thanks for the review @mkuzak ! I think I've addressed all the issues (with the possible exception of the bio.tools one, but we can revisit this for sure). |
Updated PR to reflect @tobyhodges comment in #44. |
objectives: | ||
- "First objective." | ||
- "Basics of Open Science in research software" | ||
- "Introduction to the FAIR principles" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
FAIR principles are related to metadata, I suggest to add a key question about the role played by metadata in research software
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Given that this introduction is for all four of the best practices, I think that it may be too specific to have a question just for metadata. Thoughts?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Totally agree, you are right, no need to add anything here
_episodes/01-introduction.md
Outdated
keypoints: | ||
- "First key point." | ||
- "Best practices in research software are tied to the FAIR principles" | ||
- "They are not tailored to software developers, but rather to a wider audience" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Not sure whether "they" here refers to the best practices or the FAIR principles.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I've addressed this with a new commit.
_episodes/01-introduction.md
Outdated
--- | ||
|
||
Scientific research relies on computer software, yet software is not always developed following practices that ensure its quality and sustainability. One of the most recent publications ([Four simple recommendations to encourage best practices in research software](https://f1000research.com/articles/6-876/v1)) provided a simple, yet robust framework of simple recommendations that encourage the adoption of existing best practices in developing research software. These recommendations are designed around Open Science values, and provide practical suggestions that contribute to making research software and its source code more discoverable, reusable and transparent. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
"Scientific research relies on computer software" Do you mean it relies on regular software development practices?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Rephrased to better reflect the original intention.
_episodes/01-introduction.md
Outdated
--- | ||
|
||
Scientific research relies on computer software, yet software is not always developed following practices that ensure its quality and sustainability. One of the most recent publications ([Four simple recommendations to encourage best practices in research software](https://f1000research.com/articles/6-876/v1)) provided a simple, yet robust framework of simple recommendations that encourage the adoption of existing best practices in developing research software. These recommendations are designed around Open Science values, and provide practical suggestions that contribute to making research software and its source code more discoverable, reusable and transparent. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
"One of the most recent publications" I would suggest to omit "most".
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I've addressed this with a new commit.
_episodes/01-introduction.md
Outdated
|
||
Scientific research relies on computer software, yet software is not always developed following practices that ensure its quality and sustainability. One of the most recent publications ([Four simple recommendations to encourage best practices in research software](https://f1000research.com/articles/6-876/v1)) provided a simple, yet robust framework of simple recommendations that encourage the adoption of existing best practices in developing research software. These recommendations are designed around Open Science values, and provide practical suggestions that contribute to making research software and its source code more discoverable, reusable and transparent. | ||
|
||
Based on these recommendations, this lesson focuses on providing both the underlying context as well as some practical exercises towards establishing their usefulness in the long term. The consequent episodes of this lesson are structured in the form of one episode per recommendation; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Colon rather than semi-colon at the end of this paragraph?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Fixed in the new commit.
_episodes/01-introduction.md
Outdated
|
||
"_When all researchers are aware of Open Science, and are trained, supported and guided at all career stages to practice Open Science, the potential is there to fundamentally change the way research is performed and disseminated, fostering a scientific ecosystem in which research gains increased visibility, is shared more efficiently, and is performed with enhanced research integrity._" [Open Science Skills Working Group Report (2017)](https://ec.europa.eu/research/openscience/pdf/os_skills_wgreport_final.pdf#view=fit&pagemode=none) | ||
|
||
Discussing best practices in developing research software, one is bound to touch on the subject of Open Science. Modern research relies on software, and building upon—or reproducing—that research requires access to the full source code behind that software ([ref](https://open-science-training-handbook.gitbook.io)). Sharing software used for research (whether computational in nature, or that relies on any software-based analysis/interpretation) is a necessary, though not sufficient, condition for reproducibility. In addition to reproducibility, sharing software openly allows developers to receive career credit for their efforts, either through direct citation or via published software articles. We are going to be discussing all these aspects in the following lesson. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
upon-or? Not sure if should be upon -or (long dash there)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Fixed in the new commit
_episodes/01-introduction.md
Outdated
|
||
Discussing best practices in developing research software, one is bound to touch on the subject of Open Science. Modern research relies on software, and building upon—or reproducing—that research requires access to the full source code behind that software ([ref](https://open-science-training-handbook.gitbook.io)). Sharing software used for research (whether computational in nature, or that relies on any software-based analysis/interpretation) is a necessary, though not sufficient, condition for reproducibility. In addition to reproducibility, sharing software openly allows developers to receive career credit for their efforts, either through direct citation or via published software articles. We are going to be discussing all these aspects in the following lesson. | ||
|
||
## FAIR principles |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I suggest to make explicit the relation between the 4OSS and FAIR, otherwise is kind of disconnected
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Fixed in the new commit.
_episodes/03-use-registry.md
Outdated
@@ -78,25 +101,21 @@ A standard can be defined as "a structure agreed and adopted by a community" or | |||
> | |||
{: .callout} | |||
|
|||
TODO: difference between control vocabulary and ontology. | |||
|
|||
**Controlled vocabularies** provide a way to organize knowledge for subsequent retrieval. It is usually a carefully selected list of words and phrases, which are used to tag units of information (document or work) so that they may be more easily retrieved by a search. The fundamental difference between an **ontology** and a **controlled vocabulary** is the level of abstraction and relationships among concept. A formal ontology is a controlled vocabulary expressed in an ontology representation language. ([ref](https://semwebtec.wordpress.com/2010/11/23/contolled-vocabulary-vs-ontology/)) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
If we are going to explicitlely mention ontologies, we might mention as well others, explaining how they go from soft/weak representations to those with stronger logic in it, and then move to ontologies and their importance.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I left this unresolved as I think it may be too technical. Thoughts?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It would be indeed too technical. What about "The fundamental difference between and ontology and other controlled vocabularies, e.g., thesauri, is the [...]"?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
That is a great point! Fixed in a new commit.
_episodes/03-use-registry.md
Outdated
|
||
- [Bio-Linux](http://environmentalomics.org/bio-linux-software-list/) | ||
It is a final OS containing tools that have been already published, connected metadata, etc | ||
By adding good enough metadata to our research software, we are directly supporting its findability, thus increasing the overall visibility of the software. This is tied to the **findable** aspect of the FAIR principles mentioned in the introductory episode of this lesson. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Metadate can also support accessibility if you include a license there, or interoperability if you include input/output data types or format. There might be some metadata supporting as well reusability.
That would be a nice exercise, asking attendees to map the metadata they have identified to FAIR principles... but maybe not enough time for it though...
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Fixed in the new commit. And excellent idea for an optional challenge @ljgarcia ! :)
Thanks for the comments @ljgarcia ! I've addressed most of them in a new commit, with the exception of two. |
Thanks @tobyhodges , @ljgarcia and @mkuzak for the comments and the review. If you are happy with the current version, feel free to merge. |
I'll take the silence as acceptance, so I'll merge the PR by the end of day today. :) |
Metadata episode
Introduction episode
Setup page
Guide page