Skip to content

Commit

Permalink
bulleted list
Browse files Browse the repository at this point in the history
  • Loading branch information
slobentanzer committed Dec 9, 2023
1 parent e40a833 commit e40e01c
Showing 1 changed file with 3 additions and 3 deletions.
6 changes: 3 additions & 3 deletions content/25.sup.note.5.md
Original file line number Diff line number Diff line change
Expand Up @@ -255,12 +255,12 @@ Biomedical data collections are growing to enormous sizes, which makes the handl
Additionally, these large corpuses then need to be put to good use in downstream analyses, including collaborations between groups or even institutions.
The growth of arbitrarily organised large-scale collections of knowledge poses major challenges to the maintainers of these databases:

Maintaining data ingestion pipelines for dozens of upstream data sources is not feasible in a research context and detracts development time from other tasks.
- Maintaining data ingestion pipelines for dozens of upstream data sources is not feasible in a research context and detracts development time from other tasks.

Using a custom (non-standardised) data model, the effort to integrate new upstream data sources grows with the total number of pre-existing data sources.
- Using a custom (non-standardised) data model, the effort to integrate new upstream data sources grows with the total number of pre-existing data sources.
Each new data source has to be cross-referenced with all existing data sources and inconsistencies arise because the same piece of information may be represented with different levels of abstraction.

The custom data model also complicates collaboration with external researchers.
- The custom data model also complicates collaboration with external researchers.
Integrating data from different contexts requires the collaborators to adapt to the internal data model.

BioCypher can handle all three challenges.
Expand Down

0 comments on commit e40e01c

Please sign in to comment.