Skip to content
@ContentMine

The ContentMine

The ContentMine is extracting 100 million facts from the academic literature

Popular repositories Loading

  1. quickscrape quickscrape Public

    A scraping command line tool for the modern web

    JavaScript 259 42

  2. getpapers getpapers Public

    Get metadata, fulltexts or fulltext URLs of papers matching a search query

    JavaScript 197 37

  3. journal-scrapers journal-scrapers Public

    Journal scraper definitions for the ContentMine framework

    Ruby 66 33

  4. workshop-resources workshop-resources Public

    This repository contains material helping you to set up a ContentMine workshop. It also includes tutorials for learning the ContentMine tools on your own.

    37 13

  5. norma norma Public

    Convert XML/SVG/PDF into normalised, sectioned, scholarly HTML

    HTML 36 21

  6. scraperJSON scraperJSON Public

    The scraperJSON standard for defining web scrapers as JSON objects

    33 2

Repositories

Showing 10 of 101 repositories
  • norma Public

    Convert XML/SVG/PDF into normalised, sectioned, scholarly HTML

    ContentMine/normaā€™s past year of commit activity
    HTML 36 Apache-2.0 21 34 12 Updated Jan 22, 2024
  • getpapers Public

    Get metadata, fulltexts or fulltext URLs of papers matching a search query

    ContentMine/getpapersā€™s past year of commit activity
    JavaScript 197 MIT 37 70 6 Updated Jul 15, 2020
  • contentmine-gui Public

    GUI for executing ContentMine commands - browser SPA for running locally on user's machine.

    ContentMine/contentmine-guiā€™s past year of commit activity
    JavaScript 1 0 3 0 Updated Jun 21, 2020
  • CMForestPlots Public

    Things for managing the ContentMine forest plot functionality in normal

    ContentMine/CMForestPlotsā€™s past year of commit activity
    Python 0 Apache-2.0 1 0 0 Updated Nov 17, 2019
  • sciencesource-wikibase-docker Public Forked from wmde/wikibase-docker

    šŸ³ Docker images and compose file for Wikibase and the query service

    ContentMine/sciencesource-wikibase-dockerā€™s past year of commit activity
    Shell 2 95 0 0 Updated Oct 24, 2019
  • vms Public

    ContentMine virtual machines

    ContentMine/vmsā€™s past year of commit activity
    3 CC0-1.0 6 10 1 Updated Oct 23, 2019
  • cephis Public

    Document processing including support libraries and PDFBox2

    ContentMine/cephisā€™s past year of commit activity
    1 0 0 0 Updated Aug 31, 2019
  • stataforestplots Public

    documents and tests relating to ForestPlots in Stata format

    ContentMine/stataforestplotsā€™s past year of commit activity
    0 Apache-2.0 0 2 0 Updated Jul 9, 2019
  • junk Public

    analysis of documents containing forest plots in Stata format

    ContentMine/junkā€™s past year of commit activity
    0 Apache-2.0 0 0 0 Updated Jun 20, 2019
  • ContentMine/ScienceSourceReviewā€™s past year of commit activity
    Go 1 Apache-2.0 1 0 0 Updated May 18, 2019