Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Investigate missing Bodleian V3 data #125

Open
hepplerj opened this issue Jul 10, 2024 · 2 comments
Open

Investigate missing Bodleian V3 data #125

hepplerj opened this issue Jul 10, 2024 · 2 comments
Assignees
Labels
bug Something isn't working type: database Database-related issues type: datascribe

Comments

@hepplerj
Copy link
Member

hepplerj commented Jul 10, 2024

The BOM DataScribe database is missing about half of the data / transcription form for Bodleian V3

  • DataScribe backups for the data? Data dropped out of here somehow, the transcription form half of this seems to have been deleted.
      1. Can we reacquire the data without re-transcribing it? Are there backups?
      1. What went wrong and how do we prevent it?
  • Rough date of problem identified around 2023-04-19 -- the last export that showed the issue
  • This is the only dataset that seems affected
@hepplerj hepplerj self-assigned this Jul 10, 2024
@hepplerj hepplerj added bug Something isn't working type: database Database-related issues labels Jul 10, 2024
@qtrinh2
Copy link
Contributor

qtrinh2 commented Jul 12, 2024

Root cause of the missing data likely related to VM migration for bom.chnm.org around Feb/Mar 2023: https://github.com/chnm/systems/issues/204

On 2023-03-14, BOM team reported application errors on bom.chnm.org. The errors seemed to be related to null values of the dataset in question. Application errors were resolved after updating PHP parameters. It's likely the data was missing during this time but had gone unnoticed until now. If bom.chnm.org was in active use during the migration, an unsynchronized mysql database could be the reason why the data was incomplete.

A backup of chnmdev from 2023-03-12 included the bom.chnm.org site with the dataset seemingly intact. That version of the website has been deployed to https://20230312.bom.chnm.org to start exporting the dataset.

@jmotis jmotis self-assigned this Sep 18, 2024
@jmotis
Copy link
Contributor

jmotis commented Sep 18, 2024

think we have recovered most of the data, Jessica to confirm

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working type: database Database-related issues type: datascribe
Projects
None yet
Development

No branches or pull requests

3 participants