-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
bad status #30
Comments
tar_writer.rb is trying to access a deleted schema. Code will be developed to prevent this situation. |
This is a new bug, according to Ira, that happens when a schema gets deleted and the next package needs that schema and tries to read it. It happens with GETs. The solution is to retain the schemas directory and only delete/recreate it when xmlresolution is restarted. Jen sees the logs reflecting the problem on both GETs and POSTs. Ira will look for POSTs in the log. |
Carol suggests doing an xmlresolution code review in the near future. |
I'm adding further analysis into this issue to this ticket due to the number that occurred today alone. ONLY the following schema are what XMLresolution is attempting to GET/POST when this error is thrown:
Of this list, I still need to identify what these schema are:
The following schema have been identified as follows:
Analysis:Key: 01490ebdea13c1bc82a17e4783daeeaa: SUCCESS: TEST: 1fadeaf88d4b93ab263f7c59917c26bc 2b2f6040cfc603d5873d7fa0bf976274 42519c72a741cc30e256b99369f1d735 5e0bd6f94ec78a3a88fca2275ab05f9e 712fc5a7750e69f904f61086a997713c d90774b02fa694f3b358b4ed828295be Conclusion:It would appear that XMLresolution is somehow obtaining an outdated copy of schema or is in some way corrupting those that are failing. However, since the identical FAILURE md5sum is seen multiple times throughout the xmlresolution.log files, I would say that XMLresolution is not getting the most recent schema, intermittently. We need to look at our cache and squid to determine how old schema are being pulled down. |
The FAILURE checksums appear to be the ones computed on the schema name, and the SUCCESS checksums are the ones computed on the contents. |
Yes, so perhaps as we speculated Thursday evening, there may be two versions of the code running that is causing this issue to crop up and not that the schema are corrupted or incorrect. I think the next step will be to identify the exact code that is performing this method and correct it to the hash value of the filename and content combined, as described in #14. |
Actually, I'm not quite sure this is true. Let's use simpledc20021212.xsd as an example. Here is the original information from above about this schema: d90774b02fa694f3b358b4ed828295be The md5sum of the content is: The md5sum of the schema name string is: Even if we include the newline, the md5sum of the schema name string is: We're still left with not knowing how d90774b02fa694f3b358b4ed828295be was derived. As it turns out, this md5sum is the entire string of the location of the schema: Perhaps this was known but I find it interesting that the entire location of the schema is being used. |
It appears I've misunderstood what should be happening. In production, the md5sum of the schema should be calculated on the URL string and is stored in the manifest.xml file within the xmlres-* directory of the AIP. Apologies for the confusion. |
Per this morning's meeting: it appears that this problem was caused by code implemented relating to #17, where schemas are deleted when no collection references the schema. |
When looking at the XMLresolution log we can see the number of times these errors have occurred:
$ grep " 500" xmlresolution.log
Thrown Errors:
Number of Cases: 112
Action(s): GET, POST
I'm currently working on more statistical information to include in this ticket that may help troubleshoot the issue.
Thank you,
Jen
The text was updated successfully, but these errors were encountered: