-
Notifications
You must be signed in to change notification settings - Fork 169
skimdb replication timeout since Nov 24, 2014 #216
Comments
Our operational logs say we rolled out a new SSL cert to the skimdb on November 24th but other than that we have made very few operational changes to skimdb since it was set up. You're right that the HEAD request is timing out but I'm not sure that it would ever have worked in our current configuration. We can dig into it more. To eliminate one possibility, can you try relaxing your SSL requirements by messing with |
$ curl -i -k -X HEAD https://skimdb.npmjs.com/registry/
HTTP/1.1 200 OK
Server: CouchDB/1.5.0 (Erlang OTP/R16B03)
Date: Wed, 17 Dec 2014 00:07:40 GMT
Content-Type: text/plain; charset=utf-8
Content-Length: 259
Cache-Control: must-revalidate
# This never returns Since the HEAD request never completes, our CouchDB 1.6.1 which uses HEAD requests for the replication times out and cannot continue. |
it's off
this is set to 10
It took us this long to create this issue because we don't have monitoring for replication failing so we didn't notice it wasn't working until yesterday, but I agree more devs should probably notice a problem by now. It would be helpful to see a graph of skimdb replicator count on http://status.npmjs.org/ to see if there was a drop on Nov 24, 2014. 😄 |
According to #215 this is a problem with older versions of Erlang being unable to process SHA256 SSL certs. This seems a very plausible explanation since that was the change we made -- we replaced our expiring cert with a new SHA256 cert -- and it also explains why only a few people were affected, since most people are running newer versions of erlang. Traffic on that box has been growing steadily and there was no sudden drop-off on the 24th. I'm going to close this as I'm pretty sure it's a dupe of #215. Please put further updates over there. |
Hi, we've been replicating
skimdb.npmjs.com
since it was created, but on or around Nov 24th, 2014 it stopped working with a timeout error. We only just noticed this today, whoops, time to add more monitoring.The following
HEAD
request times out, maybe that's part of the problem?From couchdb:
The text was updated successfully, but these errors were encountered: