Exception dumping recent planets #25

tomhughes · 2022-03-26T13:04:06Z

Planet has failed two weeks in a row now and this week I caught the output and it is throwing an exception dumping the relations to the PBF output:

Writing relations...
EXCEPTION: writer_thread(2): pbf_writer.cpp(189): Throw in function void pbf_writer::pimpl::write_blob(const google::protobuf::MessageLite&, const string&)
Dynamic exception type: boost::exception_detail::clone_impl<boost::exception_detail::error_info_injector<std::runtime_error> >
std::exception::what: Unable to write block of type OSMData, uncompressed size 33630260 because it is larger than the maximum allowed 33554432.

. Trying to continue...
EXCEPTION: pbf_writer.cpp(189): Throw in function void pbf_writer::pimpl::write_blob(const google::protobuf::MessageLite&, const string&)
Dynamic exception type: boost::wrapexcept<std::runtime_error>
std::exception::what: Unable to write block of type OSMData, uncompressed size 33723897 because it is larger than the maximum allowed 33554432.

Not sure if this is an issue in protozero (maybe @joto can help? we could rebuild against a newer version?) or whether planet-dump-ng is feeding it things that are too large.

The text was updated successfully, but these errors were encountered:

tomhughes · 2022-03-26T13:07:57Z

Oh scratch that - that's the pbf_writer.cpp in this repository, not the protozero one ;-)

zerebubuth · 2022-03-26T14:45:32Z

Wow. I guess there's a bug in there that's massively undercounting or underestimating the byte count for relations. Short term, you could try reducing the constant in https://github.com/zerebubuth/planet-dump-ng/blob/master/src/pbf_writer.cpp#L107 to 0.125 times the max block size. It will be less efficient, but might allow the dump to proceed. Longer term, I can look at why the relation handling isn't accurately estimating the block size. 26 Mar 2022 13:04:21 Tom Hughes ***@***.***>:

…

Planet has failed two weeks in a row now and this week I caught the output and it is throwing an exception dumping the relations to the PBF output: *Writing relations... EXCEPTION: writer_thread(2): pbf_writer.cpp(189): Throw in function void pbf_writer::pimpl::write_blob(const google::protobuf::MessageLite&, const string&) Dynamic exception type: boost::exception_detail::clone_impl<boost::exception_detail::error_info_injector<std::runtime_error> > std::exception::what: Unable to write block of type OSMData, uncompressed size 33630260 because it is larger than the maximum allowed 33554432. . Trying to continue... EXCEPTION: pbf_writer.cpp(189): Throw in function void pbf_writer::pimpl::write_blob(const google::protobuf::MessageLite&, const string&) Dynamic exception type: boost::wrapexcept<std::runtime_error> std::exception::what: Unable to write block of type OSMData, uncompressed size 33723897 because it is larger than the maximum allowed 33554432. * Not sure if this is an issue in protozero (maybe @joto[https://github.com/joto] can help? we could rebuild against a newer version?) or whether planet-dump-ng is feeding it things that are too large. — Reply to this email directly, view it on GitHub[#25], or unsubscribe[https://github.com/notifications/unsubscribe-auth/AACCIAAZBZKC6DR4WB3PD4TVB4DNFANCNFSM5RXBLDQQ]. You are receiving this because you are subscribed to this thread.[Tracking image][https://github.com/notifications/beacon/AACCIADTRDWVJDFCH7YLBJTVB4DNFA5CNFSM5RXBLDQ2YY3PNVWWK3TUL52HS4DFUVEXG43VMWVGG33NNVSW45C7NFSM4RTQCK3A.gif]

tomhughes · 2022-03-26T15:21:03Z

I've applied that change locally for now so hopefully next week will go better...

tomhughes · 2022-04-02T16:19:41Z

Looks like that worked and this week's dump is in the process of being published now.

tomhughes · 2022-08-05T23:05:16Z

Of course bumping to 1.2.4 undid that local hack and it failed again this week :-(

tomhughes · 2022-09-16T23:54:20Z

It failed again this week even with the modified limit...

zerebubuth · 2022-09-18T10:28:49Z

Urgh. Crap. Sorry about that. I've pushed v1.2.5, which should have the lower limit plus also a reduced recheck time. I'm hoping that helps, and I'll try to repro locally again.

tomhughes · 2022-09-18T10:59:42Z

Thanks. I've deployed that now.

Planet dump would fail intermittently complaining of block overflow (#25). The root cause of this was failing to encode [relation 6677259](https://www.openstreetmap.org/relation/6677259) in the history PBF. PBF files are sequences of individual blocks and every block must be smaller than 16MiB. To try and satisfy this, the PBF encoder checks the size of the encoded pblock on a regular basis, every `N` elements. The assumption is that the average size of the element multiplied by `N` would be smaller than 16MiB. Unfortunately with relation 6677259 that wasn't the case, as versions in its history have more than 25,000 members and were taking up about 100kiB each (so only 164 versions are needed to overflow, relation 6677259 has 440-ish). This change implements an _approximate_ size counter for relations, which should help to make sure that the pgroup is flushed when it gets too large, even if it hasn't reached `N` elements yet. This hasn't seemed necessary yet for ways (they are limited to a fixed number of nodes each) or nodes, but extending to handle those - if necessary - should be straightforward.

zerebubuth · 2022-10-05T16:41:31Z

Thanks for your patience! I think I found what was causing the issue: basically relation 6677259 is very large (some versions >25k members) and has relatively many versions (about 440), so this was enough to overflow the pblock between checks of the current size.

I think this might also explain the intermittent failure, as it's possible that the recheck might have happened in the "right" place and split the history into two blocks, or in the "wrong" place and tried to collect it all into one block.

I've implemented an approximate size counter for the relation pgroups, which acts as an additional trigger for a re-check of the pblock size. On my local machine, this allowed dumping the 2022-09-12 planet to completion.

I've pushed a new version: v1.2.6. Hopefully this works 🤞

tomhughes · 2022-10-05T17:08:49Z

Thanks. I've deployed that now.

tomhughes mentioned this issue Sep 20, 2022

Occasional lockups dumping planet #24

Open

mmd-osm mentioned this issue Jan 2, 2024

Planet replication process get stuck OpenHistoricalMap/issues#665

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Exception dumping recent planets #25

Exception dumping recent planets #25

tomhughes commented Mar 26, 2022

tomhughes commented Mar 26, 2022

zerebubuth commented Mar 26, 2022 via email

tomhughes commented Mar 26, 2022

tomhughes commented Apr 2, 2022

tomhughes commented Aug 5, 2022

tomhughes commented Sep 16, 2022

zerebubuth commented Sep 18, 2022

tomhughes commented Sep 18, 2022

zerebubuth commented Oct 5, 2022

tomhughes commented Oct 5, 2022

Exception dumping recent planets #25

Exception dumping recent planets #25

Comments

tomhughes commented Mar 26, 2022

tomhughes commented Mar 26, 2022

zerebubuth commented Mar 26, 2022 via email

tomhughes commented Mar 26, 2022

tomhughes commented Apr 2, 2022

tomhughes commented Aug 5, 2022

tomhughes commented Sep 16, 2022

zerebubuth commented Sep 18, 2022

tomhughes commented Sep 18, 2022

zerebubuth commented Oct 5, 2022

tomhughes commented Oct 5, 2022