Druid - Spark interoperation is problematic due to Netty dependency mismatch #4390

leventov · 2017-06-09T19:47:23Z

Spark task (https://github.com/metamx/druid-spark-batch) which uses Druid fails with AbstractMethodError:

java.lang.AbstractMethodError
	at io.netty.util.ReferenceCountUtil.touch(ReferenceCountUtil.java:73) ~[druid-selfcontained-0.10.1-mmx17.jar:0.10.1-mmx17]
	at io.netty.channel.DefaultChannelPipeline.touch(DefaultChannelPipeline.java:107) ~[druid-selfcontained-0.10.1-mmx17.jar:0.10.1-mmx17]
	at io.netty.channel.AbstractChannelHandlerContext.write(AbstractChannelHandlerContext.java:810) ~[druid-selfcontained-0.10.1-mmx17.jar:0.10.1-mmx17]
	at io.netty.channel.AbstractChannelHandlerContext.write(AbstractChannelHandlerContext.java:723) [druid-selfcontained-0.10.1-mmx17.jar:0.10.1-mmx17]
	at io.netty.handler.codec.MessageToMessageEncoder.write(MessageToMessageEncoder.java:111) ~[druid-selfcontained-0.10.1-mmx17.jar:0.10.1-mmx17]
	at io.netty.channel.AbstractChannelHandlerContext.invokeWrite0(AbstractChannelHandlerContext.java:738) [druid-selfcontained-0.10.1-mmx17.jar:0.10.1-mmx17]
	at io.netty.channel.AbstractChannelHandlerContext.invokeWrite(AbstractChannelHandlerContext.java:730) [druid-selfcontained-0.10.1-mmx17.jar:0.10.1-mmx17]
	at io.netty.channel.AbstractChannelHandlerContext.write(AbstractChannelHandlerContext.java:816) [druid-selfcontained-0.10.1-mmx17.jar:0.10.1-mmx17]
	at io.netty.channel.AbstractChannelHandlerContext.write(AbstractChannelHandlerContext.java:723) [druid-selfcontained-0.10.1-mmx17.jar:0.10.1-mmx17]
	at io.netty.handler.timeout.IdleStateHandler.write(IdleStateHandler.java:305) [druid-selfcontained-0.10.1-mmx17.jar:0.10.1-mmx17]
	at io.netty.channel.AbstractChannelHandlerContext.invokeWrite0(AbstractChannelHandlerContext.java:738) [druid-selfcontained-0.10.1-mmx17.jar:0.10.1-mmx17]
	at io.netty.channel.AbstractChannelHandlerContext.invokeWrite(AbstractChannelHandlerContext.java:730) [druid-selfcontained-0.10.1-mmx17.jar:0.10.1-mmx17]
	at io.netty.channel.AbstractChannelHandlerContext.access$1900(AbstractChannelHandlerContext.java:38) [druid-selfcontained-0.10.1-mmx17.jar:0.10.1-mmx17]
	at io.netty.channel.AbstractChannelHandlerContext$AbstractWriteTask.write(AbstractChannelHandlerContext.java:1089) [druid-selfcontained-0.10.1-mmx17.jar:0.10.1-mmx17]
	at io.netty.channel.AbstractChannelHandlerContext$WriteAndFlushTask.write(AbstractChannelHandlerContext.java:1136) [druid-selfcontained-0.10.1-mmx17.jar:0.10.1-mmx17]
	at io.netty.channel.AbstractChannelHandlerContext$AbstractWriteTask.run(AbstractChannelHandlerContext.java:1078) [druid-selfcontained-0.10.1-mmx17.jar:0.10.1-mmx17]
	at io.netty.util.concurrent.AbstractEventExecutor.safeExecute(AbstractEventExecutor.java:163) [druid-selfcontained-0.10.1-mmx17.jar:0.10.1-mmx17]
	at io.netty.util.concurrent.SingleThreadEventExecutor.runAllTasks(SingleThreadEventExecutor.java:403) [druid-selfcontained-0.10.1-mmx17.jar:0.10.1-mmx17]
	at io.netty.channel.nio.NioEventLoop.run(NioEventLoop.java:462) [druid-selfcontained-0.10.1-mmx17.jar:0.10.1-mmx17]
	at io.netty.util.concurrent.SingleThreadEventExecutor$5.run(SingleThreadEventExecutor.java:858) [druid-selfcontained-0.10.1-mmx17.jar:0.10.1-mmx17]
	at io.netty.util.concurrent.DefaultThreadFactory$DefaultRunnableDecorator.run(DefaultThreadFactory.java:138) [druid-selfcontained-0.10.1-mmx17.jar:0.10.1-mmx17]
	at java.lang.Thread.run(Thread.java:745) [?:1.8.0_111]

It doesn't seem that Spark will upgrade to Netty 4.1 soon, so proposed solution is to isolate Druid's usage of Netty somehow, e. g. via shading.

Modules which currently depend on Netty 4.1 in Druid:

druid-sql
druid-rocketmq
druid-avro-extensions
druid-services via druid-sql
druid-historgram via druid-sql
druid-indexing-service via hadoop-client
druid-indexing-hadoop

But pretty much all everything in Druid is going to depend on Netty 4.1 via http-client, see metamx/http-client#29.

@gianm @drcrallen @himanshug any thoughts?

The text was updated successfully, but these errors were encountered:

drcrallen · 2017-06-09T19:58:18Z

I think we should patch spark to make it work with netty 4.1 and just call out in the spark batch indexer for the druid version which pulls in 4.1 everywhere that it only works with patched spark.

leventov · 2017-06-09T19:58:37Z

Also @b-slim because it's related to Hadoop

nishantmonu51 · 2017-06-09T20:25:06Z

it seems this will also break anyone using https://github.com/SparklineData/spark-druid-olap .
asking everyone to patch their spark installations may not be a good idea, we can probably shade netty jars for now until spark updates netty version.
Also, is there any specific fix/improvement in Netty 4.1 that we are updating for ? If not, can we use an older version that works well and skip upgrading Netty in 0.10.1 ?

leventov · 2017-06-09T20:55:40Z

Not sure about https://github.com/SparklineData/spark-druid-olap, but for https://github.com/metamx/druid-spark-batch publishing druid-processing alone would also help, because druid-spark-batch actually depends only on druid-processing, which doesn't depend on Netty.

b-slim · 2017-06-10T17:22:13Z

Looking quickly at https://github.com/SparklineData/spark-druid-olap does not seem to be impacted by this anyway (don't see dependency on druid processing).

himanshug · 2017-06-12T19:06:38Z

reiterating @nishantmonu51 's question , what is forcing us to upgrade Netty at this point ?

leventov · 2017-06-12T19:17:44Z

@himanshug probably nothing right now (not sure about druid-sql), but conceptually I'm against this approach. IMO huge projects should drive each other to update to the newer versions of libraries like Netty, not to stop each other from updating, because others are not updating. In comments here: https://issues.apache.org/jira/browse/SPARK-19552 a Spark committer Sean Owen said that they are not that much against updating Netty in Spark anymore, so IMO the solution is to update Spark to Netty 4.1, not to downgrade Druid to 4.0.

himanshug · 2017-06-12T20:26:27Z

@leventov yes, keeping dependency versions up-to-date is good in general. but in the current case, given that there is no definite need, I would let spark get updated first and then update Druid rather than trying to find workarounds for now.

drcrallen · 2017-06-23T16:47:29Z

@himanshug we're looking to upgrade the http-client metamx/http-client#29

leventov · 2017-12-22T12:16:37Z

for note Spark 2.3 updated it's Netty dep to 4.1

stale · 2019-06-21T09:13:28Z

This issue has been marked as stale due to 280 days of inactivity. It will be closed in 2 weeks if no further activity occurs. If this issue is still relevant, please simply write any comment. Even if closed, you can still revive the issue at any time or discuss it on the [email protected] list. Thank you for your contributions.

stale · 2019-07-05T10:10:14Z

This issue has been closed due to lack of activity. If you think that is incorrect, or the issue requires additional review, you can revive the issue at any time.

leventov added the Bug label Jun 9, 2017

leventov added Compatibility and removed Bug labels Jun 12, 2017

leventov mentioned this issue Nov 8, 2017

Downgrade Netty version 4.1 -> 4.0 #5059

Merged

drcrallen mentioned this issue Oct 4, 2018

Upgrade Netty to 4.1.x #6417

Merged

stale bot added the stale label Jun 21, 2019

stale bot closed this as completed Jul 5, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Druid - Spark interoperation is problematic due to Netty dependency mismatch #4390

Druid - Spark interoperation is problematic due to Netty dependency mismatch #4390

leventov commented Jun 9, 2017 •

edited

Loading

drcrallen commented Jun 9, 2017

leventov commented Jun 9, 2017

nishantmonu51 commented Jun 9, 2017

leventov commented Jun 9, 2017

b-slim commented Jun 10, 2017

himanshug commented Jun 12, 2017

leventov commented Jun 12, 2017

himanshug commented Jun 12, 2017

drcrallen commented Jun 23, 2017

leventov commented Dec 22, 2017

stale bot commented Jun 21, 2019

stale bot commented Jul 5, 2019

Druid - Spark interoperation is problematic due to Netty dependency mismatch #4390

Druid - Spark interoperation is problematic due to Netty dependency mismatch #4390

Comments

leventov commented Jun 9, 2017 • edited Loading

drcrallen commented Jun 9, 2017

leventov commented Jun 9, 2017

nishantmonu51 commented Jun 9, 2017

leventov commented Jun 9, 2017

b-slim commented Jun 10, 2017

himanshug commented Jun 12, 2017

leventov commented Jun 12, 2017

himanshug commented Jun 12, 2017

drcrallen commented Jun 23, 2017

leventov commented Dec 22, 2017

stale bot commented Jun 21, 2019

stale bot commented Jul 5, 2019

leventov commented Jun 9, 2017 •

edited

Loading