Simple transactional outbox #4

gtoselli · 2024-01-09T15:23:52Z

No description provided.

lucagiove · 2024-01-10T17:03:53Z

In case there are multiple repositories using the outbox pattern probably we should add a key to separate them otherwise it might use the wrong function.
Imagine

Repo1 -> callback1 (not identified by the job payload)
Repo2 -> callback2
The executeAllScheduledJobs on Repo1 can find scheduled jobs of Repo2 and execute callback1 on them instead of callback1

gtoselli · 2024-01-10T17:10:23Z

Now the scheduledBy field in outbox collection is the hostname.

We can concatenate hostname with something like contextName.

Maybe we can add a new field and use both scheduledBy and contextName

lucagiove · 2024-01-11T08:24:11Z

I would use a dedicated property in the document, that might be the name of the aggregateName that should be unique.
If we want to keep more generic for outbox contextName might work.
The issue with being too generic is that the aggregate method can't be saveAndPublish, but saveAndRunJobs that's not nice.
Maybe we can provide a dedicated interface of the outbox with names more for messaging that maps on a generic outbox, and same for repo might have different interfaces for different usages that maps to the same methods.

To discuss further the api interfaces of the lib here: #9

gtoselli · 2024-01-11T16:37:59Z

Today I wrote another Outbox class here on the issue branch. I introduced the aggregateName to differentiate multiple instances on the same host.

For this milestone (0.1.0) to keep the outbox api closed to "messaging" concept. The outbox interface is

scheduleEvent
publishAllScheduledEvents
publishAllEventsScheduledByMe
startOutboxWatching

The OutboxEvent is

eventPayload: any
eventRoutingKey: string

Maybe in the future we will discuss a more generic interface.

lucagiove · 2024-01-12T08:39:53Z

I was thinking that probably os.hostname might be good for k8s but not in other environments where you might have multi processes on the same machine or in case physical hardware where hostname as not been customized.
What would be the drawback of using a uuid?

gtoselli · 2024-01-12T09:30:00Z

Yes I had thought of that.
scheduledBy field should be the identifier of something go down or up in the same time (pod in k8s, process in local machine for example)

In case of two app that use ddd-toolkit lib on the same machine I think PID should be used.

In this implementation, the hostname is used as the default value of the hostname argument of the Outbox class constructor.
We should find a better name!

If you use ddd-toolkit in the cloud, you will use the default value (os.hostame), but if you want you can pass something else (like process.pid). Will it be easy to explain this clearly in the documentation?

lucagiove · 2024-01-12T22:30:31Z

But if you go up and down pid changes. In k8s a restart probably the hostname is kept but at startup or shutdown all events are sent so it's not really useful restoring a previous used value.

If we use a certainly unique value this mechanism can be transparent to the library user and it's a value under the library control.
What are the drawbacks?

gtoselli · 2024-01-15T09:15:54Z

So you would put in the scheduledBy field a uuid generated for each instance of the outbox? Because then aggregateName becomes useless.

Let's decide if it makes sense to combine the outbox instances in the same process (host in the case of k8s)

lucagiove · 2024-01-15T14:00:15Z

aggregateName meaning is mapping data with code callback, the uuid could be the same per node but yes if each instance is in charge than one random uuid would be useful for both aims

gtoselli · 2024-01-16T09:03:08Z

Is the uuid the same for every instance of the Outbox class in the same app?

In any case the aggregateName is needed for the data-fn mapping.
If however the uuid you want to make it global (for the whole app) it cannot be transparent and must be passed from the app to the library (and managed as a global provider in the nest DI)

gtoselli · 2024-02-06T14:12:03Z

I might have a good idea.
In order not to complicate our lives with the leader instance and without having a single point of failure, a simple solution could be this

each instance of the app has the cdc active
when the change is received, each instance tries to update the document (simultaneously or almost simultaneously) by setting the status to processing.
mongo guarantees that the updateOne is atomic and that only one of the competing operations can modify the document.
the event publication is only taken by the instance that succeed (via modifiedCount) to update the status

What do you think @lucagiove?

// cdc async iterator
for await (const change of this.activeChangeStream) {
	console.log('change!');
	await this.publishEvent((change as any).fullDocument);
}

private async publishEvent(outBoxModel: OutBoxModel) {
	const { modifiedCount } = await this.collection.updateOne(
		{ _id: outBoxModel._id },
		{ $set: { status: 'processing' } },
	);
	if (modifiedCount !== 1) {
		this.logger.log(`Already processing by another outbox`);
		return;
	}

	try {
		await this.publishEventFn(outBoxModel.event);
		await this.collection.updateOne(
			{ _id: outBoxModel._id },
			{
				$set: {
					status: 'published',
					publishedAt: new Date(),
				},
			},
		);
	} catch (e) {
		this.logger.warn(`Failed publishEventFn with ${JSON.stringify(outBoxModel.event)}`);
	}
}

lucagiove · 2024-03-30T17:49:17Z

I would remove this from the first beta and keep repo, event bus and command bus

gtoselli added this to ddd-toolkit Jan 9, 2024

gtoselli converted this from a draft issue Jan 9, 2024

lucagiove moved this from Todo to In Progress in ddd-toolkit Jan 10, 2024

lucagiove added this to the 0.1.0 milestone Jan 10, 2024

gtoselli mentioned this issue Jan 12, 2024

0.1.0 Repo API #9

Closed

gtoselli self-assigned this Jan 12, 2024

lucagiove removed this from the 0.1.0 milestone Mar 30, 2024

gtoselli linked a pull request Mar 30, 2024 that will close this issue

Transactional outbox #32

Merged

lucagiove closed this as completed in #32 Apr 3, 2024

github-project-automation bot moved this from In Progress to Done in ddd-toolkit Apr 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Simple transactional outbox #4

Simple transactional outbox #4

gtoselli commented Jan 9, 2024 •

edited

Loading

lucagiove commented Jan 10, 2024

gtoselli commented Jan 10, 2024

lucagiove commented Jan 11, 2024

gtoselli commented Jan 11, 2024

lucagiove commented Jan 12, 2024

gtoselli commented Jan 12, 2024

lucagiove commented Jan 12, 2024 •

edited

Loading

gtoselli commented Jan 15, 2024

lucagiove commented Jan 15, 2024

gtoselli commented Jan 16, 2024 •

edited

Loading

gtoselli commented Feb 6, 2024 •

edited

Loading

lucagiove commented Mar 30, 2024

Simple transactional outbox #4

Simple transactional outbox #4

Comments

gtoselli commented Jan 9, 2024 • edited Loading

lucagiove commented Jan 10, 2024

gtoselli commented Jan 10, 2024

lucagiove commented Jan 11, 2024

gtoselli commented Jan 11, 2024

lucagiove commented Jan 12, 2024

gtoselli commented Jan 12, 2024

lucagiove commented Jan 12, 2024 • edited Loading

gtoselli commented Jan 15, 2024

lucagiove commented Jan 15, 2024

gtoselli commented Jan 16, 2024 • edited Loading

gtoselli commented Feb 6, 2024 • edited Loading

lucagiove commented Mar 30, 2024

gtoselli commented Jan 9, 2024 •

edited

Loading

lucagiove commented Jan 12, 2024 •

edited

Loading

gtoselli commented Jan 16, 2024 •

edited

Loading

gtoselli commented Feb 6, 2024 •

edited

Loading