Skip to content

Commit

Permalink
[Webcrawler] Avoid null bytes in tags (#8993)
Browse files Browse the repository at this point in the history
Description
---
Fixes issue from [thread](https://dust4ai.slack.com/archives/C05F84CFP0E/p1732793680391139)

(in addition to PR #8869)

Risks
---
na

Deploy
---
connectors
  • Loading branch information
philipperolet authored Nov 28, 2024
1 parent 29b3891 commit bae9884
Showing 1 changed file with 1 addition and 1 deletion.
Original file line number Diff line number Diff line change
Expand Up @@ -327,7 +327,7 @@ export async function crawlWebsiteByConnectorId(connectorId: ModelId) {
}),
documentUrl: request.url,
timestampMs: new Date().getTime(),
tags: [`title:${pageTitle}`],
tags: [`title:${stripNullBytes(pageTitle)}`],
parents: getParentsForPage(request.url, false),
upsertContext: {
sync_type: "batch",
Expand Down

0 comments on commit bae9884

Please sign in to comment.