docs: explanation about sitemaps in headless WordPress #116

theodesp · 2025-03-26T14:47:55Z

closes #50

colinmurphy

LGTM @theodesp 🚀 🚀 🚀

moonmeister

I like the overview and the introduction of the problem. I like how each solution has clear pros and cons. The three solutions need a little work. 1 is new to me, I haven't heard of it before, have you seen others do this method? I'm open to it but seems a little strange. The three methods documented in the RFC #17 that we need to comer here are:

Proxying the sitemap from the front end to the back end. - https://github.com/wpdecoupled/site/tree/main/src/routes/%5Bfile%3Dsitemap%5D
Generating a sitemap from GraphQL content (you cover this with 2/3...but static VS SSR is not a "separate" method...just a different rendering strategy. - https://github.com/wpengine/faustjs.org/blob/main/src/pages/wp-sitemap.xml/index.jsx
Fetching existing sitemaps and and parsing them into a new sitemap with any added framework routes (the current faust method)

ahuseyn · 2025-03-31T10:24:26Z

@theodesp @moonmeister, the problem with the 2. approach (WPGraphQL) is that WPGraphQL returns maximum 100 node per page. If you set first: 1000 it will only return the first 100. Details: https://www.wpgraphql.com/docs/known-limitations#pagination-limits

You can increase that limit but it may cause problems for the WP instances low on resources: https://www.wpgraphql.com/filters/graphql_connection_max_query_amount

colinmurphy

LGTM 🚀 🚀 🚀

whoami-pwd

Looks good!

moonmeister

looking really close. just had a couple clarifying questions needed on some of the code examples

moonmeister · 2025-03-31T20:07:30Z

docs/explanation/sitemaps.md

+    const transformedContent = sitemapContent.replace(
+      new RegExp(process.env.WORDPRESS_URL, 'g'),
+      process.env.FRONTEND_URL
+    );
+
+    return new Response(transformedContent, {
+      headers: {
+        'Content-Type': 'application/xml',
+      },
+    });


If the front-end url is correctly set in WP this isn't necessary and causes performance issues. I'd recommend we make a note that the correct URL needs to be set in WP for this method and do a direct proxy as shown in https://github.com/wpdecoupled/site/blob/main/src/routes/%5Bfile%3Dsitemap%5D/%2Bserver.ts

@moonmeister I'm resposting this question regarding the nature of the proxy:

Regarding proxying the sitemap.xml from WordPress to the Headless site:

If the sitemap.xml is served from WordPress, the links inside it will use the WordPress site URL, which can cause CORS issues when accessed from the headless frontend.

If we ensure the sitemap links use the correct headless frontend hostname, the next issue is how Next.js maps <sitemapindex> links to file system routes.
For example, these sitemap links:

http://localhost:3000/wp-sitemap-posts-post-1.xml http://localhost:3000/wp-sitemap-posts-page-1.xml http://localhost:3000/wp-sitemap-taxonomies-category-1.xml http://localhost:3000/wp-sitemap-users-1.xml

Each of these would need a corresponding file or API route in Next.js.
I saw that Faust.js solves this by transforming sitemap links into a query format:
sitemap.xml?sitemap=<url.pathname>
I tried to do some Next.js Regex Path Matching to match in the config but its very unreliable they don't seem to work
I’m interested in hearing your thoughts on what we mean by proxying.

So a proxy fetches content from another endpoint based on a request. So a VPN is a proxy of sorts. https://en.wikipedia.org/wiki/Proxy_server

moonmeister · 2025-03-31T20:12:16Z

docs/explanation/sitemaps.md

+// pages/[...sitemap].js
+export async function GET(request, { params }) {
+  const { sitemap } = params;
+  const sitemapFile = Array.isArray(sitemap) ? sitemap.join('/') : sitemap;
+
+  const wpSitemapUrl = `${process.env.WORDPRESS_URL}/wp-sitemap${sitemapFile ? `-${sitemapFile}` : ''}.xml`;


I don't really understand this code. A catch-all route wouldn't work with most other routing situations. Aditionally, any unknown URL that should 404 would just render a sitemap.

See comment above regarding some questions about the logic.

docs: explanation about sitemaps in headless WordPress

831930a

theodesp requested a review from a team as a code owner March 26, 2025 14:47

colinmurphy previously approved these changes Mar 26, 2025

View reviewed changes

colinmurphy requested a review from moonmeister March 27, 2025 10:10

moonmeister requested changes Mar 27, 2025

View reviewed changes

chore: update sitemaps documentation based on feedback

9eea41e

theodesp dismissed colinmurphy’s stale review via 9eea41e March 31, 2025 10:06

theodesp requested review from moonmeister and colinmurphy March 31, 2025 10:07

colinmurphy requested a review from ahuseyn March 31, 2025 10:25

chore: pr review update

2f32abd

colinmurphy previously approved these changes Mar 31, 2025

View reviewed changes

whoami-pwd self-requested a review March 31, 2025 15:07

whoami-pwd previously approved these changes Mar 31, 2025

View reviewed changes

ahuseyn previously approved these changes Mar 31, 2025

View reviewed changes

moonmeister requested changes Mar 31, 2025

View reviewed changes

chore: update proxy example

19073cd

theodesp dismissed stale reviews from ahuseyn, whoami-pwd, and colinmurphy via 19073cd April 7, 2025 15:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs: explanation about sitemaps in headless WordPress #116

docs: explanation about sitemaps in headless WordPress #116

theodesp commented Mar 26, 2025 •

edited by moonmeister

Loading

colinmurphy left a comment

moonmeister left a comment •

edited

Loading

ahuseyn commented Mar 31, 2025

colinmurphy left a comment

whoami-pwd left a comment

moonmeister left a comment

moonmeister Mar 31, 2025

theodesp Apr 3, 2025

moonmeister Apr 3, 2025

moonmeister Mar 31, 2025

theodesp Apr 3, 2025

docs: explanation about sitemaps in headless WordPress #116

Are you sure you want to change the base?

docs: explanation about sitemaps in headless WordPress #116

Conversation

theodesp commented Mar 26, 2025 • edited by moonmeister Loading

colinmurphy left a comment

Choose a reason for hiding this comment

moonmeister left a comment • edited Loading

Choose a reason for hiding this comment

ahuseyn commented Mar 31, 2025

colinmurphy left a comment

Choose a reason for hiding this comment

whoami-pwd left a comment

Choose a reason for hiding this comment

moonmeister left a comment

Choose a reason for hiding this comment

moonmeister Mar 31, 2025

Choose a reason for hiding this comment

theodesp Apr 3, 2025

Choose a reason for hiding this comment

moonmeister Apr 3, 2025

Choose a reason for hiding this comment

moonmeister Mar 31, 2025

Choose a reason for hiding this comment

theodesp Apr 3, 2025

Choose a reason for hiding this comment

theodesp commented Mar 26, 2025 •

edited by moonmeister

Loading

moonmeister left a comment •

edited

Loading