Add post about Web Retrieval #242

ZanSara · 2023-11-10T15:01:02Z

The thumbnail is from Wikipedia, copyright free. Let me know if and how I should provide attribution.

vercel · 2023-11-10T15:01:08Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated (UTC)
haystack-home	✅ Ready (Inspect)	Visit Preview	💬 Add feedback	Nov 22, 2023 10:38am

ZanSara · 2023-11-10T15:02:53Z

content/blog/the-world-of-web-rag/index.md

+
+# Searching the Web
+
+As we've seen [earlier](#), a Haystack RAG Pipeline is made of three components: a Retriever, a PromptBuilder, and a Generator, and looks like this:


This was linking to a post that we haven't published here yet. LMK what to point it to.

just remove the first bit omi, Start with 'A Haystack RAG pipeline...'

ZanSara · 2023-11-10T15:03:11Z

content/blog/the-world-of-web-rag/index.md

+
+# Processing the page
+
+In a [previous post](#), we've seen how Haystack can convert web pages into clean Documents ready to be stored in a Document Store. We will reuse many of the components we have discussed there, so if you missed it, make sure to check it out.


This was linking to a post that we haven't published here yet. LMK what to point it to.

Again, same as previous comment, we will probably publish this first so we can remove this line?

TuanaCelik

I've added some comments, @bilgeyucel might have more. Also, do we have to change the headers to h2? Not sure.

TuanaCelik · 2023-11-10T15:15:36Z

content/blog/the-world-of-web-rag/index.md

+tags: ["Haystack 2.0"]
+---	
+
+In an earlier post of the Haystack 2.0 series, we've seen how to build RAG and indexing pipelines. An application that uses these two pipelines is practical if you have an extensive, private collection of documents and need to perform RAG on such data only. However, in many cases, you may want to get data from the Internet: from news outlets, documentation pages, and so on.


I would rephrase because we will probably post this before the others: "An application that uses these two pipelines is practical if you have an extensive, private collection of documents and need to perform RAG on such data only. One pipeline to index documents to your own, private document store (dubbed an indexing pipeline), and another to query it with a language model (the querying pipeline). However,..."

TuanaCelik · 2023-11-10T15:16:14Z

content/blog/the-world-of-web-rag/index.md

+
+# Searching the Web
+
+As we've seen [earlier](#), a Haystack RAG Pipeline is made of three components: a Retriever, a PromptBuilder, and a Generator, and looks like this:


just remove the first bit omi, Start with 'A Haystack RAG pipeline...'

TuanaCelik · 2023-11-10T15:17:05Z

content/blog/the-world-of-web-rag/index.md

+
+# Processing the page
+
+In a [previous post](#), we've seen how Haystack can convert web pages into clean Documents ready to be stored in a Document Store. We will reuse many of the components we have discussed there, so if you missed it, make sure to check it out.


Again, same as previous comment, we will probably publish this first so we can remove this line?

remove links to my own blog

1999fee

ZanSara commented Nov 10, 2023

View reviewed changes

ZanSara marked this pull request as ready for review November 10, 2023 15:03

ZanSara assigned bilgeyucel Nov 10, 2023

TuanaCelik suggested changes Nov 10, 2023

View reviewed changes

TuanaCelik requested a review from bilgeyucel November 10, 2023 15:18

Update index.md

ac29914

vercel bot deployed to Preview November 22, 2023 10:38 View deployment

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add post about Web Retrieval #242

Add post about Web Retrieval #242

ZanSara commented Nov 10, 2023 •

edited

Loading

vercel bot commented Nov 10, 2023 •

edited

Loading

ZanSara Nov 10, 2023

TuanaCelik Nov 10, 2023

ZanSara Nov 10, 2023

TuanaCelik Nov 10, 2023

TuanaCelik left a comment

TuanaCelik Nov 10, 2023

TuanaCelik Nov 10, 2023

TuanaCelik Nov 10, 2023


		# Searching the Web

		As we've seen [earlier](#), a Haystack RAG Pipeline is made of three components: a Retriever, a PromptBuilder, and a Generator, and looks like this:


		# Processing the page

		In a [previous post](#), we've seen how Haystack can convert web pages into clean Documents ready to be stored in a Document Store. We will reuse many of the components we have discussed there, so if you missed it, make sure to check it out.

Add post about Web Retrieval #242

Are you sure you want to change the base?

Add post about Web Retrieval #242

Conversation

ZanSara commented Nov 10, 2023 • edited Loading

vercel bot commented Nov 10, 2023 • edited Loading

ZanSara Nov 10, 2023

Choose a reason for hiding this comment

TuanaCelik Nov 10, 2023

Choose a reason for hiding this comment

ZanSara Nov 10, 2023

Choose a reason for hiding this comment

TuanaCelik Nov 10, 2023

Choose a reason for hiding this comment

TuanaCelik left a comment

Choose a reason for hiding this comment

TuanaCelik Nov 10, 2023

Choose a reason for hiding this comment

TuanaCelik Nov 10, 2023

Choose a reason for hiding this comment

TuanaCelik Nov 10, 2023

Choose a reason for hiding this comment

ZanSara commented Nov 10, 2023 •

edited

Loading

vercel bot commented Nov 10, 2023 •

edited

Loading