Skip to content

Commit

Permalink
clean up notebook
Browse files Browse the repository at this point in the history
Signed-off-by: Panos Vagenas <[email protected]>
  • Loading branch information
vagenas committed Nov 8, 2024
1 parent bf44e89 commit 2055810
Showing 1 changed file with 9 additions and 52 deletions.
61 changes: 9 additions & 52 deletions docs/docs/integrations/document_loaders/docling.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -57,17 +57,9 @@
},
{
"cell_type": "code",
"execution_count": 1,
"execution_count": null,
"metadata": {},
"outputs": [
{
"name": "stdout",
"output_type": "stream",
"text": [
"Note: you may need to restart the kernel to use updated packages.\n"
]
}
],
"outputs": [],
"source": [
"%pip install -qU docling langchain-community"
]
Expand All @@ -80,7 +72,7 @@
"\n",
"Now we can instantiate our loader and load documents.\n",
"\n",
"By default, `DoclingLoader` loads each input document as a LangChain `Document` with Markdown content."
"By default, `DoclingLoader` loads each input document as a LangChain `Document` with Markdown content (more options can be found in the \"Deep Dive\" section further below)."
]
},
{
Expand Down Expand Up @@ -108,13 +100,6 @@
"execution_count": 3,
"metadata": {},
"outputs": [
{
"name": "stderr",
"output_type": "stream",
"text": [
"Using CPU. Note: This module is much faster with a GPU.\n"
]
},
{
"name": "stdout",
"output_type": "stream",
Expand Down Expand Up @@ -201,17 +186,9 @@
},
{
"cell_type": "code",
"execution_count": 6,
"execution_count": null,
"metadata": {},
"outputs": [
{
"name": "stdout",
"output_type": "stream",
"text": [
"Note: you may need to restart the kernel to use updated packages.\n"
]
}
],
"outputs": [],
"source": [
"%pip install -qU langchain-text-splitters"
]
Expand Down Expand Up @@ -256,13 +233,6 @@
"execution_count": 8,
"metadata": {},
"outputs": [
{
"name": "stderr",
"output_type": "stream",
"text": [
"Using CPU. Note: This module is much faster with a GPU.\n"
]
},
{
"name": "stdout",
"output_type": "stream",
Expand Down Expand Up @@ -299,17 +269,9 @@
},
{
"cell_type": "code",
"execution_count": 9,
"execution_count": null,
"metadata": {},
"outputs": [
{
"name": "stdout",
"output_type": "stream",
"text": [
"Note: you may need to restart the kernel to use updated packages.\n"
]
}
],
"outputs": [],
"source": [
"%pip install -qU langchain langchain-huggingface langchain-milvus"
]
Expand Down Expand Up @@ -397,7 +359,7 @@
"Which are the main AI models in Docling?\n",
"\n",
"Answer:\n",
"\"The main AI models in Docling are a layout analysis model, which is an accurate object-detector for page elements, and TableFormer, a state-of-the-art table structure recognition model.\"\n",
"\"The main AI models in Docling are DocLayNet and TableFormer. DocLayNet is a layout analysis model that is an accurate object-detector for page elements, and TableFormer is a state-of-the-art table str[...]\"\n",
"\n",
"Source 1:\n",
" text: \"As part of Docling, we initially release two highly capable AI models to the open-source community, which have been developed and published recently by our team. The first model is a layout analysis m[...]\"\n",
Expand Down Expand Up @@ -451,7 +413,7 @@
"Which are the main AI models in Docling?\n",
"\n",
"Answer:\n",
"\"The main AI models in Docling are a layout analysis model, which is an accurate object-detector for page elements, and TableFormer, a state-of-the-art table structure recognition model. These models a[...]\"\n",
"\"The main AI models in Docling are a layout analysis model, an accurate object-detector for page elements, and TableFormer, a state-of-the-art table structure recognition model. These models are develo[...]\"\n",
"\n",
"Source 1:\n",
" text: \"As part of Docling, we initially release two highly capable AI models to the open-source community, which have been developed and published recently by our team. The first model is a layout analysis m[...]\"\n",
Expand Down Expand Up @@ -493,11 +455,6 @@
"\n",
"For detailed documentation of all `DoclingLoader` features and configurations head to the [API reference](https://python.langchain.com/api_reference/community/document_loaders/langchain_community.document_loaders.docling.DoclingLoader.html)."
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": []
}
],
"metadata": {
Expand Down

0 comments on commit 2055810

Please sign in to comment.