Skip to content

Commit 5bda86c

Browse files
committed
confluence
Signed-off-by: Costa Shulyupin <costa.shul@redhat.com>
1 parent 8fde6f7 commit 5bda86c

File tree

1 file changed

+40
-0
lines changed

1 file changed

+40
-0
lines changed

docs/confluence-doc-source.md

+40
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,40 @@
1+
# Confluence document source
2+
3+
Fetching information from Confluence is an essential
4+
feature for fine-tuning models on internal documentation.
5+
6+
## Interfaces
7+
8+
qna.yaml file, `document` section:
9+
10+
- Confluence Host: The base URL of the Confluence instance.
11+
- Space: The Confluence space key where the documents reside.
12+
- Page titles: The titles of the Confluence pages to fetch.
13+
- Version: The version of the Confluence page (optional).
14+
15+
The qna.yaml file can define single host and multiple spaces and pages,
16+
each with an optional version.
17+
18+
Confluence credentials in config.yaml:
19+
- Username
20+
- [Token](https://support.atlassian.com/atlassian-account/docs/manage-api-tokens-for-your-atlassian-account/)
21+
22+
## Changes across modules
23+
24+
- [Configuration module](https://github.com/instructlab/instructlab/blob/main/src/instructlab/config.py)
25+
defines the structure and validation rules for
26+
the config.yaml file.
27+
- [Schema module](https://github.com/instructlab/schema) defines the structure and validation rules for
28+
the qna.yaml file.
29+
- [ilab utilities module](https://github.com/instructlab/instructlab/blob/main/src/instructlab/utils.py)
30+
fetches documents
31+
- [unit test](https://github.com/instructlab/instructlab/tree/main/tests)
32+
33+
## Additional External Packages
34+
35+
The implementation relies on the following external packages:
36+
37+
- [atlassian-python-api](https://atlassian-python-api.readthedocs.io/)
38+
A Python library to interact with Atlassian products, including Confluence.
39+
- [markdownify](https://pypi.org/project/markdownify/)
40+
A library to convert HTML content to Markdown for processing Confluence page content.

0 commit comments

Comments
 (0)