machinetranslate · liashahnazaryan · Mar 28, 2023 · Mar 29, 2023 · Mar 29, 2023 · Mar 29, 2023
diff --git a/customisation/alignment.md b/customisation/alignment.md
@@ -1,6 +1,28 @@
 ---
 parent: Customisation
-layout: coming_soon
 title: Alignment
-description:
+description: Linking corresponding sentences in the input and output languages
 ---
+
+**Alignment** is the process of identifying and linking the corresponding sentences in the input and output languages.
+
+Alignment can be used to create [parallel data](/customisation/parallel-data.md).
+The aligned parallel corpora are then used to train machine translation models.
+The goal is to improve machine translation accuracy through pattern and regularity recognition in data.
+
+## Approaches
+
+- In manual alignment, human translators align corresponding [segmented sentences](/concepts/sentence-splitting.md) in the input and output languages.
+- Rule-based approaches use linguistic rules and patterns, such as word order, syntactic properties, punctuation, and sentence boundaries.
+- The statistical models rely on statistical algorithms that find and analyse relationship patterns in comparable corpora.
+The statistical relationships are based on the likelihood of observing alignments in a training corpus.
+- With neural approaches, alignment is learned automatically through [neural networks](/approaches/neural-machine-translation.md#neural-networks).
+The neural models can be based on various encoder-decoder architectures, such as convolutional neural networks (CNNs), recurrent neural networks (RNNs), or [transformer](/approaches/transformers.md) models.
+
+## Challenges
+
+- Aligning sentences with varying lengths, punctuation, and complex structures can be challenging for alignment algorithms.
+- Many words and phrases can have multiple meanings or form idiomatic expressions.
+Semantic ambiguity can trigger inaccurate sentence alignments. 
+- Typological similarities of languages can result in sentence pairs that share highly similar linguistic properties but have different meanings and translations.
+Similarity-based interference can lead to incorrect alignments.
diff --git a/customisation/parallel-data.md b/customisation/parallel-data.md
@@ -21,8 +21,7 @@ Parallel data sets can be created manually, automatically, or created synthetica
 - Human [post-editing](../workflows/post-editing.md)
 - [Crawling](crawling.md)
 - [Alignment](alignment.md)
-
-Parallel data can be created by crawling and aligned monolingual test, and by [back-translation](back-translation.md) or [back-copying](back-translation.md).
+- [Back-translation](back-translation.md) or [back-copying](back-translation.md)
 
 ### Goals