Skip to content

Commit

Permalink
📦 0.4.0
Browse files Browse the repository at this point in the history
  • Loading branch information
bertsky committed Feb 11, 2024
1 parent 065a57f commit 80c92b5
Show file tree
Hide file tree
Showing 2 changed files with 22 additions and 4 deletions.
17 changes: 17 additions & 0 deletions CHANGELOG.md
Original file line number Diff line number Diff line change
Expand Up @@ -5,6 +5,23 @@ Versioned according to [Semantic Versioning](http://semver.org/).

## Unreleased

## [0.4.0] - 2024-02-11

Fixed:

* binarize: OCR-D conformity (PAGE output, AlternativeImage input/output)
* docstrings

Added:

* recognize: param `overwrite_text`
* segment: param `overwrite_segments`
* segment: param `level-of-operation` (now supports `table` and `region`, too)

Changed:

* segment: existing segmentation will be masked away (unless `overwrite_segments`)

## [0.3.1] - 2023-08-17

Fixed:
Expand Down
9 changes: 5 additions & 4 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -83,13 +83,14 @@ Available [OCR-D processors](https://ocr-d.de/en/spec/cli) are:

- [ocrd-kraken-binarize](ocrd_kraken/binarize.py) (nlbin – not recommended)
- adds `AlternativeImage` files (per page, region or line) to the output fileGrp
- [ocrd-kraken-segment](ocrd_kraken/segment.py) (all-in-one segmentation – recommended for handwriting and simply layouted prints)
- adds `TextRegion`s, `TableRegion`s, `ImageRegion`s, `MathsRegion`s, `NoiseRegion`s, `ReadingOrder` and `AlternativeImage` to `Page` (depending on model training)
- adds `TextLine`s to `TextRegion`s, including their `Baseline`
- [ocrd-kraken-segment](ocrd_kraken/segment.py) (all-in-one segmentation – recommended for handwriting and simply layouted prints, or as pure line segmentation)
- adds `TextRegion`s to `Page` (if `level-of-operation=page`) or `TableRegion`s (if `table`)
- adds `TextLine`s (with `Baseline`) to `TextRegion`s (for all `level-of-operation`)
- masks existing segments during detection (unless `overwrite_segments`)
- [ocrd-kraken-recognize](ocrd_kraken/recognize.py) (benefits from annotated `Baseline`s, falls back to center-normalized bboxes)
- adds `Word`s to `TextLine`s
- adds `Glyph`s to `Word`s
- adds `TextEquiv`
- adds `TextEquiv` (removing existing `TextEquiv` if `overwrite_text`)

## Testing

Expand Down

0 comments on commit 80c92b5

Please sign in to comment.