Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

The OCR is trainable in order to support even special fonts (embossed, dotted, etc.), rotated texts, good performance on more variable backgrounds, and other things which general pretrained pre-trained OCR models can't usually offer.

...

For training, you draw rectangles in the image and can rotate them using the dot connected to the top side of the rectangle. When a rectangle is selected, you can write a the text it contains to the textfield text field on the right.

...

After you’ve filled in the texts, you need to click the Split button to split the annotations into individual characters (also spaces if your text contains them) - this splits the text into equally-sized rectangles.

...

If your text is in the monospace font (each character has the same width), you probably won't need to do anything with the character annotations, but for other fonts, you should adjust the character rectangles to fit each character properly.

...

In the training settings, we set the number of training epochs for detection and classification. If your model doesn't find the text position well, increase the number of detection epochs. If the position is right, but the individual characters are misclassified as different ones, increase the number of classification epochs.

...