1001 Freelance Projects
Latest Projects from
Freelance Marketplaces
View Project
View this project in detail
(Note: you will be redirected to external marketplace)
Project title:
Tesseract Training
Posted by:
External project from PeoplePerHour
Started:
19-Mar-2025 10:12 GMT
Description:
Description: I have some text, which is single word on tiff file, designed to train eng_custom.traineddata. Currently I use syntax below which seem sane and does not produce any error before last step.

Important: I don't want to change current approach as my goal to train each of 1000 tiff files with same parameters, since I prepared corresponding tessRead and boxes for each tiff.

#Make lstmf file

tesseract test_sample.tiff test_sample \
--tessdata-dir /home/j/img2/tess_files \
--psm 7 --oem 1 -l eng_custom \
/home/j/tesseract/tessdata/configs/lstm.train

echo "test_sample.lstmf" single_lstmf_file.txt

#Train LSTM model

lstmtraining \
--model_output tess_training.lstm \
--continue_from /home/j/img2/tess_files/eng.lstm \
--traineddata /home/j/img2/tess_files/eng_custom.traineddata \
--train_listfile single_lstmf_file.txt \
--max_iterations 1

Stop training and finalize model

lstmtraining --stop_training \
--continue_from tess_training.lstm_checkpoint \
--traineddata /home/j/img2/tess_files/eng_custom.traineddata \
--model_output /home/j/img2/tess_files/eng_final.lstm

Update traineddata with new LSTM model

mkdir -p /home/j/img2/base_model
combine_tessdata -u /home/j/img2/tess_files/eng_custom.traineddata /home/j/img2/base_model/eng_custom
cp /home/j/img2/tess_files/eng_final.lstm /home/j/img2/base_model/eng.lstm
combine_tessdata /home/j/img2/base_model/eng_custom
cp /home/j/img2/base_model/eng_custom.traineddata /home/j/img2/tess_files/eng_custom.traineddata

But I get problem during final step:

j@j:~/t$ tesseract test_sample.tiff stdout -l eng_custom --tessdata-dir /home/j/img2/tess_files/
index = 0:Error:Assert failed:in file /home/j/tesseract4/src/ccutil/strngs.cpp, line 266
Aborted (core dumped)

Question: How to amend above commands so I can combine eng_final.lstm with eng_custom.traineddata

Environment:

/home/j/img2/tess_files/

eng.traineddata eng_custom.traineddata eng.lstm eng_final.lstm

/home/j/img2/base_model/

eng_custom.bigram-dawg eng_custom.normproto
eng_custom.word-dawg eng_custom.freq-dawg
eng_custom.number-dawg eng.lstm eng_custom.inttemp
eng_custom.pffmtable eng.lstm-number-dawg eng_custom.lstm
eng_custom.punc-dawg eng.lstm-punc-dawg eng_custom.lstm-number-dawg eng_custom.shapetable
eng.lstm-recoder eng_custom.lstm-punc-dawg eng_custom.traineddata
eng.lstm-unicharset eng_custom.lstm-recoder
eng_custom.unicharambigs eng.lstm-word-dawg eng_custom.lstm-unicharset eng_custom.unicharset eng.version eng_custom.lstm-word-dawg eng_custom.version

Any guidance would be greatly appreciated.

Thanks!

Jacob
Project ID:
3426853
Project category:
Project budget:
View this project in detail
(Note: you will be redirected to external marketplace)
Last Projects / Browse Projects
  Project Started
MS Access Database Design 21 Apr 2025 15:30 GMT
MySQL Server Recovery & Website Restoration 21 Apr 2025 15:29 GMT
Elementor Set-Up 21 Apr 2025 15:27 GMT
High Quality image from my design 21 Apr 2025 15:27 GMT
Google ad support 21 Apr 2025 15:27 GMT
telesales for real estate in London 21 Apr 2025 15:27 GMT
Question Author: Employee Training 21 Apr 2025 15:26 GMT
Amazon Product and Seller Reviewer 21 Apr 2025 14:27 GMT
SEO both on page and off page 21 Apr 2025 14:19 GMT
Personal Assistant 21 Apr 2025 13:50 GMT
Seamstress Needed – 50-Piece Sewing Project (Upcycled Trousers) 21 Apr 2025 13:46 GMT
Lead Search 21 Apr 2025 13:32 GMT
ASO + ASA for new app 21 Apr 2025 13:32 GMT
Logo for a new project 21 Apr 2025 13:15 GMT
Pull RSS Feed - send to ChatGPT to rewrite - post on blog 21 Apr 2025 13:12 GMT
Browse All Projects
Projects by Skills ...
android
ajax
asp
aspnet
cms
cpp
csharp
css
delphi
design
drupal
excel
facebook
flash
html
java
javascript
joomla
iphone
mysql
photoshop
php
python
ruby
seo
sql
sysadm
translate
typing
twitter
vbnet
xml
wordpress
writing
New!
Проекты на русском
(Projects in Russian)

Copyright © 2005-2024
1001 Freelance Projects