1001 Freelance Projects
Latest Projects from
Freelance Marketplaces
View Project
View this project in detail
(Note: you will be redirected to external marketplace)
Project title:
Tesseract Training
Posted by:
External project from PeoplePerHour
Started:
19-Mar-2025 10:12 GMT
Description:
Description: I have some text, which is single word on tiff file, designed to train eng_custom.traineddata. Currently I use syntax below which seem sane and does not produce any error before last step.

Important: I don't want to change current approach as my goal to train each of 1000 tiff files with same parameters, since I prepared corresponding tessRead and boxes for each tiff.

#Make lstmf file

tesseract test_sample.tiff test_sample \
--tessdata-dir /home/j/img2/tess_files \
--psm 7 --oem 1 -l eng_custom \
/home/j/tesseract/tessdata/configs/lstm.train

echo "test_sample.lstmf" single_lstmf_file.txt

#Train LSTM model

lstmtraining \
--model_output tess_training.lstm \
--continue_from /home/j/img2/tess_files/eng.lstm \
--traineddata /home/j/img2/tess_files/eng_custom.traineddata \
--train_listfile single_lstmf_file.txt \
--max_iterations 1

Stop training and finalize model

lstmtraining --stop_training \
--continue_from tess_training.lstm_checkpoint \
--traineddata /home/j/img2/tess_files/eng_custom.traineddata \
--model_output /home/j/img2/tess_files/eng_final.lstm

Update traineddata with new LSTM model

mkdir -p /home/j/img2/base_model
combine_tessdata -u /home/j/img2/tess_files/eng_custom.traineddata /home/j/img2/base_model/eng_custom
cp /home/j/img2/tess_files/eng_final.lstm /home/j/img2/base_model/eng.lstm
combine_tessdata /home/j/img2/base_model/eng_custom
cp /home/j/img2/base_model/eng_custom.traineddata /home/j/img2/tess_files/eng_custom.traineddata

But I get problem during final step:

j@j:~/t$ tesseract test_sample.tiff stdout -l eng_custom --tessdata-dir /home/j/img2/tess_files/
index = 0:Error:Assert failed:in file /home/j/tesseract4/src/ccutil/strngs.cpp, line 266
Aborted (core dumped)

Question: How to amend above commands so I can combine eng_final.lstm with eng_custom.traineddata

Environment:

/home/j/img2/tess_files/

eng.traineddata eng_custom.traineddata eng.lstm eng_final.lstm

/home/j/img2/base_model/

eng_custom.bigram-dawg eng_custom.normproto
eng_custom.word-dawg eng_custom.freq-dawg
eng_custom.number-dawg eng.lstm eng_custom.inttemp
eng_custom.pffmtable eng.lstm-number-dawg eng_custom.lstm
eng_custom.punc-dawg eng.lstm-punc-dawg eng_custom.lstm-number-dawg eng_custom.shapetable
eng.lstm-recoder eng_custom.lstm-punc-dawg eng_custom.traineddata
eng.lstm-unicharset eng_custom.lstm-recoder
eng_custom.unicharambigs eng.lstm-word-dawg eng_custom.lstm-unicharset eng_custom.unicharset eng.version eng_custom.lstm-word-dawg eng_custom.version

Any guidance would be greatly appreciated.

Thanks!

Jacob
Project ID:
3426853
Project category:
Project budget:
View this project in detail
(Note: you will be redirected to external marketplace)
Last Projects / Browse Projects
  Project Started
Intérprete Español en Tanzania
Category: Language Tutoring, Spanish Translator, Spanish Tutoring, Swahili Translator, Translation
Budget: €12 - €18 EUR
30 Mar 2026 16:02 GMT
Carolinag002544
Category: Excel, Internet Research, PDF, Web Search
Budget: £20 - £250 GBP
30 Mar 2026 16:02 GMT
Recreate Leaflet in InDesign
Category: Adobe Illustrator, Adobe InDesign, Photoshop, Print Design
Budget: £10 - £50 GBP
30 Mar 2026 16:01 GMT
Wordpress website developer
Category: Elementor, HTML, PHP, Web Design, Web Development, WordPress, WordPress Design
Budget: $30 - $250 NZD
30 Mar 2026 16:01 GMT
Qualtrics Mobile-Friendly Matrix Revamp
Category: CSS, HTML, JavaScript, Qualtrics Survey Platform, User Experience Research
Budget: $10 - $30 USD
30 Mar 2026 16:01 GMT
Basement Mother-in-Law Suite Completion
Category: AutoCAD, Building Architecture, Building Engineering, Construction, Electrical Engineering, Home Design, Interior Design, Plumbing
Budget: $250 - $750 USD
30 Mar 2026 15:59 GMT
Mizo Lecture Transcription Needed
Category: Audio Editing, Audio Production, Audio Services, Content Writing, Data Entry, English Spelling, Natural Language, Transcription, Translation, Voice Talent
Budget: ₹100 - ₹400 INR
30 Mar 2026 15:58 GMT
Colaboración Agencia Sudamérica SEO Avanzado + Wordpress + Desarrollo web (Proyectos demostrables) -- 3
Category: Bootstrap, Graphic Design, PHP, Prestashop, Security, SEO, Web Design, Web Development, WordPress
Budget: €2 - €6 EUR
30 Mar 2026 15:58 GMT
Renovation Leads via Google/Facebook Ads
Category: Advertising, Digital Marketing, Facebook Ads, Google Ads, Google Adwords, Internet Marketing, Lead Generation, Search Engine Marketing (SEM)
Budget: $30 - $250 SGD
30 Mar 2026 15:57 GMT
Poetry Collection Cover Design -- 2
Category: Art Consulting, Book Cover Design, Creative Design, Graphic Design, Illustration, Pattern Design, Photoshop
Budget: £250 - £750 GBP
30 Mar 2026 15:56 GMT
Stop Image Cropping, Disable Contact Form
Category: CSS, Documentation, Frontend Development, JavaScript, PHP, Web Design, Web Development, WordPress
Budget: $250 - $750 USD
30 Mar 2026 15:55 GMT
Modern Floor Plan Visualization & Optimization
Category: 2D Drafting, 3D Rendering, 3D Visualization, AutoCAD, Building Architecture, Interior Design, Revit, SketchUp, Smart Lighting, Smarty PHP
Budget: $30 - $250 USD
30 Mar 2026 15:54 GMT
UK SuDS Drainage Plans Drafting
Category: 2D Drafting, AutoCAD, Building Architecture, Building Regulations, CAD / CAM, Civil Engineering, Environmental Engineering, Geotechnical Engineering
Budget: £250 - £750 GBP
30 Mar 2026 15:54 GMT
Develop Crypto Copy Trading Software for Binance
Category: C, Programming, C++, PHP, Python, Software Architecture
Budget: ₹50000 - ₹100000 INR
30 Mar 2026 15:52 GMT
24-Hour Glide Dashboard Build
Category: API Integration, App Development, Cloud Computing, Data Management, Data Visualization, Google Sheets, Mobile App Development, Software Development
Budget: $30 - $250 USD
30 Mar 2026 15:51 GMT
Browse All Projects
Projects by Skills ...
android
ajax
asp
aspnet
cms
cpp
csharp
css
delphi
design
drupal
excel
facebook
flash
html
java
javascript
joomla
iphone
mysql
photoshop
php
python
ruby
seo
sql
sysadm
translate
typing
twitter
vbnet
xml
wordpress
writing
New!
Проекты на русском
(Projects in Russian)

Copyright © 2005-2025
1001 Freelance Projects