1001 Freelance Projects
Latest Projects from
Freelance Marketplaces
View Project
View this project in detail
(Note: you will be redirected to external marketplace)
Project title:
Tesseract Training
Posted by:
External project from PeoplePerHour
Started:
19-Mar-2025 10:12 GMT
Description:
Description: I have some text, which is single word on tiff file, designed to train eng_custom.traineddata. Currently I use syntax below which seem sane and does not produce any error before last step.

Important: I don't want to change current approach as my goal to train each of 1000 tiff files with same parameters, since I prepared corresponding tessRead and boxes for each tiff.

#Make lstmf file

tesseract test_sample.tiff test_sample \
--tessdata-dir /home/j/img2/tess_files \
--psm 7 --oem 1 -l eng_custom \
/home/j/tesseract/tessdata/configs/lstm.train

echo "test_sample.lstmf" single_lstmf_file.txt

#Train LSTM model

lstmtraining \
--model_output tess_training.lstm \
--continue_from /home/j/img2/tess_files/eng.lstm \
--traineddata /home/j/img2/tess_files/eng_custom.traineddata \
--train_listfile single_lstmf_file.txt \
--max_iterations 1

Stop training and finalize model

lstmtraining --stop_training \
--continue_from tess_training.lstm_checkpoint \
--traineddata /home/j/img2/tess_files/eng_custom.traineddata \
--model_output /home/j/img2/tess_files/eng_final.lstm

Update traineddata with new LSTM model

mkdir -p /home/j/img2/base_model
combine_tessdata -u /home/j/img2/tess_files/eng_custom.traineddata /home/j/img2/base_model/eng_custom
cp /home/j/img2/tess_files/eng_final.lstm /home/j/img2/base_model/eng.lstm
combine_tessdata /home/j/img2/base_model/eng_custom
cp /home/j/img2/base_model/eng_custom.traineddata /home/j/img2/tess_files/eng_custom.traineddata

But I get problem during final step:

j@j:~/t$ tesseract test_sample.tiff stdout -l eng_custom --tessdata-dir /home/j/img2/tess_files/
index = 0:Error:Assert failed:in file /home/j/tesseract4/src/ccutil/strngs.cpp, line 266
Aborted (core dumped)

Question: How to amend above commands so I can combine eng_final.lstm with eng_custom.traineddata

Environment:

/home/j/img2/tess_files/

eng.traineddata eng_custom.traineddata eng.lstm eng_final.lstm

/home/j/img2/base_model/

eng_custom.bigram-dawg eng_custom.normproto
eng_custom.word-dawg eng_custom.freq-dawg
eng_custom.number-dawg eng.lstm eng_custom.inttemp
eng_custom.pffmtable eng.lstm-number-dawg eng_custom.lstm
eng_custom.punc-dawg eng.lstm-punc-dawg eng_custom.lstm-number-dawg eng_custom.shapetable
eng.lstm-recoder eng_custom.lstm-punc-dawg eng_custom.traineddata
eng.lstm-unicharset eng_custom.lstm-recoder
eng_custom.unicharambigs eng.lstm-word-dawg eng_custom.lstm-unicharset eng_custom.unicharset eng.version eng_custom.lstm-word-dawg eng_custom.version

Any guidance would be greatly appreciated.

Thanks!

Jacob
Project ID:
3426853
Project category:
Project budget:
View this project in detail
(Note: you will be redirected to external marketplace)
Last Projects / Browse Projects
  Project Started
Unlock Amazon SES Sending Limits
Category: Amazon Web Services, API, AWS Lambda, Cloud Computing, DNS, Email Marketing, Internet Marketing, Linux, System Admin
Budget: ₹1500 - ₹12500 INR
16 Sep 2025 22:04 GMT
Freelancers Needed for Ongoing Small Projects
Category: Backend Development, Content Management System (CMS), Digital Marketing, Frontend Development, Project Management, SEO, Social Media Management, Web Development
Budget: €30 - €250 EUR
16 Sep 2025 22:03 GMT
Simple 5-Page WordPress Site
Category: CSS, HTML, PHP, Web Design, Web Development, Website Management, WordPress
Budget: $10 - $30 AUD
16 Sep 2025 22:02 GMT
PE stamped Pergola plans for permit submittal in Florida
Category: Building Architecture, CAD / CAM, Civil Engineering, Graphic Design
Budget: $250 - $750 USD
16 Sep 2025 21:59 GMT
Vectorize Existing Logo Files
Category: Adobe Illustrator, Affinity Designer, Graphic Design, Illustration, Inkscape, Logo Design, Photoshop, Vector Design
Budget: €8 - €30 EUR
16 Sep 2025 21:58 GMT
Ethereum DeFi Integration Expert
Category: API Integration, Blockchain, Documentation, Ethereum, Frontend Development, Git, Smart Contracts, Web3.js
Budget: $25 - $50 USD
16 Sep 2025 21:57 GMT
Zoho Form Auto-Populate Logic
Category: Zoho, Zoho Creator, Zoho CRM
Budget: $10 - $30 USD
16 Sep 2025 21:57 GMT
Company Sourcing in Preston, Lancashire, UK -- 4
Category: Business Analysis, Inspections, Local Job, Photography, Travel Ready
Budget: $80 - $100 USD
16 Sep 2025 21:56 GMT
Arabic Localization for MMORPG Game
Category: Game Design, MMORPG, Project Management, Proofreading, Translation
Budget: $8 - $15 USD
16 Sep 2025 21:56 GMT
Modern Multi-Industry Logo Design
Category: Adobe Illustrator, Photoshop, Branding, Creative Design, Graphic Design, Illustration, Logo Design, Vector Design, Visual Design
Budget: $10 - $3000 CAD
16 Sep 2025 21:56 GMT
Virtual Assistant Needed for Simple Tasks
Category: Admin Support, Customer Service, Customer Support, Data Entry, Excel, Microsoft Office, Typing, Virtual Assistant
Budget: $250 - $750 USD
16 Sep 2025 21:54 GMT
Several Websites GitHub Repository Setup & Upload
Category: CSS, Documentation, Frontend Development, Git, GitHub, HTML, Software Development, Technical Writing, Vercel, Web Development
Budget: $10 - $30 USD
16 Sep 2025 21:54 GMT
PageSpeed Insights Rendimiento de al menos 80%
Category: CSS, HTML, JavaScript, PHP, WordPress
Budget: $30 - $250 USD
16 Sep 2025 21:54 GMT
Solicitud de Proyecto: Ilustrador para diseño de héroe en cubo de Rubik
Category: Corporate Identity, Covers & Packaging, Graphic Design, Logo Design, Photoshop
Budget: $10 - $30 USD
16 Sep 2025 21:54 GMT
Graphic/Styled Designs for Company Insights Articles
Category: Adobe Creative Suite, Adobe Illustrator, Adobe InDesign, Canva, Graphic Design, PDF, Photoshop, Typography
Budget: $100 - $300 USD
16 Sep 2025 21:53 GMT
Browse All Projects
Projects by Skills ...
android
ajax
asp
aspnet
cms
cpp
csharp
css
delphi
design
drupal
excel
facebook
flash
html
java
javascript
joomla
iphone
mysql
photoshop
php
python
ruby
seo
sql
sysadm
translate
typing
twitter
vbnet
xml
wordpress
writing
New!
Проекты на русском
(Projects in Russian)

Copyright © 2005-2024
1001 Freelance Projects