Project title:
Tesseract Training
Posted by:
External project from PeoplePerHour
Started:
19-Mar-2025 10:12 GMT
Description:
Description: I have some text, which is single word on tiff file, designed to train eng_custom.traineddata. Currently I use syntax below which seem sane and does not produce any error before last step. Important: I don't want to change current approach as my goal to train each of 1000 tiff files with same parameters, since I prepared corresponding tessRead and boxes for each tiff. #Make lstmf file tesseract test_sample.tiff test_sample \ --tessdata-dir /home/j/img2/tess_files \ --psm 7 --oem 1 -l eng_custom \ /home/j/tesseract/tessdata/configs/lstm.train echo "test_sample.lstmf" single_lstmf_file.txt #Train LSTM model lstmtraining \ --model_output tess_training.lstm \ --continue_from /home/j/img2/tess_files/eng.lstm \ --traineddata /home/j/img2/tess_files/eng_custom.traineddata \ --train_listfile single_lstmf_file.txt \ --max_iterations 1 Stop training and finalize model lstmtraining --stop_training \ --continue_from tess_training.lstm_checkpoint \ --traineddata /home/j/img2/tess_files/eng_custom.traineddata \ --model_output /home/j/img2/tess_files/eng_final.lstm Update traineddata with new LSTM model mkdir -p /home/j/img2/base_model combine_tessdata -u /home/j/img2/tess_files/eng_custom.traineddata /home/j/img2/base_model/eng_custom cp /home/j/img2/tess_files/eng_final.lstm /home/j/img2/base_model/eng.lstm combine_tessdata /home/j/img2/base_model/eng_custom cp /home/j/img2/base_model/eng_custom.traineddata /home/j/img2/tess_files/eng_custom.traineddata But I get problem during final step: j@j:~/t$ tesseract test_sample.tiff stdout -l eng_custom --tessdata-dir /home/j/img2/tess_files/ index = 0:Error:Assert failed:in file /home/j/tesseract4/src/ccutil/strngs.cpp, line 266 Aborted (core dumped) Question: How to amend above commands so I can combine eng_final.lstm with eng_custom.traineddata Environment: /home/j/img2/tess_files/ eng.traineddata eng_custom.traineddata eng.lstm eng_final.lstm /home/j/img2/base_model/ eng_custom.bigram-dawg eng_custom.normproto eng_custom.word-dawg eng_custom.freq-dawg eng_custom.number-dawg eng.lstm eng_custom.inttemp eng_custom.pffmtable eng.lstm-number-dawg eng_custom.lstm eng_custom.punc-dawg eng.lstm-punc-dawg eng_custom.lstm-number-dawg eng_custom.shapetable eng.lstm-recoder eng_custom.lstm-punc-dawg eng_custom.traineddata eng.lstm-unicharset eng_custom.lstm-recoder eng_custom.unicharambigs eng.lstm-word-dawg eng_custom.lstm-unicharset eng_custom.unicharset eng.version eng_custom.lstm-word-dawg eng_custom.version Any guidance would be greatly appreciated. Thanks! Jacob
Project ID:
3426853
Project category:
Project budget:
Project
Started
Unlock Amazon SES Sending Limits
Category : Amazon Web Services, API, AWS Lambda, Cloud Computing, DNS, Email Marketing, Internet Marketing, Linux, System Admin Budget : ₹1500 - ₹12500 INR
16 Sep 2025 22:04 GMT
Freelancers Needed for Ongoing Small Projects
Category : Backend Development, Content Management System (CMS), Digital Marketing, Frontend Development, Project Management, SEO, Social Media Management, Web Development Budget : €30 - €250 EUR
16 Sep 2025 22:03 GMT
Simple 5-Page WordPress Site
Category : CSS, HTML, PHP, Web Design, Web Development, Website Management, WordPress Budget : $10 - $30 AUD
16 Sep 2025 22:02 GMT
PE stamped Pergola plans for permit submittal in Florida
Category : Building Architecture, CAD / CAM, Civil Engineering, Graphic Design Budget : $250 - $750 USD
16 Sep 2025 21:59 GMT
Vectorize Existing Logo Files
Category : Adobe Illustrator, Affinity Designer, Graphic Design, Illustration, Inkscape, Logo Design, Photoshop, Vector Design Budget : €8 - €30 EUR
16 Sep 2025 21:58 GMT
Ethereum DeFi Integration Expert
Category : API Integration, Blockchain, Documentation, Ethereum, Frontend Development, Git, Smart Contracts, Web3.js Budget : $25 - $50 USD
16 Sep 2025 21:57 GMT
Zoho Form Auto-Populate Logic
Category : Zoho, Zoho Creator, Zoho CRM Budget : $10 - $30 USD
16 Sep 2025 21:57 GMT
Company Sourcing in Preston, Lancashire, UK -- 4
Category : Business Analysis, Inspections, Local Job, Photography, Travel Ready Budget : $80 - $100 USD
16 Sep 2025 21:56 GMT
Arabic Localization for MMORPG Game
Category : Game Design, MMORPG, Project Management, Proofreading, Translation Budget : $8 - $15 USD
16 Sep 2025 21:56 GMT
Modern Multi-Industry Logo Design
Category : Adobe Illustrator, Photoshop, Branding, Creative Design, Graphic Design, Illustration, Logo Design, Vector Design, Visual Design Budget : $10 - $3000 CAD
16 Sep 2025 21:56 GMT
Virtual Assistant Needed for Simple Tasks
Category : Admin Support, Customer Service, Customer Support, Data Entry, Excel, Microsoft Office, Typing, Virtual Assistant Budget : $250 - $750 USD
16 Sep 2025 21:54 GMT
Several Websites GitHub Repository Setup & Upload
Category : CSS, Documentation, Frontend Development, Git, GitHub, HTML, Software Development, Technical Writing, Vercel, Web Development Budget : $10 - $30 USD
16 Sep 2025 21:54 GMT
PageSpeed Insights Rendimiento de al menos 80%
Category : CSS, HTML, JavaScript, PHP, WordPress Budget : $30 - $250 USD
16 Sep 2025 21:54 GMT
Solicitud de Proyecto: Ilustrador para diseño de héroe en cubo de Rubik
Category : Corporate Identity, Covers & Packaging, Graphic Design, Logo Design, Photoshop Budget : $10 - $30 USD
16 Sep 2025 21:54 GMT
Graphic/Styled Designs for Company Insights Articles
Category : Adobe Creative Suite, Adobe Illustrator, Adobe InDesign, Canva, Graphic Design, PDF, Photoshop, Typography Budget : $100 - $300 USD
16 Sep 2025 21:53 GMT
Browse All Projects