Synthetized Multilanguage OCR Using CRNN and SVTR Models for Realtime Collaborative Tools

dc.centroFacultad de Ciencias de la Saludes_ES
dc.contributor.authorBiró, Attila
dc.contributor.authorCuesta-Vargas, Antonio
dc.contributor.authorMartín-Martín, Jaime
dc.contributor.authorSzilágyi, László
dc.contributor.authorMiklós Szilágyi, Sándor
dc.date.accessioned2023-06-15T11:27:40Z
dc.date.available2023-06-15T11:27:40Z
dc.date.created2023
dc.date.issued2023-03-30
dc.departamentoFisioterapia
dc.description.abstractBackground: Remote diagnosis using collaborative tools have led to multilingual joint working sessions in various domains, including comprehensive health care, and resulting in more inclusive health care services. One of the main challenges is providing a real-time solution for shared documents and presentations on display to improve the efficacy of noninvasive, safe, and far-reaching collaborative models. Classic optical character recognition (OCR) solutions fail when there is a mixture of languages or dialects or in case of the participation of different technical levels and skills. Due to the risk of misunderstandings caused by mistranslations or lack of domain knowledge of the interpreters involved, the technological pipeline also needs artificial intelligence (AI)-supported improvements on the OCR side. This study examines the feasibility of machine learning-supported OCR in a multilingual environment. The novelty of our method is that it provides a solution not only for different speaking languages but also for a mixture of technological languages, using artificially created vocabulary and a custom training data generation approach. Methods: A novel hybrid language vocabulary creation method is utilized in the OCR training process in combination with convolutional recurrent neural networks (CRNNs) and a single visual model for scene text recognition within the patch-wise image tokenization framework (SVTR). Data: In the research, we used a dedicated Python-based data generator built on dedicated collaborative tool-based templates to cover and simulated the real-life variances of remote diagnosis and co-working collaborative sessions with high accuracy. The generated training datasets ranged from 66 k to 8.5 M in size. Twenty-one research results were analyzed. Instruments: Training was conducted by using tuned PaddleOCR with CRNN and SVTR modeling and a domain-specific, customized vocabulary. [...]es_ES
dc.description.sponsorshipPartial funding for open access charge: Universidad de Málagaes_ES
dc.identifier.citationBiró A, Cuesta-Vargas AI, Martín-Martín J, Szilágyi L, Szilágyi SM. Synthetized Multilanguage OCR Using CRNN and SVTR Models for Realtime Collaborative Tools. Applied Sciences. 2023; 13(7):4419. https://doi.org/10.3390/app13074419es_ES
dc.identifier.doi10.3390/app13074419
dc.identifier.urihttps://hdl.handle.net/10630/26967
dc.language.isoenges_ES
dc.publisherMDPIes_ES
dc.rightsAtribución 4.0 Internacional*
dc.rights.accessRightsopen accesses_ES
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/*
dc.subjectDiagnosticoses_ES
dc.subjectDispositivos ópticos de reconocimiento de caractereses_ES
dc.subject.otherCRNNes_ES
dc.subject.otherSVTRes_ES
dc.subject.otherCNNes_ES
dc.subject.otherRNNes_ES
dc.subject.otherMultilingual OCRes_ES
dc.subject.otherRemote diagnosises_ES
dc.subject.otherReal-time translationes_ES
dc.subject.otherCollaborative diagnosticses_ES
dc.subject.otherReal-time text detectiones_ES
dc.subject.otherAssessmentes_ES
dc.titleSynthetized Multilanguage OCR Using CRNN and SVTR Models for Realtime Collaborative Toolses_ES
dc.typejournal articlees_ES
dc.type.hasVersionVoRes_ES
dspace.entity.typePublication
relation.isAuthorOfPublication94126d4b-371d-4727-a252-f4182972d4b6
relation.isAuthorOfPublicationaf904741-d538-4bf8-a882-d00782271171
relation.isAuthorOfPublication.latestForDiscovery94126d4b-371d-4727-a252-f4182972d4b6

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
applsci-13-04419.pdf
Size:
2.18 MB
Format:
Adobe Portable Document Format
Description:

Collections