JavaScript is disabled for your browser. Some features of this site may not work without it.

    Listar

    Todo RIUMAComunidades & ColeccionesPor fecha de publicaciónAutoresTítulosMateriasTipo de publicaciónCentrosDepartamentos/InstitutosEditoresEsta colecciónPor fecha de publicaciónAutoresTítulosMateriasTipo de publicaciónCentrosDepartamentos/InstitutosEditores

    Mi cuenta

    AccederRegistro

    Estadísticas

    Ver Estadísticas de uso

    DE INTERÉS

    Datos de investigaciónReglamento de ciencia abierta de la UMAPolítica de RIUMAPolitica de datos de investigación en RIUMAOpen Policy Finder (antes Sherpa-Romeo)Dulcinea
    Preguntas frecuentesManual de usoContacto/Sugerencias
    Ver ítem 
    •   RIUMA Principal
    • Investigación
    • Artículos
    • Ver ítem
    •   RIUMA Principal
    • Investigación
    • Artículos
    • Ver ítem

    Visual Object Detection with DETR to Support Video-Diagnosis Using Conference Tools

    • Autor
      Biró, Attila; Tünde Janosi-Rancz, Katalin; Szilágyi, László; Cuesta-Vargas, AntonioAutoridad Universidad de Málaga; Martín-Martín, JaimeAutoridad Universidad de Málaga; Miklós Szilágyi, Sándor
    • Fecha
      2022-06-12
    • Editorial/Editor
      MDPI
    • Palabras clave
      Diagnóstico por imagen
    • Resumen
      This text discusses the need for real-time multilingual sentence detection during online video presentations, particularly in the healthcare sector for remote diagnosis. The use of visual (textual) object detection and preprocessing is essential for subsequent analysis. The researchers propose using the DEtection TRansformer (DETR) model to achieve accurate and real-time detection of textual objects. The development of real-time videoconference translation supported by artificial intelligence has become especially important during the COVID-19 pandemic. The challenge lies in the variety of languages spoken by specialists, which requires human translators or AI-based technological channels. The accuracy of visual localization of textual elements depends on the complexity, quality, and variety of the training datasets. The researchers compare the performance of the DETR model with other real-time object detectors like YOLO4 and Detectron2, and introduce AI-based innovations through collaborative solutions combined with OCR. The researchers conducted evaluations using training datasets and achieved higher-than-expected accuracy in terms of visual text detection range, with an average accuracy of 0.4 to 0.65.
    • URI
      https://hdl.handle.net/10630/32668
    • DOI
      https://dx.doi.org/10.3390/app12125977
    • Compartir
      RefworksMendeley
    Mostrar el registro completo del ítem
    Ficheros
    applsci-12-05977-v2-3.pdf (3.538Mb)
    Colecciones
    • Artículos

    Estadísticas

    REPOSITORIO INSTITUCIONAL UNIVERSIDAD DE MÁLAGA
    REPOSITORIO INSTITUCIONAL UNIVERSIDAD DE MÁLAGA
     

     

    REPOSITORIO INSTITUCIONAL UNIVERSIDAD DE MÁLAGA
    REPOSITORIO INSTITUCIONAL UNIVERSIDAD DE MÁLAGA