Extract and Analyze the Performance Data from Research Papers using Generative AI

Show simple item record

dc.contributor.author Abeysekara, W.A.D.S.S.
dc.contributor.author Yasotha, R.
dc.date.accessioned 2026-03-26T03:27:56Z
dc.date.available 2026-03-26T03:27:56Z
dc.date.issued 2026
dc.identifier.uri http://drr.vau.ac.lk/handle/123456789/2030
dc.description.abstract This study explores automated extraction and analysis of tabular data from research papers to streamline researchers’ workflows. It integrates Generative AI and Optical Character Recognition within an end-to-end pipeline applied to over 1,200 open-access Artificial Intelligence and Machine Learning papers. PDF files were first converted into high-resolution images, after which tables were detected using a fine-tuned YOLOv8 model. Text from the detected tables was extracted using Tesseract OCR, and performance-related data was filtered and analyzed using Retrieval-Augmented Generation methods. The analysis identified top-performing models, such as BERT and CODEX, and widely used datasets including SQuAD and GSM8K, enabling automated meta-analysis. The results demonstrate the scalability and effectiveness of combining computer vision and NLP for high-quality data extraction.Models such as Llama3-8B and deepseek-r1:8b 0528-quwen3-q8_0 provided domain-specific insights. The study also suggests improved table detection without relying on keyword searches by leveraging advanced AI, NLP, and ensemble learning techniques. en_US
dc.language.iso en en_US
dc.publisher Korea Database Strategy Society (KDSS) en_US
dc.subject Table detection en_US
dc.subject YOLOv8 en_US
dc.subject Tesseract en_US
dc.subject Research papers en_US
dc.subject Retrieval-augmented generation en_US
dc.subject Optical character recognition en_US
dc.title Extract and Analyze the Performance Data from Research Papers using Generative AI en_US
dc.type Conference full paper en_US
dc.identifier.proceedings 32nd International Conference on IT Applications and Management en_US


Files in this item

This item appears in the following Collection(s)

  • IITAMS - 2026 [39]
    International Conference on IT Applications and Management

Show simple item record

Search


Browse

My Account