| dc.contributor.author | Abeysekara, W.A.D.S.S. | |
| dc.contributor.author | Yasotha, R. | |
| dc.date.accessioned | 2026-03-26T03:27:56Z | |
| dc.date.available | 2026-03-26T03:27:56Z | |
| dc.date.issued | 2026 | |
| dc.identifier.uri | http://drr.vau.ac.lk/handle/123456789/2030 | |
| dc.description.abstract | This study explores automated extraction and analysis of tabular data from research papers to streamline researchers’ workflows. It integrates Generative AI and Optical Character Recognition within an end-to-end pipeline applied to over 1,200 open-access Artificial Intelligence and Machine Learning papers. PDF files were first converted into high-resolution images, after which tables were detected using a fine-tuned YOLOv8 model. Text from the detected tables was extracted using Tesseract OCR, and performance-related data was filtered and analyzed using Retrieval-Augmented Generation methods. The analysis identified top-performing models, such as BERT and CODEX, and widely used datasets including SQuAD and GSM8K, enabling automated meta-analysis. The results demonstrate the scalability and effectiveness of combining computer vision and NLP for high-quality data extraction.Models such as Llama3-8B and deepseek-r1:8b 0528-quwen3-q8_0 provided domain-specific insights. The study also suggests improved table detection without relying on keyword searches by leveraging advanced AI, NLP, and ensemble learning techniques. | en_US |
| dc.language.iso | en | en_US |
| dc.publisher | Korea Database Strategy Society (KDSS) | en_US |
| dc.subject | Table detection | en_US |
| dc.subject | YOLOv8 | en_US |
| dc.subject | Tesseract | en_US |
| dc.subject | Research papers | en_US |
| dc.subject | Retrieval-augmented generation | en_US |
| dc.subject | Optical character recognition | en_US |
| dc.title | Extract and Analyze the Performance Data from Research Papers using Generative AI | en_US |
| dc.type | Conference full paper | en_US |
| dc.identifier.proceedings | 32nd International Conference on IT Applications and Management | en_US |