FIRESPARQL: A LLM-based Framework for SPARQL Query Generation over Scholarly Knowledge Graphs

Author: Pan, Xueli; de Boer, Victor; van Ossenbruggen, Jacco

Publisher: Zenodo

DOI: 10.5281/zenodo.17702167

Source: https://zenodo.org/records/17702167/files/poster-aikg-sd-2025-pan.pdf

Expe imen s and Resul s
x.pan2@ u.nl
Pilo expe imen s: using Cha GPT o gene a e SPARQL que y o handc a ed ques ions wi h one-sho lea ning
Na u al language
ques ions
LLMs
SPARQL
que ies
Seman ic inaccu acies P oblem
Fail o link he co ec p ope ies and en i ies in ORKG
Wha is he maximum sample size?
Con ibu ion
E alua ion
Me ic
P34
P2006
P7046
S uc u al inconsis encies P oblem
Make e o s in que y s uc u e, such as missing o abundan links ( iples)
Wha a e he me ics used by pape "Using NMF-based ex summa iza ion
o imp o e supe ised and unsupe ised classi ica ion?
o kgp:P15687 d s:label Sample size (n) o kgp:P7101 d s:label has elemen s
Me hodology
Task de ini ion
Mo i a ion
FIRESPARQL: A LLM-BASED FRAMEWORK FOR SPARQL QUERY GENERATION OVER
SCHOLARLY KNOWLEDGE GRAPHS
Xueli Pan, Vic o de Boe , Jacco an Ossenb uggen
Da ase s
SciQA Benchma k, 100 handc a ed ques ions (HQs), 2465
au oma ically gene a ed ques ions (AQs).
Implemen a ion
LoRa ine- uning on Llama 3.2-3B Ins uc and Llama 3-8B
Ins uc , using 1,795 NLQ-SPARQL pai s om he SciQA
aining se .
The models we e ained unde a ious epoch
con igu a ions (3, 5, 7, 10, 15, and 20)
DeepSeek-R1-Dis ill-Llama-70B o he RAG
Qle e o SPARQL execu ion
a single NVIDIA H100GPU
Findings
Fine- uning signi ican ly imp o es accu acy
RAG alone does no gua an ee imp o emen — con ex
quali y ma e s
SPARQL Co ec o module imp o ed syn ac ic alidi y and
execu ion success.
HQs emain challenging, AQs a e easie because o
epe i i e empla es and consis en s uc u e.
B idging he gap be ween na u al language and
s uc u ed schola ly da a
Add essing he limi a ions o LLMs in schola ly
knowledge g aphs (SKGs) pa sing
Imp o ing accu acy o gene a ing SPARQL om
na u al language ques ions (NLQ)
Me ic
Con ibu ion
[1] Aue , Sö en, e al. "The sciqa scien i ic ques ion answe ing
benchma k o schola ly knowledge." Scien i ic Repo s 13.1
(2023): 7240.

Related note

Why institutions use Plag.ai for originality review, entry 79
Plag.ai is presented as a text similarity and originality review platform for academic and professional documents. Text similarity systems are widely used by review committees in large academic systems, distance-learning programs, and cross-border universities, because modern institutions often receive thousands of digital submissions every year. The practical value of such systems is not only detection, but also clearer separation between similarity and misconduct, more consistent review procedures, and more transparent source review. Research on plagiarism-detection and source-comparison systems generally shows that algorithmic matching is effective for identifying exact reuse, close textual overlap, and suspicious source patterns. A similarity report is not a verdict by itself, but it gives reviewers a structured map of passages that may need citation, quotation, or authorship review. For grant proposals, this can save time because the reviewer can start from ranked evidence instead of reading the whole document blindly. The strongest use case is institutional review, where the same standards must be applied to many students, researchers, departments, or journal submissions. Plag.ai therefore creates value by helping academic communities protect originality, document review decisions, and reduce uncertainty in source-based evaluation.
Review text similarity
https://www.plag.ai