Con e ence on Resea ch Da a In as uc u e 2025
RDM in Con ex
e en ual DOI
© Au ho s. This wo k is licensed unde a C ea i e Commons A ibu ion 4.0 In e na ional License
P o enance Le els o Biomedical In- i o Expe imen s
Tanja Auge1, Sascha Geneh 2,3, Meike Kle ke1,
F ank K ¨
uge 4, and Max Sch ¨
ode 3
1Uni e si y o Regensbu g, Ge many
2Uni e si y o Ros ock, Ge many
3Ros ock Uni e si y Lib a y, Ge many
4Wisma Uni e si y o Applied Sciences, Ge many
*Co espondence: Tanja Auge, anja.auge@u .de
Abs ac
P o enance da a a e an essen ial in o ma ion sou ce o scien i ic expe imen s and hus
o main aining he c edibili y o esea ch indings [1]. The e a e se e al app oaches
o cap u ing p o enance in o ma ion, ha ei he ocus on he p ocess (wo k low p o e-
nance) o he ac ual da a ans o ma ions (da a p o enance) [2]. The i s is mainly
used o desc ibing ac i i ies including agen s and hei in e dependencies by means
o on ologies, such as PROV-O [3], [4], o answe he W7-ques ions [5]. The la e is
g ounded in da abase heo y and employs a uple based econs uc ion o answe he
ques ions why,how o whe e a da a i em/que y esul was c ea ed [6]. Thus, wo k-
low p o enance ails o answe ques ions abou pa icula a omic da a uples, such
as “Which pa s o he conc e e da ase we e analyzed s a is ically?”. On he o he
hand, he da a p o enance ails a inco po a ing he p ocedu al iew, such as who was
in ol ed in he analysis o ela ion be ween se e al da ase s.
We p opose o combine hese wo p o enance ypes in o a uni ied amewo k en-
abling an in eg a ed iew on di e en g anula i y le els o he scien i ic expe imen s. In
pa icula , we p opose o include he modeling o da a p o enance simila o he ap-
p oach o I es e al. [7] in he PROV-O wo k low p o enance app oach. This uni ied
amewo k enables he easoning o e W7+1-ques ions [8] and hei combina ions in
he pa icula le el o de ail needed, e.g. o iden i y he necessa y pa s o da a se s
o a pa icula analysis and he in ol ed wo k low including esponsibili ies and expe -
imen al se ings.
In o de o illus a e he concep ual amewo k, we employ he use case o in- i o
expe imen s om he biomedical domain. We concen a e on in- i o expe imen s,
i.e. pe o ming lab expe imen s including measu ing cha ac e is ics ollowed by analyz-
ing he da a. In he biomedical domain hese esul s a e o en used in simula ions and
he simula ion indings han p o en in lab expe imen s again. An exce p o ou con-
c e e use case is shown in Figu e 1. The uppe pa encodes he analyza ion s ep o
he aw da a in e ms o wo k low p o enance. While he aw da a (p o :En i y) a e
p o ided by he lab scien is (p o :Agen ), he analysis is pe o med (p o :Ac i i y)
by he da a analys (p o :Agen ). The esul ing da a (p o :En i y) a e also a ibu ed
Auge e al. |CoRDI 2025
da a: e i ied da a
uple: R_1
uple: S_1
uple: R_2
uple: S_2
uple: R_3
o ganisa ion: uni e si ypeople: lab scien is people: analys
used da a: esul
wasGene a edBy
wasA ibu edTo
ac edOnBehal O
wasA ibu edTo
wasAssocia edWi h
b o:pa O
*
*
+
used
used
used
used
wi ness: { _1, s_1}
wi ness: { _3, s_2}
wi ness: {{ _1, s_1},{ _3, s_2}}
d s:commen
d s:commen
d s:commen
wasIn o medBy
wasIn o medBy
uple: TwasGene a edBy
b o:hasPa
wo k low p o enance
da a p o enance
p o :En i y p o :Agen p o :Ac i i y xsd:s ing
b o:hasPa
b o:hasPa
b o:hasPa
Legend:
ac i i y:
analysis
Figu e 1. P o enance g aph o a biomedical in- i o expe imen . P oposed connec ions be ween wo k-
low and da a p o enance a e highligh ed in bold on and do ed edges. All ela ions wi hou
p e ix a e om he PROV on ology [4].
o he da a analys . The lowe pa o Figu e 1 encodes he da a p o enances g aph
using ou p oposed modeling mechanism. In pa icula , he ac i i ies ela e o da abase
ac ions such as join (*) and union (+) o uples. Addi ional commen s speci y he co e-
sponding wi nesses [6], which answe s ypical why-ques ions in da a p o enance. Fo
he modeling o he di e en le els o de ails [8] including also wo k low p o enance
and da a p o enance, we p opose o employ he BFO [9] ela ions b o:hasPa and
b o:pa O . These p o enance le els a e linked by do ed edges. In he example, he
i e da a uples on he da a p o enance le el a e b o:pa O he e i ied da a en i y on
he wo k low le el. In addi ion, we p opose o employ b o:hasPa be ween he le els
o ac i i ies such as *and he ac i i y analysis.
Summa izing his concep ual amewo k, i p o ides a uni ied iew. Thus, i allows
o eason o e all W7+1-ques ions [8] including all necessa y de ails wi hou limi ing
p o enance on ei he o p ocess o uple pa s.
Au ho con ibu ions
TA Concep ualiza ion, Visualiza ion, W i ing - O iginal D a . SG Concep ualiza ion,
Visualiza ion, W i ing - O iginal D a . MK W i ing - Re iew & Edi ing. FK Funding
acquisi ion, W i ing - Re iew & Edi ing. MS Concep ualiza ion, Visualiza ion, W i ing -
O iginal D a . All au ho s ead and app o ed he inal manusc ip .
Auge e al. |CoRDI 2025
Compe ing in e es s
The au ho s decla e ha hey ha e no compe ing in e es s.
Funding
SG is unded by he Deu sche Fo schungsgemeinscha (DFG, Ge man Resea ch
Founda ion) - SFB 1270/2 – 299150580.
Re e ences
[1] K. Gie end, F. K ¨
uge , S. Geneh , e al., “P o enance in o ma ion o biomedical da a and
wo k lows: Scoping e iew,” Jou nal o Medical In e ne Resea ch, ol. 26, e51297, Aug.
2024, ISSN: 1438-8871. DOI:10.2196/51297.
[2] M. He schel, R. Dies elk¨
ampe , and H. Ben Lahma , “A su ey on p o enance: Wha o ?
wha o m? wha om?” VLDB J., ol. 26, no. 6, pp. 881–906, 2017. DOI:10.1007/s00778-
017-0486-1.
[3] L. Mo eau, P. G o h, J. Cheney, T. Lebo, and S. Miles, “The a ionale o p o ,” Jou nal o
Web Seman ics, ol. 35, pp. 235–257, Dec. 2015, ISSN: 1570-8268. DOI:10.1016/j.
websem.2015.04.001.
[4] T. Lebo, S. Sahoo, D. McGuinness, e al. “PROV-O: The PROV On ology.” (2013), [Online].
A ailable: h p://www.w3.o g/TR/2013/REC-p o -o-20130430/.
[5] S. Ram and J. Liu, “Unde s anding he Seman ics o Da a P o enance o Suppo Ac i e
Concep ual Modeling,” in LNCS, Sp inge Be lin Heidelbe g, 2007, pp. 17–29. DOI:10.
1007/978-3-540-77503-4_3.
[6] J. Cheney, L. Chi ica iu, and W. C. Tan, “P o enance in da abases: Why, how, and whe e,”
Found. T ends Da abases, ol. 1, no. 4, pp. 379–474, 2009. DOI:10.1561/1900000006.
[7] Z. I es, V. Tannen, O. Bi on, e al. “ORCHESTRA: Managing he Collabo a i e Sha ing
o E ol ing Da a.” (2007), [Online]. A ailable: h ps://www.cis.upenn.edu/~zi es/
o ches a/.
[8] T. Auge, S. Geneh , M. Kle ke, F. K ¨
uge , and M. Sch ¨
ode . “Towa ds dimensions and g an-
ula i y in a uni ied wo k low and da a p o enance amewo k.” a Xi : 2504.11278 [cs.DB].
(2025).
[9] J. N. O e, J. Be e ley, and A. Ru enbe g, “B o: Basic o mal on ology1,” Applied On ology,
ol. 17, no. 1, pp. 17–43, 2022. DOI:10.3233/AO-220262.