scieee Science in your language
[en] (orig)

DiSSCo Prepare Milestone report MS5.1 - Functional technical implementation of DiSSCo Knowledgebase and documentation of most relevant building blocks

Author: von Mering, Sabine; Pim Reis, Julia; Glöckler, Falko; Dillen, Mathias; Cubey, Robert; Güntsch, Anton; Petersen, Mareike
Publisher: Zenodo
DOI: 10.34960/35ec-wr55
Source: https://zenodo.org/records/17658329/files/DPP_MS5-1_Knowledgebase.pdf
DiSSCo ela ed ou pu
This empla e collec s he equi ed me ada a o e e ence he o icial Deli e ables and Miles ones o
DiSSCo- ela ed p ojec s. Mo e in o ma ion on he manda o y and condi ionally manda o y ields can be
ound in he suppo ing documen 'Me ada a o DiSSCo Knowledge base' ha is sha ed among wo k
package leads, and in Teamwo k > Files. A sho explana o y ex is gi en o all me ada a ields, hus
allowing easy en y o he equi ed in o ma ion. I he e a e any ques ions, please con ac us a
[email p o ec ed].
Ti le
DiSSCo P epa e Miles one epo MS 5.1 "Func ional echnical implemen a ion o DiSSCo Knowledgebase
and documen a ion o mos ele an building blocks"
Au ho (s)
Sabine on Me ing
Julia Pim Reis
Falko Glöckle
Ma hias Dillen
Robe Cubey
An on Gün sch
Ma eike Pe e sen
Iden i ie o he au ho (s)
h ps://o cid.o g/0000-0003-2982-7792 (S M)
h ps://o cid.o g/0000-0003-4383-0357 (JPR)
h ps://o cid.o g/0000-0002-7127-2738 (FG)
h ps://o cid.o g/0000-0002-3973-1252 (MD)
h ps://o cid.o g/0000-0001-7902-3843 (RC)
h ps://o cid.o g/0000-0002-4325-4030 (AG)
h ps://o cid.o g/0000-0001-8666-1931 (MP)
A ilia ion
Museum ü Na u kunde - Leibniz Ins i u e o
E olu ion and Biodi e si y Science
Con ibu o s
Wou e Addink h ps://o cid.o g/0000-0002-3090-
1761
Alex Ha dis y h ps://o cid.o g/0000-0002-0767-4310
Da id Fich mülle h ps://o cid.o g/0000-0002-0829-
5849
Kessy Aba enko h ps://o cid.o g/0000-0001-5526-
4845
Ma Woodbu n h ps://o cid.o g/0000-0001-6496-
1423
Elspe h Has on h ps://o cid.o g/0000-0001-9144-
2848
Publishe
DiSSCo P epa e
Iden i ie o he publishe
Resou ce ID
h ps://doi.o g/10.34960/sk y-bq35
Publica ion yea
2021
Rela ed iden i ie s
Is i he i s ime you submi his ou come?
Yes
C ea ion da e
15/07/2021
Ve sion
1
Ci a ion
on Me ing S., Pim Reis J., Glöckle F., Dillen M., Cubey R., Gün sch A. & Pe e sen M. (2021): DiSSCo
P epa e Miles one epo MS 5.1 "Func ional echnical implemen a ion o DiSSCo Knowledgebase and
documen a ion o mos ele an building blocks". h ps://doi.o g/10.34960/sk y-bq35
Abs ac
The DiSSCo P epa e Miles one epo MS5.1 “Func ional echnical implemen a ion o DiSSCo
Knowledgebase and documen a ion o mos ele an building blocks” desc ibes he app oach aken in
de eloping he Knowledgebase (KB) as a cen al hub o esea ch ou pu s and echnical documen a ion
ela ed o DiSSCo. In o ma ion ypes o be co e ed in he KB and po en ial so wa e componen s a e
desc ibed. The DSpace sys em was chosen as a cen al documen eposi o y and is a ailable in a be a
e sion a h ps://know.dissco.eu/. Feedback om p ojec pa ne s will be p io i ized o decide on he nex
s eps in u he de elopmen o a knowledge hub o in o ma ion on DiSSCo- ela ed opics.
Con en keywo ds
scien i ic
P ojec e e ence
DiSSCo P epa e (GA-871043)
WP numbe
WP5
P ojec ou pu
Miles one epo
Deli e able/miles one numbe
MS5.1
Dissemina ion le el
Public
Righ s
License
A ibu ion 4.0 In e na ional (CC BY 4.0)
Resou ce ype
Tex
Fo ma
PDF
Funding P og amme
H2020-INFRADEV-2019-2
Con ac email
[email p o ec ed]
36
DiSSCo P epa e WP5 – Miles one epo
MS5.1 Func ional echnical implemen a ion o DiSSCo
Knowledgebase and documen a ion o mos ele an
building blocks
Wo k package lead: Ma eike Pe e sen (M N)
Au ho s: Sabine on Me ing (M N), Julia Pim Reis (M N), Falko Glöckle
(M N), Ma hias Dillen (MeiseBG), Robe Cubey (RBGE), An on Gün sch
(BGBM), Ma eike Pe e sen (M N)
Con ibu o s: Wou e Addink (Na u alis), Alex Ha dis y (U Ca di ), Da id
Fich mülle (BGBM), Kessy Aba enko (U Ta u), Ma Woodbu n (NHM),
Elspe h Has on (RBGE)
2
Abs ac
The DiSSCo P epa e Miles one epo MS5.1 “Func ional echnical implemen a ion o DiSSCo
Knowledgebase and documen a ion o mos ele an building blocks” desc ibes he app oach aken in
de eloping he Knowledgebase (KB) as a cen al hub o esea ch ou pu s and echnical documen a ion
ela ed o DiSSCo. In o ma ion ypes o be co e ed in he KB and po en ial so wa e componen s a e
desc ibed. The DSpace sys em was chosen as a cen al documen eposi o y and is a ailable in a be a
e sion a h ps://know.dissco.eu/. Feedback om p ojec pa ne s will be p io i ized o decide on he
nex s eps in u he de elopmen o a knowledge hub o in o ma ion on DiSSCo- ela ed opics.
Keywo ds
DiSSCo, DSpace, FAIR da a, in o ma ion ans e , knowledge base, knowledge hub, eposi o y,
sys em a chi ec u e
H2020-INFRADEV-2018-2020 / H2020-INFRADEV-2019-2
3
Index
Abs ac 2
Keywo ds 2
Index 3
01 INTRODUCTION 4
App oach 4
In o ma ion ypes o be co e ed in he Knowledgebase 4
Landscape analysis o selec a sys em as documen eposi o y 5
DiSSCo P epa e Round able on “O ganisa ion o knowledge and documen a ion o
s akeholde s” 8
Con en o he DiSSCo Knowledgebase 8
DSpace as a documen eposi o y 9
O he in o ma ion ypes o be in eg a ed 11
So wa e code 12
Use s o ies & use cases 13
T aining ma e ials 13
Pilo implemen a ion o DiSSCo Knowledgebase using DSpace as documen eposi o y 13
Ou look & nex s eps 17
Appendix 1: Minu es o he DiSSCo P epa e Round able on “O ganisa ion o knowledge and
documen a ion o s akeholde s” on July 6 h, 2021. 18

4
01 INTRODUCTION
As an ini ia i e o med by public esea ch ins i u ions, he Dis ibu ed Sys em o Scien i ic Collec ions
(DiSSCo) is commi ed o Open Science. Open Science no only makes scien i ic wo k mo e anspa en
and accessible, bu also enables a whole new se o collabo a i e and IT-based scien i ic me hods.
The e o e, he ou pu s o ou common esea ch p ojec s should be openly a ailable as much as
possible and esea ch da a easily Findable, Accessible, In e ope able and Reusable (FAIR p inciples).
DiSSCo P epa e (DPP), he p epa a o y phase o DiSSCo, is building on p o ound echnical knowledge
om a ious sou ces and ini ia i es. E icien knowledge and echnology ans e o pa ne s building
he DiSSCo echnical backbone will be acili a ed by a cen al and eely accessible DiSSCo
Knowledgebase, designed and implemen ed wi hin he Wo k Package “Common Resou ces and
S anda ds” and speci ically ask 5.1 “DiSSCo Knowledgebase o echnical de elopmen ”. As a hub o
knowledge managemen ele an wi hin he DiSSCo con ex , he DiSSCo Knowledgebase (KB) will no
only s o e all esea ch ou pu s om DiSSCo-linked p ojec s and o he esou ces in one place bu also
u he building blocks ele an o use s o he DiSSCo Resea ch In as uc u e (RI). Such building
blocks include web se ices, PID (pe sis en iden i ie ) sys ems, con olled ocabula ies, on ologies and
da a s anda ds o bio-and geo-collec ions objec s, collec ion desc ip ions, digi al asse s s anda ds as
well as domain-speci ic so wa e o quali y assu ance and moni o ing.
App oach
In o ma ion ypes o be co e ed in he Knowledgebase
In close collabo a ion and exchange wi h o he DPP p ojec pa ne s, he ask g oup collec ed he
ex en o in o ma ion ypes expec ed o be s o ed in he knowledgebase. To ge a mo e comple e
pic u e, his was also discussed oge he wi h p ojec o e a ching bodies such as he DiSSCo Technical
Team. As a las p epa a o y s ep, a su ey was sen o all ask and wo k package leads o DPP o
e alua e which in o ma ion ypes pa ne s a e planning o make a ailable ia he knowledgebase. The
eedback was included in he discussions and nex planning s eps. The la es o e iew o desi ed
in o ma ion ypes is gi en in Figu e 1.
H2020-INFRADEV-2018-2020 / H2020-INFRADEV-2019-2
Fig. 1: In o ma ion Types in he DiSSCo Knowledgebase. This expec ed clus e o in o ma ion ca ego ies (blue
do s) was based on DPP p ojec ou comes and ele an ex e nal esou ces.
As he e m Knowledgebase adi ionally was used in a con ex o p o iding machines wi h a da abase
o ac s o easoning p ocesses, he pa ne s ag eed ha we would use he e m wi h a main ocus on
human eadabili y in he DiSSCo Knowledgebase in he i s place. The impo ance o machine
eadabili y a ies amongs di e en in o ma ion ypes. Howe e , he me ada a will be machine-
eadable in a consis en manne .
Landscape analysis o selec a sys em as documen eposi o y
A comp ehensi e landscape analysis wi h sho p esen a ions o each sys em ook place du ing wo
ask g oup mee ings. The ollowing candida e sys ems we e in oduced by di e en ask membe s:
DSpace + Vi o, Al esco, Fedo a, OSF, Li e ay, In enio, Da a e se, and E-P in s. Fo he decision
p ocess, equi emen s o he knowledgebase we e collec ed and p io i ised.
C i e ia o op p io i y o he decision o an app op ia e componen o he knowledgebase o se e
he in o ma ion ype “Public Documen s and Ex e nal Resou ces” we e:
● Capabili y o s o ing documen s and ee ex o e e encing deli e ables, publica ions and
Ques ions and Answe s / FAQs
● Ex ensibili y & cus omiza ion (plugins o ex ensions)
● Comp ehensi e public echnical documen a ion and use documen a ion
● Comp ehensi e REST API
● Mechanisms o s able e sioning o con en
● Sea ch index (including he capabili y o indexing o cus omizable me ada a)
● Hie a chical s uc u ing o pages and o he en i ies
● Capabili y o s uc u ing he con en by ca ego ies, ags o labels
● File upload, s o age and download
● Use - iendly sea ch unc ionali y
● Regula secu i y upda es
6
● View and download unc ionali y o common documen and image ile o ma s
● Op ion o un an ins ance in a cloud en i onmen ( a he han a So wa e as a Se ice
app oach)
● Sus ainabili y o he so wa e p oduc (e.g. o ganisa ion in place o suppo and main ain)
Based on he equi emen s, he mos p omising sys ems we e DSpace, CKAN, and Al esco. All h ee
p oduc s mee he equi emen s o he espec i e in o ma ion ype “Public documen s and ex e nal
esou ces” in he knowledgebase acco ding o he p io i ized c i e ia. So, he ollowing addi ional
aspec s wi h espec o he implemen a ion and main enance ha e been included in he decision
p ocess: la es eleases, size o use communi y, egula suppo and good so wa e main enance
allowing he co ec ion o possible bugs, and egula secu i y upda es. Thus, he eam chose DSpace,
an open sou ce eposi o y so wa e package o ich and powe ul ea u es ha ocus on long- e m
s o age, access and p ese a ion o digi al con en . I is a ailable as ee so wa e unde an open-
sou ce license in a public Gi Hub eposi o y and has a huge use communi y and a e y ac i e g oup o
de elope s. I o e s cus omizable in e aces, a ull- ex -sea ch whe e he p o ided me ada a o
con en is indexed o be sea chable and accessible wi h he use o a REST API enabling he da a o be
FAIR. A eliable sea ch unc ionali y allows he end-use s o ind he con en wi hou delay e en o
huge amoun s o da a which is essen ial ega ding scalabili y wi h an inc easing amoun o linked
in o ma ion. A lis o mo e con incing key ea u es o DSpace can be accessed a he o icial websi e.
The app oach and decision p ocess was also p esen ed o a wide audience in a blog pos published in
Decembe 2020 in he DiSSCo Tech blog.
Wo king session a he DiSSCo P epa e All Hands Mee ing
A i s be a e sion (see Fig. 2) was made a ailable and in oduced du ing he Fi s i ual All Hands
Mee ing o DiSSCo P epa e (Janua y 18-22, 2021). This e en b ough oge he p ojec pa ne s, wi h
he objec i e o p esen and discuss wha will become Eu ope’s leading na u al science collec ions
Resea ch In as uc u e, he DiSSCo RI. In a dedica ed sec ion, he DiSSCo Knowledgebase and i s
unc ionali ies we e p esen ed o he audience in o de o discuss possibili ies o s uc u e he con en ,
and o p esen a numbe o ea u es such as he ull ex sea ch unc ionali y, possibili y o di e en
ypes o ile upload, me ada a cus omiza ion, e c.
The pa icipan s could es he i s e sion by b owsing he so wa e and es ing ea u es and ools,
allowing eedback and equi emen s om DiSSCo pa ne s.
H2020-INFRADEV-2018-2020 / H2020-INFRADEV-2019-2
Fig. 2: Sc eensho o he i s be a e sion o DiSSCo Knowledgebase homepage.
Some eedback on he i s be a e sion was collec ed di ec ly du ing he mee ing. A dedica ed Gi Hub
eposi o y (see Fig. 3) was se up o collec all issues and eedback ela ed o he Knowledgebase.
Fig. 3: Sc eensho o Gi Hub eposi o y o he DiSSCo KB (h ps://gi hub.com/DiSSCo/kb).
14
DSpace Du able Digi al Deposi o y
DSpace o e s cus omizable in e aces, a ull- ex -sea ch whe e he p o ided me ada a o con en is
indexed o be sea chable and accessible wi h he use o a REST API enabling he da a o be FAIR. Wi h
he eliable sea ch unc ionali y he end-use s can ind and b owse he desi ed con en . DSpace allows
s o ing di e en e sions o documen s, adding me ada a and ee ex , hie a chical s uc u ing o
pages and s able esou ce URLs. This enables he DiSSCo pa ne s o e e ence hei con en like
deli e ables, publica ions and Ques ions and Answe s (i.e. FAQs). Fu he mo e, an edi o ial wo k low
modelled and ole based access managemen in he sys em allow he p ojec coo dina o s and
adminis a o s o p epa e con en p i a ely and o e iew he con en which helps o conduc a
p o ound quali y assu ance p io o publica ion.
DSpace was de eloped o be open sou ce, and in such a way ha ins i u ions and o ganiza ions wi h
minimal esou ces could un i . The sys em is designed o un on he UNIX pla o m, and comp ises
o he open sou ce middlewa e and ools, and p og ams w i en by he DSpace eam. All o iginal code
is in he Ja a p og amming language and equi es an open Ja a de elopmen ki (OpenJDK). O he
pieces o he echnology s ack include a ela ional da abase managemen sys em (Pos g eSQL) o
s o e and p o ide access o da a poin s ha a e ela ed o one ano he . Mo e in o ma ion on how o
ins all DSpace is p o ided a : h ps://wiki.ly asis.o g/display/DSDOC6x/Ins alling+DSpace.
In o de o enable de elope s o easily pack, ship, and un DSpace as a ligh weigh , po able, sel -
su icien con aine , which can un i ually anywhe e, he open OS-le el i ualiza ion so wa e
Docke was used.
DSpace Sys em A chi ec u e
Fig. 8: DSpace echnical a chi ec u e (h ps://wiki.ly asis.o g/display/DSDOC6x/A chi ec u e)
The DSpace echnical a chi ec u e is designed o ha e a h ee-laye a chi ec u e (see Fig. 8), de ined
by: s o age, business, and applica ion laye s. A documen ed API allows o u u e cus omiza ion and
enhancemen . The s o age laye is esponsible o physical s o age o me ada a and con en ,

H2020-INFRADEV-2018-2020 / H2020-INFRADEV-2019-2
implemen ed using he ile sys em, as managed by Pos g eSQL da abase ables. The business laye is
whe e he managemen o he con en , use s (e-pe son), au ho iza ion and wo k low esides. The
applica ion laye con ains componen s o c oss- e e ence ha allows DSpace o communica e wi h
ex e nal esou ces, such as Web use in e ace and he Open A chi es Ini ia i e
(h ps://guidelines.openai e.eu/en/la es /) p o ocol o me ada a ha es ing se ice. Each module has
an API o allow DSpace adop e s o eplace o enhance ha unc ion as desi ed (Naik & Naik 20191).
Fi s Be a e sion o he DiSSCo Knowledgebase
The online p esen a ion o he con en is o ganized in a la hie a chy o wo le els ha a e called
‘P ojec s’ and ‘Clus e s’, o which he la e ep esen he lowes agg ega ion le el o con en i ems.
Use s can access landing pages o indi idual con en i ems using he ull- ex sea ch, he ace ed
b owsing, and h ough ex e nal e e ence such as a pe sis en iden i ie (e.g. DOI) in o de o ind he
con en e en wi hou he hie a chical s uc u e. Thus, mul iple app oaches o explo ing he con en
a e co e ed.
DSpace can accommoda e any ype o iles uploaded o he sys em. Wi hin he con ex o he DiSSCo
Knowledgebase he only mos common documen ypes will be ele an (e.g., XML, XSD, PDF, XLS, PPT,
JPEG), bu he e is no ac ual limi a ion. A lo o con en is expec ed o be submi ed in ee ex o ma
( o FAQs and Glossa y) ins ead o using he ile upload. This is bene icial o he ull ex sea ch, which
is limi ed in he case o ce ain ile ypes.
The i s be a e sion o he Knowledgebase uses he Me ada a Schema (Dublin Co e) wi h a de aul
submission o m, and me ada a display. The i ems a e displayed in a simple and ull me ada a i em
eco d. Howe e , mul iple me ada a schemas can be con igu ed and equi ed me ada a ields selec ed
om a mix o con igu ed schemas o desc ibe you i ems.
Con igu able wo k low and con en cu a ion when uploading an i em
Wo k lows allow submissions o be checked be o e en e ing he i ems in o he eposi o y (Fig. 9). This
can be achie ed by esponsible e-pe sons. An e-pe son is a use who has pe mission o edi and
adminis e he clus e ). I equi ed, e-pe sons o a g oup o e-pe sons can be assigned o he clus e s
o check o accu acy, in o de o imp o e he me ada a, o simply o decide i he con en is sui able
o be a chi ed.
1 Naik P. G. & Naik G. R. 2019. C ea ing and Managing Ins i u ional Reposi o y Using DSpace - A Case
S udy App oach. Educ ea ion Publishing, Kolhapu .
16
Fig. 9: Sc eensho o DiSSCo Knowledgebase submission wo k low o a speci ic clus e .
Wi h he eedback collec ed om he Gi hub eposi o y and he no es om he All Hands Mee ing, a
numbe o imp o emen s and adjus men s we e implemen ed in o he Knowledgebase. Besides layou
changes (colou s eplaced o be simila o he DiSSCo P epa e b anding), mo e gene al in o ma ion
abou DiSSCo and a sec ion dedica ed o edi ec he use o he DiSSCo websi e we e included in he
homepage.
The manda o y me ada a ields equi ed by he DiSSCo p ojec ou pu o m used o gene a e he
epo co e pages (cu en ly a h ps://www.cogni o o ms.com/DiSSCo1/DiSSCoRela edOu pu )
we e added, including a me ada a ield o he i em’s DOI (accessible by ull i em eco d iew), he
publishe , ci a ion and also he abs ac o inc ease he sea chabili y / usabili y (see Fig. 11).
Fig. 10: Sc eensho o he i em’s display and i s ela ed me ada a wi h he addi ional ields: Keywo ds, Publishe ,
Abs ac , Ci a ion (h ps://know.dissco.eu/handle/i em/112).
H2020-INFRADEV-2018-2020 / H2020-INFRADEV-2019-2
A e applying he sugges ed enhancemen s collec ed by p ojec pa ne s o he Knowledgebase, i is
necessa y o make a deploymen in a p oduc ion se e , his s ep allows making he so wa e p oduc
a ailable o he end-use . To ensu e ha he p oduc is well es ed p io o i s elease, he ollowing
wo k low is applied: i s ly, he DiSSCo Knowledgebase is es ed in he de elopmen es en i onmen
and e- es ed a he p e-p oduc ion se e . A his s age, issues a e collec ed om he pa ne s wi hin
he Gi Hub eposi o y (see Fig. 3) dedica ed o he Knowledgebase. The las s ep is o make he p oduc
a ailable on he p oduc ion se e , whe e i is being hos ed and accessed by he end-use s.
Ou look & nex s eps
The DiSSCo Knowledgebase, e en hough s ill in be a e sion and unde ac i e de elopmen , has
al eady been use ul in p o iding access o esea ch ou pu s om a ious DiSSCo-linked p ojec s.
Discussions du ing he DPP Round Table showed ha he e is consensus among p ojec pa ne s ha
linking di e en esou ces and componen s should ha e p io i y o e de eloping new unc ionali ies
wi hin one sys em.
Nex s eps will include u he exchange and collabo a ion wi h se e al DPP wo k packages o discuss
and speci y u he in eg a ion o ools and e-se ices. This will be especially impo an o he
ollowing opics (and he WPs and ask g oups wo king on hem):
● aining ma e ials, in eg a ion wi h lea ning pla o m and DEST po al o e ing aining
cou ses and wo kshops (WP 2)
● he Digi al ma u i y sel -assessmen ool (WP3)
● Policy ool (WP 7, Task 7.3)
The ex ensi e eedback and ideas con ibu ed du ing he DiSSCo P epa e Round able mee ing will be
so ed, s uc u ed and p io i ized. A single poin o en y will be c ea ed whe e p ojec pa ne s (and
po en ially also ex e nal use s) could sugges esou ces o inclusion in he KB. This should be done in
a s uc u ed way (e.g. in Gi Hub), by en e ing he i le, he URL and maybe some addi ional in o ma ion
on he esou ce.
In he emaining mon hs be o e he due da e o he deli e able, he wo k on he DiSSCo
Knowledgebase will ocus on bo h, essen ial aspec s owa ds a obus ope a ion o he sys em, and
addi ional ea u es ha ha e been iden i ied as high p io i y. Fo he obus ope a ion he
main enance and upda e wo k lows will be e isi ed and consolida ed. Which includes e icien
backup/ es o e p ocedu es, ea u es o web analy ics and echnical use suppo acco ding o basic
se ice le els. The basic suppo would be a ailable a leas un il he end o he p ojec . The op p io i y
asks ha ha e been eques ed by he communi y comp ise he imp o emen o he use expe ience
and web design (acco ding o he DiSSCo/ELViS s yle guide), acili a ion o impo ing exis ing
knowledge, including deep- and c oss-linking o con en (e.g. ia ORCIDs), well-es ablished wo k lows
o con en manage s and con ibu ing use s (e.g. au oma ed DOI assignmen ). Some high (bu no
op) p io i y ea u es depend on he p og ess o o he asks o e en ex e nal aspec s. One example
would be he p og ess on a DiSSCo wide au hen ica ion se ice he Knowledgebase would connec o
o Single-Sign-On au hen ica ion o use s. Ano he example would conce n ea u e eques s ega ding
DSpace’s API laye which is limi ed in DSpace 6, bu ull- ledged in DSpace 7. Howe e , DSpace 7 is
cu en ly in a Be a e sion, wi h ac i e de elopmen and es ing happening wi hin he communi y, so
ha he i s elease o he DiSSCo Knowledgebase will s ill be based on he s able DSpace e sion 6.
18
The pa icipan s o he Round Table exp essed he demand o include only well cu a ed con en in he
KB. To ensu e high da a quali y and consis ency, an edi o ial boa d o he DiSSCo KB could be
es ablished. This needs o be discussed and e alua ed among p ojec pa ne s bu migh help o
de elop he KB in o a us ed esou ce o he DiSSCo communi y bu also o (po en ial) ex e nal use s.
Appendix 1:
Minu es o he DiSSCo P epa e Round able on “O ganisa ion o knowledge and documen a ion o
s akeholde s” on July 6 h, 2021.
19
Appendix 1: Minu es o he DiSSCo P epa e Round able on
“O ganisa ion o knowledge and documen a ion o s akeholde s”
DiSSCo P epa e Round able
O ganisa ion o knowledge and documen a ion o s akeholde s
When: July 6, 2021 – Thu sday, 9:00-12:00 CEST ( i ual mee ing)
Chai s: Ma eike Pe e sen, M N & Wou e Addink, Na u alis
In oduc ion
DiSSCo aims o p o ide open access o he knowledge and documen a ion ha is being p oduced e.g.
in he DiSSCo-linked p ojec s. Th ough DiSSCo P epa e (DPP) a Knowledge Base is being c ea ed:
h ps://know.dissco.eu, a eposi o y ha will o e a cen al place o s o e all DiSSCo knowledge o
make his publicly accessible o DiSSCo s akeholde s and anybody in e es ed in his in o ma ion. The
Knowledgebase con en is ini ially o ganized a ound P ojec s which a e ep esen ed by EC p ojec s in
which DiSSCo ou pu s a e being c ea ed (DiSSCo P epa e, ICEDIG, ENVRI-FAIR, MOBILISE, SYNTHESYS+,
BiCIKL). DPP pa ne s ha e indica ed h ough a su ey which in o ma ion ypes hey may wan o
p o ide h ough he knowledgebase, hese we e clus e ed in g oups (see Fig. 1).
Fig. 1: Possible in o ma ion ypes in he DiSSCo Knowledgebase.

20
Round able opics
1. To imp o e his o ganisa ion o knowledge u he , a discussion is needed a ound he po en ial
s akeholde s ha need access o he knowledge and documen a ion, o discuss how he
in o ma ion should be o ganised o each o he s akeholde g oups o make i as accessible as
possible.
2. Ano he hing ha needs o be discussed is whe e o make he documen a ion a ailable/
p o ide links. Should access be p o ided h ough he Biodi e si y Knowledge Cen e, h ough
GBIF, h ough some o he o he DiSSCo eSe ices like he Helpdesk, ELViS, o he Policy
F amewo k and Sel Assessmen ools?
3. The selec ed solu ion (DSpace) o he knowledgebase is no sui able o all in o ma ion ypes
collec ed h ough he su ey bu ocuses on he mos common in o ma ion ype "Public
Documen s and Ex e nal Resou ces" in o de o agg ega e e e ences o dis ibu ed
documen s and sou ces in a single poin o en y. Fo So wa e code, use s o ies and use cases
DiSSCo is using Gi Hub. Fu he discussion is needed o decide on sys ems o o he
in o ma ion ypes.
4. Know.dissco.eu is no a Co eT us Seal ce i ied da a eposi o y o long e m p ese a ion o
ou pu s and i would be ha d o ge such ce i ica ion. So o his pu pose he ou pu s may also
need o be s o ed elsewhe e, like in Zenodo. Wha ou pu s need o be p ese ed and whe e?
Should we make use o he new Open Resea ch Eu ope pla o m o Ho izon2020 ou pu s?
Goal
The goal o his ound able is o iden i y he s akeholde g oups ha need access o DiSSCo knowledge
and documen a ion, o discuss how o o ganise he knowledge in such a way ha op imum access is
p o ided o hese s akeholde s, and o iden i y esou ces (exis ing elsewhe e o missing) ha should
be p o ided.
Expec ed ou comes
• A lis o iden i ied s akeholde s
• Di ec ions o o ganising he knowledge
• An ini ial lis o esou ces o include pe s akeholde
• Op ional: iden i ica ion o g oups o olun ee s who will collec and add iden i ied esou ces
ha al eady exis o he knowledge base
Agenda
• Round o in oduc ions - 15 min
• Gene al p esen a ion - se ing he scene
o In oduc ion - 5 min
o DiSSCo Knowledge Base - 10 min
• Iden i ica ion o s akeholde g oups - 10 min
• Discussion on op imal knowledge o ganisa ion o each s akeholde g oup - 20 min
21
• Discussion on places whe e o make knowledge a ailable beyond he knowledge base - 20 min
• B eak - 15 min
• In oduc ion o he b eakou session - 5 min
o B eakou wo king session in 5 subg oups o iden i y documen a ion and ou pu s o
iden i ied s akeholde s ha need o go in o he knowledge base (bo h exis ing and no
ye exis ing documen a ion). 50 min
o P esen a ion o subg oup esul s. 20 min (5x5)
• Iden i ying nex s eps and closing - 10 min
Backg ound eading ma e ial:
The DiSSCo Knowledge Base - h ps://dissco. ech/2020/12/18/ he-dissco-knowledgebase/
Pa icipan s
Mo e han 50 people pa icipa ed in he wo kshop. Men ime e su eys we e used o peoples’
in oduc ion and o collec expec a ions.
In oduc ion o pa icipan s using Men ime e
22
23
In oduc ion o he Knowledgebase
The p esen a ion in oducing he DiSSCo Knowledgebase (be a e sion) is a ailable a :
h ps://dissco. eamwo k.com/#/ iles/9973261
Iden i ica ion o s akeholde g oups
Men ime e su eys we e used o ge eedback on exis ing knowledge sou ces as well as on he
added alue o he DiSSCo Knowledgebase.
30
A e he men ime e su eys he s akeholde s we e discussed u he and whe he also in e nal use s
a e included in he di e en g oups iden i ied.
Based on he Men ime e esul s and discussion i was decided o ocus on 5 S akeholde G oups:
• Collec ion s a (cu a o s, collec ions manage s & da a manage s, e c.)
• Resea che s & s uden s
• De elope s
• Policy make s & unde s
• Ci izens, ci izen scien is s & wide public
How o bes o ganize he knowledge o di e en s akeholde s?
• I needs o be clea : Wha a e he bounda ies o he knowledgebase?
• The Knowledgebase should behind he scene connec he di e en e-Se ices
• To imp o e o ganiza ion, collec ion o ques ions and usage o seman ics o g oup
in o ma ion o i he needs o use s
• Should he KB be esponsi e o use needs? Respond o ques ions o is his all co e ed by he
Helpdesk?
• How should we link o he e-Se ices?
• Would a Communi y Fo um be an added alue?
• Possible need o p o iding ( o a ce ain ex en ) di e en access pa hs o he KB o di e en
use p o iles (e.g. he non-ini ia ed, he inside s, e c.)
• Ex e nal Use s: Fi s in o ma ion hub could be he Websi e no he KB
• A e in e nal use s - p ojec membe s he bigges use g oup?

31
Places whe e o make knowledge a ailable?
• DSpace implemen a ion, wha is needed in addi ion?
o Gi Hub
o T aining sys ems/deli e y
• DiSSCo websi e and comms (Binnacle Blog? Social e c.)
• How can we each di e en s akeholde g oups?
• We do no need suppo o collabo a i e wo k on documen a ion, bu a s able e e ence
poin
• KB could no only be a eposi o y, bu a us ed esou ce o he communi y, guidance on
how o do hings, need o communi y- esou ce hey can ely on.
AOB
• Handbook on wha kind o o ma s / in o ma ion could be made a ailable in DSPACE, e.g.
HTML/XML/MD and PDF?
• S o age o he edi able e sions? A he momen using PDFs means we lose access o asse s
(da a/images/ igu es).
• S akeholde G oup Publishe : Mo e In o ma ion on he Publishing unc ionali y needed, so
we can publish mo e sus ainable and eusable publica ions, ha e p ojec asse s be a bi mo e
FAIR - including p esen a ion asse s o DiSSCo in gene al.
B eakou Rooms
Task
Iden i y documen a ion and ou pu s o iden i ied s akeholde s ha need o go in o he knowledge
base (bo h exis ing and no ye exis ing documen a ion)
I you discuss, please keep in mind he in o ma ion ypes iden i ied and p esen ed ea lie as well as
sys ems which could be a home o he kind o knowledge collec ed o he S akeholde g oup discussed.
B eakou G oup 1 - Cu a o s, collec ion & da a manage s
Summa y:
• o e iew page o “di ec o y” o mos ele an se ices and checklis s
• bes p ac ices and guidelines o special collec ions, incl. sampling p ac ices
• wo k low desc ip ions, connec ing di e en wo k lows wi hin he collec ions
• use o s anda ds, mapping (collec ions desc ip ions), link be ween physical and digi al
specimens
• p o ide wo k lows, bes p ac ices and aining ma e ials in di e en languages (s a English
bu aim o mul ilingual con en )
• links o aining ma e ials / policy documen s incl. legisla ion
32
• link o sel -assessmen ool on Digi al ma u i y o ins i u ions:
o p o ide in o ma ion abou expe ise and s eng hs o ce ain ins i u ions
o allow o syne gies, e.g. wo king g oups o collec ions in simila si ua ion, acing
simila challenges (exchange o in o ma ion and expe ise)
B eakou G oup 2 - esea che s, s uden s, eache s
Wha documen s?
In wha o m?
Whe e?
Who?
Public Documen s & Ex e nal Resou ces (=
Li e a u e)
A gene al eposi o y, di e en access
op ions and il e s ela ed o he
use /s akeholde
Wha is DiSSCo? Wha can DiSSCo p o ide o me?
“Whe e o s a ?” documen a ion (summa ized,
accessible language in o ma ion)
Google
i ?
e e y i s
ime use o
DiSSCo
Use cases
Visuals, s a ic documen s
om esea che poin o iew: no
need o be e y in e ac i e
T aining ma e ial
in e ac i e ma e ial
So wa e
Gene al commen om he b eakou g oup: especially o he esea che s poin o iew, he use cases
need o be e y isually p esen . Mos esea che s a e asking he ques ion: how can I bene i om
DiSSCo?
B eakou G oup 3 - De elope s
Gene al poin s:
• no code should be s o ed di ec ly in he KB
• p ope me ada a should be associa ed wi h eposi o y links so ha he eposi o ies a e
indable and unde s andable (wha does he eposi o y do/p o ide?)
• Gi Hub would wo k as an ex e nal esou ce o he KB
• could po en ially pull me ada a om Gi Hub au oma ically
• KB should s o e guidelines o bo h in e nal and ex e nal de elope s
• he me ada a and de elope guidelines need o be cu a ed
• KB should se e as a de elope s in e ace be ween DiSSCo, GBIF, ALA, iDigBio, COL, e c.
33
B eakou G oup 4 -Policy make s, unde s
Policy make s can mean hose who make o con ibu e o in e nal policies and go e nmen s/ci il
se an s/ex e nal wide policy make s. Funde s could be go e nmen (na ional o in e na ional);
quasi-go e nmen al bodies such as esea ch unding bodies; o companies, p i a e indi idual dono s
e c wi h a wide ange o di e en needs - gene ally hough hey will need some o m o in o ma ion
abou he cos s and impac o wha hey migh und.
Wi hin his g oup we ha e no-one who sel -iden i ies o his use g oup bu many who ha e some
ele an expe ience - wi h in e nal o ex e nal policy make s and bidding o unde s. DiSSCo P epa e
asks on socio-economic indica o s will be e y ele an o hese s akeholde s.
We need o be able o ind in o ma ion o con ey o policymake s and unde s a he igh ime? They
a e unlikely o look o i ?
They wan o he poin summa ies / answe s o ques ions.
Could conside new o ma s o in o ma ion such as policy b ie s on key opics? Wha would he scope
o hese be?
Discussion on he scope o policy equi emen s o he knowledgebase
Fi e majo opics eme ged:
Pe o mance indica o s o DiSSCo a e e y ele an o unde s and policymake s. We wan epo s o
dashboa ds co e ing he b ead h o DiSSCo ac i i ies. They need o answe on p og ess o he na ional
go e nmen s, bu also o he EU o highe in e na ional ins ances, whe e hey need o collec p og ess
o RIs among o he s.
This is associa ed wi h p o iding suppo wi h e idence o he RI e alua ion exe cises by ESFRI, which
guidelines and c i e ia a e a ailable a he Public Guide.
Ou ins i u ional policies a e a key documen se o in e nal policy make s, and o DiSSCo cen e. We
need he knowledge base o hold hese documen s, o a leas me ada a abou hem, o help assu e
de eloping policy alignmen be ween DiSSCo ins i u ions ( his will unde pin he unc ionali y o he
Task 7.3 policy ool), i.e. holds he me ada a schema, which could be used o ag/ma k up policy
documen s o help hem be mo e disco e able and accessible.
Ou policy s anda ds and policy me ada a schema a e cen al o a classi ica ion o ou policies and
p o ide a ou e o unde s and hei alignmen ac oss di e en ins i u ions, and wi h he DiSSCo
Policies.
The DiSSCo policies associa ed wi h he deli e y and needs o DiSSCo Se ices and he equi emen s
o DiSSCo o use hese se ices. Ideally hese need o be desc ibed in he same way as he Ins i u ional
Policies (e.g. ia he same me ada a schema) o enable us o align ins i u ional and DiSSCo policies.
34
Na ional, Eu opean and In e na ional Policies ela ing o biodi e si y (e.g. IPBES Indica o s)
add essing la ge socie al challenges associa ed wi h ou domain.
B eakou G oup 5: O he ex e nal s akeholde s: ci izens, esea ch in as uc u es, indus y
Documen a ion and Ou pu s
Iden i y ele an con en o he (ex e nal) s akeholde s
• Maybe dis inguish a bi mo e he “ci izens” acco ding o use cases
• we won’ know all he ( u u e) use cases and s akeholde s, hus we need o eac lexibly o
use needs.
o compiled in o ma ion on se ices and communi ies ha could help / suppo wi h
ce ain use cases (e.g. analy ical da a)
o How o cu a e se s o documen s? (e.g. a shopping baske in he KB)
o Se o ques ions o ecommend documen s o use s acco ding o s akeholde g oups
• FAQ & glossa y
• DiSSCo websi e s. DiSSCo KB
o websi e could deep link o KB
o la es in o ma ion on he p og ess (e.g. p og ess epo s) could li e in he KB
• Wha o do wi h da a ha is no in he p ima y scope o DiSSCo?
o How o make a ailable ou da a?
o How o link?
Inc ease Disco e abili y and Rele ance o (ex e nal) s akeholde s
• DiSSCo RI could eed back c oss-linked li e a u e e c. o he KB
o maybe inges esul s o li e a u e/ esou ce mining in he KB
• based on usage s a is ics sugges con en (e.g. “O he use s also sea ched o …”, “This migh
be in e es ing o you as well…”) based on he que ies he use s do.
• en ich KB wi h SEO me ada a in o de o make knowledge indable ia Google and o he
sea ch engines (people will sea ch hese ins ead o di ec ly on DiSSCo KB)
• o ci izens: ease / highligh in e es ing con en (s o ies), specimens e c.
o linking in as uc u es and s o y elling could be a by-p oduc om BiCIKLE p ojec
o How o cu a e p i a e collec ions (held by ci izens)? How o digi ize hem and
con ibu e?
• pu oge he exis ing documen s and hen iden i y gaps in in o ma ion ypes
o esul s om iDigBio
o and o he s ...
• include in as uc u e s udies (socio- echnological aspec s o da a in as uc u es) as a
esou ce
35
Iden i ying nex s eps and closing
• Sha e con en which should be a ailable in he KB
• Sugges ion: Boa d o edi o s a e he p ojec ime o new con en and cu a ion