scieee Science in your language
[en] (orig)
Share of open access journal articles
published by Berlin authors from 2019:
data
Martin Hampl1
, Pamela Finke2
, Michaela Voigt3
May 2021
Published report M. Kindling, J. Delasalle, P. Finke. M. Hampl, M. Neufend, M. Voigt: Open-
Access-AnteilbeiZeitschriftenartikelnvonWissenschaftlerinnenundWissenschaftlernanEin-
richtungen des Landes Berlin: Datenauswertung für das Jahr 2019.
DOI: http://dx.doi.org/10.14279/depositonce-11774
Data The data described here were retrieved from multiple bibliographic databases. Due to
license terms the database raw data cannot be provided for download. Data were aggregated,
normalized and analyzed with help of a Python script (https://github.com/tuub/oa-eval, code
documentation in English). Search queries and download settings for these databases are doc-
umented in the (German) manual that accompanies the script. For a detailed description of the
retrieval process and the analysis steps see the report. Data are distributed under the Creative
Commons Public Domain Dedication (CC0).
DOI: http://dx.doi.org/10.14279/depositonce-11775.
cz This work is distributed under the Creative Commons Public Domain Dedication (CC0).
You cancopy, modify, distribute andperform thework, even forcommercialpurposes, all with-
out asking permission.
For more information see https://creativecommons.org/publicdomain/zero/1.0/.
1Freie Universität Berlin, Universitätsbibliothek, [email protected], ORCiD: 0000-0002-1887-7148
2Humboldt-Universität zu Berlin, Universitätsbibliothek, pamela.fink[email protected],
ORCiD: 0000-0001-9086-3202
3Technische Universität Berlin, Universitätsbibliothek, [email protected], ORCiD:0000-0001-9486-3189
1
1 General remarks
1 General remarks
The overall goal was to analyze the publication output from nine research institutions located
in Berlin (Germany) and determine the share of open access journal articles. Journal articles
whose authors are affiliated with the following nine institutions were analyzed:
Alice Salomon Hochschule (ASH Berlin)
Beuth Hochschule für Technik Berlin (Beuth)
Charité Universitätsmedizin Berlin (Charité)
Freie Universität Berlin (FU Berlin)
Hochschule für Technik und Wirtschaft Berlin (HTW Berlin)
Hochschule für Wirtschaft und Recht Berlin (HWR Berlin)
Humboldt-Universität zu Berlin (HU Berlin)
Technische Universität Berlin (TU Berlin)
Universität der Künste (UdK Berlin)
Data were retrieved from sixteen bibliographic databases: Academic Search Ultimate (EBSCO),
Business Source Complete (via EBSCOhost), CAB Abstracts (via OvidSP), CINAHL (via EBSCO-
host), Embase (via OvidSP), IEEE Xplore, Inspec, Library and Information Science Abstracts
(LISA) (via ProQuest), ProQuest Social Sciences, GeoRef (via ProQuest), PubMed, SciFinder (CA
Plus), Scopus, Sport Discus (via EBSCOhost), TEMA and Web of Science Core Collection.
To identify articles in gold open access journals1the Directory of Open Access Journals (DOAJ)
was used.2In order to reduce script run time the API3provided by DOAJ was not used. Instead,
DOAJ data were downloaded as comma-separated file4. The csv file was saved as tab-delimited
file; the file doaj.txt constitutes the state of DOAJ metadata as of October 28th, 2020 listing
15.413 open access journals. An article is considered to be gold OA if the journal is published
in DOAJ.
To identify open access articles in hybrid journals5a combination of data retrieved from the
UnpaywallAPI6andthe CrossrefAPI7wasused(November2020). Unpaywalldatawerechecked
for OA status, host type (publisher or repository) for the detected OA version and license in-
1An open access journal publishes open access articles, i. e. all published articles are openly available on the
publisher’s website, without charge or delay.
2The analysis does not rely on Unpaywall data ‘Unpaywall[journal_is_oa]’) for detection of Gold OA articles
because 1) it would leave articles without a DOI undetected and 2) samples showed that Unpaywall data are
incomplete on journal OA status.
3DOAJ metadata: API https://doaj.org/api/v2/docs
4DOAJ metadata: CSV file https://doaj.org/csv
5A hybrid (open access) journal publishes both closed access and open access articles. It is operated under a
subscription business model with the (fee-based) option to make single articles open access.
6http://api.unpaywall.org/v2/
7http://api.crossref.org/works/
2
1 General remarks
formation; Crossref data were checked for license information. An article is considered to be
hybrid OA if a Creative Commons licensed version is accessible via the publisher website.
To identify green open access articles data from Unpaywall were used. An article is considered
to be green OA if the article is detected as neither gold OA nor hybrid OA and Unpaywall
detected at least one OA version in a repository.
Tab. 1 shows which values were included to determine the open access status.
Table 1: Detection of OA status
OA status Note
Gold OA DOAJ data as of October 28th, 2020 (ISSN + year lookup)
Hybrid OA according to Unpaywall (as of November 9th, 2020) and Crossref data (as of
November 12th, 2020)
following values must apply:
‘Unpaywall[is_oa]’ = TRUE,
‘Unpaywall[host_type]’ = ‘publisher’,
‘license’ = ‘CC*’
OR
‘OA status’ != gold,
‘license’ = ‘CC*’
Green OA according to Unpaywall data as of November 9th, 2020
entries in ‘Unpaywall[OA Repos]’ were searched manually for ‘.com’; if
Unpaywall incorrectly assigned the ‘host_type’ = ‘repository’, the re-
spective entry was corrected (changed to ‘OA status’ = closed)
following values must apply:
‘OA status’ != gold OR hybrid,
‘Unpaywall[is_oa]’ = TRUE,
at least one entry in ‘Unpaywall[OA Repos]’
Data on APC costs for open access journals were retrieved from DOAJ (as of October 28th, 2020);
the costs were not verified manually. Since value-added taxes vary by country publishers usu-
ally list costs excluding VAT. APC listed here do not include VAT.
To determine exchange rates we consulted http://www.xe.com: Exchange rates were retrieved
forthebeginningof2019(January1st, 2019)andthecurrentrateatthedateofanalysis(Novem-
ber 11th, 2020).
3
Advertisement
2 Bibliographic data
2 Bibliographic data
Data were analyzed with regard to the following questions:
How many journal articles did Berlin-based researchers publish in 2019?
How many of these articles were published in open access journals (gold OA)?
How manyof these articles have a Berlin-based corresponding author, in other words for
how many articles did a Berlin-based author (resp. his/her institution) most likely cover
the open access fee (Article Processing Charge, APC)?
How many open access articles did researchers from Berlin publish in hybrid journals
(hybrid OA)?
How many articles from Berlin-based researchers are available via a repository as green
open access (green OA)?
For a list of available files see tab. 2. For a list of bibliographic data available in the file contain-
ing article data see tab. 3.
Table 2: Overview of files
File name Note
OABerlin2019_data.xlsx script output: list of articles (12.479 items)
OABerlin2019_data.csv script output: list of articles (12.479 items) in comma-
separated format (UTF-8 encoded)
OABerlin2019_data_repositories.xlsx list of OA versions on repositories (17.760 entries)
OABerlin2019_data_repositories.csv list of OA versions on repositories (17.760 entries) in
comma-separated format (UTF-8 encoded)
OABerlin2019_data_results.xlsx detailed results (pivot tables)
DOAJ.txt script input: DOAJ metadata (tab-delimited file)
Table 3: Bibliographic data
Field name Source Note
index manually Article ID
authors databases string trimmed if field length exceeds 200 charac-
ters
title databases title as indexed as main title in databases; for non-
English articles title might be translated to English
Continued on next page
4
2 Bibliographic data
Table 3 continued from previous page
Field name Source Note
OA status manually type of OA (gold,hybrid,green or closed); see
tab. 1 for overview on which values were included
to determine OA status
DOI databases,manually if available; DOIs added and/or corrected manually
doiRA DOI Foundation name of the responsible DOI Registry Agency
(which agency minted the DOI); the note DOI does
not exist indicates that the databases included a
DOI but it was not registered correctly
journal databases if available
ISSN databases if available; could be ISSN for either print or elec-
tronic edition (print ISSN most likely)
eISSN databases if available; could be ISSN for either print or elec-
tronic edition (electronic ISSN most likely)
publisher databases,DOAJ,
Crossref(data
refined), manually
missing publisher information added manually if
retrievable; publisher names normalized
publisher group manually (data re-
fined)
cluster publisher syndicates: publisher group
based on column publisher
Wiley: AmericanGeophysicalUnion(AGU);Inter-
national Union of Crystallography (IUCr); John Wi-
ley and Sons; Wiley; Wiley-Blackwell; Wiley-VCH;
GIT-Verlag
Springer Nature: Nature Publishing Group;
Springer; Springer Healthcare; Springer Hei-
delberg; Springer International Publishing;
Springer Medizin; Springer Nature; Springer New
York; Springer Singapore; Springer Netherlands;
Springer-VDI-Verlag; BioMed Central (BMC)
Wolters Kluwer: Medknow Publications; Ovid
Technologies (Wolters Kluwer Health); Wolters
Kluwer; Lippincott, Williams and Wilkins
IOP Publishing: IOP Publishing; American
Astronomical Society; Japan Society of Applied
Physics
Continued on next page
5
Advertisement
Loading more pages...