scieee Science in your language
[en] (orig)
Nucleic Acids Research, 2018 1
doi: 10.1093/nar/gky1183
Updating genome annotation for the microbial cell
factory
Aspergillus niger
using gene co-expression
networks
P. Sch¨
ape,M.J.Kwon
, B. Baumann, B. Gutschmann, S. Jung, S. Lenz , B. Nitsche,
N. Paege, T. Sch¨
utze, T.C. Cairns and V. Meyer *
Department of Applied and Molecular Microbiology, Institute of Biotechnology, Technische Universit¨
at Berlin,
Gustav-Meyer-Allee 25, 13355 Berlin, Germany
Received August 24, 2018; Revised October 31, 2018; Editorial Decision November 01, 2018; Accepted November 27, 2018
ABSTRACT
A significant challenge in our understanding of bio-
logical systems is the high number of genes with un-
known function in many genomes. The fungal genus
Aspergillus
contains important pathogens of hu-
mans, model organisms, and microbial cell factories.
Aspergillus niger
is used to produce organic acids,
proteins, and is a promising source of new bioactive
secondary metabolites. Out of the 14,165 open read-
ing frames predicted in the
A. niger
genome only 2%
have been experimentally verified and over 6,000 are
hypothetical. Here, we show that gene co-expression
network analysis can be used to overcome this limi-
tation. A meta-analysis of 155 transcriptomics exper-
iments generated co-expression networks for 9,579
genes (65%)ofthe
A. niger
genome. By populating
this dataset with over 1,200 gene functional exper-
iments from the genus
Aspergillus
and performing
gene ontology enrichment, we could infer biologi-
cal processes for 9,263 of
A. niger
genes, includ-
ing 2,970 hypothetical genes. Experimental valida-
tion of selected co-expression sub-networks uncov-
ered four transcription factors involved in secondary
metabolite synthesis, which were used to activate
production of multiple natural products. This study
constitutes a significant step towards systems-level
understanding of
A. niger
, and the datasets can be
used to fuel discoveries of model systems, fungal
pathogens, and biotechnology.
INTRODUCTION
The genus Aspergillus (phylum Ascomycota) is comprised
of nearly 350 species of saprophytic and ubiquitous fungi,
and includes important pathogens of humans (Aspergillus
fumigatus), model organisms (Aspergillus nidulans) and mi-
crobial cell factories (Aspergillus oryzae,Aspergillus niger).
Aspergillus niger has been exploited for over a century by
biotechnologists for the production of organic acids, pro-
teins and enzymes (1). It is the major worldwide producer
of citric acid with an estimated value of $2.6 billion in
2014, which is predicted to rise to $3.6 billion by 2020 (2).
As a prolific secretor of proteins, A. niger is used to pro-
duce various enzymes at a bulk scale (1). The first A. niger
genome was sequenced in 2007, which contained an esti-
mated 14,165 coding genes 6,000 of which were hypo-
thetical (3). More recent sequencing of additional A. niger
strains and other genomes from the genus Aspergillus (4–
9), combined with refinement of online genome analyses
portals (10–12), and comparative genomic studies amongst
the Aspergilli (5,13–15), have not significantly increased the
percentage of A. niger genes that have functional predic-
tions. While the exact number of ‘hypothetical’ genes varies
between databases and A. niger genomes, recent estimates
suggest that between 40 and 50% of the genes still remain
hypothetical (1,16). Furthermore, only 2% of its genes (n
=247) have a verified function in the Aspergillus Genome
Database (AspGD (17)). Even for the gold-standard model
organism Saccharomyces cerevisiae, 21% of its predicted
genes have dubious functional predictions (18), despite its
high genetic tractability and a research community with
>1,800 research labs worldwide. Indeed, gene functional
predictions for A. nidulans,A. fumigatus,A.oryze and other
Aspergilli typically cover 40–50% of the genome (16,17).
Such high frequency of unknown and hypothetical genes
severely limits the power of systems-level analyses. One ap-
proach to overcome this limitation involves the generation
and interrogation of gene expression networks based on
transcriptomic datasets (19–21). The hypothesis underlying
this approach is that genes which are robustly co-expressed
under diverse conditions are likely to function in the same
or closely related biological processes or pathways (22).
*To whom correspondence should be addressed. Tel: +49 30 314 72750; Fax: +49 30 314 72922; Email: vera.meyer@tu-berlin.de
The authors wish it to be known that, in their opinion, the first two authors should be regarded as Joint First Authors.
C
The Author(s) 2018. Published by Oxford University Press on behalf of Nucleic Acids Research.
This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which
permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
Downloaded from https://academic.oup.com/nar/advance-article-abstract/doi/10.1093/nar/gky1183/5213907 by Technische Universitaet Berlin user on 18 January 2019
2Nucleic Acids Research, 2018
As one example, accurate delineation of fungal secondary
metabolite biosynthetic gene clusters has been achieved
by robustly defining contiguous gene co-expression dur-
ing both in vitro (23) and infectious (24) growth, with co-
expression analysis pipelines now publicly available to non-
coders (25).
In this study, we conducted a meta-analysis of 155 pub-
licly available transcriptomics analyses for A. niger,and
used these data to generate a genome-level co-expression
network and sub-networks for >9,500 genes. To aid user in-
terpretations of gene biological process, gene sub-networks
were analysed for enriched gene ontology (GO) terms,
and integrated with information gleaned from 1,200 val-
idated genes from the genus Aspergillus. Interrogation of
selected co-expression sub-networks for verified genes and
randomly selected hypothetical genes confirmed high qual-
ity datasets that enable rapid and facile predictions of bio-
logical processes. This co-expression resource has been in-
tegrated in the functional genomic database FungiDB (10)
for use by the research community.
In order to validate that novel predictions of gene bio-
logical function were accurate, we functionally character-
ized genes which we hypothesized played a role in A. niger
natural product synthesis based on co-expression datasets.
Experimental validation included generation of null and
overexpression mutants of transcription factors present in
these sub-networks, controlled bioreactor cultivations, and
activation of secondary metabolite gene expression and
metabolite biosynthesis. In addition to demonstrating that
this study enables novel predictions of A. niger gene func-
tion, these data suggest novel mechanisms for activating
cryptic secondary metabolism can be used in natural prod-
uct discovery programs, which is urgent due to the emer-
gence of multi-resistant bacteria and fungi (26,27). As A.
niger has been shown to be a superior expression host for
medicinal drugs in g/l scale (28), such discoveries have
significant translational potential. Taken together, the co-
expression resources and experimental validation devel-
oped in this study enable high quality gene functional pre-
dictions in A. niger.
MATERIALS AND METHODS
Strains and molecular techniques
Aspergillus niger strains used in this study are summarized
in Supplementary Table S1. Media compositions, transfor-
mation of A. niger, strain purification and fungal chromo-
somal DNA isolation were described earlier (29). Standard
PCR and cloning procedures were used for the generation of
all constructs (30) and all cloned fragments were confirmed
by DNA sequencing. Correct integrations of constructs in
A. niger were verified by Southern analysis (30). In the case
of overexpressing TF1, TF2 and HD, the respective open
reading frames were cloned into the Tet-on vector pVG2.2
(31) and the resulting plasmids integrated as single or mul-
tiple copies at the pyrG locus (for details see Supplementary
Table S1). Deletion constructs were made by PCR amplifi-
cation of the 5- and 3-flanks of the respective open reading
frames (at least 0.9 kb long). N402 genomic DNA served
as template DNA. The histidine selection marker (32)was
used for selecting single deletion strains, whereas the pyrG
marker was used for the establishment of the strain deleted
in both TF1 and TF2. Details on cloning protocols, primers
used and Southern blot results can be requested from the
authors.
Gene network analysis and quality control
As of 2 March 2016, 283 microarray data (platform:
GPL6758) of A. niger covering 155 different cultivation con-
ditions were publically available at the GEO database (33),
whose processing and normalization of the arrays have been
published (34). In brief, array data in the form of CEL-files
(35) were processed using the Affymetrix analysis package
(35) (version 1.42.1) from Bioconductor (36) and expression
data were calculated for genes under each condition with
an MAS5 background correction. Pairwise correlations of
gene expression between all A. niger genes were generated
by calculating the Spearmans rank correlation coefficient
(37) using R. To assess a cut-off indicating biological rele-
vance, Spearman correlations were firstly calculated using
a pseudo random data set whereby normalized transcript
values for each individual gene were randomized amongst
the 283 arrays and 155 experimental conditions. Using this
pseudo random data set, the Spearmans rank coefficient
was calculated pairwise for all predicted A. niger genes, giv-
ing a total of 104,958,315 comparisons/calculated Spear-
mans rank coefficients, from which 52,476,536 were pos-
itively correlated. From these, only two were greater than
|0.4|and none above |0.5|. Subsequently |0.5|was taken as a
threshold for co-expression. Sub-networks were calculated
at an individual gene level using Python. All genes that were
co-expressed with individual query ORFs are reported at
FungiDB (10) and summarized in Supplementary File 1 us-
ing a |0.5|and |0.7|Spearman cut-off.
To expedite investigation of the sub-networks and their
common biological process, gene ontology enrichment
(GOE) was implemented using Python version 2.7.13. GO
terms and their hierarchical structure were downloaded
from AspGD (17). Enriched GO Biological Process terms
for all genes residing in a query sub-network were calculated
relative to the A. niger genome and statistical significance
was defined using the Fishers exact test (P-value <0.05).
For all datasets now available at FungiDB, GO enrichment
is conducted using Fisher’s exact test and Bonferroni cor-
rections (10). For informant ORFs, experimentally verified
A. niger genes were retrieved from AspGD (17). Addition-
ally, A. niger orthologs for any gene with wet lab verification
in A. fumigatus,A. nidulans or A. oryzae were identified us-
ing the ENSEMBL BLAST tool using default settings (12).
Finally, 81 secondary metabolite core enzymes (39,40)were
also defined as informant ORFs. We generated informant
ORF (‘prioritized ORF’) sub-networks, which report sig-
nificant co-expression of query genes exclusively with one
or more informant ORFs which are reported in Supplemen-
tary Table S2.
Reporter gene expression
Protocols for luciferase-based measurement of gene expres-
sion in microtiter format based on Tet-on (31)oranafp (34)
promoter systems have been published. In case of strain
Downloaded from https://academic.oup.com/nar/advance-article-abstract/doi/10.1093/nar/gky1183/5213907 by Technische Universitaet Berlin user on 18 January 2019
Nucleic Acids Research, 2018 3
BBA17.6, unable to form spores, the strain was inoculated
on complete medium and allowed to grow for 7 days at
30C. Biomass was harvested using physiological salt solu-
tion and used for inoculation. All data shown are derived
from biological duplicates each measured in technical qua-
druplicates if not otherwise indicated. Raw datasets can be
requested from the authors.
Bioreactor cultivation
Medium composition and the protocol for glucose-limited
batch cultivation of A. niger in 5 l bioreactors have been
described (31). In the case of strains overexpressing TF1
(strain MJK10.22) and TF2 (strain MJK11.17), the Tet-
on system was induced with a final concentration of 10
g/ml doxycycline when the culture reached 1 g/kg dry
biomass. Samples for transcriptional and metabolome pro-
filing were taken 6h(72 h) after induction, i.e. dur-
ing exponential (post-exponential) growth phase. For con-
trol (MJK17.25) and deletion strains (MJK14.7, MJK16.5,
MJK18.1), doxycycline was added twice; once after the cul-
ture reached 1 g/kg dry biomass and 24 h before samples
were taken from post-exponential growth phase for tran-
scriptomics and metabolomics analyses. To avoid modifica-
tions in gene expression due to degradation of doxycycline
in growth media, 10 g/ml doxycycline was added every
10–12 h (five times in total) during growth of conditional
expression mutants. Growth and physiology profiles for all
seven strains cultivated in biological duplicates are summa-
rized in Supplementary File 4.
Transcriptional profiling
Total RNA extraction, RNA quality control, and RNA
sequencing were performed at GenomeScan (Leiden, the
Netherlands). Quality analysis of raw data was done as
previously described (41). In brief, 13 million reads of
150 bp were obtained from paired-end mode for each
sample. Read data were trimmed and quality controlled
with FastQC (http://www.bioinformatics.babraham.ac.uk/
projects/fastqc/). STAR (42) was used to map the reads
to the A. niger CBS 513.88 genome (http://fungi.ensembl.
org/). On average, the unique alignment rate was 95%.
Data normalization was performed with DEseq2 (43). Dif-
ferential gene expression was evaluated with Wald test with
a threshold of the Benjamini and Hochberg False Discovery
Rate (FDR) of 0.05 (44) with DEseq2. Raw and processed
data are summarized in Supplementary Table S3 and have
been deposited at the GEO database (33) under the acces-
sion number GSE119311.
Metabolome profiling
Metabolites were extracted from biomass corresponding
to 2.5 mg biomass dry weight by Metabolomic Discov-
eries GmbH (Potsdam, Germany) and identified based
on Metabolomic Discoveries’ database entries of authen-
tic standards. Liquid chromatography (LC) separation was
performed using hydrophilic interaction chromatography
with a iHILIC®-Fusion, 150 ×2.1 mm, 5 m, 200 ˚
A5
m, 200 A column (HILICON, Ume˚
a Sweden), operated
by an Agilent 1290 UPLC system (Agilent, Santa Clara,
USA). The LC mobile phase was (i) 10 mM ammonium ac-
etate (Sigma-Aldrich, USA) in water (Thermo, USA) with
5% and 95% acetonitrile (Thermo, USA) (pH 6) and (ii)
acetonitrile with 5% 10 mM ammonium acetate in 95%
water. The LC mobile phase was a linear gradient from
95% to 65% acetonitrile over 8.5 min, followed by linear
gradient from 65% to 5% acetonitrile over 1 min, 2.5 min
wash with 5% and 3 min re-equilibration with 95% ace-
tonitrile. The flow rate was 400 l/min and injection vol-
ume was 1 l. Mass spectrometry was performed using a
high-resolution 6540 QTOF/MS Detector (Agilent, Santa
Clara, USA) with a mass accuracy of <2 ppm. Spectra
were recorded in a mass range from 50 m/zto 1700 m/z
at 2 GHz in extended dynamic range in both positive and
negative ionization mode. The measured metabolite con-
centrations were normalized to the internal standard. Sig-
nificant concentration changes of metabolites in different
samples were analyzed by appropriate statistical test proce-
dures (ANOVA, paired t-test) using R. When the adjusted
Pvalue based on Benjamini and Hochberg FDR (44)was
lower than 0.05 and the fold change (log2) higher than ±1,
expression of the metabolites was considered as significantly
different.
RESULTS
The transcriptomic landscape of A. niger inferred from a gene
expression meta-analysis
We normalized and interrogated gene expression across 155
published transcriptomic analyses for the A. niger labora-
tory wildtype strain N402 (ATCC 64974) and its descen-
dants, comprising of 283 Affymetrix microarray experi-
ments in total (34). Experimental parameters include a di-
verse range of cultivation conditions (agar plate, bioreac-
tor, shake flask), developmental and morphological stages
(germination, mycelial growth, sporulation), deletion and
disruption mutants, stress conditions (antifungals, secretion
stress, pH), different carbon and nitrogen sources, starva-
tion, and co-cultivation with bacteria. These experimental
conditions represent diverse niches inhabited by A. niger
as well as industrial cultivation conditions, in addition to
(a)biotic and genetic perturbations that result in global
changes in gene expression.
In order to demonstrate that accurate values of transcript
abundance were derived from this meta-analysis, we plotted
average gene expression values for each gene throughout the
155 conditions as a function of chromosomal locus (Fig-
ure 1A). From these data, we categorized low, medium, and
highly expressed loci. Subsequently, we generated a DNA
cassette expressing a luciferase reporter gene under con-
trol of the inducible Tet-on promoter (31) and targeted it
to the 5-upstream region of two low and one high expres-
sion locus (Figure 1B). The pyrG locus present on chro-
mosome III, routinely used for gene-targeted integration in
A. niger, served as locus control for Tet-on driven medium
expression of luciferase (31). Luciferase levels measured at
these loci were confirmed to be low, medium and high in
relative terms in microtiter cultivations of the different A.
niger strains (Figure 1C) and in controlled batch cultiva-
tion at bioreactor scale (Figure 1D). These data demon-
Downloaded from https://academic.oup.com/nar/advance-article-abstract/doi/10.1093/nar/gky1183/5213907 by Technische Universitaet Berlin user on 18 January 2019
4Nucleic Acids Research, 2018
Figure 1. The transcriptomic landscape of A. niger.(A) Chromosomal expression values expressed as log2 mean from 155 cultivation conditions. Blue
arrows indicated centromere position on each chromosome (3). (B) Selected chromosomal positions in chromosome III and VII for in vivo verification.
Red arrow indicates position of chosen integration site at high level expression locus, and grey arrows indicate positions of chosen integration sites at low
level expression loci. (C) Data for luciferase activity measured by luminescence in counts per second (cps) over time after induction in hours in microtiter
plates. Data for strains expressing luciferase at the locus An11g08480 (‘low gene expression locus’; strain BG1.1), An11g07580 (‘low gene expression locus’;
strain BG3.1), An12g03570 (‘medium gene expression locus’, pyrG, strain VG8.27), and An11g11310 (‘high gene expression locus’; strain BG6.14) are given.
Doxycycline was added 16 h after inoculation. (D) Cultivation of strain BG1.1 and BG6.14 in 5L bioreactors. For each strain, two batch cultivations were
performed; one was induced with 5 g/ml doxycycline at a dry biomass concentration of 2 g/kg. Cultivation in the presence of doxycycline mediated gene
expression as demonstrated by increased luminescence after induction measured in technical quintuplicate.
strate that low, medium, or high expression at these loci are
applicable for both high-throughput assays (microtiter) and
more labour and resource intensive bioreactors cultivations.
We conclude that the microarray data accurately reflect A.
niger gene expression values. Note that this transcriptomic
landscape is a significant addition to the A. niger molecu-
lar toolkit, as it facilitates rational control of gene dosage
(time of induction and absolute expression level) by targeted
locus-specific integration of a gene of interest.
Construction of high quality co-expression networks for A.
niger
Experimentally validated gene expression data from the A.
niger transcriptional meta-analysis was utilized to generate
a gene co-expression network based on Spearmans rank
correlation coefficient (37). In order to define a minimum
Spearmans rank correlation coefficient () for which we
could be confident in extracting biologically meaningful co-
expression, we conducted a preliminary quality control ex-
periment, where transcript values for each individual gene
were randomized amongst the 155 experimental conditions.
This gave a dataset with identically distributed but random-
ized expression patterns. Next, we calculated every possible
transcriptional correlation between genes on the A. niger
genome, resulting in over 100 million -values. This identi-
fied 52 million positive and 48 million negative correlations
(Figure 2). From this dataset, only two -values were above
|0.4|, and none were above |0.5|. Consequently, we took
|0.5|as a minimum cut-off for biologically meaningful co-
expression relationships. Calculations of Spearman corre-
lations using the non-randomized microarray data resulted
in over 4.5 million correlations which passed the minimum
|0.5|cut-off. From these datasets, co-expression sub-
networks for every gene in the global network were gen-
erated for both positively and negatively correlated genes
(Figure 2, Supplementary File 1). We classified them into
two groups: ‘stringent’ (|0.5|, 9,579 gene networks) and
‘highly stringent’ (|0.7|, 6,305 gene networks) and calcu-
lated enriched GO terms for each gene sub-network relative
to the A. niger genome.
Integration of co-expression networks with community-wide
experimental evidence of gene function
The Aspergillus community has functionally characterized
over a thousand genes in different species of the genus As-
pergillus, which we reasoned can be used to aid apriori
Downloaded from https://academic.oup.com/nar/advance-article-abstract/doi/10.1093/nar/gky1183/5213907 by Technische Universitaet Berlin user on 18 January 2019
Nucleic Acids Research, 2018 5
Figure 2. Workflow to generate gene co-expression resources. (A) Various transcriptional signatures across 155 cultivation conditions of A. niger obtained
from 283 microarrays are schematically represented. Spearmans rank correlation coefficients were calculated pairwise between all predicted genesinthe
A. niger genome. Over 96 million correlations between gene pairs were not defined as co-expressed based on the |0.5|cut-off (black and orange lines),
whereas 2.7 million were positively correlated (black and green lines) and 1.8 million were negatively correlated (black and red lines). These >4.5 million
significant co-expression relationships are visualized as a network in (B), with genes shown as grey squares, and correlations as lines. The length of each
line is proportional to the Spearman correlation between gene pairs. Genes predicted to encode transcription factors (3) are highlighted in blue. For each
individual gene in the network (13,975 in total; this number deviates from 14,165 reported in 2007 (3), because we have omitted truncated ORFs reported
there from our analysis), sub-networks were calculated which report all significant correlations with the query gene of |0.5|(stringent dataset) and of
|0.7|(very stringent dataset). Both sub-networks were analyzed for significantly enriched GO terms (biological processes). Additionally, we highlight
co-expression between query genes and ‘informant ORFs’, which have either been experimentally validated in the Aspergilli, or are predicted to encode
key secondary metabolite biosynthetic enzymes (C).
predictions of hypothetical genes or not yet verified genes
in A. niger. In order to integrate such experimental data
with the co-expression network, we mined the Aspergillus
genome database AspGD (17) to generate a near-complete
list of ORFs that have been functionally characterized in
Aspergilli. All experimentally validated ORFs for A. niger
(n=247), A. nidulans (n =639), A. fumigatus (n =218) and
A. oryzae (n =81) were included in this dataset. Given the
strong potential of A. niger as a platform for discovery and
production of new bioactive molecules, we also included 81
putative polyketide synthase (PKS) or nonribosomal pep-
tide synthetase (NRPS) encoding genes of A. niger that re-
side in 78 predicted secondary metabolite clusters (39,40),
giving in total 1,266 prioritized ORFs. For every gene in the
A. niger genome, we calculated co-expression interactions
specifically with these 1,266 rationally prioritized ORFs. A
total of 9,263 (|0.5|) and 5,178 (|0.7|) candidate
genes had one or more correlations with prioritized ORFs
(Supplementary Table S2). These datasets thus constitute
the most comprehensive co-expression resource for a fila-
mentous fungus and are accessible at FungiDB (10).
Co-expression resources enable facile predictions of gene bi-
ological function
In order to test whether biologically meaningful interpreta-
tions of gene and network function can be extracted from
these resources, we interrogated both stringent and highly
stringent datasets for genes where the biological processes,
molecular function, and subcellular localization of encoded
proteins have been studied in fungi and which represent
the broad range of utilities and challenges posed by fungi.
From the perspective of industrial biotechnology, we inter-
rogated networks for the gene encoding the ATPase BipA,
which is required for high secretion yield of industrially use-
ful enzymes by acting as chaperone to mediate protein fold-
ing in the endoplasmic reticulum (45). With regards to po-
tential drug target discovery, we analyzed gene expression
networks for Erg11 (Cyp51), which is the molecular target
for azoles (46). For assessment of virulence in both plant
and human infecting fungi, we interrogated networks for
the NRPS SidD, which is necessary for the biosynthesis of
the siderophore triacetyl fusarinine C, and ultimately iron
acquisition during infection (47). Assessment of all con-
trol sub-networks at GO and individual gene-level revealed
striking co-expression of genes encoding proteins involved
in respective metabolic pathways, associated biological pro-
cesses, subcellular organelles, protein complexes, known
regulatory transcription factors/GTPases/chaperones, and
cognate transporters, amongst others (Figure 3). The lowest
Spearman correlation coefficient of |0.5|clearly results in bi-
ologically meaningful gene co-expression as exemplified by
the delineation of diverse yet related processes, including:
(i) orchestration of retrograde/anterograde vesicle traffick-
ing via COPI/COPII/secretion associated proteins (BipA)
(48); (ii) coordination of ergosterol biosynthesis by sterol
regulatory binding element regulators SrbA/SrbB and as-
sociation of this pathway with respiration at the mitochon-
drial membrane (Erg11) (49) and (iii) and the linking of
respective ergosterol and ornithine primary and secondary
metabolic pathways during siderophore biosynthesis via the
interdependent metabolite mevalonate (SidD) (50). With re-
gards to co-expressed genes as a function of chromosomal
location, a common feature of filamentous fungal genomes
is that genes necessary for the biosynthesis of secondary
metabolite products occur in physically linked contiguous
clusters. SidD resides in a six-gene cluster with SidJ, SidF,
SidH, SitT and MirD, all of which were represented in the
Downloaded from https://academic.oup.com/nar/advance-article-abstract/doi/10.1093/nar/gky1183/5213907 by Technische Universitaet Berlin user on 18 January 2019
6Nucleic Acids Research, 2018
Figure 3. Schematic representation of co-expression networks for BipA, SidD, and Erg11 encoding genes. Genes are represented by circles, with positive
and negative correlations depicted by grey and red lines, respectively. For simplicity, protein names are given in the centre of each circle. Where names
were not available in A. niger, we used the name for either A. nidulans or S. cerevisiae ortholog. Query genes encoding BipA, SidD, or Erg11 are given in
black diamond boxes. Co-expression sub-networks are given for gene pairs passing |0.7|(A) and |0.5|(B) Spearman correlation coefficient cut-offs. Both
sub-networks were assessed for enriched GO terms, and genes were manually filtered into functional categories based on these analyses and interrogation
of research literature (C). Note that in each instance, hypothetical proteins were co-expressed with each query gene, indicating the encoded products are
somehow associated with these biological processes.
high stringency network that contained a total of only 13
genes (Figure 3).
Based on enriched GO terms from gene sub-networks
and co-expression with experimentally verified ORFs, we
could further rapidly infer biological processes for a total
of 2,970 (|0.5|) and 1,016 (|0.7|) hypothetical genes
that were positively and/or negatively associated using this
analysis as exemplarily shown for eight hypothetical genes
in Supplementary Table S4. Additionally, we interrogated
entire families of functionally related genes that have been
well characterized in the Aspergilli, including phosphatases,
chromatin remodelers, and transcription factors and were
able to assign novel biological processes for all these pre-
dicted genes as exemplarily shown for nine genes in Supple-
mentary Table S5.
Additionally, in order to confirm that hypotheses re-
garding gene function and co-expression networks were
independent of the array platform and/or strain utilised,
we compared data derived from the microarray used in
our study (specifically the Affymetrix technology for A.
niger isolate CBS 513.88), with a separate tri-species plat-
form for strain ATCC 1015 (51). This latter microarray
has been used to concomitantly compare batch cultiva-
tions of A. niger,A. oryzae and A. nidulans on glucose
and xylose media, which identified 23 genes to be a con-
served response to xylose utilisation, including the xylose
transcriptional regulator XlnR (An15g05810, (51)). De-
spite differences in strain, microarray platform, and ex-
perimental design, interrogation of the XlnR sub-network
from our study revealed strong concordance with the con-
served xylose response genes reported by Andersen et
al.(51), including those encoding an l-arabitol dehydro-
genase (An01g10920), aldose 1-epimerase (An02g09090),
glycoside hydrolase (An12g01850), xylitol dehydrogenase
(An12g00030), sugar transporter (An03g01620) and short-
chain dehydrogenase (An04g03530). We therefore conclude
that sub-networks generated in our study can be used to
define A. niger co-expression relationships and infer gene
function across strain backgrounds.
Taken together, these quality control experiments
strongly suggest that the co-expression resources developed
in this study can be used for high confidence hypothesis
generation at a variety of conceptual levels, including
biological process, metabolic pathway, protein complexes,
and individual genes.
Co-expression resources accurately predict transcription fac-
tors of the ribosomally synthesized natural product AnAFP
of A. niger
In order to provide experimental confirmation in pre-
dictions of biological processes gleaned from this co-
expression resource, we interrogated all datasets associated
with the gene encoding the A. niger antifungal peptide
AnAFP. Ribosomally synthesized antifungal peptides of
the AFP family are promising molecules for use in medical
or agricultural applications to combat human- and plant-
pathogenic fungi (52). We and others could show that ex-
pression of their cognate genes are under tight temporal and
spatial regulation in their native hosts and precedes asexual
sporulation (34,53–55).
The gene encoding AnAFP (An07g01320), is co-
expressed with 986 genes (|0.5|; 605 positively cor-
related /381 negatively correlated (34)). GO enrichment
analyses of positively correlated sub-networks uncovered
that anafp gene expression parallels with fungal secondary
metabolism, carbon limitation and autophagy (34). In to-
tal, 23 predicted transcription factors are co-expressed at
a stringent level among which were the transcription fac-
tors VelC (An04g07320) and StuA (An05g00480; Figure 4),
both of which are key regulators of asexual development
and secondary metabolism in Aspergilli, whereby VelC is
Downloaded from https://academic.oup.com/nar/advance-article-abstract/doi/10.1093/nar/gky1183/5213907 by Technische Universitaet Berlin user on 18 January 2019
Nucleic Acids Research, 2018 7
Figure 4. In vivo proof of the predictive power of gene co-expression sub-networks. (A) Depicted are 23 predicted transcription factors in the anafp sub-
network which either positively (n =19) or negatively (n =4) correlate with the expression of the anafp gene. Two of these (velC and stuA) were selected
for in vivo verification by establishing velC and stuA deletion strains in an anafp::mluc reporter strain published earlier (strain PK2.9)(34). (B,C) Analysis
of anafp promoter activity in a strain deleted for velC (strain BBA17.9) and for stuA (strain BBA13.2), respectively, which was compared to the progenitor
strain PK2.9 abbreviated as ‘wt’. Note that the scale bar is different between B and C, and that StuA is thus likely a very strong repressor of anafp expression.
Data are from liquid microtiter plate cultures, where a defined amount of biomass was used for inoculation and luciferase expression monitored online
during cultivation. Expression levels are depicted for one representative example from three independent experiments. Each experiment was performed in
quintuplicate.
known as an activator, and StuA is both activator or re-
pressor (56–58). In order to confirm a regulatory function
of these transcription factors on anafp expression, we used
a reporter strain in which the anafp ORF has been replaced
with a luciferase gene. Deletion of stuA or velC in this back-
ground revealed a strong increase or decreased/delayed ac-
tivation of the anafp promoter, respectively (Figure 4). In-
terestingly, the transcription factor binding site for VelC is
unknown, whereas the binding sequence for StuA is absent
from the predicted promoter region of anafp. These data
thus indicate that the resources generated in this study en-
able accurate predictions of (in)direct regulatory proteins
even in the absence of DNA binding sites.
Co-expression resources accurately predict transcription fac-
tors of non-ribosomally synthesized natural products of A.
niger
The transcriptional activation of secondary metabolite
(SM) gene clusters in different filamentous fungi is one cur-
rent focus of the fungal research community (16)as>60%
of currently approved clinical drugs are derived from nat-
ural products (59). A. niger stands out due to its excep-
tional high number of predicted SM gene clusters in its
genome (n =78), harboring 81 core enzymes in total, such
as NRPS and PKS (39). However, only a dozen of SMs
have been identified from A. niger so far (60). Our sur-
vey of the expression data of all gene clusters under the
155 cultivation conditions uncovered that the majority of
SM core genes (53) are expressed in at least one condi-
tion (Supplementary File 2). The majority of expressed
core genes are also co-expressed with their cluster mem-
bers (Supplementary File 3). Notably, not all cluster mem-
bers are co-expressed with contiguous transcription fac-
tors. Indeed, only 30% of the gene clusters display co-
expression with contiguous transcription factors (Supple-
mentary File 3). We thus questioned which transcription
factors are regulating these SM gene clusters, and used the
co-expression dataset to assign biological processes to genes
predicted to encode transcription factors. Given the impor-
tant role of chromatin remodelers in activation and silenc-
ing of secondary metabolite clusters, we also interrogated
genes predicted to encode histone deacetylases (61). This
identified two ORFs encoding putative transcription fac-
tors: An07g07370 (TF1) and An12g07690 (TF2), and a his-
tone deacetylase (An09g06520, HD) that are positively and
negatively (TF1, TF2) or only negatively (HD) co-expressed
with numerous core SM genes (Supplementary Table S5).
Notably, all three genes do not reside in contiguous SM gene
clusters but belong to a large SM sub-network consisting
of 152 genes including 26 SM core genes (Supplementary
Table S6), whereby gene expression of TF1 and TF2 cor-
relate very strongly (=0.87). Interrogation of enriched
GO terms for both TF1 and TF2 gene sub-networks re-
vealed enrichment of fatty acid metabolism, autophagy, mi-
tochondria degradation (positively correlated) and matura-
tion of rRNA and tRNA, ribosomal assembly and amino
acid metabolism (negatively correlated). This analysis thus
allowed us to select genes for in vivo functional studies based
on a non-intuitive selection procedure. Neither TF1, TF2
nor HD have been experimentally characterized in fungi so
far. In order to confirm a regulatory function of these puta-
tive regulators on SM core gene expression, we generated (i)
single deletion strains for TF1, TF2 and HD, respectively,
(ii) a double deletion strain for TF1 and TF2, and (iii) in-
dividual conditional overexpression mutants for TF1, TF2
and HD using the Tet-on system (31). The strongest effect
Downloaded from https://academic.oup.com/nar/advance-article-abstract/doi/10.1093/nar/gky1183/5213907 by Technische Universitaet Berlin user on 18 January 2019
8Nucleic Acids Research, 2018
Figure 5. Co-expression sub-networks uncover global regulators of secondary metabolism in A. niger. Controlled batch cultivations using 5 l bioreactor
systems were performed with the progenitor strain and strains deleted for HD, TF1 and/or TF2 (TF1, TF2, TF12, HD) or overexpressing either
transcription factors (TF1OE, TF2OE). Note that we excluded the strain overexpressing HD from this analysis because its gene expression is negatively
correlated with the mostly silent SM core genes. Growth and physiology profiles are given in Supplementary File 4. Samples were taken for untargeted
metabolome analyses (LC–QTOF/MS, Supplementary Table S7) and global RNA-Seq (Supplementary Table S3) during exponential and post-exponential
growth phase. Note that the maximum growth rate was for all strains 0.24 h1except for TF1OE (0.19 h1). (A) Differential gene expression for 81 predicted
secondary metabolite core genes in mutant isolates relative to the progenitor control as determined by RNA-Seq analysis. (B) Assessment of metabolite
abundance demonstrated differential abundance of hundreds of A. niger metabolites following deletion or over-expression of HD, TF1 and TF2. In total,
3,410 primary and secondary metabolites were detected in all runs, 1,260 of which were known compounds and 478 thereof were differentially expressed
upon deletion or overexpression of these putative regulators (Supplementary File 4). (C) Relative abundance of the SM aurasperone B is shown in progenitor
control and mutant isolates throughout bioreactor cultivation as an exemplar.
on the metabolome profile of A. niger was observed dur-
ing overexpression of TF1 and TF2 (Figure 5). SMs up-
regulated under these conditions included, but were not lim-
ited to aurasperones, citreoviridin D, terrein, aspernigrin A,
nigerazine A and B, pyranonigrin A and D, flavasperone,
fonsecin, O-demethylfonsecin, flaviolin, funalenone; among
the SMs down-regulated under these conditions were asper-
pyrone, L-agaridoxin and nummularine F (Figure 5, Sup-
plementary File 4, Supplementary Table S7). These physio-
logical changes were paralleled by up/down-regulation of
thousands of genes as determined by RNA-Seq analyses
(Supplementary Table S3), whereby controlled overexpres-
sion of TF1 (TF2) modulated expression of 45 (43) SM core
genes especially during post-exponential growth phase of
A. niger (Figure 5and Supplementary File 4). This strongly
suggests that both transcription factors are likely global reg-
Downloaded from https://academic.oup.com/nar/advance-article-abstract/doi/10.1093/nar/gky1183/5213907 by Technische Universitaet Berlin user on 18 January 2019
Nucleic Acids Research, 2018 9
ulators modulating gene expression dynamics during late
growth stages of A. niger either directly or indirectly.
DISCUSSION
In this study, we performed a transcriptomic meta-analysis
to generate a high-quality gene co-expression network, and
used this to predict biological processes for 9,579 (65%)
of all A. niger genes including 2,970 hypothetical genes (
|0.5|). The compendium of resources developed in this work
consists of (i) gene-specific sub-networks with two stringent
Spearman cut-offs that ensure high confidence in biolog-
ically meaningful interpretations; (ii) statistically enriched
GO terms for each co-expression network, and (iii) a re-
fined list of co-expression relationships which incorporate
over 1,200 experimentally characterized ORFs to aid pre-
dictions of gene biological process based on experimental
evidence.
In order to demonstrate the utility of these resources, we
firstly interrogated these datasets at a gene-level, demon-
strating that transcription factors VelC and StuA, which are
critical components of ascomycete development and sec-
ondary metabolism, also regulate expression of the A. niger
antifungal peptide AnAFP. Our data provide further evi-
dence of the coupling between development, biosynthesis
of secondary metabolites, and secreted antifungal peptides
(34). The datasets generated in this study can also be used to
identify global regulators at the level of biological processes
as demonstrated for fungal secondary metabolism and the
two transcription factors TF1 and TF2 (which we name
MjkA and MjkB, respectively). Previously, co-expression of
contiguous genes has been used to determine the bound-
aries of secondary metabolite clusters in A. nidulans and
other fungi (23–25). Our study can be viewed as comple-
mentary to such analyses, as MjkA and MjkB are phys-
ically located outside the boundaries of any putative sec-
ondary metabolite cluster, and as such would not be identi-
fied using previous approaches (23,25). Generation of loss-
and gain-of-function mutants demonstrated that they likely
(in)directly regulate dozens of secondary metabolite loci
at the transcript and metabolite level. Interestingly, MjkA
is a Myb-like transcription factor highly conserved in As-
pergilli with orthologues present in several plant genomes.
Myb transcription factors have recently been demonstrated
to regulate plant natural product biosynthesis (62), and
our co-expression data and wet lab experiments suggest
that titratable control of MjkA is a promising strategy for
the activation of ascomycete secondary metabolism during
drug discovery programs. With regards to the application
of our co-expression approach to predict gene biological
processes in other fungi, interrogation of the GEO database
(33) demonstrates that several hundred global gene expres-
sion experiments are available for industrial cell factories
(e.g. A. oryzae,Trichoderma reesei) and human or plant in-
fecting fungi (e.g. A. fumigatus,Cryptococcus neoformans,
Candida albicans,Magnaporthe oryzae), indicating that our
approach can be broadly applied for industrial, medically,
and agriculturally relevant fungi. As the financial costs for
gene expression profiling continues to decline, this study
paves the way for prediction of gene biological function us-
ing co-expression network analyses throughout the fungal
kingdom.
DATA AVAILABILITY
All datasets generated and/or analyzed during this study
are available at FungiDB (http://fungidb.org/fungidb/).
Spearmans correlation coefficients for gene co-expression
sub-networks will also be made available by the correspond-
ing author upon request. RNA seq data have been deposited
at the Gene expression Omnibus under accession number
GSE119311.
SUPPLEMENTARY DATA
Supplementary Data are available at NAR Online.
ACKNOWLEDGEMENTS
The authors wish to acknowledge all members of the
FungiDB project for providing the bioinformatics infras-
tructure to integrate the omic’s datasets of this study. Carina
Feldle is acknowledged for her assistance during bioreactor
cultivations.
Authors’ contributions: P.S. performed the transcriptome
meta-analyses, constructed the co-expression network and
executed in silico quality analyses. M.J.K. analyzed sub-
networks related to secondary metabolism. M.J.K., S.J.
and N.P. generated deletion and overexpression strains
and characterized them. M.J.K. and T.S. performed biore-
actor cultivations. M.J.K. analyzed transcriptome and
metabolome data derived from deletion and overexpression
strains. B.B. and B.G. contributed to molecular analyses,
B.N. and S.L. contributed to bioinformatics analyses. T.C.
contributed to bioinformatics analyses and co-wrote the fi-
nal text. V.M. initiated this study, coordinated the project
and co-wrote the final text. All authors read and approved
the final manuscript.
FUNDING
European Commission (funding by the Marie Curie Inter-
national Training Network QuantFung, FP7-People-2013-
ITN) [607332]. Funding for open access charge: Technische
Universit¨
at Berlin.
Conflict of interest statement. None declared.
REFERENCES
1. Cairns,T.C., Nai,C. and Meyer,V. (2018) How a fungus shapes
biotechnology: 100 years of Aspergillus niger research. Fungal Biol.
Biotechnol.,5, 13.
2. Show,P.L., Oladele,K.O., Siew,Q.Y., Aziz Zakry,F.A., Lan,J.C.-W.
and Ling,T.C. (2015) Overview of citric acid production from
Aspergillus niger.Front. Life Sci.,8, 271–283.
3. Pel,H.J., Winde,J.H., Archer,D.B., Dyer,P.S., Hofmann,G.,
Schaap,P.J., Turner,G., de Vries,R.P., Albang,R., Albermann,K. et al.
(2007) Genome sequencing and analysis of the versatile cell factory
Aspergillus niger CBS 513.88. Nat. Biotechnol.,25, 221–231.
4. Gong,W., Cheng,Z., Zhang,H., Liu,L., Gao,P. and Wang,L. (2016)
Draft genome sequence of Aspergillus niger strain An76. Genome
Announc.,4, e01700–e01715.
Downloaded from https://academic.oup.com/nar/advance-article-abstract/doi/10.1093/nar/gky1183/5213907 by Technische Universitaet Berlin user on 18 January 2019
10 Nucleic Acids Research, 2018
5. de Vries,R.P., Riley,R., Wiebenga,A., Aguilar-Osorio,G., Amillis,S.,
Uchima,C.A., Anderluh,G., Asadollahi,M., Askin,M., Barry,K. et al.
(2017) Comparative genomics reveals high biological diversity and
specific adaptations in the industrially and medically important
fungal genus Aspergillus. Genome Biol.,18, 28.
6. Wang,B., Lv,Y., Li,X., Lin,Y., Deng,H. and Pan,L. (2018) Profiling of
secondary metabolite gene clusters regulated by LaeA in Aspergillus
niger FGSC A1279 based on genome sequencing and transcriptome
analysis. Res. Microbiol.,169, 67–77.
7. Singh,N.K., Blachowicz,A., Romsdahl,J., Wang,C., Torok,T. and
Venkateswaran,K. (2017) Draft genome sequences of several fungal
strains selected for exposure to microgravity at the international space
station. Genome Announc.,5, e01602–e01616.
8. Yin,C., Wang,B., He,P., Lin,Y. and Pan,L. (2014) Genomic analysis
of the aconidial and high-performance protein producer, industrially
relevant Aspergillus niger SH2 strain. Gene,541, 107–114.
9. Paul,S., Lude˜
na,Y., Villena,G.K., Yu,F., Sherman,D.H. and
Guti´
errez-Correa,M. (2017) High-quality draft genome sequence of a
biofilm forming lignocellulolytic Aspergillus niger strain ATCC
10864. Stand Genomic Sci.,12,9.
10. Stajich,J.E., Harris,T., Brunk,B.P., Brestelli,J., Fischer,S., Harb,O.S.,
Kissinger,J.C., Li,W., Nayak,V., Pinney,D.F. et al. (2012) FungiDB:
an integrated functional genomics database for fungi. Nucleic Acids
Res.40, D675–D681.
11. Grigoriev,I.V., Nikitin,R., Haridas,S., Kuo,A., Ohm,R., Otillar,R.,
Riley,R., Salamov,A., Zhao,X., Korzeniewski,F. et al. (2014)
MycoCosm portal: gearing up for 1000 fungal genomes. Nucleic
Acids Res.,42, D699–D704.
12. Zerbino,D.R., Achuthan,P., Akanni,W., Amode,M.R., Barrell,D.,
Bhai,J., Billis,K., Cummins,C., Gall,A., Gir´
on,C.G. et al. (2018)
Ensembl 2018. Nucleic Acids Res.,46, D754–D761.
13. Yin,X., Shin,H.D., Li,J., Du,G., Liu,L. and Chen,J. (2017)
Comparative genomics and transcriptome analysis of Aspergillus
niger and metabolic engineering for citrate production. Sci. Rep.,7,
41040.
14. Andersen,M.R., Salazar,M.P., Schaap,P.J., Van De
Vondervoort,P.J.I., Culley,D., Thykaer,J., Frisvad,J.C., Nielsen,K.F.,
Albang,R., Albermann,K. et al. (2011) Comparative genomics of
citric-acid-producing Aspergillus niger ATCC 1015 versus
enzyme-producing CBS 513.88. Genome Res.,21, 885–897.
15. Vesth,T.C., Nybo,J.L., Theobald,S., Frisvad,J.C., Larsen,T.O.,
Nielsen,K.F., Hoof,J.B., Brandl,J., Salamov,A., Riley,R. et al. (2018)
Investigation of inter- and intraspecies variation through genome
sequencing of Aspergillus section Nigri. Nat. Genet.,
doi:10.1038/s41588-018-0246-1.
16. Meyer,V., Andersen,M.R., Brakhage,A.A., Braus,G.H.,
Caddick,M.X., Cairns,C.T., de Vries,R.P., Haarmann,T., Hansen,K.,
Hertz-Fowler,K. et al. (2016) Current challenges of research on
filamentous fungi in relation to human welfare and a sustainable
bio-economy: a white paper. Fungal Biol. Biotechnol.,3, 1–17.
17. Cerqueira,G.C., Arnaud,M.B., Inglis,D.O., Skrzypek,M.S.,
Binkley,G., Simison,M., Miyasato,S.R., Binkley,J., Orvis,J., Shah,P.
et al. (2014) The Aspergillus Genome Database: Multispecies
curation and incorporation of RNA-Seq data to improve structural
gene annotations. Nucleic Acids Res.,42, D705–D710.
18. Cherry,J.M., Hong,E.L., Amundsen,C., Balakrishnan,R., Binkley,G.,
Chan,E.T., Christie,K.R., Costanzo,M.C., Dwight,S.S., Engel,S.R.
et al. (2012) Saccharomyces Genome Database: The genomics
resource of budding yeast. Nucleic Acids Res.,40, D700–D705.
19. Gov,E. and Arga,K.Y. (2017) Differential co-expression analysis
reveals a novel prognostic gene module in ovarian cancer. Sci. Rep.,7,
4996.
20. van Dam,S., V˜
osa,U., van der Graaf,A., Franke,L. and de
Magalh˜
aes,J.P. (2017) Gene co-expression analysis for functional
classification and gene–disease predictions. Brief. Bioinform.,19,
575–592.
21. Hsu,C.-L., Juan,H.-F. and Huang,H.-C. (2015) Functional analysis
and characterization of differential coexpression networks. Sci. Rep.,
5, 13295.
22. Dutkowski,J., Kramer,M., Surma,M.A., Balakrishnan,R.,
Cherry,J.M., Krogan,N.J. and Ideker,T. (2013) A gene ontology
inferred from molecular networks. Nat. Biotechnol.,31, 38–45.
23. Andersen,M.R., Nielsen,J.B., Klitgaard,A., Petersen,L.M.,
Zachariasen,M., Hansen,T.J., Blicher,L.H., Gotfredsen,C.H.,
Larsen,T.O., Nielsen,K.F. et al. (2013) Accurate prediction of
secondary metabolite gene clusters in filamentous fungi. Proc. Natl.
Acad. Sci. U.S.A.,110, E99–E107.
24. Cairns,T. and Meyer,V. (2017) In silico prediction and
characterization of secondary metabolite biosynthetic gene clusters in
the wheat pathogen Zymoseptoria tritici.BMC Genomics,18, 631.
25. Vesth,T.C., Brandl,J. and Andersen,M.R. (2016) FunGeneClusterS:
Predicting fungal gene clusters from genome and transcriptome data.
Synth. Syst. Biotechnol.,1, 122–129.
26. Cooper,M.A. and Shlaes,D. (2011) Fix the antibiotics pipeline.
Nature,472, 32.
27. Macheleidt,J., Mattern,D.J., Fischer,J., Netzker,T., Weber,J.,
Schroeckh,V., Valiante,V. and Brakhage,A.A. (2016) Regulation and
Role of Fungal Secondary Metabolites. Annu. Rev. Genet.,50,
371–392.
28. Boecker,S., Gr¨
atz,S., Kerwat,D., Adam,L., Schirmer,D., Richter,L.,
Petras,D., S¨ussmuth,R.D. and Meyer,V. (2018) Aspergillus niger is a
superior expression host for the production of bioactive fungal
cyclodepsipeptides. Fungal Biol. Biotechnol.,5,4.
29. Meyer,V., Ram,A.F.J. and Punt,P.J. (2010) Genetics, genetic
manipulation, and approaches to strain improvement of filamentous
fungi. In: Manual of Industrial Microbiology and Biotechnology.3rd
edn. Wiley, NY, pp. 318–329.
30. Green,M.R. and Sambrook,J. (2012) Molecular Cloning: A
Laboratory Manual. Cold Spring Harbor Laboratory Press, NY, Vol.
1-3, pp. 1–2028.
31. Meyer,V., Wanka,F., van Gent,J., Arentshorst,M., van den
Hondel,C.A. and Ram,A.F. (2011) Fungal gene expression on
demand: an inducible, tunable, and metabolism-independent
expression system for Aspergillus niger.Appl. Environ. Microbiol.,77,
2975–2983.
32. Fiedler,M.R.M., Gensheimer,T., Kubisch,C. and Meyer,V. (2017)
HisB as novel selection marker for gene targeting approaches in
Aspergillus niger.BMC Microbiol.,17, 57.
33. Barrett,T., Wilhite,S.E., Ledoux,P., Evangelista,C., Kim,I.F.,
Tomashevsky,M., Marshall,K.A., Phillippy,K.H., Sherman,P.M.,
Holko,M. et al. (2013) NCBI GEO: Archive for functional genomics
data sets - Update. Nucleic Acids Res.,41, D991–D995.
34. Paege,N., Jung,S., Sch¨
ape,P., M¨uller-Hagen,D., Ouedraogo,J.P.,
Heiderich,C., Jedamzick,J., Nitsche,B.M., van den Honde,C.A.,
Ram,A.F. et al. (2016) A transcriptome meta-analysis proposes novel
biological roles for the antifungal protein Anafp in Aspergillus niger.
PLoS One,11, e0165755.
35. Gautier,L., Cope,L., Bolstad,B.M. and Irizarry,R.A. (2004)
affy––analysis of Affymetrix GeneChip data at the probe level.
Bioinformatics,20, 307–315.
36. Gentleman,R.C., Carey,V.J., Bates,D.M., Bolstad,B., Dettling,M.,
Dudoit,S., Ellis,B., Gautier,L., Ge,Y., Gentry,J. et al. (2004)
Bioconductor: open software development for computational biology
and bioinformatics. Genome Biol.,5, R80–R80.
37. Spearman,C. (1987) The proof and measurement of association
between two things. By C. Spearman, 1904. Am. J. Psychol.,100,
441–471.
39. Inglis,D.O., Binkley,J., Skrzypek,M.S., Arnaud,M.B.,
Cerqueira,G.C., Shah,P., Wymore,F., Wortman,J.R. and Sherlock,G.
(2013) Comprehensive annotation of secondary metabolite
biosynthetic genes and gene clusters of Aspergillusnidulans,A.
fumigatus,A. niger and A. oryzae.BMC Microbiol.,13, 91.
40. Sanchez,J.F. and Wang,C.C.C. (2012) The chemical identification and
analysis of Aspergillus nidulans secondary metabolites. In: Fungal
Secondary Metabolism - Methods and Protocols, pp. 97–109.
41. Park,J., Hulsman,M., Arentshorst,M., Breeman,M., Alazi,E.,
Lagendijk,E.L., Rocha,M.C., Malavazi,I., Nitsche,B.M., van den
Hondel,C.A. et al. (2016) Transcriptomic and molecular genetic
analysis of the cell wall salvage response of Aspergillus niger to the
absence of galactofuranose synthesis. Cell Microbiol.,18, 1268–1284.
42. Dobin,A., Davis,C.A., Schlesinger,F., Drenkow,J., Zaleski,C., Jha,S.,
Batut,P., Chaisson,M. and Gingeras,T.R. (2013) STAR: Ultrafast
universal RNA-seq aligner. Bioinformatics,29, 15–21.
43. Love,M.I., Huber,W. and Anders,S. (2014) Moderated estimation of
fold change and dispersion for RNA-seq data with DESeq2. Genome
Biol.,15, 550.
Downloaded from https://academic.oup.com/nar/advance-article-abstract/doi/10.1093/nar/gky1183/5213907 by Technische Universitaet Berlin user on 18 January 2019
Nucleic Acids Research, 2018 11
44. Benjamini,Y. and Hochberg,Y. (1995) Controlling the false discovery
rate: a practical and powerful approach to multiple testing. J. R. Stat.
Soc. Ser. B,57, 289–300.
45. Punt,P.J., van Gemeren,I.A., Drint-Kuijvenhoven,J., Hessing,J.G.,
van Muijlwijk-Harteveld,G.M., Beijersbergen,A., Verrips,C.T. and
van den Hondel,C.A. (1998) Analysis of the role of the gene bipA,
encoding the major endoplasmic reticulum chaperone protein in the
secretion of homologous and heterologous proteins in black
Aspergilli.Appl. Microbiol. Biotechnol.,50, 447–454.
46. Hargrove,T.Y., Wawrzak,Z., Lamb,D.C., Guengerich,F.P. and
Lepesheva,G.I. (2015) Structure-functional characterization of
cytochrome P450 Sterol 14-alpha-Demethylase (CYP51B) from
Aspergillus fumigatus and molecular basis for the development of
antifungal drugs. J. Biol. Chem.,290, 23916–23934.
47. Scharf,D.H., Heinekamp,T. and Brakhage,A.A. (2014) Human and
plant fungal Pathogens: The role of secondary metabolites. PLOS
Pathog,10, e1003859.
48. Hoang,H.D., Maruyama,J.I. and Kitamoto,K. (2015) Modulating
endoplasmic reticulum-Golgi cargo receptors for improving secretion
of carrier-fused heterologous proteins in the filamentous fungus
Aspergillus oryzae.Appl. Environ. Microbiol.,81, 533–543.
49. Dhingra,S. and Cramer,R.A. (2017) Regulation of sterol biosynthesis
in the human fungal pathogen Aspergillus fumigatus: Opportunities
for therapeutic development. Front. Microbiol.,8, 92.
50. Yasmin,S., Alcazar-Fuoli,L., Grundlinger,M., Puempel,T., Cairns,T.,
Blatzer,M., Lopez,J.F., Grimalt,J.O., Bignell,E., Haas,H. et al. (2012)
Mevalonate governs interdependency of ergosterol and siderophore
biosyntheses in the fungal pathogen Aspergillus fumigatus.Proc. Natl.
Acad. Sci. U.S.A.,109, E497–E504.
51. Andersen,M.R., Vongsangnak,W., Panagiotou,G., Salazar,M.P.,
Lehmann,L. and Nielsen,J. (2008) A trispecies Aspergillus
microarray: comparative transcriptomics of three Aspergillus species.
Proc. Natl. Acad. Sci.,105, 4387–4392.
52. Meyer,V. (2008) A small protein that fights fungi: AFP as a new
promising antifungal agent of biotechnological value. App. Microbiol.
Biotechnol.,78, 17–28.
53. Meyer,V., Wedde,M. and Stahl,U. (2001) Transcriptional regulation
of the antifungal protein in Aspergillus giganteus.Mol. Genet.
Genomics,266, 747–757.
54. Heged¨us,N., Sigl,C., Zadra,I., P´
ocsi,I. and Marx,F. (2011) The paf
gene product modulates asexual development in Penicillium
chrysogenum.J. Basic Microbiol.,51, 253–262.
55. Meyer,V. and Jung,S. (2018) Antifungal peptides of the AFP family
revisited: are these cannibal toxins? Microorganisms,6, E50.
56. Sheppard,D.C. (2005) The Aspergillus fumigatus StuA protein
governs the Up-Regulation of a discrete transcriptional program
during the acquisition of developmental competence. Mol. Biol. Cell,
16, 5866–5879.
57. Hu,P., Wang,Y., Zhou,J., Pan,Y. and Liu,G. (2015) AcstuA, which
encodes an APSES transcription regulator, is involved in conidiation,
cephalosporin biosynthesis and cell wall integrity of Acremonium
chrysogenum.Fungal Genet. Biol.,83, 26–40.
58. Clutterbuck,A.J. (1969) A mutational analysis of conidial
development in Aspergillus nidulans.Genetics,63, 317–327.
59. Newman,D.J. and Cragg,G.M. (2012) Natural products as sources of
new drugs over the 30 years from 1981 to 2010. J. Nat. Products,75,
311–335.
60. Nielsen,K.F., Mogensen,J.M., Johansen,M., Larsen,T.O. and
Frisvad,J.C. (2009) Review of secondary metabolites and mycotoxins
from the Aspergillus niger group. Anal. Bioanalyt. Chem.,395,
1225–1242.
61. Bayram, ¨
O., Krappmann,S., Ni,M., Jin,W.B., Helmstaedt,K.,
Valerius,O., Braus-Stromeyer,S., Kwon,N.J., Keller,N.P., Yu,J.H. et al.
(2008) VelB/VeA/LaeA complex coordinates light signal with fungal
development and secondary metabolism. Science,320, 1504–1506.
62. Liu,J., Osbourn,A. and Ma,P. (2015) MYB transcription factors as
regulators of phenylpropanoid metabolism in plants. Mol. Plant,8,
689–708.
Downloaded from https://academic.oup.com/nar/advance-article-abstract/doi/10.1093/nar/gky1183/5213907 by Technische Universitaet Berlin user on 18 January 2019