Data-driven analysis for multimodal neuroimaging [original]

Data-driven analysis for

multimodal neuroimaging

vorgelegt von

Felix Bießmann

Von der Fakultät IV — Elektrotechnik und Informatik

der Technischen Universität Berlin

zur Erlangung des akademischen Grades

Doktor der Naturwissenschaen (Dr. rer. nat.)

genehmigte Dissertation

Promotionsausschuss:

Vorsitzender: Prof. Dr. Olaf Hellwich (Technische Universität Berlin)

1. Gutachter: Prof. Dr. Klaus-Robert Müller (Technische Universität Berlin)

2. Gutachter: Prof. Dr. Nikos K Logothetis (MPI Tübingen)

3. Gutachter: Prof. Dr. Tom Eichele (University of Bergen)

Tag der wissenschalichen Aussprache: 07.12.2011

Berlin, 2012

D

In memoriam Florian Perleth

Acknowledgements

I grateful for the support of many people during the past few years. Above

all I thank my supervisors Prof. Dr. Klaus-Robert Müller, Prof. Dr. Gregor

Rainer and Prof. Dr. Nikos K. Logothetis. Prof. Dr. Müller always encouraged

me with scientiﬁc enthusiasm and helped in countless stimulating discussions with

experienced advice on theoretical and practical aspects of statistical learning the-

ory. With his Machine Learning Group he created an inspiringly interdisciplinary

and friendly atmosphere – and gave me the opportunity to pursue research interest

in neuroscience as well as in artiﬁcial intelligence in the best way I could imagine.

Furthermore I am very grateful to Prof. Dr. Gregor Rainer for continuous support

and for valuable insights into the experimental side of neuroscience. Of course I

owe my utmost gratitude to Prof. Dr. Nikos K. Logothetis. His pioneering work

on the neurophysiological basis of the hemodynamic signal is the starting point of

this dissertation. I proﬁted enormously from his scientiﬁc advice. Without his sup-

port, this dissertation would not have been possible. Also I would like to thank

Dr. Yusuke Murayama for scientiﬁc advice, technical help and his interest in new

analysis methods. Moreover I am indebted to the outstanding experimental exper-

tise of Axel Öltermann, Dr. Marc Augath, Dr. Jozien Goense, Dr. Alexander Rauch,

Prof. Dr. Robert Kretz, Jule Veit and Anwesha Bhattacharyya. By the same token

I would like to thank Dr. Arthur Gretton, Dr. Jakob Macke and Prof. Dr. Matthias

Bethge for their help on mathematical problems during my time in Tübingen. I also

thank Dr. Andreas Harth foran interesting research project outside of neuroscience.

I was lucky to meet many friendly, creative and bright minds who helped a lot for

calibrating my sense of whats right and wrong in science and beyond, among them

Michael Gaebler, Dr. David Greenberg, Daniel Lebrecht, Dr. Dr. Franz Kiraly , Ste-

fan Haufe, Paul von Bünau and Jule Veit. Special thanks go to Frank C. Meinecke

forconstant andpatientsupervision. Hisexperienced skepticism, hissharp thinking

and his talent to explain complex concepts in a comprehensive way were invaluable

tome. TohimIalsoowethelayoutofthisthesis. AlsoIwouldliketothankValentina

Mosienko for her patience and support. Finally I owe my deepest gratitude to my

family. e most important decisions which led to this thesis were not taken by me,

but by my parents. ank you for making me curious.

Abstract

N, the measurement, analysis and visualization of neural activity,

contributed considerably to our understanding of information processing in

the brain. e availability of non-invasive neuroimaging devices such as functional

magnetic resonance imaging (fMRI) has been increasing rapidly throughout the last

two decades. Nowadays every interested student can obtain non-invasively high res-

olution image time series of the blood-oxygen level dependent (BOLD) signal using

fMRI. How exactly neural activity is reﬂected in the BOLD contrast is still subject of

activeresearch. Fora moreaccurateinterpretationof the fMRI signaland the under-

lying neurovascular coupling mechanisms, combined measurements of intracranial

neural activity and fMRI signals are indispensable. Such simultaneous measure-

ments have become technically possible – however appropriate analysis methods are

still lacking. Classical analysisapproachesrelyonsimplifying assumptionsaboutthe

neurovascular coupling dynamics. ese assumptions are convenient but numerous

studies have provided empirical evidence against them.

In this dissertation a novel analysis method, termed temporal kernel Canoni-

cal Correlation Analysis (tkCCA), will be developed, tested on artiﬁcial data and

applied to experimental data in order to investigate neurovascular coupling mech-

anisms. TkCCA estimates dependency structures between high dimensional data

with complex temporal coupling dynamics. e important advantages of tkCCA

compared to standard methods are a) tkCCA can be directly applied to multimodal

data, b) tkCCA is very eﬃcient for high dimensional data with few data points (as is

the case for fMRI) and c) tkCCA does not make use of restrictive assumptions about

thedatageneratingprocess. InparticulartkCCAcanbeusedtoanalyzehighdimen-

sional simultaneous measurements of neural activity and fMRI signals. Predictions

of neural activity using tkCCA are better than when using classical methods. Basic

research as well as clinical applications can proﬁt from this more accurate predic-

tion. Besides tkCCA is readily applicable to other domains in which data streams

have high dimensional features that are non-instantaneously coupled, such as data

from social networks in the World Wide Web.

viii

Zusammenfassung

B Verfahren in den Neurowissenschaen haben unser Verständnis

von Informationsverarbeitung im Hirn entscheidend geprägt. Die schnelle

Verbreitung von nicht-invasiven bildgebenden Verfahren wie etwa der funktionellen

Magnet-Resonanz Tomographie (fMRT) in den letzten beiden Dekaden erlaubt es

heutzutage jedem interessierten Studenten nicht-invasiv den Blutsauerstoﬀgehalt

im Hirn zu messen und so räumlich hochaufgelöste Zeitreihen von Hirnaktivität

aufzuzeichnen. Wie genau sich jedoch neuronale Aktivität im Blut-Sauerstoﬀ Gehalt

abhängigem Kontrast (englisch: Blood-oxygen level dependent oder BOLD contrast)

wiederspiegelt, isttrotzintensiverForschungan derNeurovaskulären Kopplung nach

wie vor aktuelles Forschungsthema. Für ein besseres Verständnis des fMRT Signals

sind kombinierte Messungen von intrakranialer neuronaler Aktivität und fMRT

Signalen zwingend erforderlich. Diese multimodalen Simultanmessungen sind in-

zwischen technisch möglich. Jedoch fehlen geeignete Analysemethoden. Gängige

Verfahren basieren auf vereinfachenden Annahmen über die neurovaskuäre Kop-

plungsdynamik. Diese Annahmensind zwarpraktisch, erwiesen sichaber inzahlre-

ichen Studien als falsch.

In dieser Dissertation wird ein neuartiges Analyseverfahren, temporal kernel

CanonicalCorrelationAnalysis(tkCCA),entwickelt,getestedundangewandt. TkCCA

schätztAbhängigkeitsstruktureninhochdimensionalenDatenmitnicht-instantaner

Kopplung. DieentscheidendenVorteilevontkCCAgegenüberherkömmlichenMeth-

oden sinda) direkte Anwendbarkeit aufmultimodale Daten, b)Eﬃzienz bei hochdi-

mensionalen Daten mit wenig Datenpunkten und c) Verzicht auf einschränkende

Annahmen über die generativen Modelle der gemessenen Daten. Insbesondere er-

laubttkCCA die Analysehochdimensionalermultimodaler Simultanmessungenvon

neuronaler Aktivität und fMRT Signalen. Mit Hilfe von tkCCA können neuronale

Signale besser aus multivariaten fMRI Messungen vorhergesagt werden. Davon

könnensowohlGrundlagenforschungalsauchklinischeDiagnostikproﬁtieren. Da-

rüberhinausisttkCCAdirektanwendbaraufandereArtenhochdimensionalerDaten-

ströme mit nicht-instantanen Abhängigkeiten, wie etwa in sozialen Netzwerken im

World Wide Web.

Contents

1 Introduction 1

I Unimodal Neuroimaging

2 A short history of neuroimaging 9

3 Unimodal neuroimaging 11

4 Unimodal analysis approaches 19

II Multimodal Neuroimaging

5 Multimodal neuroimaging 35

6 Multimodal analysis approaches 45

7 Data-driven multimodal analysis 53

8 Model-free analysis of neurovascular coupling 73

9 Summary and outlook 103

Appendix

A Mathematical preliminaries 109

B Experimental protocols 113

xii CONTENTS

Chapter 1

Introduction

… it is in the brain that everything takes place.

Oscar Wilde

T brain is an exceptionally interesting organ. Like no other organ it is in the

focus of attention of a broad spectrum of scientiﬁc disciplines, ranging from

natural science over psychology to philosophy. Many of the insights we have about

the brain were gained by measuring its electrical and metabolic activity. Measure-

ment of brain activity and the visualization thereof using appropriate analysis tech-

niques is called neuroimaging. Various neuroimaging techniques have been devel- Measurement, analysis

and visualization of

brain activity is called

neuroimaging

oped, but each method has its speciﬁc technological and physiological limits. Some

techniques, such as electroencephalography (EEG), can visualize brain activity at a

high temporal resolution, but only at a very poor spatial resolution. Other meth-

ods, such as measurements of the blood-oxygen level dependent (BOLD) signal for

instance, canimage thewholebrain atonce, inparticular deepstructuresthatcannot

be measured with EEG, but they cannot give insights in the fast temporal dynam-

ics of brain activity. is is why combinations of multiple neuroimaging modalities

have become popular. Multimodal imaging setups can take advantage of comple-

mentary views on neural activity and enhance our understanding about how neural

information processing is reﬂected in each modality. In order to exploit the poten-

tial ofmultimodalmethods, dedicated analysismethodsareneeded. Manysolutions

to this data integration problem have been proposed. However the complex physi-

ological processes that give rise to the BOLD contrast are diﬃcult to model. us,

every analysis method for hemodynamic neuroimaging data has to make simpli-

fying assumptions about the data generating process. is allows for more eﬃcient

computations and easier interpretation of the results. However due to these assump-

tionssomeaspectsofbrainactivitymightbelostin the analysis. Inthis dissertationa

novel analysis framework for multimodal neuroimaging data will be proposed. Ap-

plying this analysis method on multimodal neural data, a simplifying assumption

underlying many studies based on BOLD contrast will be tested empirically.

2CHAPTER 1. INTRODUCTION

What is measured with fMRI? One of the most important applications of multi-

modal neuroimaging is the investigation of the neurovascular coupling mechanisms,

the relationship between brain activity and the hemodynamic signal. While accu-e relationship

between brain activity

and its hemodynamic

response is called

neurovascular coupling

rate measurements of brain activity require intracranial electrodes, hemodynamic

signalscanbemeasurednon-invasivelyusingforinstancefMRI [Ogawaetal.,].

ehopethatnon-invasivetechniqueswilleventuallyreplaceinvasiveneuralrecord-

ings is the basis for the enormous success of fMRI in recent years [Friston, ].

However the exact relationship between neural activity and fMRI signals is still sub-

ject of active research [Logothetis, ]. e physiological processes underlying

the generation of the BOLD signal are still not suﬃciently well understood to model

them accurately. Many detailed models have been proposed, but they are diﬃcult

to test empirically. A few reasons for this are: Recording the quantities involved in

neurovascular coupling is technically challenging. Large scale recordings of all elec-

trical and chemical forces involved cannot be obtained yet. Another reason is the

computational complexity of the analysis. Classical statistical methods are not de-

signed for state of the art multimodal data with tens of thousands of dimensions and

only a few data points – a classical problem faced by researchers working on fMRI

data. Technical advances in the ﬁeld of multimodal recordings enable researchers to

obtain high resolution hemodynamic measurements simultaneously with intracra-

nial neurophysiological recordings [Logothetis et al., , Oeltermann et al., ,

Goense and Logothetis, ]. For these data, novel analysis methods are needed

[Dale and Halgren, , Friston, ].

Anovelanalysisframework Inthisdissertationanovelanalysisframeworktermed

temporal kernel canonical correlation analysis (tkCCA) is proposed. It is tailored

to the speciﬁc needs in multimodal neuroimaging: Scalability to high dimensional

data and the ability to account for arbitrary non-instantaneous coupling mecha-

nisms while making only minimal assumptions about the data generating process

and the coupling mechanisms. In order to meet these requirements, tkCCA com-

bines well established statistical learning techniques with modern machine learning

methods.

Does spatiotemporal variability of fMRI signals contain neural information?

Aer introduction of tkCCA and validation thereof on synthetic data, the method

is used to test a simplifying assumption underlying many neuroimaging methods, in

particular multimodal analyses. Most neuroimaging methods assume that the spa-

tial dynamics of the hemodynamic response to neural activation is separable from

the temporal dynamics of the response. Non-separable spatiotemporal variability of

the hemodynamic response is thus neglected in these methods. Empirical evidence

suggests that this spatiotemporal separability assumption is a good approximation:

When considering the voxels in an fMRI image sequence located around a neural

ensemble of interest aer that ensemble was stimulated, the temporal dynamics are

very similar. However there is substantial evidence showing that the hemodynamic

response varies across brain regions [Aguirre et al., ] or diﬀerent cortical lay-

ers [Yacoub et al., ]. Taking this variance into account can reveal areas of ac-

tivation that would have been overlooked by methods that assume spatiotemporal

separability [Mourão-Miranda et al., , Lu et al., ]. e scientiﬁc hypothesis

underlying the spatiotemporal separability assumption is: Does the spatiotemporal

variability of the hemodynamic response carry information about neural signals?

While the above studies provide only indirect evidence, tkCCA applied to simulta-

neous recordings of neural and hemodynamic activity can directly test this hypoth-

esis.

Overview of this dissertation

e structure of this thesis is divided in two parts. e ﬁrst part will give a short

overview over popular neuroimaging methods in chapters ,  and . e second

part will deal with the combination of neuroimaging methods in multimodal se-

tups. Chapter  gives an introduction to multimodal neuroimaging setups and some

motivating examples of their application in basic neuroscientiﬁc research and clin-

ical application. Common analysis standards established in the literature will be

reviewed in chapter .

e main part of this thesis is chapter  in which the proposed algorithm is de-

veloped. In collaboration with the Max-Planck Institute for Biological Cybernetics,

Tübingen, we applied tkCCA for estimation of neurovascular coupling dynamics

and high accuracy prediction of neural activity from simultaneously recorded fMRI

signals. Chapter  will show applications of the proposed algorithm on artiﬁcial and

real data.

Own contributions

e algorithm and its application to multimodal neural data have been published

in internationally renown journals in the machine learning and neuroimaging com-

munity; a detailed overview is given on the next page. Early stages of this work have

been presented in form of conference abstracts at the Forum of the European Neu-

roscience Society (FENS) in Geneva [Bießmann et al., ], the Computational

Neuroscience Society (CNS) in Berlin [Bießmann et al., ] and the Computa-

tional and Systems Neuroscience (COSYNE) Conference in Salt Lake City [Bieß-

mann et al., a]. is work proﬁted from scientiﬁc exchange aer presentation

of preliminary results at the RIKEN Brain Science Institute (Tokyo), the Cognitive

4CHAPTER 1. INTRODUCTION

and Neurobiological Imaging Lab at Stanford University and the Redwood Cen-

ter for eoretical Neuroscience at University of California, Berkeley. Parts of this

thesis are based on a review article on multimodal neuroimaging analysis methods

[Bießmann et al., b]. Next to the work on multimodal neuroimaging we also

explored applications in other domains such data mining [Bießmann and Harth,

]. A summary of the work that is published as peer reviewed manuscripts is

given in the following.

Peer reviewed manuscripts

. Bießmann, Meinecke, Gretton, Rauch, Rainer, Logothetis, and Müller, Tem-

poral Kernel CCA and its Application in Multimodal Neuronal Data Analysis,

Machine Learning, 

Summary Temporal kernel canonical correlation analysis (tkCCA) was

proposed and tested in simulations and preliminary data from simulta-

neousmeasurementsofneurophysiological recordingsandfMRIsignals

during sensory stimulation.

Contribution I developed parts of the recording soware, did all analy-

ses and wrote the manuscript.

. Murayama, Bießmann, Meinecke, Müller, Augath, Oeltermann, and Logo-

thetis, Relationship between neural and hemodynamic signals during sponta-

neous activity studied with temporal kernel CCA,Magnetic Resonance Imag-

ing, 

Summary In this manuscript we showed that tkCCA can robustly esti-

mate neurovascular coupling dynamics from recordings of spontaneous

activity in primary visual cortex.

Contribution Ididpartsoftheanalysesandwrotepartsofthemanuscript.

. Bießmann and Harth, Analysing Dependency Dynamics in Web Data,Pro-

ceedings of AAAI Symposium, 

Summary An alternative application of tkCCA is illustrated on social

networkdata; usingtimeresolvedlisteningbehaviorofusersinmyfriend

subgraph on http://last.fm I extracted music trends and identiﬁed users

who where ahead of this trend and those who lagged behind.

Contribution I collected the data, analyzed it and wrote the manuscript.

. Bießmann, Plis, Meinecke, Eichele, and Müller, Analysis of Multimodal Neu-

roimaging Data,IEEE Reviews in Biomedical Engineering, 

Summary is review summarizes the state of the art in multimodal

neuroimaging analysis with a special focus on data driven methods.

Contribution I wrote major parts of the manuscript and provided ap-

plication examples from artiﬁcial and real data.

. Bießmann, Murayama, Logothetis, Müller, and Meinecke, Improved Decoding

of Neural Activity from fMRI Signals: Towards Non-Separable Spatiotemporal

Deconvolutions, in revision

Summary In this manuscript we show that the spatiotemporal variabil-

ity of the hemodynamic response carries information about neural ac-

tivity that most fMRI analysis methods neglect.

Contribution I analyzed the data and wrote the paper.

. Bhattacharyya, Bießmann, Veit, Kretz, and Rainer, Functional and laminar

dissociations between muscarinic and nicotinic cholinergic neuromodulation in

the tree shrew primary visual cortex, to appear in European Journal of Neuro-

science

Summary In this paper we showed diﬀerential eﬀects of the neuromod-

ulator Acetylcholine (ACh) on visual information processing in diﬀer-

ent layers of primary visual cortex.

Contribution I developed parts of the stimulus presentation soware,

helped recording some of the data, contributed to the data analysis and

wrote minor parts of the manuscript.

. Rauch, Zhang, Bießmann, Meinecke, Goense, Müller, Rainer, and Logothetis,

Baseline BOLD signal shi in macaque primary visual cortex (V) aer local

application of Acetylcholine, in preparation

Summary In this paper we showed diﬀerential eﬀects of the neuromod-

ulator Acetylcholine (ACh) on visual information processing as mea-

sured with neurophysiological recordings and hemodynamic activity.

e BOLD signal exhibited a pronounced increase in baseline activity,

while the eﬀects on neural activity were rather heterogeneous. A series

of additional experiments outside of the scanner helped to resolve the

heterogeneous eﬀects on neural activity: In [Bhattacharyya et al., ]

we could show that the eﬀects of ACh can be dissociated with respect to

laminar position and receptor type.

Contribution I developed the realtime recording system, helped record-

ing some of the data, performed all analyses and wrote minor parts of

the manuscript.

6CHAPTER 1. INTRODUCTION

Part I

Unimodal Neuroimaging

Chapter 2

A short history of neuroimaging

Es war von vornherein zu erwarten, daß auch im

Zentralnervensystem […] bioelektrische

Erscheinungen nachweisbar seien.

Berger [], Über das

Elektroenkephalogramm des Menschen

S the late th century researchers explore how cognitive functions are re-

ﬂected in measurements of neural activity [Caton, ]. Using diﬀerent neu-

roimaging techniques, disciplines such as psychology, biology and medicine accu-

mulated knowledge about how what we perceive, feel, think and do is related to the

complex activity patterns of neurons in our central nervous system. Table . high-

lights a selection of important achievements in the history of (multimodal) neu-

roimaging. Animal studies have been indispensable for a better understanding of

how neural activity is related to the outside world, some examples are [Mountcas-

tle, , Hubel and Wiesel, ]. But the most fascinating mental phenomena,

such as higher cognitive functions like language or reasoning, are diﬃcult to study

with animals. Oen long training phases are needed to establish an experimental

paradigm. Humans have a clear advantage here: We can use language to commu-

nicate what a subject is supposed to do in an experiment. Human neuroscientiﬁc

experiments rely on non-invasive measurements of brain activity. ey can be ob-

tained for instance by electroencephalography (EEG) [Berger, ], magnetoen-

cephalography (MEG) [Cohen, ], near infrared spectroscopy (NIRS) [Jöbsis,

] or functional magnetic resonance imaging (fMRI) [Ogawa et al., ]. e

quantities measured by each of these modalities have diﬀerent physiological origins

and thus diﬀerent limitations and advantages. And each modality reﬂects neural ac-

tivity at a diﬀerent spatiotemporal scale. A short summary of various neuroimaging

techniques is given in the next chapter.

10 CHAPTER 2. A SHORT HISTORY OF NEUROIMAGING

 Caton Electrocorticography (ECoG) reveals electrical ac-

tivity in the brain in response to visual stimulation

 Berger Electroencephalography (EEG) reveals brain oscilla-

tionsbetween −Hz(α-rhythms)in occipital areas

associated with states of vigilance

 Mountcastle Detailed topographic mapping of sensory modalities

using intracranial electrophysiology

 Cohen Magnetoencephalography (MEG) for measuring α

activity in occipital cortex

 Jöbsis Near infrared spectroscopy (NIRS) for non-invasive

imaging of brain activity

 Grinvald et al. Combined intrinsic optical signal imaging and in-

tracranial electrophysiology to investigate neurovas-

cular coupling

 Ogawa et al. Functional magnetic resonance imaging (fMRI) to

visualize brain activity

 Dale et al. Sequential fMRI and MEG

 Lemieux et al. Simultaneous fMRI and EEG

 Logothetis et al. Simultaneous fMRI and intracranial

microelectrode recordings

Table .: A short history of (multimodal) neuroimaging methods

Chapter 3

Unimodal neuroimaging

M neuroimaging modalities measure either electrophysiological or hemo-

dynamic signals. A widely used neuroimaging technique for electrophysi-

ological activity with an exquisite spatial and temporal resolution are intracranial

microelectrode recordings. Among hemodynamic modalities, fMRI became most

popular [Friston, ]. Electrophysiological recordings pick up changes in electro-

magnetic ﬁelds induced by neural activity; energy consumption of neural activity

is correlated with blood oxygenation. e hemodynamic signal thus reﬂects neural

activity indirectly. Both electrophysiological and hemodynamic signals can be mea-

sured invasively and non-invasively. Electrodes can be placed directly in the neural Neural activity is

reﬂected directly in

electrical ﬁeld changes

and indirectly in the

amount of oxygen

bound to hemoglobin

molecules; both can be

measured invasively

and non-invasively;

tissue, on the cortical surface (electrocorticograms or ECoG) [Caton, ] or on

the skull (electroencephalograms or EEG) [Berger, ]. Neural activity is also re-

ﬂected in the magnetic ﬁeld ﬂuctuations which can be measured non-invasively by

magnetoencephalograms (MEG) [Cohen, ]. EEG and MEG measure changes

in electrical andmagnetic ﬁelds, respectively, onthe scalp surface. eorigin of EEG

and MEG signals areelectrical dipoles in the cortex emerging from synchronized ac-

tivity of neighboring neurons with elongated shape [Murakami and Okada, ].

e strongest signal in EEG recordings arises from dipoles oriented perpendicular

to the scalp surface [Nunez and Srinivasan, ]. MEG in contrast is most sensi-

tive to cortical dipoles tangential to the scalp [da Silva and Niedermayer, ]. e

magnetic ﬁelds measured with MEG are not aﬀected by volume conduction and can

resolve ﬁner structures than EEG [Grynszpan and Geselowitz, , Cuﬃn and Co-

hen, ]. Hemodynamic activity can be measured semi-invasively using intrinsic

optical imaging [Grinvald et al., ] ornon-invasively by functional magnetic res-

onance imaging (fMRI) [Ogawa et al., ] and near infrared spectroscopy (NIRS)

[Jöbsis, ]. In NIRS infrared light is sent through the scull and the cortical tissue;

as oxygenated blood has diﬀerent wavelength absorption characteristics than deoxy-

genated blood, neural activity can be measured by examining the reﬂected light. e

spatial resolution is much lower than fMRI and imaging is restricted to the cortical

surface but the temporal resolution can be higher than fMRI; besides the setup is

much cheaper and less complex than that of fMRI. Next to these non-invasive neu-

roimaging techniques, there are semi-invasive techniques that require opening the

12 CHAPTER 3. UNIMODAL NEUROIMAGING

Measurement Origin Resolution Invasive

Spatial Temporal

Electrophysiological activity

Intracranial

recordings

single cell /

population activity high high yes

EEG cortical dipoles

orthogonal to scalp low high no

ECoG cortical dipoles

orthogonal to scalp medium high yes

MEG cortical dipoles

tangential to scalp low high no

Hemodynamic activity

Optical imaging blood oxygenation

and volume high high yes

NIRS blood oxygenation

and volume low medium no

fMRI blood oxygenation high low no

Table .: Simpliﬁed overview of popular neuroimaging modalities;

skull but no penetration of neural tissue as in the case of intracranial electrophysi-

ological recordings. Semi-invasive preparations such as ECoG and intrinsic optical

imaging oﬀer a higher resolution than completely non-invasive methods and bear

fewer risks than invasive recordings. ECoG measures electrical oscillations between

electrodes directly on the cortex. Similarly hemodynamic activity can be imaged

with optical imaging in minimally invasive preparations also in humans [Arthur

and Nader, ].

3.1 Electrophysiogical measurements

e computations carried out by our brain are reﬂected in changes of electrical po-

tentials across the cell membrane of neurons [Hodgkin and Huxley, ]. Under-

standing neural computations requires measurements of electrophysiological activ-

ity. is can be done at various levels, invasively, semi-invasively with ECoG or

non-invasively with EEG or MEG. e most detailed measurements of neural activ-

ity can be obtained with invasive electrophysiological recordings.

Physiologicalorigin Invasiveelectrophysiologicalrecordingsmeasurethechanges

in electrical ﬁelds emerging from neural activity. Neurons communicate mainly via

activation of chemosensitive ion channels located on the (post-)synapse, illustrated

as black dots in ﬁgure .. If a neuron releases neurotransmitters at the synapse, ion

channels on the postsynaptic neuron open. e opening allows ions to ﬂow down

their electrochemical gradient. is results in a depolarization of the dendritic part

of the cell with respect to the extracellular medium (indicated by red plus sign in

3.1. ELECTROPHYSIOGICAL MEASUREMENTS 13

Dendrites

(Input)

Soma

Axon

(Output)

Neurophys WT05/06

T01

Electric Dipole

ground: V ≡0

∆V > 0

∆V < 0

The dependence of the potential polarity is due to the fact that the potential differences measured

with respect to ground depend on the electrode position within the electric field of a dipole.

Activated neurons generate transient electric dipols due to inhomogeneous charge distributions.

Neurophys WT05/06

T01

Electric Dipole

ground: V ≡0

∆V > 0

∆V < 0

The dependence of the potential polarity is due to the fact that the potential differences measured

with respect to ground depend on the electrode position within the electric field of a dipole.

Activated neurons generate transient electric dipols due to inhomogeneous charge distributions.

Synapse

Figure .: Sketch of a neuron: In grey the axon of another cell forming a synapse

with the neuron in black. During rest, the intracellular medium is approximately

at −mV with respect to the extracellular medium; neurotransmitters released at

the synapse result ina depolarization inthedendrite, thecellbecomesanelectrical

dipole: the soma is negatively charged relative to the dendrite; if many neurons

are arranged in parallel and receive synchronized dendritic input, this dipole will

become stronger – eventually strong enough give rise to electromagnetic ﬁelds

that can be measured outside of the scull using EEG;

the post synapse and a blue minus sign in the extracellular medium). Synchronized

depolarization of large ensembles of neighboring neurons result in the generation

of electromagnetic dipoles. e ﬁelds emerging from these dipoles can be measured

extracellularly with electrodes, onthe cortical surfacewith ECoG oroutsidetheskull

with non-invasive EEG or MEG measurements. e strength of the measured signal

depends critically on the dipole generating neurons, their shape and their spatial ar-

rangement in an ensemble. Aspiny inhibitory interneurons with radially symmetric

dendritic trees around their somata for instance will form dipoles that are diﬃcult

to measure in extracellular recordings. Other cells, such as large pyramidal neu-

rons, have a bipolar shape with the some on one end and the dendrite elongating

in parallel to neighboring neurons (see ﬁg. .). ese pyramidal ensembles gen-

erate dipoles with are much stronger in electrophysiological recordings [Murakami

and Okada, ]. e strongest signal in EEG recordings arises from dipoles ori-

ented perpendicular to the scalp surface, the strongest signal in MEG recordings are

dipoles tangentially to the scalp [da Silva and Niedermayer, , Nunez and Srini-

vasan, ]. Invasive electrical recordings oﬀer the highest spatiotemporal resolu-

tion. Two aspects are diﬀerentiated in intracranial electrophysiological recordings,

fast discharges – spikes or action potentials (APs) – and low frequency content, oen

called local ﬁeld potentials or LFP. Neuronal spikes are associated with the output of

the computations carried out by a neuron. e high frequency spectrum of neu-

rophysiological recordings containing the spiking activity of many single units is

called multi-unit activity (MUA). As the electrode is at a ﬁxed position relative to

14 CHAPTER 3. UNIMODAL NEUROIMAGING

the cells in its surround, action potentials of diﬀerent cells can be diﬀerentiated by

their distinct AP shapes recorded at the electrode. Classifying single cell units is

called spike sorting. e LFP is hypothesized to reﬂect subthreshold membrane os-Local Field Potentials

(LFPs) are slow oscil-

lations in neurophy-

siological signals; fast

(above 1KHz) oscilla-

tions are called multi-

unit activity (MUA)

cillations (the input to or state of a neuron before it is sending out spikes). Metabolic

signals of brain activity, such as the BOLD contrast, has been reported to be more

correlated with LFPs than spikes [Logothetis et al., , Goense and Logothetis,

]. is ﬁnding is consistent with the larger number of mitochondria¹ found in

the dendritic parts of neurons (the input site) as compared to the axons (the output

part) [Wong-Riley, , Attwell and Laughlin, ].

Signalpropertiesandlimitations Electrophysiologicalmeasurementshaveahigh

spatiotemporal resolution. Intracranial recordings have the potential to resolve sin-

gle cell activity. However the eﬀective spatial resolution depends on the number

and arrangement of electrodes. For intracranial neurophysiology the temporal res-

olution is typically around Khz. Depending on the number of electrodes up to

 single cells can be recorded at once [Lehev and Nicolelis, ]. Recently there

has been increased interest in LFP signals. ey are easy to measure, with much

lower sampling rates than spikes, but exhibit a similar sensitivity for sensory fea-

tures compared to spikes in early visual cortex [Xing et al., ]. Also the LFP is

shown to correlate better than spikes with the fMRI signal [Logothetis et al., ,

Goense and Logothetis, ]; for many applications the high temporal resolution

of spikes is not needed and LFPs represent a promising alternative [Waldert et al.,

]. e spatial resolution of LFPs is on the order of hundreds of micrometers

[Katzner et al., ]. Non-invasive electrophysiological measures like EEG have a

lower spatial resolution on the order of several millimeters as measured by compar-

ing EEG based dipole estimates with fMRI activation centers [Im et al., ]. Al-

thoughthereis evidencethatthephase ofthe EEG oscillationsis coupledtointracra-

nial spikes [Whittingstall and Logothetis, ] neuronal spikes cannot be detected

directly using EEG. Hence EEG as well as MEG is typically sampled at frequencies

below KHz.

3.2 Hemodynamic measurements

Neural activity is reﬂected indirectly in the hemodynamic response. Hemodynamic

activity in the entire brain can be measured non-invasively using fMRI. Nowadays

commercially available fMRI setups can easily be operated without any knowledge

about MRI physics. us fMRI has become by far the most popular neuroimaging

technique [Friston, ]. It is important to keep in mind that fMRI does not mea-

sure neural activity directly. e relationship between neural and hemodynamic

¹Mitochondria are the cellular power plants and provide the energy needed for the ion pumps pre-

serving electrochemical gradients across neuronal membranes.

3.2. HEMODYNAMIC MEASUREMENTS 15

signals is still subject of active research [Logothetis, ]. As more and more non-

technically minded researchers are making use of this powerful technique, recent

studies highlighted the importance of solid statistical standards for fMRI analysis

[Kriegeskorte et al., , Bennett et al., , Vul et al., ].

Physiological origin Neural activity in the brain consumes energy which is de-

livered by the blood stream. e blood supply is controlled by the highly complex

cortical microvasculature, shown in ﬁg. . as a vascular corrosion cast² of primary

visual cortex of the macaque monkey. Most importantly the vascular system has to

ensure that there is always enough oxygen delivered to the neurons. A blockage of

the arterial vessels on the cortical surface can aﬀect whole cortical columns under-

neath and thus can have devastating consequences on cognitive functions. Oxygen

is carried through the blood stream via erythrocytes, the red blood cells. e cell

plasma of erythrocytes is rich in hemoglobin molecules, which transport oxygen.

e important aspect for neuroimaging is that deoxygenated hemoglobin (HbR) Oxygenated and deoxy-

genated blood have

diﬀerent magnetic and

light absorption

properties

and oxygenated hemoglobin (HbO) have diﬀerent magnetic and light absorption

properties. is gives rise to the blood-oxygen level dependent (BOLD) signal. e

BOLD contrast is a complex combination of blood oxygenation, blood ﬂow and

blood volume [Buxton et al., ]. Hemodynamic signals can be measured in-

vasively using intrinsic signal optical imaging (ISOI) [Grinvald et al., , Frostig

et al., ] as well as non-invasively using functional magnetic resonance imaging

(fMRI) [Ogawa et al., ] or near infrared spectroscopy (NIRS) [Jöbsis, ].

Signal Properties and Limitations e temporal resolution of intrinsic optical

imaging data is typically around Hz (see e.g. [Berwick et al., ]). While op-

tical imaging can visualize hemodynamic signals on the cortical surface at a high

resolution, the depth resolution is rather poor. Intrinsic optical imaging requires

opening (or at least removing parts of) the skull. Although initial applications of

optical imaging were in basic neuroscience research [Grinvald et al., , Frostig

et al., ] its usefulness for diagnostic purposes is also being explored in human

patients [Pouratian et al., ]. NIRS operates with infrared light which can travel

through the intact skull, measurements can thus be taken non-invasively. is non-

invasiveness comes at the price of a poor spatial resolution. e advantage is that

NIRS setups are simple, low-cost and portable [Villringer and Chance, , Wolf

et al., ]. e advantages of optical imaging and NIRS are combined in fMRI:

Like NIRS, fMRI measurements are non-invasive. And like optical imaging, the spa-

tial resolution of fMRI measurements is high; in contrast to optical imaging, fMRI

can image deep subcortical structures. Using specialized imaging protocols and

hardware, fMRI can resolve cortical laminae [Goense and Logothetis, ]. e

²In vascular corrosion casts vessels are perfused with plastic and surrounding tissue is removed; the

negative of the vascular system can be imaged at high resolution with electron microscopy;

16 CHAPTER 3. UNIMODAL NEUROIMAGING

Figure .: Vascular system in primary visual cortex of the macaque monkey;

veins are shown in blue, arteries in red, micro vessels in grey (taken from [Keller

et al., ] with kind permission of Anna Lena Keller, MPI Biological Cybernet-

ics, Tübingen)

point spread function of fMRI signals is on the order of mm [Shmuel et al., ,

Sirotin et al., ]. Of course the neural signal is blurred by the vascular structure,

but this can actually be helpful for decoding the neural signal as the microvascula-

ture has a functional meaning. Recent work demonstrates a laminar speciﬁcity of

the ratio of microvasculature-to-cell-density [Weber et al., , Tian et al., ].

Incorporating these insights into detailed models of neurovascular coupling allowsCortical

microvasculature gives

rise to complex

hemodynamic

activation patterns

for a more detailed analysis of fMRI signals [Boas et al., , Guibert et al., ].

e temporal resolution of fMRI can be below Hz, but high temporal resolution

comes at the price of poor signal to noise ratio of the image sequence. In human

experiments the spatial resolution of fMRI signals is typically much lower and the

temporal resolution is usually below Hz; an advantage of human fMRI is that single

subjects’ brain scans can be co-registered with template brains. is makes multi-

subject analysis possible. But the co-registration requires spatial smoothing which

reduces the eﬀective spatial resolution of human fMRI scans. To summarize, the

best spatiotemporal resolution for whole brain imaging is obtained by fMRI. As

fMRI requires hardware which is expensive to maintain and is not portable, opti-

cal imaging methods are a sensible alternative for measuring hemodynamic activity

in many applications.

3.3. PRIMARY VISUAL CORTEX: A WELL STUDIED BRAIN REGION 17

3.3 Primary visual cortex: A well studied brain region

When investigating neurovascular coupling, it is reasonable to choose a brain re-

gion that is well characterized. Testing hypotheses about how multiple modalities

are related is easier if the functional and anatomical properties of the brain region

measured are well understood. In terms of the functional characterization the best

studied sensory modality is undoubtedly the visual sense. In their seminal work Neurons in primary

visual cortex (V1)

respond selectively to

bars of light at a certain

orientation

[Hubel and Wiesel, ] found that cortical neurons in occipital regions respond

very selectively to bars of light of a certain orientation when presented in the vi-

sual ﬁeld. In  the authors received the Nobel Prize. As the primary visual cor-

tex (also called V) is one of the best understood brain areas we will focus on data

recorded in this brain region throughout the entire dissertation. An example of the

cortical functions ﬁrst described in [Hubel and Wiesel, ] is shown in ﬁgure ..

e data was taken from [Bhattacharyya et al., ]. When placing a microelec-

trode in V and presenting driing gratings of diﬀerent orientation (see ﬁg. .A)

to the animal, cortical neurons will emit more action potentials than during rest.

Cells in V are selective for gratings of a certain orientation: A cell will emit more

spikes when the preferred orientation was shown as opposed to when the orthogonal

orientation is shown (see ﬁg. .B).

100

Spikerate [Hz]

Preferred

Orthogonal

0 0.5 1 1.5 2 0

Time [s]

45°135°

225°315°

100

Spikerate [Hz]

mAChR Agonist

Preferred

Orthogonal

−0.5

Time [s]

OTI 6.6

45°135°

225°315°

100

120

Spikerate [Hz]

Recover

Preferred

Orthogonal

−0.5

Time [s]

OTI 1.7

45°135°

225°315°

Orthogonal (45°)

Preferred (135°)

Figure .: Selectivity of neurons recorded in primary visual cortex in response

to moving gratings of diﬀerent orientation; A: Peristimulus time histogram (top

panel) and single trial spike raster plots (bottom panel) of a cortical cell’s spiking

responsetoa preferredstimulus(gratingwith◦angle)andorthogonalstimulus

(◦); stimulus on- and oﬀset is marked by vertical black dashed line; highest

spike rates were recorded when the preferred orientation was shown, lowest spike

rates when the orthogonal orientation was shown; B: Polar plot summarizing the

responses (in a s window) to all stimulus orientations (in degree); black circles

indicate Hz increments in spike rate, a light blue inner polygon denotes median

ﬁring rate for a given orientation, a grey tube indicates variance across trials;

18 CHAPTER 3. UNIMODAL NEUROIMAGING

Chapter 4

Unimodal analysis approaches

Tchapterwill recapitulatepopularanalysisconceptsforunimodalneuroimag-

ing data in order to set the stage for multimodal extensions in later chapters.

From a data analyst’s point of view, unimodal analysis methods can be categorized

into three classes:

•Supervised methods

Are concerned with regression between measurements

and/or experimentally controlled variables.

•Unsupervised methods

Find structure in measurements in an explorative fashion.

•Model driven methods

Fit physiological models to measurements.

Many neuroimaging studies combine methods from more than one of these

three classes. us the analysis of a particular neuroimaging study is oen

diﬃcult to assign to one of these three classes. Nonetheless this categorization can

be helpful. Each of these classes has advantages and drawbacks. Not all methods

can be applied to every data set. For instance supervised methods usually require

an experimentally controlled variable as regressor, which is not available in every

experimental setting. And some scientiﬁc questions can only be addressed with

methods from one of the three. For example in order to test a scientiﬁc hypothe-

sis about a certain biological process it is oen convenient to formulate a model and

compare the model predictions with experimental data. While an exhaustive review

of these categories is beyond the scope of this dissertation, a basic understanding of

standard neuroimaging analyses will be helpful in later chapters. e following sec-

tions will give a short introduction to some methods popular in classical unimodal

neuroimaging analyses.

20 CHAPTER 4. UNIMODAL ANALYSIS APPROACHES

4.1 Supervised Methods

Supervised methods have in common that a regressor x∈

Uis used to explain a

target variable y∈



f(x) = ˆ

y+ε(.)

where f(.)denotes a function that maps the regressor onto an estimateˆ

yof the target

variable yand εdenotes noise variance that is not captured by f(.). is function is

chosen such that some diﬀerence measure between target variable and its estimate

is minimized

argmin

f(.)(∥ˆ

y−y∥p), (.)

where the distance measure oen is the pnorm ∥x∥p=p

√∑xp. Typically one

choses p= such that eq. (.) minimizes the least-squares error [Legendre, ].

During the last decades two main streams of supervised neuroimaging data analysis

have emerged, mass-univariate methods, also known as statistical parametric maps

(SPMs) [Friston and Buchel, ], and multivariate pattern analysis (MPA) [Cox

and Savoy, , Haynes and Rees, , Norman et al., ]. Both are based on

(mostly linear) regression, only the role of target variables and regressors are dif-

ferent. ere is an ongoing debate as to whether mass-univariate SPMs are better

than MPA approaches; the main argument for SPMs is that brain activation can be

better localized with mass-univariate methods [Kiebel and Friston, ]; other au-

thors advocate the higher sensitivity of multivariate methods [Kriegeskorte et al.,

]. While the methods used for SPMs and MPA approaches are similar, theyA hemodynamic

response function

(HRF) models the

temporal dynamics of

the fMRI signal in

response to a neural

stimulus

rely on diﬀerent assumptions about the data generating process. Some assumptions

however the two approaches have in common. For instance all supervised settings,

when applied to fMRI data, need to correct for the hemodynamic lag when corre-

lating fMRI signals with a stimulus. is is usually done by convolving the stimulus

regressor with a canonical hemodynamic response function (HRF). An example of

a typical HRF as implemented in the analysis soware SPM [Friston and Buchel,

] is shown in ﬁgure .. It is based on a biomechanical model of the hemody-

namic response, see [Buxton et al., , Friston et al., ] and section .. e

HRF was generated using the MATLAB function spm_hrf.m in SPM¹. is HRF

is commonly assumed to be the same for all voxels and subjects, hence canonical.

While this is a convenient assumption, there is evidence for a considerable variabil-

ity of HRFs across subjects and brain regions [Aguirre et al., , Handwerker et al.,

] – neglecting this variability will lead to poor sensitivity to hemodynamic ac-

tivation which does not follow the canonical HRF dynamics. Chapter  will show

an alternative approach without these assumptions.

¹e parameters used were the default parameters, delay of response (relative to onset) s, delay

of undershoot (relative to onset) s, dispersion of response s, dispersion of undershoot s, ratio of

response to undershoot s, onset (seconds) s, s, temporal resolution Hz

4.1. SUPERVISED METHODS 21

5 10 15 20 25 30 35 40 45 50

Time [s]

Activity [a.u.]

Artificial measurements

Stimulus

Stimulus * HRF

0 5 10 15 20

Temporal delay [s]

Neuron

HRF

Figure .: Example of a canonical hemodynamic response function (HRF); in

order to construct a regressor for the GLM analysis, a stimulus time series (black,

le panel) is convolved (∗) with the canonical HRF (grey, right panel); the result-

ing hypothesized fMRI time series is a smoothed version of the stimulus;

Mass-univariate methods In mass-univariate methods one treats single voxel or

dipole time series as target variable yand predicts their time course from a linear

combination of multivariate regressors xcontaining experimentally controlled vari-

ables and parameters that are not of interest but account for variance y. If the time

course is explained well by the regressor of interest, that voxel will have a high weight

in the SPM. Most SPM analyses are based on the so called general linear model Statistical parametric

maps (SPMs) visualize

activation patterns that

are correlated with an

experimental stimulus

(GLM). GLMs are the most oen used class of supervised methods for ﬁnding sta-

tistical parametric maps (SPMs) of neural activation [Friston et al., , Friston

and Buchel, ]. For a given data set the target variable y∈

L(e.g. a single voxel

time course of length L) is modeled as a linear combination of all Nregressors, each

weighted by a coeﬃcient stored in a vector β∈

N, plus some gaussian i.i.d. error

ε∼ N(, )

y=Xβ +ε. (.)

e L×Nmatrix X= [xx. . . xN]containing the time series of Nregressors

xiof length Las column vectors is called design matrix. A typical GLM analysis General linear models

(GLMs) are the most

oen used method to

compute SPMs

includes as regressors all experimentally controlled parameters and additionally so

called nuisance regressors, which are not of interest in the analysis but explain some

of the variance in the data. In fMRI data, such a nuisance regressor could be for

instance head movement within the scanner. Including an estimate of movement

along each axis (pitch, roll, yaw) into the design matrix will improve the ﬁt of the

GLM. In SPM analyses for fMRI data yis a single voxel time course and the same de-

sign matrix Xis ﬁtted to all voxels separately. is approach is called mass-univariate

analysis. Equation (.) can be solved by the ordinary least squares (OLS) solution

β= (X⊤X)−X⊤y⊤. (.)

e magnitude of the entries in the vector βcan now be subjected to statistical tests.

Voxels with values of βthat are signiﬁcantly diﬀerent from zero will be considered

22 CHAPTER 4. UNIMODAL ANALYSIS APPROACHES

in bands below 24 Hz (δ/θ,α,βbands) makes smaller

contributions to the BOLD signal changes. The estimated

optimal spatial filter (Fig. 2B), on the other hand, shows

that voxels within the gray matter (red stripe) of V1 make a

larger contribution to the maximized correlation between

the neural and fMRI signals. In the present study the γand

MUA contributions were greater than those of spiking

activity, which commonly represents a small group of cells.

The strong contribution of the γand MUA bands was

observed in most cases (see also Fig. 3A), and it confirms

previous results showing that the highest correlation

between the neural and the BOLD responses occurred in

the 30- to 140-Hz range of the neural signal in intracranial

recordings in the anesthetized monkey [11] and in human

patients [18,19].

In order to compare tkCCA with conventional correlation

analysis, we estimated the canonical correlogram as defined

in Eq. (3). This canonical correlogram is similar to the

univariate cross-correlogram, except that it is strictly

positive. However, the right sign of correlation can be

identified by looking at the signs of the projection

coefficients (see Methods section). Much as with the

correlogram computed in mass-univariate approaches, the

peak of the correlogram was found at a lag of about 5 s,

suggesting that, at least in area V1, the time-to-peak of the

BOLD response occurs 5 s after the onset of neural activity.

Fig. 1. Recording hardware, activation maps and analysis principles for combined physiology-fMRI experiments. (A) The upper column shows anatomical

images and the multichannel electrode used in this study. Blue arrows represent positions of recording sites on the electrode; 7 out of 10 channels are visible. The

lower column shows the BOLD responses (Pb.001) tested by means of a full-field visual stimulus. Note the minimal susceptibility artifacts caused by the

electrode; no distortions are evident in the activation map. (B) Spontaneous changes in the amplitude of different frequency bands. Depicted are the bands δ/θ,α,

β,γL, γ,γH, γVH, MUA (multi-unit activity) with 1–8, 8–12, 12–24, 24–40, 40–60, 60–100, 120–250 and 1000–3000 Hz, respectively. Spikes were

extracted by detecting zero crossings and thresholding the high-pass signal. Arrows show examples of synchronous γand spiking activation. The entire

frequency–time matrix was used for tkCCA. (C) Spontaneous changes in the BOLD signal in the regions of V1, defined on the basis of anatomical criteria in each

slice. The upper trace shows the average time course of BOLD fluctuations from all voxels seen below.

5Y. Murayama et al. / Magnetic Resonance Imaging xx (2010) xxx–xxx

ARTICLE IN PRESS