Document [original]

Proc. of the EAA Joint Symposium on Auralization and Ambisonics, Berlin, Germany, 3-5 April 2014

EVALUATING THE ACCURACY OF THE AMBISONIC REPRODUCTION OF MEASURED

SOUNDFIELDS

Sam Clapp

Graduate Program in Architectural Acoustics

Rensselaer Polytechnic Institute

Troy, New York, USA

[email protected]

Anne Guthrie

Arup Acoustics

New York, NY

[email protected]

Jonas Braasch

Graduate Program in Architectural Acoustics

Rensselaer Polytechnic Institute

Troy, New York, USA

[email protected]

Ning Xiang

Graduate Program in Architectural Acoustics,

Rensselaer Polytechnic Institute

Troy, New York, USA

[email protected]

ABSTRACT

A spherical microphone array can encode a measured soundfield

into its spherical harmonic components. Such an array will be sub-

ject to limitations on the highest spherical harmonic order it can

encode and encoding accuracy at different frequencies. Ambison-

ics is a system designed to reproduce the spherical harmonic com-

ponents of a measured or virtual soundfield using multiple loud-

speakers. In ambisonic systems, the size of the sweet spot is wave-

length dependent, and thus decreases in size with an increase in

frequency. This paper examines how to reconcile the limitations

of the recording and playback stages to arrive at the optimum am-

bisonic decoding scheme for a given spherical array design. In

addition, binaural models are used to evaluate these systems per-

ceptually.

1. INTRODUCTION

Spherical microphone arrays have been studied extensively for beam-

forming [1, 2], source localization, and other applications. Like-

wise, much theoretical and practical work has been done on the

optimum methods for ambisonic decoding [3, 4, 5]. Both systems

use the concept of spherical harmonics: spherical microphone ar-

rays can decompose a soundfield into its spherical harmonic com-

ponents, while ambisonic decoding can reconstruct the spherical

harmonic components of a soundfield for a listener.

The performance of spherical microphone arrays is frequency-

dependent, affected primarily by the array’s size and number of

sensors. In many applications, such as beamforming and direction-

of-arrival (DOA) estimation, a narrow frequency band is used,

where the array’s performance is optimum. However, restricting

oneself to a narrow frequency band is untenable when presenting

auditory scenes to listeners. Thus, we require a way to utilize infor-

mation from outside of the optimum band, where higher spherical

harmonic orders might not be accurately decomposed, but lower

orders are.

Much of the literature on ambisonics deals with the reproduc-

tion of simulated soundfields, where the exact DOA of every sound

event is known, and the restrictions on the highest spherical har-

monic order come only from the number of channels in the loud-

speaker array.

This paper examines a method for determining mixed-order

ambisonic decoding schemes determined by the constraints of the

two systems, as well as a way to evaluate the perceptual accuracy

of these schemes using binaural models.

2. SPHERICAL HARMONICS

The homogeneous wave equation is given in its general form by:

∇2p=1

∂2p

∂t2.(1)

If expressed in spherical coordinates, solutions can be obtained

through a separation of variables, yielding sets of functions for the

radial, azimuthal, and elevation components [6]. Solutions to the

azimuthal component are given by either sine and cosine terms or

complex exponentials, while solutions to the elevation component

are given by the associated Legendre functions (Pm

n(x)) in the co-

sine of the elevation angle. Combining these two components (and

adding a normalization term) yields the (complex-valued) expres-

sion for spherical harmonics:

n(θ, φ) = s2n+ 1

4π

(n−m)!

(n+m)!Pm

n(cos θ)eimφ.(2)

(The real-valued expression uses sine and cosine terms for the az-

imuthal component.)

The spherical harmonics form an orthonormal basis on the

sphere:

Z2π

0Zπ

Ym0

n0(θ, φ)∗Ym

n(θ, φ) sin θ dθ dφ =δnn0δmm0.(3)

3. SPHERICAL MICROPHONE ARRAY PROCESSING

3.1. Spherical Harmonic Decomposition

When a plane wave impinges upon a rigid sphere, the sphere will

radiate a spherical wave whose intensity will vary as a function

169

Proc. of the EAA Joint Symposium on Auralization and Ambisonics, Berlin, Germany, 3-5 April 2014

0.1 0.5 1 5 10

−40

−35

−30

−25

−20

−15

−10

−5

Amplitude (dB)

n=0

n=1

n=2

n=3

n=4

n=5

n=6

Figure 1: Modal amplitude for orders zero through six, as a func-

tion of ka, a quantity that relates the wavelength of the incoming

plane wave to the radius of the spherical array.

of the incident wave’s angle of incidence and wavelength (in re-

lation to the radius of the sphere). It is shown in [6] and [7] that

by solving the wave equation with the appropriate boundary con-

ditions, we arrive at an expression for the pressure at a point on

a rigid sphere of radius a(denoted by its angular position (θ, φ))

due to a plane wave incident from (θi, φi)with amplitude P0and

wavenumber k= 2πf/c:

p(θ, φ, ka) = 4πP0

∞

n=0

inbn(ka)

m=−n

n(θ, φ)Ym

n(θi, φi)∗,

(4)

with

bn(ka) = jn(ka)−j0

n(ka)

h(1)0

n(ka)h(1)

n(ka),(5)

a quantity referred to as the modal amplitude, which is shown for

several orders in Fig. 1. The slope of each curve is 3 dB per octave

multiplied by the order.

Now, if we apply a weighting factor Wto each point on the

sphere:

Wm0

n0(θ, φ, ka) = Ym0

n0(θ, φ)∗

4πin0bn0(ka),(6)

and then integrate over the entire sphere, we can use the orthonor-

mality of the spherical harmonics (Eq. 3) to yield the following

result:

Z2π

0Zπ

Wm0

n0(θ, φ, ka)p(θ, φ, ka) sin θ dθ dφ =

P0Ym0

n0(θi, φi)∗.(7)

Thus, we can determine the spherical harmonic components of the

incident plane wave. However, evaluating the integral in Eq. 7 re-

quires a continuous spherical transducer, and in practice, we must

sample the pressure at Qdiscrete points on the sphere (denoted by

their angular positions (θq, φq)), leading to the following summa-

tion:

q=1

Wm0

n0(θq, φq, ka)p(θq, φq, ka)Cm

n(θq, φq)≈

P0Ym0

n0(θi, φi)∗,(8)

where Cm

nare the quadrature coefficients. Using a nearly uniform

sampling scheme, we can estimate the spherical harmonic compo-

nents up to order Nsuch that Q≥(N+ 1)2.

1 2 3 4 5

31.5

125

250

500

Sphere Radius (cm)

Frequency (Hz)

WNG = −20 dB

1 2 3 4 5

500

16k

Sphere Radius (cm)

Frequency (Hz)

Aliasing Frequency

7th Order

6th Order

5th Order

4th Order

3rd Order

2nd Order

1st Order

Figure 2: Left side: frequency at which WNG = -20 dB as a func-

tion of sphere radius for a 64-channel array at different spherical

harmonic orders. Right side: frequency at which aliasing errors

occur as a function of sphere radius for different spherical har-

monic orders.

3.2. Error Sensitivity

Spherical microphone arrays lend themselves well to beamform-

ing, a set of techniques for multi-channel sensors that allow for

spatial filtering [2, 8, 9, 10]. These techniques involve solving for

a set of weighting factors that are applied to each channel on the

array, allowing the array to “look” in a particular direction at a par-

ticular frequency. The robustness of the beamformer is given by a

quantity known as the white noise gain (WNG):

WNG(θ0, φ0, θq, φq, ka) = 10 log10 |dTW|2

WHW,(9)

where dTis a column vector of the microphone pressure values

due to a plane wave impinging on the sphere from some direction

(θi, φi), and Wthe sensor weights to look in that direction. WNG

represents the array’s sensitivity to noise and microphone position-

ing errors, with negative values representing an amplification and

positive values representing an attenuation of (spatially uncorre-

lated) white noise. A spherical array is most sensitive to these

types of errors at lower frequencies, where higher order spherical

harmonic components are low in level and must be amplified con-

siderably. The lefthand portion of Fig. 2 shows the frequencies

at which WNG = -20 dB for a 64-channel array of varying radius.

This value can be used as a guideline to determine the lowest fre-

quency at which a certain order of spherical harmonic components

can be accurately measured by a given array.

3.3. Aliasing

At higher frequencies, the aliasing of higher order spherical har-

monic components into lower orders becomes an issue. We can

see from Eq. 4 that a plane wave is not spherical harmonic order-

limited and from Eq. 5 that higher-order components become more

prominent at higher frequencies. Thus, aliasing errors affect the ar-

ray for frequencies such that ka > N, where ais the radius of the

sphere and Nis the highest spherical harmonic order that can be

measured by the array [11]. This frequency is shown in the right-

hand portion of Fig. 2 for a range of sphere radii and spherical

harmonic orders, with arrays of smaller radii and higher order (i.e.

170

Proc. of the EAA Joint Symposium on Auralization and Ambisonics, Berlin, Germany, 3-5 April 2014

with a greater number of channels) offering higher thresholds for

aliasing.

4. AMBISONICS

4.1. Basic Decoding

Ambisonics is a system that uses multiple loudspeakers to synthe-

size a soundfield, where the gains of the loudspeakers are deter-

mined based on the spherical harmonic expansion of the soundfield

[12, 3, 4, 5].

Let us start with a plane wave of wavenumber kiand incident

from the direction (θi, φi), defined by its spherical harmonic coef-

ficients as:

p=eikir= 4π

∞

n=0

injn(kr)

m=−n

n(θ, φ)Ym

n(θi, φi)∗.

(10)

If we want to synthesize this plane wave with Lplane wave sources

located at (θl, φl)for 1≤l≤L, each with an amplitude of wl,

then our expression for the synthesized field ˆpis:

ˆp= 4π

∞

n=0

injn(kr)

m=−n

n(θ, φ)

l=1

wlYm

n(θl, φl)∗.

(11)

Then we solve for each nand min order to have Eq. 11 be equal

to Eq. 10:

l=1

wlYm

n(θl, φl)∗=Ym

n(θi, φi)∗.(12)

These amplitudes wlcan be solved for up to order N, subject to

the requirement that L≥(N+1)2for 3-D arrays and L≥2N+1

for horizontal arrays. This is known as basic decoding.

4.2. Wavefield Error Analysis

One method of evaluating the quality of the reconstruction is by

calculating the normalized radial error between the synthesized

sound field ˆpand the sound field being reconstructed, p, given by:

¯(kr) = R2π

0Rπ

0|p−ˆp|2sin θ dθ dφ

R2π

0Rπ

0|p|2sin θ dθ dφ .(13)

This quantity is a function of the distance from the center of the ar-

ray as related to the wavelength one is examining, expressed as kr,

and is shown for 1st through 7th order (moving from left to right)

in the top portion of Fig. 3. A given value of kr will be a larger dis-

tance from the center of the array for a lower frequency (i.e. longer

wavelength) than for a higher frequency. Thus, the sweet spot will

be larger for lower frequencies than for higher frequencies.

One way to evaluate the performance of ambisonic systems in

terms of human perception is to examine the frequency at which

the radius of the sweet spot (as determined by some threshold in

the normalized radial error) becomes smaller than the radius of

the average human head (given in [4] as 8.9 cm). As the order of

ambisonic reproduction increases, so will this frequency, as shown

in the bottom portion of Fig. 3, for several different thresholds of

normalized radial error. Thus, as higher spherical components are

added to the reconstruction, higher frequencies are reconstructed

accurately at the two ears.

0 2 4 6 8 10 12

ï30

ï20

ï10

Distance from Center (kr)

Normalized Radial Error (dB)

1234567

500

Ambisonic Reconstruction Order

Threshold Frequency (Hz)

ï10 dB

ï15 dB

ï20 dB

Figure 3: The top shows normalized radial error (in dB) as a func-

tion of distance from the center of the array (in units of kr), from

1st to 7th order basic decoding (moving left to right), after Ward

and Abhayapala[13] and Poletti[4]. The bottom shows the fre-

quencies at which the -10, -15, and -20 dB error thresholds occur

at a radius of 8.9 cm, the size of a typical human head.

4.3. Max-rEDecoding

For a given order of ambisonic reproduction, there will be some

frequency above which the area of the sweet spot will be smaller

than the size of a typical human head. Thus, audio content above

this frequency will not be simulated accurately at the listener’s

ears. In [14], Gerzon proposed that high frequency localization

can be predicted by the direction of the energy vector at the center

of the array, ˆrE, as given in:

rEˆrE=PL

l=1 w2

lˆul

l=1 w2

,(14)

where wlis the gain of the lth loudspeaker radiating sound from a

position denoted by the vector ˆulfrom the loudspeaker’s position

to the center of the array. The magnitude of the vector, rE, repre-

sents the concentration of source energy in the desired direction,

and thus the accuracy of high-frequency localization from that di-

rection. In order to maximize this value, one can apply correcting

gains (g0, g1, ..., gN)to each order of spherical harmonics (i.e. to

the right side of Eq. 12.) Methods for calculating the correcting

gains are given in [3]. This is known as max-rEdecoding.

4.4. Binaural Cue Analysis

The localization accuracy of different decoding schemes can be

examined using a model of the auditory system used previously in

[15] to evaluate the localization of various stereo recording tech-

niques. The auditory periphery is simulated with Head-Related

Transfer Functions (HRTFs). The behavior of the basilar mem-

brane and the hair cells is simulated with a gammatone filter bank

with 72 bands and half-wave rectification. ITD analysis is per-

formed using an interaural cross-correlation in each frequency band.

ILD analysis is performed using an array of excitation/inhibition

(EI) cells. This model allows for the creation of maps that corre-

171

Proc. of the EAA Joint Symposium on Auralization and Ambisonics, Berlin, Germany, 3-5 April 2014

late ITD and ILD values (expressed in milliseconds and decibels,

respectively) with azimuthal directions.

This allows for another way to evaluate the accuracy of am-

bisonic reproduction - by calculating errors in the ITD and ILD

cues rather than errors in the reproduced wavefield, as detailed ear-

lier. First, the spherical harmonic signals are decoded to a virtual

24-channel horizontal loudspeaker array (with equiangular spac-

ing, putting each channel 15 degrees apart in azimuth from its

neighbors.) The signals reaching each ear are obtained by con-

volving the loudspeaker feeds with the appropriate HRTFs and

summing over all loudspeakers. Plane waves can then be simu-

lated from a variety of directions, and the ITD and ILD cues can

be compared to the natural condition.

From these maps we can then look at average binaural cue er-

ror as a function of frequency by averaging over azimuthal angles

and also over a number of different HRTF catalogs, as measured in

[16]. This is shown in Fig. 4 for 1st through 7th order ambisonic

rendering. The dotted lines indicate the average error across all

frequency bands. The results are in line with what we would ex-

pect from an analysis of the wavefields:

1. Cue errors are less at lower frequencies

2. The higher the order, the higher the frequency at which sig-

nificant cue errors occur

3. The higher the order, the lower the average cue error across

all frequency bands

5. BAND-SPLITTING FILTERS

As discussed in the previous section, it is possible to achieve more

perceptually accurate decoding by using basic decoding in lower

frequencies and max-rEdecoding in higher frequencies. This ne-

cessitates a crossover network to transition between the different

decoding schemes. One popular crossover is the Linkwitz-Riley

(LR) crossover [17]. A second-order LR crossover is formed from

cascading two first-order Butterworth filters, a fourth-order LR

crossover from two second-order Butterworth filters, etc. The ben-

efit of the LR crossover is that the low- and high-pass portions

are phase-matched, and the magnitude of each portion is -6 dB at

the crossover frequency, leading to a unity gain for the sum. The

rolloff is equal to 6 dB per octave multiplied by the order.

Depending on the characteristics of the particular systems be-

ing used, one might want to decode the signals in three or more

bands, requiring multiple crossovers. Putting two crossovers too

close to one another in frequency results in a summed signal whose

magnitude deviates noticeably from unity gain. These problems

can be avoided by placing crossovers far enough apart in frequency

so that the magnitudes of the low-pass and high-pass components

are equal at a maximum value of -20 dB.

6. PROCESSING MEASURED SOUNDFIELDS FOR

PLAYBACK

In this section, four hypothetical spherical microphone arrays are

considered: two 16-channel and two 64-channel rigid spherical ar-

rays, each set with radii of 2.5 and 5 cm. These arrays utilize a

nearly-uniform sampling scheme, with the positions of the sen-

sors given in [18]. Therefore, the 16-channel arrays are capable

of decomposing the spherical harmonics up to third order, the 64-

channel arrays up to seventh order.

Fig. 5 shows the White Noise Gain thresholds (the lower line

corresponding to a WNG value of -30 dB, the upper to -20 dB) and

aliasing frequencies, plotted together with the thresholds for tran-

sitioning from basic to max-rEdecoding. (Note that the decoding

thresholds are based on the size of a typical human head, and thus

are not affected by the properties of the microphone array.) Over-

laid on these plots are the decoding schemes chosen based on the

principles outlined previously, namely:

1. Do not utilize spherical harmonic components below a WNG

of -20 to -30 dB

2. Do not decode above the spherical harmonic aliasing fre-

quency

3. Use max-rEdecoding at frequencies above which the nor-

malized radial error reaches -10 to -20 dB

4. Separate multiple crossovers by at least 20 dB.

As before, where we compared different orders of ambison-

ics, we can evaluate the accuracy of the ITD and ILD cues of these

various decoding schemes (of “mixed" order) across multiple fre-

quency bands, as shown in Fig. 6 (with the plots of 1st and 5th

order ambisonics shown for reference). The average errors across

all frequency bands shown in Table 1.

Table 1: Average ITD and ILD cue error and aliasing frequency

for each spherical microphone array.

Array

Avg. ITD

error (ms)

Avg. ILD

error (dB)

Aliasing

Frequency (Hz)

a=2.5 cm, Q=16 0.24 4.3 6551

a=5 cm, Q=16 0.21 4.1 3275

a=2.5 cm, Q=64 0.20 4.1 15285

a=5 cm, Q=64 0.17 3.8 7643

There are two main points that we can gather from Fig. 6. The

first is that increasing the number of channels for a spherical mi-

crophone array of a given radius is a “win-win” scenario: higher

spherical harmonic orders become available, the WNG threshold

frequencies for lower orders are pushed lower in frequency, and

the aliasing frequency moves higher, yielding a wider frequency

range for reconstruction. Thus, we see improvements in both lo-

calization accuracy and bandwidth. Of course, increasing the num-

ber of channels will increase the cost and complexity of building

the array. In addition, going from a 3rd order to a 7th order micro-

phone array does not yield the same gains in localization accuracy

as moving from 3rd order to 7th order ambisonic reproduction of

a simulated soundfield, as the highest order components are some-

times available for only 1 or 2 octaves before encountering the

aliasing frequency.

Increasing the spherical array radius, however, involves a trade-

off. On the positive side, the WNG thresholds move lower, mak-

ing higher order spherical harmonic components available at lower

frequencies. However, the aliasing frequency also moves lower,

meaning that we are trading localization accuracy for bandwidth.

7. CONCLUSION

The common basis in spherical harmonics makes ambisonics a nat-

ural fit for reproducing soundfields measured with spherical mi-

crophone arrays. Incorporating higher order spherical harmonic

172

Proc. of the EAA Joint Symposium on Auralization and Ambisonics, Berlin, Germany, 3-5 April 2014

31.5 63 125 250 500 1k 2k 4k 8k

0.1

0.2

0.3

0.4

0.5

0.6

Frequency (Hz)

ITD Error (ms)

ITD Errors by Frequency Band

1st

2nd

3rd

4th

5th

6th

7th

31.5 63 125 250 500 1k 2k 4k 8k

Frequency (Hz)

ILD Error (dB)

ILD Errors by Frequency Band

Figure 4: ITD and ILD error by frequency band for 1st through 7th order ambisonics, with average error over all frequency bands shown

by the dotted lines.

components offers the opportunity for more precise localization,

but at the same time introduces complexities at both the recording

stage and the playback stage that need to be dealt with. The goal of

this paper is to illuminate the sources of those issues and develop

a framework to resolve them, particularly with respect to auditory

perception.

8. REFERENCES

[1] Boaz Rafaely, “Plane-wave decomposition of the sound field

on a sphere by spherical convolution,” Journal of the Acous-

tical Society of America, vol. 116, no. 4, pp. 2149–2157, Oc-

tober 2004.

[2] Zhiyun Li and Ramani Duraiswami, “Flexible and opti-

mal design of spherical microphone arrays for beamform-

ing,” IEEE Transactions on Audio, Speech, and Language

Processing, vol. 15, no. 2, pp. 702–714, February 2007.

[3] Jérôme Daniel, Jean-Bernard Rault, and Jean-Dominique Po-

lack, “Ambisonic encoding of other audio formats for mul-

tiple listening conditions,” in Proc. of the 105th Conven-

tion of the Audio Eng. Soc., San Francisco, California, USA,

September 26–29, 1998.

[4] M. A. Poletti, “Three-dimensional surround sound systems

based on spherical harmonics,” Journal of the Audio Engi-

neering Society, vol. 53, no. 11, pp. 1004–1025, November

2005.

[5] Aaron J. Heller, Eric M. Benjamin, and Richard Lee, “A

toolkit for the design of ambisonic decoders,” in Linux Audio

Conference, CCRMA, Stanford University, CA, USA, April

12-15 2012.

[6] Earl G. Williams, Fourier Acoustics: Sound Radiation and

Nearfield Acoustical Holography, Academic Press, 1999.

[7] Philip M. Morse and K. Uno Ingard, Theoretical Acoustics,

McGraw-Hill, Inc., 1968.

[8] Jens Meyer and Gary W. Elko, “A highly scalable spherical

microphone array based on an orthonormal decomposition of

the soundfield,” in IEEE International Conference on Acous-

tics, Speech, and Signal Processing, Orlando, Florida, USA,

May 13–17, 2002.

[9] Jens Meyer, Gary W. Elko, and Tony Agnello, “Spherical

microphone array for spatial sound recording,” in Proc. of

the 115th Convention of the Audio Eng. Soc., New York, New

York, USA, October 10–13, 2003.

[10] Jens Meyer and Gary W. Elko, Audio Signal Processing for

Next-Generation Multimedia Communication Systems, chap-

ter 3, “Spherical Microphone Arrays for 3D Sound Record-

ing", pp. 67–89, Kluwer Academic Publishers, Hingham,

Massachusetts, USA, 2004.

[11] Boaz Rafaely, Barak Weiss, and Eitan Bachmat, “Spatial

aliasing in spherical microphone arrays,” IEEE Transactions

on Signal Processing, vol. 55, no. 3, pp. 1003–1010, March

2007.

[12] Michael A. Gerzon, “Periphony: With-height sound repro-

duction,” Journal of the Audio Engineering Society, vol. 21,

no. 1, pp. 2–10, January/February 1973.

[13] Darren B. Ward and Thushara D. Abhayapala, “Reproduc-

tion of a plane-wave sound field using an array of loudspeak-

ers,” IEEE Transactions on Speech and Audio Processing,

vol. 9, no. 6, pp. 697–707, September 2001.

[14] Michael A. Gerzon, “General metatheory of auditory local-

isation,” in Proc. of the 92nd Convention of the Audio Eng.

Soc., Vienna, Austria, March 24-27 1992.

[15] Jonas Braasch, “A binaural model to predict position and ex-

tension of spatial images created with standard sound record-

ing techniques,” in Proc. of the 119th Convention of the Au-

dio Eng. Soc., New York, NY, USA, October 7–10, 2005.

[16] Jonas Braasch and Klaus Hartung, “Localization in the pres-

ence of a disctracter and reverberation in the frontal horizon-

tal plane. i. psychoacoustical data,” Acta Acustica united with

Acustica, vol. 88, no. 6, pp. 942–955, November/December

2002.

[17] Siegfried H. Linkwitz, “Active crossover networks for non-

coincident drivers,” Journal of the Audio Engineering Soci-

ety, vol. 24, no. 1, pp. 2–8, January/February 1976.

[18] J. Fliege and U. Maier, “The distribution of points on the

sphere and corresponding cubature formulae,” IMA Journal

of Numerical Analysis, vol. 19, no. 2, pp. 317–334, 1999.

173

Proc. of the EAA Joint Symposium on Auralization and Ambisonics, Berlin, Germany, 3-5 April 2014

250

16k

Frequency (Hz)

a = 2.5 cm, Q = 16

123

250

16k

Order

Frequency (Hz)

a = 5 cm, Q = 16

a = 2.5 cm, Q = 64

1 2 3 4 5 6 7

Order

a = 5 cm, Q = 64

WNG Thresholds

Decoding Thresholds

Aliasing Frequency

1st Order Basic

2nd Order Max-rE

1st Order Basic

2nd Order Basic

3rd Order Max-rE

1st Order Basic

2nd Order Basic

4th Order Max-rE

1st Order Basic

2nd Order Basic

5th Order Max-rE

Figure 5: WNG thresholds and aliasing frequencies for 4 spherical arrays, plotted together with basic/max-rEambisonic decoding thresh-

olds, overlaid with decoding schemes.

31.5 63 125 250 500 1k 2k 4k 8k

0.1

0.2

0.3

0.4

0.5

0.6

ITD Errors by Frequency Band

Frequency (Hz)

ITD Error (samples)

a = 2.5 cm, Q = 16

a = 5 cm, Q = 16

a = 2.5 cm, Q = 64

a = 5 cm, Q = 64

31.5 63 125 250 500 1k 2k 4k 8k

ILD Errors by Frequency Band

Frequency (Hz)

ILD Error (dB)

Figure 6: ITD and ILD errors for the 4 different decoding schemes, with the errors for 1st and 5th order ambisonic decoding shown for

reference.

174