Ambisonic Enlightenment | SuperCollider 3.13.0 Help

Ambisonics is a holophonic soundfield sampling and synthesis technique.

Introduction

Jérôme Daniel in his landmark paper introducing NFC-HOA, describes Ambisonics as "a very versatile approach for the spatial encoding and rendering of sound fields," and lists the following advantages of the technique: ¹

A rational encoding of spatial acoustic information, and moreover independent from the reproduction layout.
A flexible and scalable spatial sound representation: one can transform the sound field, and also adapt it to transmission constraints or reproduction capabilities by keeping only a subset of signals (variable spatial resolution).
A variable geometry rendering: a decoder can be suitably designed according to the loudspeaker array geometry, and also for binaural rendering over headphones.
A quite optimal way to achieve "holophonic" sound field reconstruction by means of a given number of loudspeakers, which makes Higher Order Ambisonics (HOA) comparable and even preferable to Wave Field Synthesis (WFS) in some conditions.

________________

Stepping onto our path towards enlightenment we'll begin by considering Ambisonics in the context of pair-wise panorama laws. We'll observe how the angular component of Ambisonics is similar to, but an optimized form of a panning technique with which we're familiar.

We'll then consider the meaning of Ambisonic order, the spatial resolution of the technique. We'll see how order relates to:

radial and frequency domain resolution
spherical and angular resolution
physical loudspeaker resolution
localisation measures

Our discussion then closes with a brief review of the Near-Field Controlled Ambsionic Soundfield Model. This is perhaps Daniel's most important contribution to the art, and moves the radius of the basic wave from infinity (classic, Gerzonic Ambisonics), to the mid-field.

We build a visualisation using a collection of virtual loudspeakers (secondary sources) and a virtual microphone (soundfield sampler). We then review three different travelling waves, observing the resulting encoding coefficients and returned encoded signals.

Panorama Laws

A panorama law, aka panning law, is a rule detailing how a loudspeaker array synthesizes a spatial sound image. This rule may act by creating amplitude, phase and time differences between loudspeakers to synthesize the desired phantom image. In practice, not all of these aspects are always touched, and different panning laws may emphasize one aspect over another.

In the discussion here we'll compare pair-wise panning laws with those returned by Ambisonics. Also, we'll restrict the Ambisonic laws to basic panning. I.e., sources to be panned and target loudspeakers are at the reference radius.

NOTE: When we do this, we are reviewing the angular component of Ambisonic panning laws.

We'll review radial aspects later.

Stereo with Pan2

Let's begin with the two channel stereophonic sine-cosine panning law, ² as this is the panning law used by SuperCollider's Pan2 UGen. From the help, we see this is described as a "Two channel equal power panner". In other words, the panorama effect is a result of acting on the amplitude scaling of an input signal, scaling in an equal power distribution between two loudspeakers.

If we look at the source code, we can see the function used is sine.

Let's make a plot to visualize...~size = 4096; // SC's Pan2 size ~pos = Array.interpolation(~size, -1.0, 1.0); // pan position // sine-cosine panning law ( ~sinCosLaw = (pi / 4 * Array.with( (~pos.neg + 1), // left (~pos + 1) // right ) ).sin; ); // plot! ~sinCosLaw.at(0).plot("sin-cos: left"); ~sinCosLaw.at(1).plot("sin-cos: right");

What we see is that we have a rule to govern how much signal is passed to the left and right to synthesize a phantom image.

Quad with PanAz

Reviewing the help for PanAz, we see it described as a "Multichannel equal power panner." When we peek at the source code, we can see that sine appears.

With the settings listed just below, PanAz will return the exact same rule as Pan2:PanAz.ar(2, in, pos: (0.5 * ~myPos), level: 1.0, width: 2.0, orientation: 0.5) // Pan2

________________

Given the default arguments, and setting numChans to four:PanAz.ar(4, in, pos: 0.0, level: 1.0, width: 2.0, orientation: 0.5)

will return a pair-wise equal power quadraphonic panning rule.

Let's go ahead and test this panner with DC and plot the results. We're starting at the left speaker and panning counter-clockwise all the way around:s.boot; // wait for the server to boot... // Quad w/ PanAz: FL, FR, BR, BL ( var dur = 0.1; { PanAz.ar(4, DC.ar(1), Line.ar(0.25.neg, 2.25.neg, 0.1), orientation: 0.5) }.plot(dur) )

What we see here is the amplitude scaling rule for all four speakers in order to pan a sound in a counter-clockwise rotation around the array. We can see that no more than two loudspeakers are active at once.

Also, note that the rule can be described as a collection of windows in space or spatial windows.

Keep this plot open, as we're going to compare this rule with Ambisonics.

Quad with PanB2 & DecodeB2

Here we'll start with two of SuperCollider's FOA built ins PanB2 and DecodeB2 to build a quadraphonic panner.³ This first UGen is a basic 2D encoder, and the second is a controlled opposites, aka cardioid, 2D decoder. Following an Ambisonic encoder with an Ambisonic decoder returns a panning law:// Quad w/ PanB2 -> DecodeB2: FL, FR, BR, BL ( var dur = 0.1; var numChans = 4; { var foa; foa = PanB2.ar(DC.ar(1), Line.ar(0.25.neg, 2.25.neg, 0.1)); // start the law DecodeB2.ar(numChans, foa.at(0), foa.at(1), foa.at(2)) // finish the law }.plot(dur) )

NOTE: We've split the law between PanB2 & DecodeB2!

In comparing the laws for Quad with PanAz and Quad with PanB2 & DecodeB2 we'll notice two things immediately. The spatial windows for :

Quad with PanAz are sharply clipped
Quad with PanB2 & DecodeB2 are very smooth

PanAz offers a parameter to adjust the amount of clipping by changing its width argument. We can modify the law, so it looks a bit more like what we see with PanB2 and DecodeB2:// Quad w/ PanAz: FL, FR, BR, BL // width = 4 ( var dur = 0.1; { PanAz.ar(4, DC.ar(1), Line.ar(0.25.neg, 2.25.neg, 0.1), width: 4, orientation: 0.5) }.plot(dur) )

NOTE: While reduced, there is are still sharp edges in the windows!

In time domain signal processing, sharp window shapes are associated with frequency domain aliasing⁴ .

In the spatial domain, sharp windows are associated with spatial domain aliasing.

Optimized Quad with HOA1

The original architects of classic first order Ambisonics were deeply concerned about the spatial domain aliasing found in the quad recordings of the Age of Quadraphonic Sound. One of their goals was to reduce or remove the spatial distortions found in these recordings.

Their solution was to offer three different panning laws on finishing off the rule. These choices are equivalent to PanAz's width parameter, but instead of being an ad hoc choice, the different laws for Ambisonics are defined against optimization criteria.

The ATK uses the parameter name beam shape within the HOA toolset.⁵

Three standard spatial windows are offered:

keyword	beam shape	localisation vector	virtual microphone
`\basic`	strict soundfield	maximum velocity rV	Hyper-cardioid
`\energy`	energy optimised	maximum energy rE	Super-cardioid
`\controlled`	controlled opposites	minimum diametric energy	Cardioid

In the codeblock immediately below you'll notice that the HOA toolset code for making an Ambisonic equivalent panner for quad is much more verbose. As a result, we have much greater control.

We'll use the ATK's projection decoder, HoaMatrixDecoder: *newProjection, to create the quad decoder. newProjection is a very simple, but powerful decoder. It quickly calculates the matrices required for decoders where space has been sampled equally. To design a 2D decoder, we just supply the vertices of a regular polygon.⁶

NOTE: In practice, we'd usually use HoaMatrixDecoder: *newPanto to return a quadraphonic, or other regular polygon, decoder, as it designs the required polygon internally. For ease of comparison we've used newProjection for the following examples, so as to directly map to the output ordering PanAz returns.

Go ahead and try each of the three window choices.// Quad w/ HOA1: FL, FR, BR, BL ( var dur = 0.1; var numChans = 4; var order = 1; // ambisonic order var decoder = HoaMatrixDecoder.newProjection( // design Array.regularPolygon(numChans, \side, pi).rotate(1).neg, // quad // \basic, // strict soundfield // \energy, // energy optimised \controlled, // controlled opposites <- same as DecodeB2 \amp, order ); { var hoa; hoa = HoaEncodeDirection.ar( // start the law DC.ar(1), // DC Line.ar(0, 2pi, 0.1, add: pi/4), 0, AtkHoa.refRadius, order ); HoaDecodeMatrix.ar(hoa, decoder) // finish the law }.plot(dur) )

With basic and energy, we see the scaling function drops below zero in places. If plotted in a polar form, we'd see the familiar tails of first order hyper-cardiod and super-cardioid microphones.

Look closely to find where these tails appear in the windows. Of particular interest, by dropping below zero they are inverted in polarity. They appear at their peaks when there are a peaks in the loudspeaker opposite. We can say, where one loudspeaker pushes, the opposite pulls.

NOTE: In Ambisonics, the loudspeakers all work together to create the panorama.

(Feel free to close the open plots.)

Octa with PanAz

Let's try a pair-wise octaphanic rule with PanAz.

For convenience, we'll use an array where the first loudspeaker is in at front center, and we'll start the test from directly behind, so that the plot returns the first window centered. As before, the panning angle will rotate counter-clockwise.// Octa w/ PanAz: FC, ... ( var dur = 0.1; { PanAz.ar(8, DC.ar(1), Line.ar(1, 1.neg, 0.1), orientation: 0) }.plot(dur) )

This plot really gives a clear sense that panning laws are spatial windows. We see each window offset in space. (Keep this plot open.)

Now let's do the same analysis, but just keep the window for the first loudspeaker:// Octa w/ PanAz: FC only! ( var dur = 0.1; { PanAz.ar(8, DC.ar(1), Line.ar(1, 1.neg, 0.1), orientation: 0).first }.plot(dur, minval: -1, maxval: 1) )

(And, keep this plot open, too!)

Optimized Octa with HOA3

Go ahead and try each of the three window choices.// Octa w/ HOA3: FC... ( var dur = 0.1; var numChans = 8; var order = 3; // ambisonic order var decoder = HoaMatrixDecoder.newProjection( // design Array.regularPolygon(numChans, \vertex, pi).neg, // octa // \basic, // strict soundfield // \energy, // energy optimised \controlled, // controlled opposites \amp, order ); { var hoa; hoa = HoaEncodeDirection.ar( // start the law DC.ar(1), // DC Line.ar(pi.neg, pi, 0.1), 0, AtkHoa.refRadius, order ); HoaDecodeMatrix.ar(hoa, decoder) // finish the law }.plot(dur) )

(After inspection, feel free to close these.)

And, another plot, keeping just the front center loudspeaker:// Octa w/ HOA3: FC only! ( var dur = 0.1; var numChans = 8; var order = 3; // ambisonic order var decoder = HoaMatrixDecoder.newProjection( // design Array.regularPolygon(numChans, \vertex, pi).neg, // octa // \basic, // strict soundfield // \energy, // energy optimised \controlled, // controlled opposites \amp, order ); { var hoa; hoa = HoaEncodeDirection.ar( // start the law DC.ar(1), // DC Line.ar(pi.neg, pi, 0.1), 0, AtkHoa.refRadius, order ); HoaDecodeMatrix.ar(hoa, decoder).first // finish the law }.plot(dur, minval: -1, maxval: 1) )

(After inspection, feel free to close these.)

Let's do one more plot, where we compare the window shape of pair-wise octaphonic with HOA3 strict soundfield:// Octa w/ PanAz vs HOA3: FC only! ( var dur = 0.1; var numChans = 8; var order = 3; // ambisonic order var decoder = HoaMatrixDecoder.newProjection( // design Array.regularPolygon(numChans, \vertex, pi).neg, // octa \basic, // strict soundfield \energy, order ); { var hoa; hoa = HoaEncodeDirection.ar( // start the law DC.ar(1), // DC Line.ar(pi.neg, pi, 0.1), 0, AtkHoa.refRadius, order ); Array.with( PanAz.ar(8, DC.ar(1), Line.ar(1, 1.neg, 0.1), orientation: 0).first, HoaDecodeMatrix.ar(hoa, decoder).first // finish the law ) }.plot(dur, minval: -1, maxval: 1) )

What we're seeing here is that in the main lobe of the two windows, the octaphonic pair-wise law is similar to the HOA3 strict soundfield law. That's interesting, in that it indicates that pair-wise octaphonic panning gives something in the neighborhood of Ambisonics!⁷

(go ahead and quit the server)s.quit

(and close the open plot windows, except for the last one comparing pair-wise and basic HOA3)

Spatial Nyquist filters

This isn't completely obvious, and seems counter intuitive, but an expert in windows for filtering will see the two plots as related. The HOA3 law looks like a smoothed version of the pair-wise law.

Let's do a little experiment.

The pair-wise window for the sine-cosine panning law is actually a zero padded Sine window.⁸

When we compare the sine window with a windowed sinc, we see some remarkable similarities with our previous plot:( ~size = 4096; ~numChans = 8; // pair-wise law ~pairWise8 = Signal.zeroFill(~size).overDub( Signal.zeroFill(2 * ~size / ~numChans).addSine(0.5, 1.0, 0.0), ((~size / 2) - (~size / ~numChans)).asInteger ); // windowed sinc ~lowpass = Signal.windowedSinc(~size, 0.00085, 1).normalize; // not far off... [ ~pairWise8.as(Array), ~lowpass.as(Array) ].plot("Octa: Sine Window & Windowed Sinc", minval: -1, maxval: 1) )

A windowed sinc is a lowpass filter. Frequency domain anti-aliasing filters are often designed by starting with a windowed sinc.

For more insight, let's review the frequency response of these two:( [ (~pairWise8.rdftZoom(30, 0, 29).magnitude.normalize + -90.dbamp).ampdb, (~lowpass.rdftZoom(30, 0, 29).magnitude.normalize + -90.dbamp).ampdb ].plot("Octa: Sine Window & Windowed Sinc Magnitude (dB)") )

What we are seeing here is that the windowed sinc is a fairly well behaved lowpass filter with a flat top and a smooth roll off. This isn't the case with the sine window.

Because we can, let's directly view the frequency response of the HOA3 strict soundfield panning law.~numChans = 8; ~order = 3; // ambisonic order // design decoder ~decoder = HoaMatrixDecoder.newProjection( // design Array.regularPolygon(~numChans, \vertex, pi).neg, // octa \basic, // strict soundfield \energy, ~order ); // make a new decoder - just the first (front) channel ~frontChanDecoder = HoaMatrixDecoder.newFromMatrix( ~decoder.getSub(0, 0, ~order.asHoaOrder.size, 1), ~decoder.directions.keep(1), // first channel only ~order ); // analyze directional response - return basic pan law for HOA3 ~testDirections = Array.interpolation(~size, pi.neg, pi); ~basicWindow8 = ~frontChanDecoder.analyzeDirections(~testDirections).amp.as(Signal); // view pairwise and actual HOA3 law [ ~pairWise8.as(Array), ~basicWindow8.as(Array) ].plot("Octa: Sine Window & HOA3 Basic", minval: -1, maxval: 1); // frequency response of the two [ (~pairWise8.rdftZoom(30, 0, 29).magnitude.normalize + -90.dbamp).ampdb, (~basicWindow8.rdftZoom(30, 0, 29).magnitude.normalize + -90.dbamp).ampdb ].plot("Octa: Sine Window & HOA3 Basic Magnitude (dB)")

What we're seeing is that the HOA3 basic (strict) panning law has a well behaved lowpass response in the frequency domain when viewed as a time domain window.

________________

In the spatial domain, the Ambisonic panning law acts as a spatial lowpass filter. Its role is as a spatial anti-aliasing filter, aka a spatial Nyquist fiter.

Let's see how this works in practice by going back to quad comparing a pair-wise quad law with an HOA3 quad law:~numChans = 4; ~order = 3; // ambisonic order // pair-wise law ~pairWise4 = Signal.zeroFill(~size).overDub( Signal.zeroFill(2 * ~size / ~numChans).addSine(0.5, 1.0, 0.0), ((~size / 2) - (~size / ~numChans)).asInteger ); // design decoder ~decoder = HoaMatrixDecoder.newProjection( // design Array.regularPolygon(~numChans, \vertex, pi).neg, // quad \basic, // strict soundfield \energy, ~order ); // make a new decoder - just the first (front) channel ~frontChanDecoder = HoaMatrixDecoder.newFromMatrix( ~decoder.getSub(0, 0, ~order.asHoaOrder.size, 1), ~decoder.directions.keep(1), // first channel only ~order ); // analyze directional response - return basic pan law for HOA3 ~testDirections = Array.interpolation(~size, pi.neg, pi); ~basicWindow4 = ~frontChanDecoder.analyzeDirections(~testDirections).amp.as(Signal); // view pairwise and actual HOA3 law [ ~pairWise4.as(Array), ~basicWindow4.as(Array) ].plot("Quad: Sine Window & HOA3 Basic", minval: -1, maxval: 1); // frequency response of the two [ (~pairWise4.rdftZoom(30, 0, 29).magnitude.normalize + -90.dbamp).ampdb, (~basicWindow4.rdftZoom(30, 0, 29).magnitude.normalize + -90.dbamp).ampdb ].plot("Quad: Sine Window & HOA3 Basic Magnitude (dB)")

Remarkably, when we go back to quad from HOA3, we see that the panning law window has opened up again!

This opening up is spatial smoothing, aka lowpass filtering in the spatial domain.

If we bother to do a check, we'll find that the quad law for HOA3 (when using the projection decoder) is the same as the one for HOA1.

This is a result of the Ambisonic laws applying a spatial anti-aliasing filter.

Also, we can see by inspecting the window frequency response, the spatial cutoff is higher for the octaphonic array. The octaphonic array has a higher spatial sampling rate. For HOA3 with the quadraphonic array, the spatial anti-aliasing filter rejects spatial detail that would otherwise alias.

In contrast, the pair-wise laws are very leaky. They have higher cutoffs, but significantly more spatial aliasing.

________________

Important takeaways

panning laws are spatial filters
spatial smoothing is spatial lowpass filtering
loudspeaker arrays have a spatial sampling rate
we use anti-aliasing filters to avoid spatial aliasing

(feel free to close any open plots)

Isotropy

Maintaining isotropy is one of the more important concerns in the design of Ambisonic panning laws.

Let's directly compare the panning laws of pair-wise sine-cosine quad with those of HOA basic quad.

The example code below makes a single window for each law. The directional amplitude and power response of the two arrays are then simulated. The plots returned illustrate these two measures for both arrays.// ---------------- // single window, to compare ~size = 4096; ~numChans = 4; ~order = 3; // ambisonic order // pair-wise law ~pairWise4 = Signal.zeroFill(~size).overDub( Signal.zeroFill(2 * ~size / ~numChans).addSine(0.5, 1.0, 0.0), ((~size / 2) - (~size / ~numChans)).asInteger ); // design decoder ~decoder = HoaMatrixDecoder.newProjection( // design Array.regularPolygon(~numChans, \vertex, pi).neg, // quad \basic, // strict soundfield \energy, ~order ); // make a new decoder - just the first (front) channel ~frontChanDecoder = HoaMatrixDecoder.newFromMatrix( ~decoder.getSub(0, 0, ~order.asHoaOrder.size, 1), ~decoder.directions.keep(1), // first channel only ~order ); // analyze directional response - return basic pan law for HOA3 ~testDirections = Array.interpolation(~size, pi.neg, pi); ~basicWindow4 = ~frontChanDecoder.analyzeDirections(~testDirections).amp.as(Signal); // pair-wise ~pairWise4Amp = Signal.zeroFill(~size); ~pairWise4Pow = Signal.zeroFill(~size); // HOA basic ~basicWindow4Amp = Signal.zeroFill(~size); ~basicWindow4Pow = Signal.zeroFill(~size); ~numChans.do({ |i| // pair-wise amp & pow ~pairWise4Amp.overDub( ~pairWise4.rotate((i / ~numChans * ~size).asInteger) ); ~pairWise4Pow.overDub( ~pairWise4.squared.rotate((i / ~numChans * ~size).asInteger) ); // HOA basic amp & pow ~basicWindow4Amp.overDub( ~basicWindow4.rotate((i / ~numChans * ~size).asInteger) ); ~basicWindow4Pow.overDub( ~basicWindow4.squared.rotate((i / ~numChans * ~size).asInteger) ) }); // view pairwise and HOA3 law [ ~pairWise4.as(Array), ~basicWindow4.as(Array) ].plot("Quad Window: Sine & HOA3 Basic", minval: -1, maxval: 1); // view pairwise and HOA3 amp ([ ~pairWise4Amp.normalize.as(Array), ~basicWindow4Amp.normalize.as(Array) ] + (-90.dbamp)).ampdb.plot("Quad Amp: Sine & HOA3 Basic (dB)", minval: -5, maxval: 5); // view pairwise and HOA3 pow ([ ~basicWindow4Amp.normalize.as(Array), ~basicWindow4Pow.normalize.as(Array) ] + (-90.dbamp)).ampdb.plot("Quad Pow: Sine & HOA3 Basic (dB)", minval: -5, maxval: 5);

Here's what we see when we inspect these plots:

both the pair-wise and the HOA basic laws are equal power
only the HOA basic law is equal amplitude

The HOA quad law is isotropic for both of these measures.

NOTE: For this review, we've made these measures of the HOA law in a brute force manner. The HOA decoder tools offer the usual formalized measures via a convenient interface. See: HoaMatrixDecoder: Analysis.

Ambisonic Order

From the ATK Glossary:

Ambisonic order: Specifies the maximum Associated Legendre degree, ℓ, of a given signal set.

Ambisonic order indicates the Associated Legendre degree to which the detail of an Ambisonic soundfield is known.

There are a number of ways to consider the meaning of Ambisonic order. As Ambisonics is a holophonic technique, we'll begin by considering the effictive radius of soundfield resynthesis. We'll consider practical aspects of spatial sampling in the spherical and angular domains. And, then end with a brief discussion of localisation measures.

The ATK includes a class, HoaOrder, which can offer formalized understandings of these various aspects of an Ambisonic soundfield. We'll use this lens in much of the discussion that follows.

Effective radius & frequency

When we recall the OUTRS tetrahedral recording experiment, the origins of Ambisonics as a soundfield sampling technique become clear. The soundfield is sampled at a single point with a measurement array. We exactly know the soundfield at this point.⁹

Surprisingly, we also know the soundfield further away from the sampling point, in a frequency dependent way. This is the effective radius:

effective radius: Radius of the volume (or area) of exact soundfield reconstruction.

Let's plot the effective radius against Ambisonic order:/* plot effective radius against effective order _fixed_ freq */ ( ~effFreq = 700.0; // hearing half band /* ~effFreq = 1000.0; // hearing duplex theory */ ~effOrders = Array.series(10, 0); // orders to test ~radii = ~effOrders.collect({ |order| HoaOrder.new(order).radiusAtFreq(~effFreq) }); ~radii.plot("Effective radius (m) @ % Hz".format(~effFreq), maxval: ~radii.last.ceil); )

Ambisonic order is on the x-axis and effective radius in meters is on the y-axis. We're measuring at 700 Hz (or 1000 Hz, if you choose). This plot illustrates: as Ambisonic order increases, the region of exact soundfield reproduction also increases.

In particular, at fifth order, we can expect a region of nearly radius = 0.4 meter to be exactly reconstructed for frequencies at and below 700 Hz.

Let's try another plot:/* plot effective bandwidth against effective order _fixed_ radius */ ( // choose a radius to test.. // ~effRadius = 1.5; // @ refRadius // ~effRadius = 0.5; // 4 peeps ~effRadius = 0.25; // 2 peeps // ~effRadius = (0.18 / 2); // head ~effFreq = 700.0; // hearing half band /* ~effFreq = 1000.0; // hearing duplex theory */ ~effOrders = Array.series(10, 0); ~effFreqs = ~effOrders.collect({ |order| HoaOrder.new(order).freqAtRadius(~effRadius) }); // log bandwidth ~effFreqs.cpsoct.put(0, 0).plot("Effective octave @ radius % (m)".format(~effRadius), maxval: ~effFreq.cpsoct); )

As with our previous plot, Ambisonic order is on the x-axis. The y-axis is frequency, but on a log scale of decimal octaves. For instance:5.octcps

This plot illustrates: as Ambisonic order increases, the cutoff frequency of exact soundfield reproduction also increases.

In particular, at third order, we can expect a region radius = 0.25 meter to be exactly reconstructed below 5.3333 decimal octaves:5.3333.octcps

Knowing the effective radius and effective frequency helps us decide which Ambisonic panning law to use. If the target for playback is a large audience, choosing the strict soundfield law is not necessarily ideal. The energy optimised or controlled opposites laws are better choices.

________________

Frequency dependent laws

Classic FOA employs the psycho-acoustic shelf filter¹⁰ to select the strict law at low frequencies and the energy law at highs. The ATK's HOA toolset includes a fiter kernel designer to do the job.¹¹ Frequency dependent laws have traditionally been advised for studio and near-field listening. For example:// -------------------------------------- // find effective freq with respect to radius & order ( ~effRadius = 0.18 / 2; // head: single listener ~order = 3; ~effFreq = HoaOrder.new(~order).freqAtRadius(~effRadius); )

A single listener can expect a third order soundfield to be reproduced exactly, up to 1820 Hz. Above this point, the energy optimised law is the better choice, as the soundfield isn't exactly reconstructed.

Spherical basis functions

From the ATK Glossary:

spherical harmonics (SH): A complete set of orthogonal, Fourier basis functions on the sphere. For Ambisonics, a set of real form harmonics truncated to a highest Associated Legendre degree, i.e., a given Ambisonic order, encodes a soundfield. See Spherical harmonics.

Open the following pages:( "https://en.wikipedia.org/wiki/File:Spherical_Harmonics_deg5.png#/media/File:Spherical_Harmonics_deg5.png".openOS; "https://en.wikipedia.org/wiki/File:Rotating_spherical_harmonics.gif#/media/File:Rotating_spherical_harmonics.gif".openOS; )

The first of these illustrates Spherical Harmonics (SH) up to degree 5; these are the SH for fifth order. We can understand these bubble shapes as illustrating the 3D polar response patterns of each SH. If we like, we can think of these as virtual microphones.

The second illustrates up to degree 4, so these are for fourth order. (We convert a fifth order soundfield to fourth by discarding the SH of degree 5.) These are illustrated as heat maps. Only one side of the "tree" is shown. The symmetries of the sectoral and tesseral SH are shown via the rotating SH.

Spherical & angular SSR

It becomes immediately clear that Ambisonic order can be directly understood as a kind of spherical domain spatial sampling rate. The higher the order, the more spherical harmonics.

Let's explore some details. We'll begin by considering:// three different orders ~hoaOrder1 = 1.asHoaOrder; ~hoaOrder3 = 3.asHoaOrder; ~hoaOrder5 = 5.asHoaOrder;

In 3D, aka Periphonic

How resolved, in terms of numbers of harmonics, are each of these?~hoaOrder1.size; // -> 4 ~hoaOrder3.size; // -> 16 ~hoaOrder5.size; // -> 36

We see that as order increases, so does the number of SH in the spherical domain. We can think of Ambisonic order as directly indicating a spatial sampling rate in the spherical domain.

For translations of soundfields to the angular domain, the ATK uses spherical t-designs. We can find the mimimum size design required for each order by observing the returned value:TDesignLib.getHoaDesigns(order: ~hoaOrder1.order).first[\numPoints]; TDesignLib.getHoaDesigns(order: ~hoaOrder3.order).first[\numPoints]; TDesignLib.getHoaDesigns(order: ~hoaOrder5.order).first[\numPoints];

3D Soundfield Spatial Sampling Rates

The table below compares the number of coefficients required for the spherical and angular domains:¹²

order	spherical SR	angular SR
1	4	4
3	16	24
5	36	60

One way we can read the table immediately above is to understand that spherical harmonics are a fairly efficient way to represent a soundfield. For fifth order, we need only 36 harmonics, but in the angular domain, 24 more spatial samples are required for the job.

In 2D, aka Pantophonic

How resolved, in terms of numbers of harmonics, are each of these?~hoaOrder1.indices(subset: \sectoral).size; // -> 3 ~hoaOrder3.indices(subset: \sectoral).size; // -> 7 ~hoaOrder5.indices(subset: \sectoral).size; // -> 11

The sectoral harmonics, aka modes, encode the 2D soundfield. You can see we need significantly less harmonics here.

2D Soundfield Spatial Sampling Rates

The usual practice is to consider the angular sampling rate for 2D to be +1 that of the spherical, as doing so returns more stable image synthesis.¹³

order	spherical SR	angular SR
1	3	4
3	7	8
5	11	12

The rule for 2D arrays is:~numSpkrs = (~order * 2) + 2

Array resolution

As we saw above with Spatial Nyquist filters, an actual loudspeaker array has spatial Nyquist frequency. For instance, a quad decoder will only be able to synthesize a first order Ambisonic soundfield. This becomes apparent when we evaluate the rule of thumb immediately above.

For a regular polygon, 2D, we can re-write the rule as:¹⁴ ~order = ((~numSpkrs / 2) - 1).ceil.asInteger

The same principle is true for 3D loudspeaker arrays.¹⁵ If we are designing an isotropic (equal in space) decoder, the degree of resolution is limited by the number of loudspeakers available. For instance, a cube can only be first order:~order = 1; // yes! // ~order = 3; // no!! ~numChans = 8; // cube // find t-design TDesign.newHoa(~numChans, order: ~order);

Localisation measures

Another way we can understand Ambisonic order, and panning law choices (beam shapes) is to consider the localisation measures Ambisonics is designed to optimize:

velocity localisation vector (rV): Vector quantity offering an estimate of the perceived localisation of a phantom source at low frequency, predicting imaging up to around 1.5 kHz. Can be found as the real part of acoustic admittance, the active acoustic admittance.
energy localisation vector (rE): Vector quantity offering an estimate of the perceived localisation of a phantom source in terms of energy, expected to predict imaging between 500 and 5000 Hz.

The strict soundfield option maximizes rV, where rE is energy optimised. For off center listeners, rE is usually preferred.

Let's try a plot:/* plot rE against order order beam: energy */ ( ~beamShape = \energy; // energy optimised // ~dim = 3; // 3D ~dim = 2; // 2D ~orders = Array.series(10, 0); // orders to test ~rEs = ~orders.collect({ |order| HoaOrder.new(order).rE(~beamShape, ~dim) }); ~rEs.plot("%D: rE for % law".format(~dim, ~beamShape), maxval: ~rEs.last.ceil); )

What we see here is that for a third order 2D array, the energy localisation measure for a synthesized Ambisonic image is more that 90% that of a real sound. We expect this energy optimized 2D array to be well defined in terms of energy.

Try:/* plot rE against order order beam: cardioid */ ( ~beamShape = \controlled; // controlled opposites // ~dim = 3; // 3D ~dim = 2; // 2D ~orders = Array.series(10, 0); // orders to test ~rEs = ~orders.collect({ |order| HoaOrder.new(order).rE(~beamShape, ~dim) }); ~rEs.plot("%D: rE for % law".format(~dim, ~beamShape), maxval: ~rEs.last.ceil); )

For the controlled opposites law, we require fifth order to get above the 90% threshold.

Ambisonic Soundfield Model

Classic, aka Gerzonic, Ambisonics has always included the Near-Field Effect (NFE) within its theoretical framework. This inclusion, however, hasn't tended to be especially visible to users on the encoding side of the panning laws. As a result many users are only familiar with basic encoding, where the encoding coefficients are real.

In classic Ambisonics, basic encoding is planewave encoding.

________________

Daniel's Near-Field Compensated Higher Order Ambisonics (NFC-HOA)¹⁶ introduces the Near-Field Effect (NFE) reference radius into the Ambisonic framework to formalize what we might call the Near-Field Controlled Ambsionic Soundfield Model (NFC-ASM).

In practice, we can view this model as a collection of virtual loudspeakers at the reference radius with a virtual microphone at the center.

In theory, this isn't quite the whole story. Recall from our discussion of Panorama Laws that we should view the loudspeakers as a collection of spatial window functions, or basis functions, with look directions. Similarly we should view the microphone as another collection of spatial basis functions, the spherical harmonics. The number of each of these is governed by the principles outlined above.

The soundfield can be represented in both angular and spherical forms.

________________

We'll start with constructing a visualisation of the model. Then we'll consider encoding three different travelling waves. We'll finish up with synthesizing the associated waveforms, directly from the calculated encoding coefficients.

In designing the encoding coefficients for these different travelling waves, you'll see that the encoding law is split between angular and radial encoding. Radial encoding what allows us to move either side of the reference radius, and is where our near-field control is found.

Virtual loudspeakers & microphone

We'll start building our model by evenly distributing a number of points evenly over the surface of a sphere. As discussed above, we'll find a spherical t-design which has an angular spatial sampling rate high enough to meet the spherical sampling rate of a selected order:~order = AtkHoa.defaultOrder; // HOA3 ~hoaOrder = HoaOrder.new(~order); // HoaOrder instance // find smallest t-design ~numChans = TDesignLib.getHoaDesigns(order: ~order).first[\numPoints]; // get t-design ~tdesign = TDesign.newHoa(~numChans, order: ~order); // t-design suitable for HOA3

Given this spherical design, we'll now explicitly collect Spherical coordinate instances, setting the radius of these to the reference radius.( ~tdirs = ~tdesign.directions; // t-design direcitons (rho = 1) ~vspkrDirs = ~tdirs.collect({ |dir| // include reference radius Spherical.new( AtkHoa.refRadius, dir.first, dir.last ) }) )

Let's now use PointView to view this array of virtual loudspeakers at the reference radius:// let's view with PointView: t-design first ( ~pv = PointView.new; ~pv.directions_(~vspkrDirs); // set directions ~pv.pointColors_(Color.yellow); // yellow ~pv.connections_(\triangulation); // set triang mesh ~pv.axisScale_(1.0); ~pv.front; // move to front )

Go ahead and touch the GUI with your mouse or pointer to re-orient the display.

Now, let's add a virtual soundfield microphone:// -- // add a virtual microphone!! ~vmicDir = Spherical.new(0.0, 0.0, 0.0); // origin ~pv.directions_(~vspkrDirs ++ [ ~vmicDir ], false); // don't reset connections! ~pv.highlightPoints([ ~vspkrDirs.size ]); // highight the microphone ~pv.front; // move to front

This is it!

We can imagine the NFC-ASM to be a collection of virtual loudspeakers evenly distributed across the surface of a sphere. The radius of the sphere is the reference radius. At the origin of the sphere is a virtual soundfield microphone.

Easy, peasy!!

When we're done inspecting:~pv.removeHighlight

Near-field travelling wave

The radial part of Ambisonic encoding (the start of the panning law) is frequency dependent, so for this demonstration we'll need to specify a frequency:// radial encoding is frequency dependent! // ~freq = 2.75.octcps // A2 ~freq = 1.75.octcps // A1

NOTE: Usually we'd use HoaEncodeDirection to encode a travelling wave when building a UGen graph. When using this UGen we don't need to specify a frequency. The included frequency dependent radial filter DegreeCtrl does the job for us.

Let's now specify a near-field source, encoded at half the reference radius. We'll use a shorthand of naming a travelling wave within the reference radius as a near-field source.// add a virtual source (near) ~nearRadius = 0.5 * AtkHoa.refRadius; ~nearDir = Spherical.new(~nearRadius, pi/6, pi/12); // update PointView directions ~pv.directions_(~vspkrDirs ++ [ ~vmicDir, ~nearDir ], false); // don't reset connections! ~pv.highlightPoints([ ~vspkrDirs.size + 1 ]); // highight the new point ~pv.front; // move to front

We can see this source is within the virtual loudspeaker array.

Now let's design the encoding coefficients. You'll see we design the angular and radial coefficients separately, and then bring them together for the final encoding law:( // coefficients for this near source (focused) ~nearAngular = ~hoaOrder.sph( // angular coeffs ~nearDir.theta, ~nearDir.phi ); ~nearRadial = ~hoaOrder.ctrlWeights( // radial coeffs ~freq, // freq ~nearDir.rho, // encoding radius AtkHoa.refRadius // reference radius )[~hoaOrder.l]; // combine angular and radial ~nearCoeffs = ~nearAngular * ~nearRadial; // all together! )

The designed coefficients are Complex. We have both real and imaginary parts for each coefficient!

When we inspect the magnitude and phase of the encoding coefficients, we're reviewing the magnitude and phase changes that are required to synthesize Ambisonic encoding of a sinusoid at the frequency we specified above:// inspect & plot magnitude & phase ~nearMag = ~nearCoeffs.magnitude; ~nearPha = ~nearCoeffs.phase; // human readable ~nearMag.ampdb.round(0.01); // dB ~nearPha.raddeg.round(0.01); // degrees

Let's plot these values:[ (~nearMag + 180.neg.dbamp).ampdb.clip(-20, 20), ~nearPha.raddeg ].plot("Near source: Magnitude (dB) & Phase (deg)", discrete: true)

NOTE: Leave this plot window open. We'll compare with other travelling wave encodings, below.

Far-field travelling wave

Let's now specify a far-field source. Like above, we'll use a shorthand of naming a travelling wave beyond the reference radius as a far-field source.

This source is at one and a half times the reference radius (more like, far-ish, actually.¹⁷ ):// add a virtual source (far-ish) ~farRadius = 1.5 * AtkHoa.refRadius; ~farDir = Spherical.new(~farRadius, pi/6, pi/12); // update PointView directions ~pv.removeHighlight; ~pv.directions_(~vspkrDirs ++ [ ~vmicDir, ~nearDir, ~farDir ], false); // don't reset connections! ~pv.highlightPoints([ ~vspkrDirs.size + 1, ~vspkrDirs.size + 2 ]); // highight the source points ~pv.front; // move to front

Now we can see both the near-field and the far-field source.

And, the far-field encoding coefficients:( // coefficients for this far source ~farAngular = ~hoaOrder.sph( // angular coeffs ~farDir.theta, ~farDir.phi ); ~farRadial = ~hoaOrder.ctrlWeights( // radial coeffs ~freq, // freq ~farDir.rho, // encoding radius AtkHoa.refRadius // reference radius )[~hoaOrder.l]; // combine angular and radial ~farCoeffs = ~farAngular * ~farRadial; // all together! )

We can inspect:// inspect & plot magnitude & phase ~farMag = ~farCoeffs.magnitude; ~farPha = ~farCoeffs.phase; ~farMag.ampdb.round(0.01); // dB ~farPha.raddeg.round(0.01); // degrees

And plot:[ (~farMag + 180.neg.dbamp).ampdb.clip(-20, 20), ~farPha.raddeg ].plot("Far source: Magnitude (dB) & Phase (deg)", discrete: true)

NOTE: Leave this plot window open. We'll continue to compare with other travelling wave encodings.

When we compare the magnitude plots of the near and far-field travelling waves, we notice the two are substantially different. In particular, we see the near-field source has high gains in high harmonics, while in the far-field source we see the gains rolling off.

We can also notice that the phases are shifted in opposite rotations on camparison. E.g., positive phases for near-field are negative for far-field.

Let's compare the angular and radial coefficients for this pair:// compare ~farAngular == ~nearAngular; // yes! ~farRadial == ~nearRadial; // no!!

So, yes, this test confirms that our two travelling waves have the same angular encoding. They have the same look direction.

Basic travelling wave

Recall from our discussion above: basic panning, aka basic encoding encodes a source at the reference radius.

Let's specify this:// add a virtual source (basic) ~basicRadius = AtkHoa.refRadius; ~basicDir = Spherical.new(~basicRadius, pi/6, pi/12); // update PointView directions ~pv.removeHighlight; ~pv.directions_(~vspkrDirs ++ [ ~vmicDir, ~nearDir, ~farDir, ~basicDir ], false); // don't reset connections! ~pv.highlightPoints([ ~vspkrDirs.size + 1, ~vspkrDirs.size + 2, ~vspkrDirs.size + 3 ]); // highight the source points ~pv.front; // move to front

Now we can see near-field, far-field and basic sources.

NOTE: These are all spherical travelling waves!

Now let's synthesize the coefficients of the basic source:// coefficients for basic source ~basicAngular = ~hoaOrder.sph( // angular coeffs ~basicDir.theta, ~basicDir.phi ); ~basicRadial = ~hoaOrder.ctrlWeights( // radial coeffs (notice anything?) ~freq, // freq ~basicDir.rho, // encoding radius AtkHoa.refRadius // reference radius )[~hoaOrder.l]; // combine angular and radial ~basicCoeffs = ~basicAngular * ~basicRadial // all together!

Inspect:// inspect & plot magnitude & phase ~basicMag = ~basicCoeffs.magnitude; ~basicPha = ~basicCoeffs.phase; ~basicMag.ampdb.round(0.01); // dB ~basicPha.raddeg.round(0.01); // degrees

Notice, for the basic travelling wave, the phase of the encoding coefficients is either 0 or 180 degrees.

This corresponds to, the coefficients for basic encoding having no imaginary components:~basicCoeffs == ~basicCoeffs.real; // test! ~basicCoeffs.imag == Array.fill(~basicCoeffs.size, { 0.0 }); // all zeros!!

Plot, and compare with our other plots:[ (~basicMag + 180.neg.dbamp).ampdb.clip(-20, 20), ~basicPha.raddeg ].plot("Basic source: Magnitude (dB) & Phase (deg)", discrete: true)

One thing we can see is that as a source moves away from the reference radius, this change is encoded in both magnitude and phase changes.

Let's directly test the encoding coefficients:// compare (~basicAngular == ~nearAngular) && (~basicAngular == ~farAngular); // yes! (~basicRadial == ~nearRadial) && (~basicRadial == ~farRadial); // no!!

So, yes, the angular coefficients are the same. The differences are in the radial coefficients.

NOTE: When you're ready, feel free to close the travelling wave coefficient plots.

Travelling waveforms

As we work with Ambisonic signals, we'll become accustomed to reviewing encoded waveforms. Let's now take the opportunity to synthesize and review a single cycle of our three sources.// waveform synthesis parameters ( ~size = 4096; ~gain = 0.0; // dB ~phase = 0; // radians )

For ease of viewing, we'll truncate our coefficients from HOA3 to HOA1:// collect HOA1 magnitudes and phases for plotting ( ~hoaOrder1 = HoaOrder.new(1); ~nearMagPha = [ ~nearMag, ~nearPha].flop.keep(~hoaOrder1.size); ~basicMagPha = [ ~basicMag, ~basicPha].flop.keep(~hoaOrder1.size); ~farMagPha = [ ~farMag, ~farPha].flop.keep(~hoaOrder1.size); )

Synthesize and plot:( [ \near, \basic, \far ].do({ |wave| var dir = currentEnvironment[(wave ++ \Dir).asSymbol]; var plotMagPha = currentEnvironment[(wave ++ \MagPha).asSymbol]; var scale = plotMagPha.flop.first.maxItem.dbamp; // rescale plot, for no clipping var sig = plotMagPha.collect({ |magPhas| var mag = magPhas.at(0); var phas = magPhas.at(1); mag * Array.newFrom( Signal.cosineFill( ~size, [ ~gain.dbamp ], [ ~phase + phas ] ) ) }); sig.plot( "% % Hz: [ %, %, % ]".format( wave.asString, ~freq, dir.rho, dir.theta.raddeg, dir.phi.raddeg ), minval: -1 * scale, maxval: scale ) }) )

When we cycle through these three plots, it becomes apparent that the first channel, degree zero, remains the same for all three travelling waves.

We see that the space of the sound is to be found in the higher degrees, and is encoded in both magnitude and phase.

[1] - J. Daniel, "Spatial Sound Encoding Including Near Field Effect: Introducing Distance Coding Filters and a Viable, New Ambisonic Format," Paper 16, 23rd International Conference: Signal Processing in Audio Recording and Reproduction (2003 May.). Permalink: http://www.aes.org/e-lib/browse.cfm?elib=12321

[2] - See Classic Stereo Imaging Transforms—A Review.

[3] - Building a panner by directly connecting an encoder and a decoder is known as Ambisonic equivalent panning, aka AEP.

[4] - See Window function

[5] - The FOA toolset uses the name k.

NOTE: We could have called this parameter spatial window or even panning law. The term beam shape appears to be a preferred name in the HOA technical literature.

[6] - This is what SuperCollider's DecodeB2 is doing under the hood.

[7] - Maybe that's why people like octaphonic sound?

[8] - Surprised?

[9] - Neglecting measurement errors having to do with the actual spatio-frequency response of the microphone. E.g, the spatial aliasing limit of the microphone, and other factors.

[10] - FoaPsychoShelf

[11] - Signal: *hoaMultiBandFocl

[12] - Using the energy criteria for t-designs as described by Zotter, et al.

Zotter, F., Frank, M., & Sontacchi, A. (2010). The Virtual T-Design Ambisonics-Rig Using VBAP. EAA Euroregio Ljubljana 2010.

Zotter, Franz, and Frank, Matthias. Ambisonics. Springer, 2019.

[13] - "In playback, to get a perfectly panning-invariant loudness measure E of the continuous panning function and also the perfectly oriented rE vector of constant spread arccos(|rE|), the parameter t must be t ≥ 2N + 1. In 2D, all regular polygons are t-designs with L = t + 1 points.

We can use the smallest set of 2N + 2... as optimal 2D layout."

Zotter, Franz, and Frank, Matthias. Ambisonics. Springer, 2019. (p. 60)

[14] - This will return the number of sectoral harmonics required.

[15] - There are some specific caveats if we're willing to accept a design that is not isotropic.

[16] - I prefer to name the technique as Near-Field Controlled, so as not to confuse with the usage of the term Near-Field Compensated in classic Ambisonics.

[17] - 10 meters is a better choice, as the encoding approaches that of a planewave.

helpfile source: /Library/Application Support/SuperCollider/downloaded-quarks/atk-sc3/HelpSource/Tutorials/ATK-Enlightenment.schelp
link::Tutorials/ATK-Enlightenment::

Ambisonic EnlightenmentExtension

Ambisonic Enlightenment
Extension