----- Experience in your own room the magical nature of stereo sound -----

Basics

Issues in speaker
design

Stereo Recording and Rendering

Audio production

Conclusions

Projects

Your own desig

LXmini

LXmini+2

LXstudio

LX521.4

PHOENIX
dipole speaker

Three-Box active
system (1978)

Resources

------------------
Digital Photo
Processes

------------------
The
Sea Ranch

------------------
My Daughter
the Jeweler

What's new

LX - Store

Conversations
with Fitz

OPLUG
Forum

Recording & Rendering

--- Recording & Rendering 101 --- Acoustics vs. Hearing --- Subjective evaluation ---
--- Room optimized stereo --- Sound reproduction --- Recording what we hear ---
--- Experimental results --- Theory --- SRA --- Sound field control ---

Sound Field Control for Rendering Stereo - 1

Creating a believable Auditory Scene using two loudspeakers in a reverberant room

An Auditory Scene is created in the listener’s mind from cues in the direct sound streams coming from the loudspeakers and from reflected, reverberated and resonant sounds in the room.
For believability the loudspeakers must not be localizable in the aural scene and attention to the listening room must have receded beyond the aural horizon.
The loudspeakers must have a polar response, such that on-axis and off-axis frequency responses are identical and may only differ in level. Omni, dipole, cardioid or other frequency independent radiation patterns are required.
First order room reflections must be delayed by >6 ms compared to the direct sound reaching the listener. Floor reflection should be minimized by the loudspeaker’s directivity.
The loudspeaker and listener equilateral triangle is preferably set up symmetrical to room boundaries or large reflecting surfaces.
The high frequency acoustic center of each loudspeaker should be at ear height.
The direct-to-reverberant sound level ratio for sounds above the Schroeder frequency should be greater than –6 dB at the listening position. An omni directional loudspeaker typically requires a close listening distance and/or a dead room. A dipole loudspeaker allows for a 1.7 times larger distance under identical room conditions, or for a 3 times longer reverberation time. It also provides some control over the first lateral reflection and coupling to room modes.
Under these conditions the direct sound from the speakers dominates the auditory scene. Spatial properties of source location, spread and distance are derived from cues imbedded in the mix of the recording and cues from the recording venue. Reflections from the listening room are delayed and are copies of the direct sound at a lower level. The room reverberated sound has the same timbre as the direct sound. A listener automatically withdraws attention from the room and falls for the aural illusion, which is delivered by cues in the direct sound streams from the two loudspeakers and is produced in his mind.

Siegfried Linkwitz
30 June 2013

Sound Field Control for Rendering Stereo - 2

The following was written for the AES 52nd International Conference, "Sound Field Control, Engineering and Perception", Guildford, UK, September 2013. The manuscript was accepted and the presentation scheduled, but I cannot attend the Conference due to conflicting family events. I withdrew the manuscript and now publish it reformatted on my website.

ABSTRACT

Stereo program material reproduced over two loudspeakers creates an aural scene in the listener’s mind. The phantom scene usually appears to originate at and behind the line between the loudspeakers. The perception is derived from cues in the direct sound streams from each loudspeaker and from room reflected and reverberated sound streams. Ideally, the loudspeakers and the listening room are not heard as such. It has been found that reflected streams and reverberation must carry the same spectral content as the direct streams and that first reflections must be delayed by proper placement of the speakers. The loudspeakers must be designed for frequency independent directional behavior. Practical approximations are shown for a monopole and a dipole constant directivity loudspeaker. Limitations to radiation pattern and sound output capability due to driver and baffle dimensions are discussed.

INTRODUCTION

Visual inspection shows that the vast majority of loudspeakers, which are currently sold for use in domestic stereo or home theatre set-ups, are designed with only minor attention to their off-axis radiation behavior. The target is usually a flat on-axis frequency response. In some cases the response is specified within certain dB limits throughout a rectangular listening window of +/-30⁰ horizontal and +/-10⁰ vertical width. In even fewer cases the power response and directivity index are specified or controlled. A change in directivity from 0 dB below 200 Hz to over 10 dB above 10 kHz is normal. Such loudspeakers are omni-directional at low frequencies and become increasingly forward beaming for higher frequencies.

On-axis and off-axis radiation causes room reflections, excites room modes and determines the strength and timbre of the reverberant sound field. A listener’s perception of the phantom acoustic scene between the loudspeakers depends upon the ratio of direct sound streams coming from the loudspeakers at +/-30⁰, to the room modified indirect sound streams arriving at a multitude of angles of incidence and all summing at the ears. In a highly damped room with RT₆₀ < 250 ms, as in a recording studio, the reverberated sound field is weak and becomes perceptually insignificant under close-field monitoring conditions. Common domestic size living and listening spaces have reverberation times of >400 ms. The optimum listening distance can become inconveniently short. The typical box loudspeaker radiates too much power at low frequencies, feeding room modes disproportional to the power in the reverberant field at higher frequencies.

Optimal conditions for phantom source perception are obtained when a) the room reverberated sound streams have the same spectral content as the direct streams, when b) the first reflections are delayed by >6 ms, and when c) the ratio of direct to reverberated stream levels is greater than –6 dB. Under those conditions the listener’s brain can withdraw attention from room and loudspeakers and focus on the cues in the direct sound. A pair of loudspeakers with frequency independent radiation pattern, placed >1 m from large reflecting surfaces, set up in an equilateral triangle with the listener and symmetrical with the room boundaries, can provide a remarkable stereo phantom scene with fine spatial detail and believability.

The requirements for optimum rendering of stereo have not been explicitly stated in the literature, nor fully investigated, but many pointers have been provided over the last six decades. For example see references [1-20].

1. CONSTANT DIRECTIVITY LOUDSPEAKERS

Basic models for loudspeakers with frequency independent radiation pattern are the monopole, cardioid and dipole. The monopole interacts maximally with the room. Any practical monopole is constructed from sub-enclosures, which decrease in size with increasing frequency, in order to remain acoustically small, while providing the necessary volume displacement for loudness. A cardioid loudspeaker requires an acoustic flow resistor for its operation unless it is realized by combining a monopole and a dipole radiator. To build a broadband and linear acoustic flow resistor for low frequency reproduction can be difficult. Also, internal cavity and panel resonances can cause spurious radiation similar to a monopole.

A cardioid would not illuminate the wall behind it with direct sound, but has the same overall sound output into the room as a dipole with the same on-axis SPL and has therefore an identical Direct-to-Reverberant ratio at the listening position. Compared to a monopole the D/R is 4.8 dB higher. This also means that the reverberation time of the room could be three times higher for the same D/R ratio as from a monopole. It is a significant advantage for the dipole or cardioid radiator. In a typical domestic room with RT₆₀ = 500 ms their reverberation distance is the same as for a monopole in a highly damped room with RT₆₀ = 167 ms. Here RT₆₀ refers to the frequency range above the Schroeder Frequency of around 150 Hz [2].

A dipole radiator is the most practical solution for a constant directivity loudspeaker provided that its baffle dimensions are acoustically small and the radiating elements are tightly positioned together in a vertical line. Dipole loudspeakers in the form of large planar electrostatic or magnetic radiator panels, tend to suffer from multi-beam radiation due to acoustically large radiating surfaces and from insufficient output volume at low frequencies. Size and output problems can be overcome by using conventional piston drivers with appropriate cone diameter and large excursion as radiating elements.

Design and construction of constant directivity loudspeakers has its difficulties and tradeoffs must be made as with all loudspeakers. Radiation in the vertical direction is usually restricted in favour of controlled horizontal dispersion. In the following we present design issues for a small experimental monopole radiator, a higher output monopole and a state-of-the art dipole.

1.1 Monopole - CD Proto #1

The loudspeaker is formed by a 75 mm diameter driver in a 100 mm diameter coupler at the top of a ABS pipe, which is sealed at its bottom (Figure 1). The correct amount of absorbing material on the inside of the pipe minimizes reflected signals being transmitted through the cone and provides sufficient internal volume to keep the driver’s mechanical resonance low. The high stiffness of the pipe suppresses spurious sound radiation from its surface. The frequency response in the horizontal plane is independent of azimuth angle due to axial symmetry of the radiator. The frequency response in the vertical plane (Figure 2) is independent of elevation angle below 700 Hz, but varies considerably at higher frequencies due to diffraction and the inherent frequency response of the driver.

Figure 1: Loudspeaker setup in an equilateral triangle
with the listener’s head.

Figure 2: Frequency response in the vertical plane.
D/l = 1 at 3.4 kHz.

The acoustic center for vertical radiation is positioned 40 mm above the top plane of the driver (Figure 3). The microphone must be rotated around this point for the measured low frequency response to become independent of elevation angle [21]. The vertical response follows to some extent the general pattern given by a point source at he end of a cylinder (Figure 4, top). This type of radiator is not a true monopole, like an acoustically small pulsating sphere, but it follows its response in the horizontal plane and to a lesser degree in the vertical plane.

Figure 3: Template for microphone positioning to
measure the frequency response in the vertical plane

Figure 4: Ratio of the pressure (for the normal axis point)
on a rigid obstacle to the pressure in the incident sound wave [22].

A pair of loudspeakers like this provides excellent spatial rendering even without equalization of the colored response. Imaging is precise and the sweet spot is very wide. The speakers must be listened to from a close distance to preserve detail in the phantom scene and because their maximum output volume is sufficient for voice but not for music program material [23].

1.2 Monopole - CD Proto #2

The limited output volume of Proto #1 is overcome by using two drivers to cover the frequency range, an upward firing woofer/midrange unit of 115 mm cone diameter and a forward firing 38 mm tweeter (Figure 5). The tweeter is mounted forward of the woofer axis to minimize diffraction. Using electrical delay the tweeter’s acoustic center is moved back to the woofer axis for signal addition in the +/-1 octave overlap region between drivers of the 1 kHz LR4 crossover. The frequency response in the horizontal plane (Figure 6), halfway between woofer and tweeter, measured at 0.3 m from the woofer axis, is independent of angle from 0 degree to 180 degrees below 500 Hz.

Figure 5: Woofer and tweeter layout for wide horizontal
and vertical dispersion.

Figure 6: Frequency response in the horizontal plane.
The on-axis response has been equalized.

This also applies to the vertical plane. The diffractive diameter of the mounted woofer is D/l = 0.44 at 1 kHz and that of the tweeter is D/l = 0.13 and thus wide dispersion can be expected, (Figure 4, top). The horizontal on-axis response is equalized and rolls off with increasing frequency and off-axis angle due to the tweeter becoming directional. The roll-off is essentially monotonic up to 90⁰. At 120⁰ and above a strong interference notch shows up due to path length differences between the microphone and the acoustic centers of woofer and tweeter and due to the phase shift of the electrical delay circuit. A measurement at greater distance from the drivers is likely to give somewhat different results.

The frequency response in the vertical plane varies insignificantly for angles up to 25⁰ as encountered by a person standing at 2.4 m distance from the speakers (Figure 7). Moving sideways by 30⁰ the response drops with increasing frequency as in (Figure 6).

Figure 7: Frequency response in the vertical plane compared to the on-axis response at 0⁰ and 30⁰ off-axis in the horizontal plane. The response at +25⁰ is for a person standing at 1.8 m distance from the speakers and at 1.8 m height. The curves are offset by 15 dB.

The speaker behaves like a pulsating sphere below 500 Hz and gradually turns into a forward firing radiator with wide vertical and horizontal dispersion as frequency increases. The speaker has proven itself as a neutral radiator in a reverberant space, rendering precise phantom image detail and dynamics when listened to at less than twice the reverberation distance for a monopole in the specific space [24].

1.3 Dipole - CD Proto #3

Experience has indicated that a full range and acoustically small dipole loudspeaker must be designed as a 4-way radiator. Dimensions for the woofer are easily kept small but this requires very high volume displacements from the drivers. For example, the woofer section can produce a sound pressure level of 94 dB at 30 Hz and 1 m distance when placed on a reflective ground plane. This requires a peak-to-peak volume displacement of 2000 cm³ and front-to-back distance of the open baffle of D_d = 400 mm. [25]

The same 94 dB SPL is generated at 120 Hz by a much lower 31.25 cm³ p-p volume displacement, if D_d = 400 mm. But maintaining dipole behavior over the midrange from 120 Hz to 7 kHz becomes difficult, because front-to-back distance D_d, driver size and baffle dimensions become first comparable and then larger than the radiated wavelength. Two differently sized drivers and a uniquely shaped baffle were needed to cover such wide frequency range.

1.3.1 Dipole baffle

Figure 8: Dipole baffle from the rear.

The baffle is shown without any drivers mounted and from the rear (Figure 8) [26].

The woofer baffle has openings for two 210 mm piston diameter, long-throw drivers, at 45⁰ angle to each other. One faces to the rear of the baffle with its cone, the other with its magnet for even order distortion reduction. A bridge above the woofer baffle supports the top baffle for lower midrange driver (170 mm), upper midrange driver (75 mm) and two identical tweeters (25 mm). The lower tweeter fires forwards, the upper tweeter fires towards the rear in opposite polarity to generate dipolar radiation.

1.3.2 Dipole woofer

Figure 9: Frequency response in the center of the opening
plane of the V-frame baffle.

The woofer’s frequency response (Figure 9) is measured with the microphone tip in the center of the opening plane of the woofer’s V-frame baffle.

The baffle sits on the ground to account for half-space operation. The response exhibits a peak at 250 Hz, which is the result of a l/4 cavity-length resonance caused by the acoustic wave impedance mismatch at the baffle opening. The peak is easily equalized. If measured at a large distance, where front and rear radiation from the baffle combine, the woofer’s response would decrease at 6 dB/octave rate. Thus to obtain an identical dipole response, as the one measured in the opening plane, it is necessary to add equalization, which boosts lower frequencies at 6 dB/octave rate. Additional equalization can turn the dipole response flat to a target corner. The woofer is crossed over at 120 Hz to a lower midrange driver and operates only in a range where it is acoustically small [27].

1.3.3 Dipolar lower and upper midrange

The polar response of the top baffle is measured from 0.5 m distance in the backyard, being elevated and positioned on a manual turntable (Figure 10). The lower midrange driver exhibits close to textbook behavior up to about 1 kHz (Figure 11). The baffle shape was determined empirically. Only frontal radiation is measured. Rear radiation reaches the listener via room reflections and reverberation, which makes the details of the response much less important. The total power radiated to the rear must be similar to the total power radiated to the front hemisphere.

Figure 10: Polar frequency response
measurement set-up.

Figure 11: Horizontal frequency response of the lower
midrange driver with angular open baffle.

The upper midrange driver with its narrow baffle is well behaved above 1 kHz (Figure 12). The response widens between 0⁰ and 30⁰ around 3.5 kHz but is in general quite useable up to 10 kHz. When the two drivers are combined with a passive quasi-B1 crossover at 1 kHz, i.e. a crossover that is 6 dB down at 1 kHz and provides in-phase addition, then the resulting overall response indicates dipolar behavior over a very wide frequency range (Figure 13). The widening around 3.5 kHz depends upon the vertical position of the measuring axis. Here it is the upper midrange driver axis. In general, it would take many sets of measurements to fully describe the polar radiation details when the vertical distances between drivers and the baffle dimensions are no more acoustically small.

Figure 12: Horizontal frequency response of the upper
midrange driver on its narrow open baffle.

Figure 13: Combined frequency response of lower and
upper midrange drivers.

1.3.4 Dipolar tweeters

Radiation from front and rear tweeters interacts in space forming a radiation pattern with sound cancellation in the plane of the baffle and a dipole like angular dependency (Figure 14). Below 2 kHz the baffle is acoustically small and a true dipole response results. Dispersion widens with increasing frequency, which is normal for a fixed baffle size. Above 5 kHz the inherent directivity of the dome drivers takes over. Front and rear radiation only interact at large off-axis angles.
An LR4 crossover at 7.5 kHz was chosen to combine the tweeters with lower and upper midranges (Figure 15).

Figure 14: Combined frequency response of front
and rear tweeters.

Figure 15: Combined and equalized frequency response
of lower midrange, upper midrange and tweeter drivers [28].

Figure 16: Vertical frequency response relative to the
upper midrange driver axis.

Response variation with elevation angle relative to the upper midrange driver axis is shown in (Figure 16). The +20⁰ curve corresponds to the response for a standing listener at 2.4 m from the speaker. The response shows interference notches at 2 kHz and 7 kHz due to vertical separation of drivers.

Acoustically small dipolar loudspeakers define the state-of-the-art in terms of stereo rendering. For a majority of recordings they fully disappear from the illusionary aural scene in front of the listener, revealing the venue acoustics, microphone placements and natural spatial relationships between virtual sources. The listening room easily disappears from perception allowing for undivided attention to the program material.

2. SUMMARY

Sound field control is essential for optimum stereo rendering, because it determines how the listening room is illuminated with sound by the loudspeakers. The radiation pattern must be frequency independent for the loudspeaker to leave no signature of its own in the room, which would compete with the phantom scene to be created in the listener’s mind. Practical difficulties and trade-offs in designing for constant directivity have been illustrated with three prototypes. Never the less, the last two examples can demonstrate what the simple stereo format is capable of in terms of spatial rendering and realism when misleading cues from loudspeaker and room are minimized. One can only hope that the importance of sound field control for home stereo systems becomes widely recognized, that novel loudspeakers will become available and that the generic box loudspeaker sound and room problems are relegated to the past.

3. REFERENCES

H. Harz,, H. Koesters: "Ein neuer Gesichtspunkt fuer die Entwicklung von Lautsprechern?", Technische Hausmitteilungen des NWDR, Jahrgang 3, Nr. 12 (1951)
H. Kuttruff, Room Acoustics, John Wiley & Sons (1973)
S. P. Lipshitz, J. Vanderkooy, "Experiments in Direct/Reverberant Ratio Modification", AES 79^th Convention, Paper 2301 (1985)
D. Moulton et al, "The Localization of Phantom Images in an Omnidirectional Loudspeaker System", AES 81^st Convention, Paper 2371 (1986)
W. Klippel, "Assessing the Subjectively Perceived Loudspeaker Quality on the Basis of Objective Parameters", AES 88th Convention, Paper 2929 (1990)
G. L. Augspurger, "Loudspeakers in Control Rooms and Living Rooms", AES 8^th International Conference, The Sound of Audio, pp. 171-178 (1990)
W. M. Hartmann, "Localization of a Source of Sound in a Room", AES 8^th International Conference, The Sound of Audio, pp. 27-32 (1990)
W. M. Hartmann, "Auditory Localization in Rooms", AES 12^th International Conference, The Perception of Reproduced Sound, pp.74-94 (1993)
S. Bech, "The Influence of the Room and of Loudspeaker Position on the Timbre of Reproduced Sound in Domestic Rooms", AES 12^th International Conference, The Perception of Reproduced Sound, pp.74-94 (1993)
W. A. Yost, ."The Cocktail Party Problem: Forty Years Later", Binaural and Spatial Hearing in Real and Virtual Environments. R. Gilkey and T. Anderson (Eds.), Lawrence Erlbaum Associates, Hillsdale, NJ (1997)
Hartmann, W. M.: "Listening in a Room and the Precedence Effect", Binaural and Spatial Hearing in Real and Virtual Environments, R. Gilkey and T. Anderson (Eds.), Lawrence Erlbaum Associates, Hillsdale, NJ (1997)
S. Linkwitz, ."Investigation of Sound Quality Differences between Monopolar and Dipolar Woofers in Small Rooms", AES 105th Convention, Paper 4786 (1998)
A. S. Bregman, Auditory Scene Analysis - The Perceptual Organization of Sound, The MIT Press (1999)
S. Linkwitz, "Room Reflections Misunderstood?", AES 123rd Convention, Paper 7162 (2007)
B. Blesser, L.-R. Salter, .Spaces Speak, Are You Listening? . Experiencing Aural Architecture., MIT Press (2007)
F. E. Toole, Sound Reproduction, Focal Press, (2008)
J. Meyer, Acoustics and the Performance of Music, Springer (2009)
B. Rakert, W. M. Hartmann, "Localization of sound in rooms. V. Binaural coherence and human sensitivity to interaural time differences in noise", J. Acoust. Soc. Am., Vol. 128, No. 5, November (2010)
S. Linkwitz, "Hearing Spatial Detail in Stereo Recordings", 26^th Tonmeistertagung, (2010) https://www.linkwitzlab.com/publications.htm
T. Mellow, L. Karkkainen, "A dipole loudspeaker with balanced directivity pattern", J. Acoust. Soc. Am, Vol. 128, No. 5, November (2010)
J. Vanderkooy, "The Acoustic Center: A New Concept for Loudspeakers at Low Frequencies", AES 121^st Convention, Paper 6912 (2006)
G. G. Muller, R. Black, T. E. Davis, "The Diffraction Produced by Cylindrical and Cubical Obstacles and by Circular and Square Plates", J. Acoust, Soc. Am., Vol. 10, July (1938)
S. Linkwitz, "WATSON – Stereo Enhancement Loudspeakers" (2013), https://www.linkwitzlab.com/Watson/watson.htm
S. Linkwitz, "PLUTO-2.1" (2013), https://www.linkwitzlab.com/Pluto/Pluto-2.1.htm
S. Linkwitz, "Excursion limited SPL" (2013), https://www.linkwitzlab.com/spl_max1.xls
S. Linkwitz, "The LX521 Monitor" (2013), https://www.linkwitzlab.com/LX521/Description.htm
S. Linkwitz, "Models for a dipole loudspeaker, design" (2013), https://www.linkwitzlab.com/models.htm
S. Linkwitz, "Active Filters" (2013), https://www.linkwitzlab.com/filters.htm

*** L. L. Beranek, T. J. Mellow, Acoustics - Sound Fields and Transducers, Elsevier (2012)

***************************************************************************************************