Why Is Room-Based Information Transfer Difficult?

Introduction to Acoustics

In acoustics, the distance at which the sound pressure (measured power) of the direct sound (D) and the reverberant sound (R) are equal is called the “critical distance.”

For most reasonably sized rooms this distance is only two or three feet. At distances longer than the critical distance the power of the reverberations (the sum of all the reflections) is greater than the direct sound! That is, the direct-sound to non-direct-sound ratio is negative.

Figure 1 shows the power of the direct sound (D, in blue) and the total power ([D + R], in black) for a room volume of 88 cubic meters and a RT60 reverberation time of 0.8 seconds (the amount of time for the power to decay 60 dB after release of stimulus) as a function of distance from the source (horizontal axis). Note that the total power flattens out to a constant far enough away from the source in what is called “the diffuse field” in acoustics – at these distances the power in the direct sound is negligible.

**Figure 1 - Acoustic Critical Distance**

If we consider the non-direct-sound power to be a self-noise component (not strictly true for speech as early reflection energy increases speech intelligibility), humans often need to decode in negative signal-to-noise ratio (SNR) distances from the source. Our human auditory system routinely decodes in “negative SNR” settings such as the so-called “cocktail party” situations, where room noise (typically other conversations) is louder than the person we wish to hear. It is well known that humans can decode speech down to about -15 dB SNR (for references, Google “Speech Intelligibility Index”). The human ability to decode in negative SNR is due to: 1) the fact that speech signals are highly redundant (signal is encoded with special redundancy), and 2) the human auditory system itself is designed to exploit the redundancy (allowing it to decode in negative SNR settings).

Successful information transfer in reverberant rooms is no different than speech, the best systems use “redundant encoding” for the transmitted signal (item 1 above) and decoding tuned to the characteristics of the transmitted signal (item 2 above).

In contrast to the human ability to decode in negative SNR, virtually all non-spread spectrum-based transmission systems require positive SNR to decode properly. This is why room-based information transmission is difficult using traditional communications techniques such as tones and the usual forms of modulation; the “information carrying signal” (the direct signal or a more dominant reflection) must have power greater than the non-information carrying components (noise or other reflections). A SNR of 12 dB or more is typically required for most traditional non-spread spectrum communications schemes. This fact alone puts these non-spread spectrum techniques at a deficit relative to human auditory function by over 20 dB!

It is also impossible to “hide” the information-carrying signals when using non-spread spectrum techniques due to the need for positive SNR (this is a requirement for some stenographic applications). One can easily find the information-carrying signal using those techniques using spectral tools (e.g., spectrograms). We shall see below that we can “bury” the information carrying signal under existing signals via spread-spectrum techniques and thus make those signals invisible to casual observation. Indeed, this “signal hiding” attribute is why many military communications use spread spectrum for covert communications.

Room Characteristics: Reverberation Creates A Multi-Path Environment

Given a transmitter (typically a speaker in the “sending system”) and a receiver (typically a microphone at a “user endpoint”) one can measure a room impulse response (RIR) that characterizes the room transfer function from the transmitter to the receiver. Whenever you change the location of either transmitter or receiver, you get a different RIR.

Figure 2 shows the magnitude response of a typical RIR between a transmitter and receiver in a conference room. It is clearly seen that that there are many frequencies where a signal at a particular frequency is nulled (cancelled by more than 30 dB) by its reflections. Moreover, it is also seen that there are variations in magnitude of 10 dB or more between any two arbitrarily chosen frequencies. This makes the design of a system using one or a few tones difficult – one or more of the tones may not survive and those that do often have vastly different magnitudes (as well as phases/delays).

**Figure 2 - Magnitude Response (dB) of Conference Room RIR**

Why Is Room-Based Information Transfer Difficult?

Introduction to Acoustics

Room Characteristics: Reverberation Creates A Multi-Path Environment

Moving Endpoints: A Multi-Path Environment with "Fading"

Room Reverberation + Moving Endpoints = Difficult Communications Design

Spread Spectrum Best for a Fading and Severe Multi-Path Acoustic Room Environment

Summary

How AccousticComms Can Help