Product · Audio Architecture

Signature layered audio design

A technical breakdown of every frequency layer and the role it plays in the session.

Every VōxSōma session is built from carefully tuned audio layers. Each one is designed around a specific frequency principle, then balanced with the others so your voice sits naturally inside the whole. This page describes how each layer works and why it is there. None of this is required reading to use the product — but if you want to understand the architecture, here it is.

Evening Wind-Down — Record Your Voice · 36 min · Flagship

Five layers, four phases.

The Evening Wind-Down — Record Your Voice is a 36-minute audio experience that progresses across four phases — Vagus tone, Alpha settle, Theta receptive window, Delta drop. Five precision sound layers are blended throughout, with your seven recorded affirmations woven through the central theta window. Stereo headphones are strongly recommended — phone speakers will not deliver the spatial elements.

Layer 1 · Breathing pad

4-note harmonic chord

A sustained four-note chord (C · E · G · C-octave) with subtle organic detuning. Provides the warm bed underneath everything else.

Layer 2 · Stereo dual-tone progression

8 → 10 → 12 Hz across 20 min

Three pairs of stereo tones (180/220/260 Hz carriers) where the left and right ears receive frequencies that differ by 8–12 Hz. The difference shifts gradually across the session — slower at the start, gently faster toward the end.

Layer 3 · Schumann grounding

7.83 Hz modulator on 110 Hz

A 110 Hz tone modulated at 7.83 Hz — a frequency sometimes referred to in audio practice as the "Schumann frequency." Sits low in the mix, providing a steady architectural foundation.

Layer 4 · Sub-body

70 Hz with 0.1 Hz breath rhythm

A low 70 Hz tone with a slow 0.1 Hz amplitude envelope — one breath every ten seconds. Felt more than heard, particularly on speakers with low-frequency response.

Layer 5 · Glue

Brown noise, sidechained

A textured low-frequency noise field that ties the harmonic and rhythmic layers together. Modulated at 0.05 Hz so it breathes with the session arc.

Layer 6 · Nature canvas

Bird ambient + soft wind

Approximately 50 short synthesized bird events scattered across the 20 minutes, plus a continuous low-band wind ambient. Distance and panning are randomized so no two listens feel identical.

Layer 7 · Activation harmonics

659 / 784 / 988 Hz · last 3 min

Three brighter harmonic tones (E5 · G5 · B5 in musical terms) that gradually fade in during the final three minutes. Marks the transition from inward focus toward an alert, awake state.

Your seven recorded affirmations sit on top of this architecture — a separate voice channel that the engine layer ducks gently around (about −3 dB) during the theta receptive window, then releases as the session descends toward delta. Voice is processed through the VōxSōma SVP™ pipeline locally on your device — it never leaves your browser.

Wellness audio tool · not a medical device · not intended to diagnose, treat, cure or prevent any disease or condition. Many practitioners describe a sense of calm alertness during and after the session. Individual experiences vary.

Theta

Theta layer · 6 Hz stereo difference tone

150 Hz left · 156 Hz right

A pair of tones — 150 Hz in the left ear, 156 Hz in the right. The 6 Hz difference between them creates a stereo difference tone in the theta range. Stereo headphones required for the stereo difference effect.

Delta

Delta layer · 2 Hz stereo difference tone

80 Hz left · 82 Hz right

A deeper pair of tones — 80 Hz left, 82 Hz right. The 2 Hz difference produces a stereo difference tone in the delta range. Used as a continuous foundation across the session.

Schumann

Schumann layer · 7.83 Hz stereo difference tone

125 Hz left · 132.83 Hz right

Tones at 125 Hz and 132.83 Hz, producing a 7.83 Hz stereo difference tone. This frequency sits between the theta and alpha ranges and is sometimes referred to in audio practice as the "Schumann frequency."

Isochronic

Isochronic pulse · 6 Hz on 174 Hz carrier

No headphones required

A 174 Hz tone pulsing at 6 Hz. Unlike stereo difference tones, isochronic pulses are perceptible without headphones, providing a parallel rhythmic element to the theta layer.

Voice · optional

Your voice · seven personal intentions

Recorded on your device · never uploaded

Your seven personal intentions, recorded in your own voice. We balance the volume across the session, add a subtle room ambience, and place each affirmation at structured intervals throughout the listening experience. You can also ask someone you trust to record for you.

Listening notes. Stereo headphones are required for L1, L2, and L3 (the stereo tone layers). L4 (isochronic pulse) is perceptible on speakers as well. L5 (your voice) is optional — you can use the session purely instrumentally if you have not recorded affirmations yet.

How the layers combine

The five layers are not played at equal volume. The voice (L5) is foregrounded — placed in front of the listener — while the stereo and isochronic layers (L1–L4) sit behind it as a steady architectural foundation. We use deterministic Web Audio processing in your browser to balance the levels and place each affirmation at structured intervals across the session length.

No layer makes any medical or therapeutic claim. The frequencies are well-known in audio practice and are used as a structural design element, not a medical treatment. If you have a history of epilepsy, seizures, or are sensitive to entrained-rhythm audio, consult your doctor before listening. See Safety for full guidelines.

Does spatial (8D) audio actually help you relax?

Spatial or "8D" audio uses panning and gentle reverb to make sound seem to move around your head, which many listeners find pleasantly immersive on headphones. The evidence that it changes relaxation physiologically is mixed — VōxSōma uses spatial layering because it makes the session feel enveloping and easy to stay with, not because it claims a clinical effect.

Why this matters

Most affirmation audio products use a single voice on top of generic background music. VōxSōma layers your own voice with calibrated frequency tones to create a session that is consistent across many listenings, and uniquely yours each time. This is the technical design choice we made — not a clinical claim.