Skip to the content.

Download Latest Windows Release

Source Code

screenshot

The Audio Spatialisation for Headphones Toolset is a set of tools for headphone correction and binaural synthesis of spatial audio systems on headphones.

Features


Windows Installation

  1. Download the latest release and run the installer or download the portable package and unzip to the location of your choice.
  2. When running the executable, if a Microsoft Defender Smartscreen warning is presented, click ‘More info’ and ‘Run anyway’. This warning may show on some releases due to the code signing certificate of the executable building reputation.
  3. (optional) If using Equalizer APO for audio processing, download Equalizer APO and follow the Equalizer APO installation tutorial.
  4. (optional) If using HeSuVi for audio processing, download HeSuVi and follow the installation and usage steps outlined in the HeSuVi Wiki.

Usage

Run the application as administrator to launch the GUI. It may take a few seconds for it to launch.

GUI Overview

The app contains the following tabs:

Headphone Correction

In the ‘Quick Configuration’ tab, this tool is used to apply headphone correction in Equalizer APO. In the ‘Filter & Dataset export’ tab, this is used to export a set of correction filter files for a specific headphone which can then be loaded into audio processing software to apply headphone correction.

hpcf steps

  1. Select a headphone database containing correction filters. The ‘ASH Filters’ option is the main database which has a structured Brand/Headphone/Sample hierarchy and the ‘Compilation’ option contains additional filters derived from various measurement datasets and has a Type/Headphone/Dataset hierarchy.
  2. Select a headphone brand or type to filter down on the headphone list.
  3. Select a specific headphone.
  4. Select a specific sample or dataset. Note that in the ‘Filter & Dataset export’ tab, all samples/datasets will be exported for the selected headphone.
  5. (‘Filter & Dataset export’ tab only) Select which files to include in the export.
    • FIR Filters: Minimum phase FIRs in WAV format for convolution. 1 channel at specified sample rate and bit depth. This is filter type is required for the app to auto-configure ‘config.txt’ in Equalizer APO.
    • Stereo FIR Filters: Minimum phase FIRs in WAV format for convolution. 2 channels at specified sample rate and bit depth.
    • Graphic EQ Filters (127 bands): Graphic EQ configurations with 127 bands. Compatible with Equalizer APO and Wavelet
    • Graphic EQ Filters (31 bands): Graphic EQ configurations with 31 bands. Compatible with 31 band graphic equalizers including Equalizer APO
    • HeSuVi Filters: Graphic EQ configurations with 127 bands. Compatible with HeSuVi. Saved in HeSuVi\eq folder
  6. Select a sample rate for the WAV files. Available options are 44.1kHz, 48kHz, and 96kHz. Note: The sample rate of the generated fitlers should match the sample rate of the sound device.
  7. Select a bit depth for the WAV files. Available options are 24 bits per sample and 32 bits per sample.
  8. Click the ‘Apply Selection’ button to apply the selected filter in Equalizer APO or click the ‘Process’ button to export the selected filters to the output directory. By default this location will be C:\Program Files\EqualizerAPO\config\ASH-Outputs but can be changed using the change folder option.

Binaural Room Simulation over Headphones

Quick Configuration

In the ‘Quick Configuration’ tab, this tool is used to apply customised binaural room simulations over headphones in equalizer APO.

brir steps

  1. Select Acoustic Space from a range of environments including audio labs, conference rooms, control rooms, seminar rooms, studios, and more. This will determine the listening environment of the simulation.
  2. Select Gain for Direct Sound in dB. Select a value between -10dB and 10dB. Higher values will result in lower perceived distance. Lower values result in higher perceived distance
  3. Select Room Target from a range of options including flat, ASH target, Harman target, and more. This will influence the overall balance of low and high frequencies. Flat is recommended if using headphone correction from other sources such as AutoEq. Variations of below targets with flat mid and high frequencies are also provided.
  4. Select Headphone Compensation from below options. The selected option should match the listener’s headphone type. High strength is selected by default. Reduce to low strength or None if sound localisation or timbre is compromised.
    • In-Ear Headphones, high strength
    • In-Ear Headphones, low strength
    • Over-Ear/On-Ear Headphones, high strength
    • Over-Ear/On-Ear Headphones, low strength
    • None
  5. Select Listener Type from below options.
    • Dummy Head / Head & Torso Simulator
    • Human Listener
    • User SOFA Input
    • Favourites
  6. Select Dataset from available options. A number of public HRTF datasets will be listed here if ‘Listener Type’ is set to ‘Dummy Head / Head & Torso Simulator’ or ‘Human Listener’.
  7. Select Listener from available options. Some options will require an internet connection as the source dataset is not included and will be automatically downloaded from the web. If ‘Listener Type’ is set to ‘User SOFA Input’, user SOFA files will be automatically listed here. User SOFA files must be placed in the user data folder which is usually located at C:\Program Files (x86)\ASH Toolset\_internal\data\user\SOFA. Additional SOFA files can be found at the SOFA conventions repository.
  8. (optional) Select Low-frequency Integration Crossover Frequency between 20Hz and 150Hz. Auto Select mode will select an optimal frequency for the selected acoustic space. This can be used to tune the integration of the cleaner Low-frequency response and original room response. Higher values may result in a smoother bass response.
  9. Select Low-frequency Response for Extension from available options. A comparison of responses are available in the supporting information section below.
  10. (optional) Enable Additional EQ. Compensate Headphone Roll-off option will compensate the typical reduction in bass response below 100Hz in many headphones. This filter will result in a gain of approximately 4dB at 20Hz. Forward-Backward Filtering option will eliminate delay introduced by the filters, however can introduce edge artefacts in some cases.
  11. Select Sample Rate for the WAV files. Available options are 44.1kHz, 48kHz, and 96kHz. Note: The sample rate of the generated WAV files should match the sample rate of the sound device.
  12. Select Bit Depth for the WAV files. Available options are 24 bits per sample and 32 bits per sample.
  13. Click the ‘Apply Parameters’ button to apply the binaural simulation in Equalizer APO.

Filter & Dataset export

In the ‘Filter & Dataset export’ tab, this is used to export a customised binaural dataset containing binaural impulse responses in WAV or SOFA format which can then be loaded into audio processing software to apply room simulation.

brir steps

  1. Select Acoustic Space from a range of environments including audio labs, conference rooms, control rooms, seminar rooms, studios, and more. This will determine the listening environment of the simulation.
  2. Select Gain for Direct Sound in dB. Select a value between -10dB and 10dB. Higher values will result in lower perceived distance. Lower values result in higher perceived distance
  3. Select Room Target from a range of options including flat, ASH target, Harman target, and more. This will influence the overall balance of low and high frequencies. Flat is recommended if using headphone correction from other sources such as AutoEq. Variations of below targets with flat mid and high frequencies are also provided.
  4. Select Headphone Compensation from below options. The selected option should match the listener’s headphone type. High strength is selected by default. Reduce to low strength or None if sound localisation or timbre is compromised.
    • In-Ear Headphones, high strength
    • In-Ear Headphones, low strength
    • Over-Ear/On-Ear Headphones, high strength
    • Over-Ear/On-Ear Headphones, low strength
    • None
  5. Select Spatial Resolution from below options. Increasing resolution will increase number of source directions available but will also increase processing time and dataset size. Low’ is recommended unless additional directions or SOFA export is required.
    • Low: Elevation angles ranging from -30 to 30 degrees in 15 degree steps. Azimuth angles ranging from 0 to 360 degrees in varying steps.
    • Medium: Elevation angles ranging from -45 to 45 degrees in 15 degree steps. Azimuth angles ranging from 0 to 360 degrees in varying steps.
    • High: Elevation angles ranging from -50 to 50 degrees (WAV export) or -60 to 60 degrees (SOFA export) in 5 degree steps. Azimuth angles ranging from 0 to 360 degrees in 5 degree steps.
  6. Select Listener Type from below options.
    • Dummy Head / Head & Torso Simulator
    • Human Listener
    • User SOFA Input
    • Favourites
  7. Select Dataset from available options. A number of public HRTF datasets will be listed here if ‘Listener Type’ is set to ‘Dummy Head / Head & Torso Simulator’ or ‘Human Listener’.
  8. Select Listener from available options. Some options will require an internet connection as the source dataset is not included and will be automatically downloaded from the web. If ‘Listener Type’ is set to ‘User SOFA Input’, user SOFA files will be automatically listed here. User SOFA files must be placed in the user data folder which is usually located at C:\Program Files (x86)\ASH Toolset\_internal\data\user\SOFA. Additional SOFA files can be found at the SOFA conventions repository.
  9. (optional) Select Low-frequency Integration Crossover Frequency between 20Hz and 150Hz. Auto Select mode will select an optimal frequency for the selected acoustic space. This can be used to tune the integration of the cleaner Low-frequency response and original room response. Higher values may result in a smoother bass response.
  10. Select Low-frequency Response for Extension from available options. A comparison of responses are available in the supporting information section below.
  11. (optional) Enable Additional EQ. Compensate Headphone Roll-off option will compensate the typical reduction in bass response below 100Hz in many headphones. This filter will result in a gain of approximately 4dB at 20Hz. Forward-Backward Filtering option will eliminate delay introduced by the filters, however can introduce edge artefacts in some cases.
  12. Select Sample Rate for the WAV or SOFA files. Available options are 44.1kHz, 48kHz, and 96kHz. Note: The sample rate of the generated WAV files should match the sample rate of the sound device.
  13. Select Bit Depth for the WAV files. Available options are 24 bits per sample and 32 bits per sample.
  14. Select which files to include in the export.
    • Direction specific WAV BRIRs: Binaural Room Impulse Responses (BRIRs) in WAV format for convolution. One file for each source direction and 2 channels per file at specified sample rate and bit depth. This is file type is required for the app to auto-configure ‘config.txt’ in Equalizer APO.
    • True Stereo WAV BRIRs: True Stereo BRIR in WAV format for convolution. One file with 4 channels representing L and R speakers (LL LR RL RR) at specified sample rate and bit depth.
    • HeSuVi WAV BRIRs: BRIRs in HeSuVi compatible WAV format. 14 channels, 24 or 32 bit depth, 44.1Khz and 48Khz. The directions of the channels can be configured in the ‘HeSuVi & Multichannel Configuration’ tab on the right.
    • 16 Channel WAV BRIRs: BRIRs in FFMPEG compatible WAV format. 16 channels at specified sample rate and bit depth. Channel mapping can be configured in the ‘HeSuVi & Multichannel Configuration’ tab on the right.
    • SOFA File: BRIR dataset file in SOFA (Spatially Oriented Format for Acoustics) format. The SOFA convention can be selected in the Misc. Settings section under ‘Additional Tools & Settings’ tab.
  15. Click the ‘Process’ button to export the binaural dataset to the output directory.

Using the Correction Filters and Binaural Simulations

The outputs can be used to create spatial surround sound on headphones by convolving an audio stream with a set of binaural impulse responses and a headphone correction filter. This requires IR Convolution software that supports stereo or true stereo processing such as Equalizer APO

Apply Filters and Simulations in Equalizer APO

V3.0.0 onwards of the toolset includes a ‘Quick Configuration’ tab which will auto-configure ‘config.txt’ to apply selected filters and binaural simulations in Equalizer APO. Ensure ‘Enable Headphone Correction’ and/or ‘Enable Binaural Room Simulation’ are ticked for the changes to apply. The audio channels can be configured in the ‘Channel Configuration’ tab on the right side of the app.

Equalizer APO example

7.1 Surround Virtualisation

If your audio device does not support a 7.1 surround channel configuration, a virtual audio device such as VB-Audio Virtual Cable or Voicemeeter can be used for audio processing in place of your regular audio playback device. Equalizer APO can be installed on the virtual audio device which can be configured for 7.1 audio, and the output of the virtual audio device can be sent to your regular audio playback device.

Apply Filters and Simulations in HeSuVi

As an alternative to above method in Equalizer APO, the generated filters can be applied using HeSuVi.

HeSuVi HPCF example HeSuVi BRIR example

  1. If using HeSuVi for the first time, download HeSuVi and follow the installation and usage steps outlined in the HeSuVi Wiki.
  2. Open or restart HeSuVi
  3. The binaural room simulations can be selected in the Virtualisation tab. The simulation can be found under the Common HRIRs section at the top of the list.
  4. The headphone correction filters can be selected in the Equalizer tab. The filters can be found under _HpCFs at the bottom of the list.

Acoustic Space Import

This tab can be used to generate new acoustic spaces from reverberant impulse response (IR) files such as room impulse responses (RIRs). Imported acoustic spaces can be used in the quick configuration and filter & dataset export tabs.

Set IR Folder

Enter Metadata (Optional)

IR Processing Options

Spatial Sampling

Pitch Shifting (for dataset expansion)

The below parameters are only used to expand the dataset with new simulated source directions in cases where few IRs are supplied

Start Processing


Room Target Generator

This tab can be used to create custom room target curves using low-shelf and high-shelf filters. These targets can be used later in quick configuration and dataset export.

Enter Target Name (Optional)

Configure Low-Shelf Filter

Configure High-Shelf Filter

Generate the Room Target


File Naming and Structure

Outputs (excluding HeSuVi files) are saved within the ASH-Outputs child folder under the output directory. This will be in the Equalizer APO config directory (e.g. C:\Program Files\EqualizerAPO\config\ASH-Outputs) by default. HeSuVi files will be saved within the HeSuVi folder (e.g. C:\Program Files\EqualizerAPO\config\HeSuVi) by default. If required, the output directory can be changed using the directory selector. The EqualizerAPO\config directory should be selected if using Equalizer APO to ensure the filters and configurations can be read by Equalizer APO.

Binaural Room Impulse Responses

Headphone Correction Filters


Supporting Information

Supported SOFA Conventions

Convention Version SOFAConventionsVersion Read Write
GeneralFIR 2.1 1 Yes No
GeneralFIR 1 1 Yes Yes
SimpleFreeFieldHRIR 2.1 1 Yes No
SimpleFreeFieldHRIR 1 1 Yes Yes
SimpleFreeFieldHRIR 0.6 0.4 Yes No
SimpleFreeFieldHRIR 0.5 0.3 Yes No
GeneralFIRE 1 1 Yes No
GeneralFIRE 0.6 0.1 Yes No
GeneralFIR-E 2.1 2 Yes No
FreeFieldHRIR 2.1 1 Yes No

Acoustic Spaces

Name Estimated RT60 (ms) Name Estimated RT60 (ms) Name Estimated RT60 (ms)
Atrium A 2894 Courtyard 1496 Outdoors A 1935
Atrium B 1282 Foyer 1246 Outdoors B 1183
Audio Lab A 305 Hall A 1418 Outdoors C 1199
Audio Lab B 413 Hall B 949 Recording Room 284
Audio Lab C 508 Hall C 1052 Recording Studio A 723
Audio Lab D 193 Hall D 1069 Recording Studio B 739
Audio Lab E 442 Hall E 1414 Seminar Room A 839
Audio Lab F 631 Kiln 777 Seminar Room B 710
Audio Lab G 360 Large Treated Room 624 Seminar Room C 705
Audio Lab H 528 Lecture Hall A 901 Small Room A 500
Audio Lab I 539 Lecture Hall B 685 Small Room B 467
Auditorium A 1455 Lecture Room A 704 Small Room C 476
Auditorium B 346 Lecture Room B 728 Small Theatre 920
Auditorium C 1213 Lecture Room C 794 Smoking Room 660
Broadcast Studio A 1183 Listening Room A 221 Stairway 876
Broadcast Studio B 1241 Listening Room B 379 Studio A 398
Broadcast Studio C 914 Listening Room C 562 Studio B 351
Chamber A 1408 Listening Room D 312 Tatami Room 513
Chamber B 730 Listening Room E 824 Tennis Court A 1432
Classroom 1147 Listening Room F 771 Tennis Court B 1273
Concert Hall A 1599 Listening Room G 233 Tennis Court C 1345
Concert Hall B 1386 Living Room 967 Theatre 884
Concert Hall C 1526 Lobby A 735 Treated Room 178
Concert Hall D 1461 Lobby B 859 Tunnel 1404
Conference Room A 467 Meeting Room 458 Yoga Studio 1325
Conference Room B 541 Office A 408    
Control Room 260 Office B 496    

Low-frequency Responses

Name Estimated rt60 Comments Frequency Range Tolerance Measurement Dataset Source Type Listener Type
Low-frequency Response A - Audio Lab, smooth 320 Primary response used in v3.1.2 3Hz-150Hz 20Hz-120Hz +/-0.8dB Derived from multiple RIR Datasets Mixed Binaural
Low-frequency Response B - Audio Lab, smooth 340 Newly added in v3.2.0 0Hz-150Hz 20Hz-120Hz +/-1.0dB Derived from multiple RIR Datasets Mixed Binaural
Low-frequency Response C - Listening Room, natural 350 Newly added in v3.2.0 5Hz-150Hz 20Hz-120Hz +/-1.4dB ASH-Listening-Set Subwoofer Binaural
Low-frequency Response D - Studio, fast & smooth 240 Newly added in v3.2.0 0Hz-150Hz 20Hz-120Hz +/-1.0dB Derived from multiple RIR Datasets Mixed Binaural
Low-frequency Response E - Audio Lab, resonant 350 Newly added in v3.4.0 0Hz-150Hz 20Hz-120Hz +/-3.0dB Derived from multiple RIR Datasets Mixed Binaural
Low-frequency Response F - Studio Live Room, natural 350 Newly added in v3.4.0 0Hz-150Hz 20Hz-120Hz +/-2.0dB BBC Maida Vale Impulse Response Dataset Loudspeaker Binaural
Low-frequency Response G - Listening Room, slow 350 Newly added in v3.4.0 0Hz-150Hz 20Hz-120Hz +/-3.0dB Derived from multiple RIR Datasets Mixed Binaural
Low-frequency Response H - Listening Room, boomy 600 Newly added in v3.4.0 0Hz-150Hz 20Hz-120Hz +/-4.0dB The ISOBEL Sound Field Dataset Subwoofer Binaural
Low-frequency Response I - Recording Studio, slow 700 Newly added in v3.4.0 0Hz-150Hz 20Hz-120Hz +/-2.0dB RSoANU: RIR Dataset Loudspeaker Binaural
Low-frequency Response J - Listening Room, boomy 350 Newly added in v3.4.0 0Hz-150Hz 20Hz-120Hz +/-3.0dB The ISOBEL Sound Field Dataset Subwoofer Binaural
Low-frequency Response K - Studio, boomy 400 Newly added in v3.6.0 0Hz-150Hz 20Hz-120Hz +/-3.0dB Derived from multiple RIR Datasets Mixed Binaural

Changelog

Version 3.6.3:

Version 3.6.2:

Version 3.6.1:

Version 3.6.0:

Version 3.5.0:

Version 3.4.0:

Version 3.3.1:

Version 3.3.0:

Version 3.2.0:

Version 3.1.2:

Version 3.1.1:

Version 3.1.0:

Version 3.0.4:

Version 3.0.3:

Version 3.0.2:

Version 3.0.1:

Version 3.0.0:

Version 2.4.0:

Version 2.3.0:

Version 2.2.0:

Version 2.1.0:

Version 2.0.0:

Version 1.2.0:

Version 1.1.0:

Version 1.0.0:


License

ASH-Toolset is distributed under the terms of the GNU Affero General Public License v3.0 (AGPL-3.0). A copy of this license is provided in the file LICENSE.

Contact

Shanon Pearce - srpearce55@gmail.com

Project Link: https://github.com/ShanonPearce/ASH-Toolset

Acknowledgments

This project makes use of various publicly available HRIR, RIR, and BRIR datasets. Refer to the sheets provided in the github docs folder for information on the raw datasets used in this project. Further information on the HRTF datasets can be found at the SOFA conventions repository.