Ever wondered why your mix is quieter when encoded with the AC-3 (Dolby Digital) codec? Ever wondered why Atmos tracks often sound so quiet compared to other audio tracks? Want to know why the external Dolby Atmos renderer can be a quality control issue? For answers to these questions and more, please take a few minutes to read the following.
As I was recently confronted with this topic again, I thought I would take this opportunity to give you some pointers on an important subject.
It concerns Dolby’s dialogue level (dialnorm). As far as I can tell, knowledge of this subject is not very widespread. This may be due to the fact that many people have never had to deal with it, as the dialogue level only appears when encoding consumer tracks with an encoder (DEE, DME, Minnetonka SurCode etc.).
It is very important that anyone creating mixes knows a few important facts about this topic. It is important whether you are doing music or post production.
Dolby’s dialogue level was originally introduced as a way of preventing volume fluctuations when switching between different audio tracks encoded with a Dolby codec. It was a good idea. And it still is. But in my experience, Dolby’s Dialogue Level causes problems in practice. This is especially true in conjunction with Dolby Atmos.
I don’t want to go into too much detail here about how the dialogue level works. If you want more information, you can read Dolby’s official PDF document “A Guide to Dolby Metadata”.
So just this much: Dialogue level is a metadata parameter. Dolby itself writes that the dialogue level is perhaps the most important metadata parameter of all.
The dialogue level setting represents the average loudness of dialogue in a presentation. When received at the consumer’s decoder, this parameter setting determines the level shift in the decoder that sets, or normalizes, the average audio output of the decoder to a preset level. The scale used in the dialogue level setting is from -1 to -31 dB in 1 dB steps. Contrary to what you might assume at first, a setting of “-31” represents no level shift in the consumer’s decoder, and “-1” represents the maximum level shift. The formula is:
31 + (dialogue level value) = Shift applied. Example: 31 + (-21) = 10 dB.
As already mentioned, you set the dialogue level during encoding. On many encoders the dialnorm is preset to “-27 dB”. (Why Dolby chose this value is the subject of some speculation. The -27dB is probably because Dolby claims that it has found that this is the average level for dialogue in an average film.) This is the reason why very, very many Dolby audio tracks on DVD, Blu-ray, UHD-BD or streaming have a dialogue level of “-27 dB”.
With the new decoders (DME, DEE), this preset value no longer exists. Instead, the encoder performs a loudness analysis during encoding. Depending on the result of this analysis, the “appropriate” value for the dialogue level is selected.
It is this analysis that often leads to major problems, especially with Atmos.
With an Atmos master file (ADM BWF) as the source, the encoder’s analysis is based on the 5.1 presentation. This results in very low dialogue levels. (Mostly around -23 to -21 dB. But I have also had values of -17 and -13 dB.) So a TrueHD with Atmos track or an E-AC-3 JOC track will sound “X” quieter than the original. This is the reason why the Dolby Atmos track on a Blu-ray or UHD BD is almost always the quietest audio track in comparison.
The solution to this problem is very simple:
We set the “Custom dialnorm” or “Dialnorm” in the encoder to “-31 dB”.
This disables this “feature”. If we are using the Encoding Engine (DEE), we have to adapt the corresponding script by adding the following line in the right place:
<custom_dialnorm>-31</custom_dialnorm>
Also important: With the newer encoders, you can select whether the content is music. If you want to (or have to) use the dialogue level function, you should make sure that you really select “Music” for “Content”. Otherwise in many cases the original will get a very low Dialogue Level value during encoding.
Another important note regarding the external Dolby Atmos renderer:
Many people use the renderer’s MP4 export function to export a file for quality control. Some also share this MP4 file with their clients. The external Atmos renderer creates an E-AC-3 JOC track during export. And the renderer also writes the dialnorm metadata. But unlike the DME or the DEE, the external Atmos renderer does not(!) allow you to set your own value for the dialogue level!
You should always keep this in mind, as in most cases the exported MP4 file will be (much) quieter than the original Atmos mix.