When adding audio one note at a time, you will need to add a note, or edit a note (if this unfamiliar to you, check out the Anki manual), and you will be interacting with the buttons in the Anki editor:
Speaker Button
is for adding audio to the note.Play Button
is for listening to audio before adding it.Gear Button
is for configuring audio settings.The first time you press the Speaker Button
, you will be asked to choose between Easy Mode
and Advanced Mode
. I recommend you choose Easy Mode
, it's the easiest way to get started. You can change this any time in the Settings (Gear Button
). In Easy Mode, the Speaker Button
and Play Button
do the same thing.
If you'd like more flexible options and automation, check out Advanced Mode.
After choosing Easy Mode, you will see the Add Audio screen. There are different ways you can control the text used for the audio:
You must choose a Service and a Voice before audio can be generated. It'll be easier to first select a Language, which will filter down the available voices. If you've got access all premium voices, Azure is a safe choice. Otherwise, you can use Google Translate. Note that other services such as Forvo, Oxford are dictionary services and they will only generate audio for single words.
Click the Preview Audio
button. If you are satisfied, you can click Add Audio
, and the audio file will be saved into your note. By default, it will be added to the same field that the text came from. You can change this by clicking More Settings...
and selecting a different target field.
You should see the audio in your note. And next time you bring up the Add Audio dialog, your settings should be memorized.
If you are repeatedly adding audio, it'll be helpful to configure keyboard shortcuts to bring up the HyperTTS Add Audio screen. You can do so in the Anki main screen Tools
menu, HyperTTS: Preferences
.
If you need more flexible options and automation, check out Advanced Mode.
April 6, 2025