Voice Technology

Singing Voice Conversion SVC Glossary

Music production is a complex and ever evolving field which is becoming more and more accessible due to the advent of AI technology.

Voice.ai

April 24, 2023
6 minutes read

Thanks to AI many previously complex processes which might have taken years to master are now made accessible to anyone. Everything from beat creation, to lyric generation to voice effects and even mastering can be assisted by AI. Even if you are using AI to create music it helps to have a general understanding of the main principles of music creation and the key terminology that is used.

Singing Voice Conversion Glossary

In this glossary, we will explore some of the most common audio effects used in music production, including delay, reverb, compression, EQ, chorus, flanger, phaser, distortion, filter, stereo widening, and limiter.

What is Pitch correction?

Pitch correction is a technique used to adjust the pitch of a singing voice to make it sound more in tune. This can be done manually or with the use of software such as Auto-Tune or Melodyne. Pitch correction can be subtle or extreme depending on the desired effect.

What is Autotune?

Autotune is a specific brand of pitch correction software that has become synonymous with the technique. It is often used to create the robotic, exaggerated effect heard in modern pop music.

What is Melodyne?

Melodyne is another popular pitch correction software that allows for more detailed manipulation of individual notes and timing. It can also be used to create harmonies and other vocal effects.

What is Vocaloid?

Vocaloid is a voice synthesis technology developed by Yamaha that allows for the creation of synthesized singing voices. It has been used to create virtual singers for various applications such as music production, advertising, and video games.

What is Vocal tuning?

Vocal tuning refers to the process of adjusting the pitch and timing of a singing voice to make it sound more polished and professional. It is often used in music production to correct mistakes made during recording or to enhance the overall sound of a performance.

What is Harmonization?

Harmonization is the process of adding additional vocal parts to a melody to create harmony. This can be done manually or with the use of software. Harmonization can add depth and richness to a vocal performance.

What is Formant shifting?

Formant shifting is a technique used to change the timbre or tone of a singing voice by adjusting the frequencies of the formants (resonant frequencies) of the vocal tract. This can be used to make a voice sound more masculine or feminine, or to create other unique vocal effects.

What is Modulation?

Modulation refers to the process of changing the key or tonality of a singing voice. This can be done manually or with the use of software. Modulation can be used to create different moods or to fit a vocal performance better with the overall key of a song.

What is Resampling?

Resampling is the process of changing the speed or tempo of a recorded vocal performance. This can be used to match the tempo of a song or to create unique vocal effects.

What is Voice morphing?

Voice morphing is a technique used to transform a singing voice into a different voice or character. This can be done manually or with the use of software. Voice morphing can be used for creative purposes or to create a specific vocal effect.

What is Voice separation?

Voice separation is the process of isolating individual vocal parts from a mixed recording, such as separating lead and backing vocals. This can be useful for remixing or for making adjustments to specific vocal parts.

What is Vocal enhancement?

Vocal enhancement refers to the process of improving the overall sound quality of a recorded vocal performance. This can include pitch correction, EQ, compression, and other techniques. Vocal enhancement can help to make a performance sound more polished and professional.

What is Voice matching?

Voice matching is the process of creating a vocal performance that sounds similar to another singer. This can be used to mimic the sound of a particular artist or to create a consistent vocal sound across multiple recordings.

What is De-essing?

De-essing is the process of reducing or removing harsh sibilant sounds (such as “s” or “sh” sounds) in a recorded vocal performance. This can be done manually or with the use of software. De-essing can help to create a more pleasant and polished-sounding performance.

What is EQ (equalization)?

EQ, or equalization, is a process used to adjust the balance of different frequencies in a recorded vocal performance. This can be used to create a more balanced or unique sound, or to remove unwanted frequencies from a recording.

What is Compression?

Compression is a process used to reduce the dynamic range of a recorded vocal performance. This can help to make the performance sound more consistent and polished.

What is Reverb?

Reverb is a technique used to add artificial or natural-sounding ambience to a recorded vocal performance. This can be used to create a more immersive and spacious sound.

What is Delay?

Delay is a technique used to create an echo or repeat of a recorded vocal performance. This can be used to create a more spacious or ambient sound, or to create a specific vocal effect.

What is Chorus?

Chorus is a technique used to create multiple copies of a recorded vocal performance and layer them on top of each other. This can be used to create a thicker and more dynamic vocal sound.

What is Flanger?

Flanger is a technique used to create a sweeping, phase-shifting effect on a recorded vocal performance. This can be used to create a unique and trippy vocal effect.

What is Auto-Tune?

Auto-Tune is a popular pitch correction software used in music production to correct or adjust a singer’s pitch. Auto-Tune can help to create a more polished and in-tune vocal performance. However, its use can also be controversial, as some argue that it can create an artificial or “robotic” sound.

What is Vocal fry?

Vocal fry is a vocal technique that involves producing a low-pitched, creaky sound. This effect is often used in pop music to create a unique and edgy vocal sound. However, overuse of vocal fry can strain the vocal cords and cause damage.

What is Vibrato?

Vibrato is a natural fluctuation in pitch that occurs when a singer holds a note. Vibrato can add expressiveness and depth to a vocal performance. Some singers use vibrato as a stylistic choice, while others try to minimize it or eliminate it entirely.

What is Double-tracking?

Double-tracking is the process of recording a vocal performance twice and layering the two recordings on top of each other. This can be used to create a thicker and more dynamic vocal sound, or to create a sense of stereo width.

What is Phaser?

A phaser is a modulation effect that creates a sweeping, swirling sound by splitting an audio signal into two parts, then delaying and modulating one part before recombining it with the other. Phasers can be used to create a variety of psychedelic or spacey sounds, and are commonly used in guitar and synth effects.

What is Distortion?

Distortion is an effect that adds harmonic overtones and creates a gritty or rough sound to an audio signal. Distortion can be achieved through the use of analog or digital distortion pedals, or through overdriving a preamp or amplifier. Distortion is commonly used in rock and metal music to create a heavy, distorted guitar sound.

What is Filter?

A filter is an effect that removes or attenuates certain frequencies from an audio signal. Filters can be used to create a variety of effects, including a “wah-wah” sound (using a bandpass filter), a telephone or radio sound (using a low-pass filter), or a bright, airy sound (using a high-pass filter).

What is Stereo widening?

Stereo widening is a technique used to create a wider stereo image by adding stereo information to a mono audio signal. This can be achieved through the use of stereo widening plugins, which can add stereo information through techniques such as phase shifting, delay, or reverb. Stereo widening can help to create a more immersive and spacious sound.

What is Limiter?

A limiter is a type of dynamic range compression used to prevent audio signals from exceeding a certain level, usually in order to avoid distortion or to create a more consistent volume level. Limiters can be used to control the overall loudness of a mix, or to protect speakers from damage caused by excessive volume levels.

Stay tuned and expect to have the best text-to-speech generator tool with Voice.ai!