Lab 7: Build a Voice Changer/Voice Shield#
Last updated 05/03/25
00. Content #
Mathematics#
N/A
Programming Skills#
flow structures (
try
/finally
,for
/else
)Timing conflicts
Buffers
Embedded Systems#
N/A
0. Required Hardware #
Headphones
Write your name and email below:
Name: me
Email: me @purdue.edu
import numpy as np
import matplotlib.pyplot as plt
1. Introduction #
In a previous lab, we learned about modulating signals and in later one, we learned how to capture and manipulate audio using python. That served as a good primer on how to manipulate audio in Python. We now combine the ideas from the earlier laboratories to create a real-time voice changer. If the time taken for processing data is more than a few milliseconds, we will be forced to either skip sections of audio or have a substantial lag between input and output. In this lab, we will explore ways to minimize latency in producing the modulated output inorder to process the audio in realtime. But before that let us look into audio recordng and playback with Python.
2. Audio Recorder #
We will be once again using Sounddevice for making our voice changer To explore more in detail about Sound and its features, you could refer the API docementation for Sounddevice.
For Recording Audio with Python, we have to do the following:
Open a data stream to get audio data frame from microphone
Iterate over the stream and append each frame to a list of frames.
Stop and close the data stream.
Save the data frames as a .wave file.
Close and terminate the audio stream.
However, fortunate for us, most of these steps are all handled automatically with sounddevice and numpy, making our job easier. And indeed, during last week’s lab, we used a voice record based on sounddevice and soundfiles already. We will be reusing it again this week.
Voice_recorder.py Recall, it records 5s of mono channel audio and saves it as ‘my_recording.wav’.
If you are curious, feel free to take a few second to understand what the code is doing, and how it records the audio.
3. Audio Player #
The idea behind the implementation of an audio player is similar but opposite to that of an audio recorder. Since we have the audio file saved in some format, a naive approach would be to load the audio as a single numpy array and play it using Audio() function.
But this approach is not an optimal solution when dealing with huge files.
So a good approach will be to read smaller chunks at a time from the audio file, so that we do not exhaust our total system memory.
The Pseudo code for the audio recording using python is as follows:
Initialize and open an output audio stream.
Iterate and read frames of data from the audio file.
Write the data frames to the output stream.
Close and terminate the audio stream.
Again, sounddevice and soundfile handles most of these steps for us.
Audio_player.py is implemented following the above pseudo code.
This should look somewhat similar to what you wrote yourself during last week’s lab.
Exercise 1#
Record a 10s audio clip using voice_recorder.py and then play the 10s audio using audio_player.py. Report your observations.
4. Megaphone - a Pythonic Approach #
A megaphone is a portable, cone-shaped device used to amplify a person’s voice. It is a simple device to amplify input sounds in realtime. In this exercise we will be using python to implememt a Megahorn in realtime.
To implement such a device, we follow these steps:
Initialize and open an input and output stream.
Read frames from the input stream.
Write the frames from the input stream to the output stream.
Close and terminate the data streams.
Megaphone.py is implemented following the steps above.
Exercise 2#
Run Megaphone.py and report your observations. How well does this program perform when you are not using headphones? Report your observations.
5. Voice Changer #
Voice changers are one of those gadgets that you might have seen in spy movies or as a part of Halloween costumes.
When you speak, your throat produces a vibration which travels as a wave. Essentially, your voice is a sound wave. A voice changer, changes the shape of this sound wave by altering it. This can be done in multiple ways.
One of the ways to implement a voice changer is to find out the frequency components of your voice and replace those frequency components with another frequency. However this is computationally expensive and will require specialized hardware to run the operation in realtime.
A simple way to do this is to multiply the received audio signal with a wave of known frequency. This will shift the frequency components of the signal altering your voice.
To implement such a device, we will do the following:
Initialize and open an input and output stream.
Read frames from the input stream.
Multiply each such audio frames with a wave of known frequency.
Write these modified audio frames to the output stream.
Close and terminate the data streams.
Real_time_voice_changer.py is implemented using this approach.
Exercise 3#
Run Real_time_voice_changer.py and report your observations. Identify the shape and frequency of the wave used in altering the input.
Exercise 4#
Change the shape of the waveform used in Real_time_voice_changer.py. Report your code and observations.
Exercise 5#
How does Real_time_voice_changer.py perform when you:
increase the frequency of the input altering wave?
decrease the frequency of the input altering wave?
Reflection #
Do not skip this section! Lab will be graded only on completion of this section.
1. What parts of the lab, if any, do you feel you did well?
2. What are some things you learned today?
3. Are there any topics that could use more clarification?
4. Do you have any suggestions on parts of the lab to improve?