Lecture 12: Virtual Environments & Virtual Machines, Containerization & Docker
Key Word(s): Virtual Machines, Virtual Environments
Slides
Tutorial - Mega Pipeline App
ποΈ β π β ποΈ β [ππ«π·] β π
In this tutorial the entire class will participate to build a Mega Pipeline App which does the following:
- Allows a user to Record audio using a mic
- The audio file is then transcribed using Google Cloud Speech to Text API
- The text is used as a prompt to a pre-trained GPT2 model to Generate Text (100 words)
- The generated text is synthesized to audio using Google Cloud Text-to-Speech API
- The generated text is also translated to French using googletrans
- The translated text is then synthesized to audio using Google Cloud Text-to-Speech API
The class will work in teams:
πTeam A transcribe_audio:
ποΈTeam B generate_text:
πTeam C synthesis_audio_en:
π«π·Team D translate_text:
πTeam E synthesis_audio:
Each team will create a Docker container to build the required functionality. The details on what to build in each container are linked above for each team.
The overall progress of this mega pipeline can be viewed here
GCP Credentials File:
(Download and place inside
GCS Bucket Details:
input_audios - Bucket where we store the input audio files
text_prompts - Bucket where we store the text prompts that was synthesized by audio to text
text_paragraphs - Bucket where we store the generated text from GPT2
text_translated - Bucket where we store the translated text
text_audios - Bucket where we store the audio of the paragraph of text
output_audios - Bucket where we store the final French audio files