Lecture 12: Virtual Environments & Virtual Machines, Containerization & Docker

Key Word(s): Virtual Machines, Virtual Environments



Slides


Tutorial - Mega Pipeline App

πŸŽ™οΈ β†’ πŸ“ β†’ πŸ—’οΈ β†’ [πŸ”ŠπŸ‡«πŸ‡·] β†’ πŸ”Š

In this tutorial the entire class will participate to build a Mega Pipeline App which does the following:

  1. Allows a user to Record audio using a mic
  2. The audio file is then transcribed using Google Cloud Speech to Text API
  3. The text is used as a prompt to a pre-trained GPT2 model to Generate Text (100 words)
  4. The generated text is synthesized to audio using Google Cloud Text-to-Speech API
  5. The generated text is also translated to French using googletrans
  6. The translated text is then synthesized to audio using Google Cloud Text-to-Speech API

The class will work in teams:

      πŸ“Team A transcribe_audio:

      πŸ—’️Team B generate_text:

      πŸ”ŠTeam C synthesis_audio_en:

      πŸ‡«πŸ‡·Team D translate_text:

      πŸ”ŠTeam E synthesis_audio:

Each team will create a Docker container to build the required functionality. The details on what to build in each container are linked above for each team.

The overall progress of this mega pipeline can be viewed here

GCP Credentials File:

(Download and place inside /secrets): See Ed

GCS Bucket Details:

      input_audios - Bucket where we store the input audio files

      text_prompts - Bucket where we store the text prompts that was synthesized by audio to text

      text_paragraphs - Bucket where we store the generated text from GPT2

      text_translated - Bucket where we store the translated text

      text_audios - Bucket where we store the audio of the paragraph of text

      output_audios - Bucket where we store the final French audio files

mega-pipeline-bucket