Key Highlights
Open-source models and pipelines for
- Multimodal speech translation: Ambinguity resolution with visual context
- Slide/Image translation: Enable multilingual lecture comprehension
- Summarization, QA and more
Beyond Only One Modality - KIT’s Multimodal Multilingual Lecture Companion
Open-source models and pipelines for
Install using pyParrot framework →
# Download
git clone https://github.com/ai4lt/pyparrot.git
# Enter the repo
cd pyparrot
# Initialize and update submodules
git submodule update --init --recursive
# Install PyParrot
pip install -e .
# Configure BOOM
pyparrot configure boom --config example/boom.yaml
# Build docker container
pyparrot build boom
# Start BOOM
pyparrot start boom
git clone https://github.com/ai4lt/pyparrot.git
cd pyparrot
git submodule update --init --recursive
pip install -e .
pyparrot configure boom --config example/boom.yaml
pyparrot build boom
pyparrot start boom