EACL 2026 System demostration

Beyond Only One Modality - KIT’s Multimodal Multilingual Lecture Companion

MIT

Key Highlights

Open-source models and pipelines for

  • Multimodal speech translation: Ambinguity resolution with visual context
  • Slide/Image translation: Enable multilingual lecture comprehension
  • Summarization, QA and more

Quickstart Guide

Install using pyParrot framework →

  1. Download
    git clone https://github.com/ai4lt/pyparrot.git
  2. Enter the repo
    cd pyparrot
  3. Initialize submodules
    git submodule update --init --recursive
  4. Install PyParrot
    pip install -e .
  5. Configure BOOM
    pyparrot configure boom --config example/boom.yaml
  6. Build docker container
    pyparrot build boom
  7. Start BOOM
    pyparrot start boom