Publications

2024

  1. EMNLP
    Corpus
    MOSEL: 950,000 Hours of Speech Data for Open-Source Speech Foundation Model Training on EU Languages
    Marco Gaido*Sara Papi*, Luisa Bentivogli, and 6 more authors
    In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, Nov 2024
  2. EMNLP
    What the Harm? Quantifying the Tangible Impact of Gender Bias in Machine Translation with a Human-centered Study
    Beatrice Savoldi, Sara Papi, Matteo Negri, and 2 more authors
    In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing (Social Impact Paper Award) , Nov 2024
  3. IWSLT Foundation Model
    SimulSeamless: FBK at IWSLT 2024 Simultaneous Speech Translation
    Sara Papi, Marco Gaido, Matteo Negri, and 1 more author
    In Proceedings of the 21st International Conference on Spoken Language Translation (IWSLT 2024), Aug 2024
  4. Automatic Subtitling and Subtitle Compression: FBK at the IWSLT 2024 Subtitling track
    Marco Gaido, Sara Papi, Mauro Cettolo, and 4 more authors
    In Proceedings of the 21st International Conference on Spoken Language Translation (IWSLT 2024), Aug 2024
  5. StreamAtt: Direct Streaming Speech-to-Text Translation with Attention-based Audio History Selection
    Sara Papi, Marco Gaido, Matteo Negri, and 1 more author
    In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Aug 2024
  6. Speech Translation with Speech Foundation Models and Large Language Models: What is There and What is Missing?
    Marco Gaido, Sara Papi, Matteo Negri, and 1 more author
    In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Aug 2024
  7. SBAAM! Eliminating Transcript Dependency in Automatic Subtitling
    Marco Gaido, Sara Papi, Matteo Negri, and 2 more authors
    In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Aug 2024
  8. When Good and Reproducible Results are a Giant with Feet of Clay: The Importance of Software Quality in NLP
    Sara Papi, Marco Gaido, Andrea Pilzer, and 1 more author
    In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Aug 2024
  9. Leveraging Timestamp Information for Serialized Joint Streaming Recognition and Translation
    Sara Papi, Peidong Wang, Junkun Chen, and 4 more authors
    In ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Aug 2024
  10. How do Hyenas deal with Human Speech? Speech Recognition and Translation with ConfHyena
    Marco Gaido, Sara Papi, Matteo Negri, and 1 more author
    Aug 2024

2023

  1. Token-Level Serialized Output Training for Joint Streaming ASR and ST Leveraging Textual Alignments
    Sara Papi, Peidong Wang, Junkun Chen, and 3 more authors
    In 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), Aug 2023
  2. Direct Speech Translation for Automatic Subtitling
    Sara Papi, Marco Gaido, Alina Karakanta, and 3 more authors
    Transactions of the Association for Computational Linguistics, Nov 2023
  3. Joint Speech Translation and Named Entity Recognition
    Marco Gaido, Sara Papi, Matteo Negri, and 1 more author
    In INTERSPEECH 2023, Aug 2023
  4. Attention as a Guide for Simultaneous Speech Translation
    Sara Papi, Matteo Negri, and Marco Turchi
    In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Aug 2023
  5. When Good and Reproducible Results are a Giant with Feet of Clay: The Importance of Software Quality in NLP
    Sara Papi, Marco Gaido, Andrea Pilzer, and 1 more author
    Aug 2023
    arXiv:2303.16166 [cs]
  6. Direct Models for Simultaneous Translation and Automatic Subtitling: FBK@IWSLT2023
    Sara Papi, Marco Gaido, and Matteo Negri
    In Proceedings of the 20th International Conference on Spoken Language Translation (IWSLT 2023), Aug 2023
  7. AlignAtt: Using Attention-based Audio-Translation Alignments as a Guide for Simultaneous Speech Translation
    Sara Papi, Marco Turchi, and Matteo Negri
    In INTERSPEECH 2023, Aug 2023
  8. Integrating Language Models into Direct Speech Translation: An Inference-Time Solution to Control Gender Inflection
    Dennis Fucci, Marco Gaido, Sara Papi, and 3 more authors
    In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, Dec 2023

2022

  1. Does Simultaneous Speech Translation need Simultaneous Models?
    Sara Papi, Marco Gaido, Matteo Negri, and 1 more author
    In Findings of the Association for Computational Linguistics: EMNLP 2022, Dec 2022
  2. Efficient yet Competitive Speech Translation: FBK@IWSLT2022
    Marco Gaido, Sara Papi, Dennis Fucci, and 3 more authors
    In Proceedings of the 19th International Conference on Spoken Language Translation (IWSLT 2022), Dec 2022
  3. Over-Generation Cannot Be Rewarded: Length-Adaptive Average Lagging for Simultaneous Speech Translation
    Sara Papi, Marco Gaido, Matteo Negri, and 1 more author
    In Proceedings of the Third Workshop on Automatic Simultaneous Translation, Dec 2022
  4. Dodging the Data Bottleneck: Automatic Subtitling with Automatically Segmented ST Corpora
    Sara Papi, Alina Karakanta, Matteo Negri, and 1 more author
    In Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing (Volume 2: Short Papers), Nov 2022

2021

  1. Speechformer: Reducing Information Loss in Direct Speech Translation
    Sara Papi, Marco Gaido, Matteo Negri, and 1 more author
    In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, Nov 2021
  2. Simultaneous Speech Translation for Live Subtitling: from Delay to Display
    Alina Karakanta, Sara Papi, Matteo Negri, and 1 more author
    In Proceedings of the 1st Workshop on Automatic Spoken Language Translation in Real-World Settings (ASLTRW), Aug 2021
  3. Dealing with training and test segmentation mismatch: FBK@IWSLT2021
    Sara Papi, Marco Gaido, Matteo Negri, and 1 more author
    In Proceedings of the 18th International Conference on Spoken Language Translation (IWSLT 2021), Aug 2021

2020

  1. Mixtures of Deep Neural Experts for Automated Speech Scoring
    Sara Papi, Edmondo Trentin, Roberto Gretter, and 2 more authors
    In Proc. Interspeech 2020, Aug 2020