Sara Papi

prof_pic.jpg

I am a Postdoctoral Researcher in Computer Science, specifically in Speech Processing, at FBK (Fondazione Bruno Kessler) in the MT Unit. I am interested in Speech Processing in general and its applications, with a particular focus on Speech Translation.

(I love elephants ♥️🐘)

Leave an anonymous feedback here! 😊

News

Mar 28, 2025 🏆 Highly Commended EAMT Best PhD Thesis
Nov 28, 2024 🏆 Honorable Mention for the Best Italian PhD Thesis at AIxIA
Nov 14, 2024 🏆 Social Impact Paper Award at EMNLP 2024!
Aug 14, 2024 🏆 Outstanding Paper and other Achievement at ACL 2024!
Aug 07, 2024 I am presenting 5 papers at ACL 2024! 🎉
Apr 18, 2024 I successfully defended my PhD 🎊
Feb 20, 2024 “How do Hyenas deal with Human Speech? Speech Recognition and Translation with ConfHyena” accepted at LREC-COLING 2024 🎊
Dec 16, 2023 My second Microsoft internship paper was accepted at ICASSP 2024 🎊
Nov 28, 2023 My first Microsoft internship paper was accepted at ASRU 2023 🎊

Selected Publications

  1. EMNLP
    Dataset
    MOSEL: 950,000 Hours of Speech Data for Open-Source Speech Foundation Model Training on EU Languages
    Marco Gaido*Sara Papi*, Luisa Bentivogli, and 6 more authors
    In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, Nov 2024
  2. EMNLP
    What the Harm? Quantifying the Tangible Impact of Gender Bias in Machine Translation with a Human-centered Study
    Beatrice Savoldi, Sara Papi, Matteo Negri, and 2 more authors
    In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing (Social Impact Paper Award) , Nov 2024
  3. ACL
    Speech Translation
    StreamAtt: Direct Streaming Speech-to-Text Translation with Attention-based Audio History Selection
    Sara Papi, Marco Gaido, Matteo Negri, and 1 more author
    In , Aug 2024
  4. ACL
    Speech Translation
    Speech Translation with Speech Foundation Models and Large Language Models: What is There and What is Missing?
    Marco Gaido, Sara Papi, Matteo Negri, and 1 more author
    In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) (Outstanding Paper and SAC Award) , Aug 2024
  5. ACL
    Automatic Subtitling
    SBAAM! Eliminating Transcript Dependency in Automatic Subtitling
    Marco Gaido, Sara Papi, Matteo Negri, and 2 more authors
    In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Aug 2024
  6. ACL
    When Good and Reproducible Results are a Giant with Feet of Clay: The Importance of Software Quality in NLP
    Sara Papi*, Marco Gaido*, Andrea Pilzer, and 1 more author
    In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Aug 2024
  7. TACL Automatic Subtitling
    Direct Speech Translation for Automatic Subtitling
    Sara Papi, Marco Gaido, Alina Karakanta, and 3 more authors
    Transactions of the Association for Computational Linguistics, Nov 2023