Token-Level Serialized Output Training for Joint Streaming ASR and ST Leveraging Textual Alignments

Publication
2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU)