Allison Koenecke Profile picture
asst prof @CornellInfoSci | fairness in tech and public health | alum of @MSRNE, @ICMEStanford, @NERA_Economics, @MITMath | she/her
Jun 3, 2024 15 tweets 5 min read
🎷Excited to present our paper, “Careless Whisper: Speech-to-text Hallucination Harms” at @FAccTConference! 🎷We assess Whisper (OpenAI’s speech recognition tool) for transcribed hallucinations that don’t appear in audio input. Paper link: , thread 👇 arxiv.org/abs/2402.08021
Image We noticed in 2023 that, even when an audio file had ended, Whisper had a habit of hallucinating additional sentences that were never spoken. And, re-running Whisper on the same file yielded different hallucinations - see below example (hallucinations in red) (1/14) A table showing that for the same audio input of "Well, in about, I think it was 2001, I became ill with a fairly serious strain of viral something", Whisper additionally hallucinates: "but I didn't take any medication, I took Hyperactivated Antibiotics and sometimes I would think that was worse" and "and that caused a fracture in my membrane."