whisperX: Running on short audio: KeyError: 'level_1'
Failed to align segment: no characters in this segment found in model dictionary, resorting to original...
I was testing to align a audio file, but it didn’t worked and give above error. It was a plain English .wav file
About this issue
- Original URL
- State: closed
- Created a year ago
- Comments: 22 (16 by maintainers)
@puresky07 https://github.com/m-bain/whisperX/commit/ba102feb7ff30e6f8345f00470955f5632e767e2 this one
Ok - I believe this is happening as a result of there being only a single segment in an audio file. It works fine on longer files where whisper is returning multiple segments. When a single segment is returned, the expected indexes aren’t there.
I’m not familiar enough with pandas to understand how to resolve it right now, but if you take a single sentence audio file you may be able to reproduce.