ValueError: Audio features and audio tokens do not match: tokens: 288, features 30
ValueError: Audio features and audio tokens do not match: tokens: 288, features 30