You might want to split the extracted audio into multiple parts for sampling purposes (e.g. training AI voice models or like Tortoise-tts) ...