# Extract SSL Unit with BPE This repo contains codes to extract Hubert units and perform BPE on them. The SSL feature is extracted from the 21st layer of `TencentGameMate/chinese-hubert-large` The number of BPE token is set to 6000. ## Usage ```bash python ssl_unit_encoder.py --audio test.wav ```