Hi there! I am the developer behind SFlowTTS. This model is using a 44.1 kHz FSQ vocoder combined with a discrete Flow-Matching architecture. This 2-stage architecture has ~200M parameters. I trained the entire model within 7 days on 2x H100