``` +------------------------------------------+ | | | Voice Processing Flow | | | +------------------------------------------+ | v +------------------------------------------+ | | | +----------+ +-----------------+ | | | User | | Voice Capture | | | | Voice |--->| Interface | | | +----------+ +-----------------+ | | | | | v | | +-----------------+ | | +----------+ | Voice Quality | | | | NPR Style | | Validation | | | | Transfer | +-----------------+ | | +----------+ | | | ^ v | | | +-----------------+ | | | | Voice Embedding | | | | | Extraction | | | | +-----------------+ | | | | | | | v | | | +-----------------+ | | +---------| Custom Voice | | | | Creation | | | +-----------------+ | | | | | v | | +-----------------+ | | | Multilingual | | | | Adaptation | | | +-----------------+ | | | | +------------------------------------------+ | v +------------------------------------------+ | | | Text-to-Speech Synthesis | | | +------------------------------------------+ ```