ChatGPT’s voice capability is “powered by a new text-to-speech model, capable of generating human-like audio from just text and a few seconds of sample speech,” Open AI said in the blogpost.
https://edition.cnn.com/2023/09/25/tech/chatgpt-open-ai-humanlike-update/index.html