Very slow for long texts, streaming would be great. Right now, the API waits for the entire audio file to be generated and written to the disk before returning the JSON with the file path. Returning the audio as a direct stream of bytes would make the response feel instantaneous, especially when using it with frontend UIs.