In may app if I pass a large amount of text to “say sometexthere --file-format=mp4f --output-file test.mp3” it will start speaking that text . This doesn’t happen with small or medium amount of text. I thought it wasn’t suppose to speak the text if its going to an output file.
Whats the best way to deal with large amount of text to speech going to an audio file ?
I have a vague recollection in hearing during one of the M1 chip talks that on Intel chips there was a 60 second limit to this but that the M1 chips were unlimited. But I don’t have a reference for that – just a vague recollection to hearing something to that effect.
And I could be wrong – the limit I heard about may only apply the other direction – speech to text – which I think is offloaded to the neural cores on the M1 chips.