About
Narration Box is a voice synthesis service that offers over 700 AI-enhanced human-like narrators in over 20 languages. It enables users to create voiceovers, narrations, audiobooks, audio pages, podcasts, and more.
- Text-to-speech
- Voiceovers
Features & specifications (USP):
- AI Generated more than 700 hyper local voices in more than 70 languages.
- Can add expressive styles to voices (pause, anger, laugh, etc.).
- Can add prosody (patterns of rhythm and sound).
- Can export in wav format too, in addition to mp3.
User flow
Below is the user flow that focuses on:
User finding the platform and exporting his first project.
Improvements:
- A part on the beginning/home page showing “the most used voices by creators” and “the most used voices by edutech”, as per the user persona. This will help in reducing time of user decision making and confusion.
- A personalized welcome email by the higher authority that breaks down the product use in very brief. Here, user will get information about tips and tricks in the app and use of voices creatively. This will give user more options and reasons of using the product.
- Asking all the required questions is amazing and also feel a tiny bit heavy on user (they have to type certain things too). So, to compensate, we can give them a short onboarding of the workspace and a quick way of registering their issues or suggestions through email or a suggestion box. Here, we can also add link to the most searched queries in the product guide.
- Product guide: a database of how the product works that mostly contains small videos, gifs wherever needed, structured (less paragraphs) texts and other product related explanation.
- We can use LLM models to add the pauses and expressions automatically that will save the user much time to manually add line by line pauses and sometimes expressive styles. This method is tough to implement.