News & Community eLanka

eLanka

Wednesday, 3 Jun 2026
  • Home
  • Read History
  • Articles
    • eLanka Journalists
  • Events
  • Useful links
    • Obituaries
    • Seeking to Contact
    • eLanka Newsletters
    • Weekly Events and Advertisements
    • eLanka Testimonials
    • Sri Lanka Newspapers
    • Sri Lanka TV LIVE
    • Sri Lanka Radio
    • eLanka Recepies
  • Gallery
  • Contact
Newsletter
Sri lankan news
  • eLanka Weddings
  • Property
  • eLanka Shop
  • Business Directory
eLankaeLanka
Font ResizerAa
Search
  • Home
  • Read History
  • Articles
    • eLanka Journalists
  • Events
  • Useful links
    • Obituaries
    • Seeking to Contact
    • eLanka Newsletters
    • Weekly Events and Advertisements
    • eLanka Testimonials
    • Sri Lanka Newspapers
    • Sri Lanka TV LIVE
    • Sri Lanka Radio
    • eLanka Recepies
  • Gallery
  • Contact
Follow US
© 2005 – 2026 eLanka Pty Ltd. All Rights Reserved.
Home » Goodnews Stories Srilankan Expats » Articles » Voice generation using text: A deep-learning method By Aditya Abeysinghe
Aditya AbeysingheArticles

Voice generation using text: A deep-learning method By Aditya Abeysinghe

eLanka admin
Last updated: September 17, 2021 2:28 am
By
eLanka admin
ByeLanka admin
Follow:
Share
3 Min Read
SHARE
Views: 29

Voice generation using text: A deep-learning method

  By Aditya Abeysinghe

Cable Bridge over Kelani River - major engineering feat By Aditya AbeysingheUsing text to generate speech similar to human voice is the main function of a text-to-speech (TTS) system. The process of converting text to speech is known as speech synthesis. Speech recorded is used to generate new speech, based on the input of the TTS. Since 1960s, several TTS systems have been developed for speech synthesis for current systems. However, these systems have several issues which led to the use of deep learning methods to synthesize speech.

Current methods

Two main methods exist for speech synthesis in traditional systems: concatenative and parametric. In concatenation-based synthesis the waveforms in the speech are concatenated to produce a speech stream. This type uses a waveform database to store and retrieve recorded speech. The speech appropriate for each text supplied is selected and joined to the stream to produce the final speech. In parametric speech synthesis, digital signal processing methods synthesize speech. Different parametric types use parameters such as phonetics and noise that are varied with time to create a waveform. Other techniques use deep neural or hidden models to produce waveforms.

The process

The first stage of speech synthesis is to analyze text input to the TTS. This involves text tokenization and removing blank characters. It breaks sentences into tokens and then sends them to the next stage, linguistic analysis. In this stage phoneme, syllable and words are analyzed in a text-to-phoneme conversion. Then the parameter prediction module predicts acoustic feature parameters of the linguistic analysis. Then the speech synthesis module produces the speech waveform.

Voice generation using text A deep-learning method

Issues in synthesis methods

The main issue with non-deep learning methods is that they are not efficient in processing complex text input that involve decision logic. Also, non-deep learning methods divide the input into separate regions and use separate parameters for each region. This results in fragmenting the data input to the TTS which causes improper models created. Therefore, the accuracy of such models to produce speech on test data is low.

Deep-learning methods in synthesis

Deep-learning methods use artificial neural networks for speech production. Therefore, models created using other methods are replaced with these neural networks, during the process between linguistic analysis and parameter generation as described in the process section. In a deep-learning neural network, linguistic features are processed using hidden layers of the network. The network is trained where error at each time is minimized by adjusting inner parameters or weights of the network. Therefore, the second issue as described where the division of input data in other methods is reduced. Also, complex logic can be represented using deep-learning method which reduces the first issue.

Image Courtesy: https://www.voicebase.com/

TAGGED:Aditya Abeysinghetext-to-speechTTSVoice generation
Share This Article
Facebook Whatsapp Whatsapp LinkedIn Email Copy Link Print
Previous Article Beddagana Wetland Park – verdant mosaic of wetlands By Arundathie Abeysinghe
Next Article TALENTED GIFTED WRITER
FacebookLike
YoutubeSubscribe
LinkedInFollow
eLanka Wedding
- Advertisement -
Ad image
Most Read
VACD Australia & Sri Lanka June 2026 Newsletter 01

VACD Australia & Sri Lanka June 2026 Newsletter

High Blood Pressure-A Vital Understanding for Health Management and Dispelling Myths

High Blood Pressure: A Vital Understanding for Health Management and Dispelling Myths-by Harold Gunatillake

Hameedia expands to Galle City Center offering a new premium fashion experience in the southern Sri Lanka 01

Hameedia expands to Galle City Center offering a new premium fashion experience in the southern Sri Lanka

Poson Reflections - By Mahinda Gunewardena

Poson Reflections – By Mahinda Gunewardena

Airport Bus Service, BIA Transport, Makumbura Transport Centre, Sri Lankan Travel

Travel Update: Luxury Bus Service Resumes Between BIA and Makumbura Transport Centre

Related News
The Ministry of Health’s Epidemiology Unit has issued a significant health update regarding a notable increase in suspected meningitis cases across several regions of the island.
Articles

Health Alert: 237 Suspected Meningitis Cases Reported Across Sri Lanka

Nishan Velupillay
Articles

Sri Lankan Pride: Nishan Velupillay Named in Australia’s 2026 FIFA World Cup Squad!

Qld Sri Lankan Newsletter - Dæhæna - June 2026
Articles

Qld Sri Lankan Newsletter – Dæhæna – June 2026

WHY ELDERLY PERSONS FEEL LONELY - By N.S.Venkataraman
Articles N.S.Venkataraman

WHY ELDERLY PERSONS FEEL LONELY ? – By N.S.Venkataraman

ABSC Inc. Hosts Annual Gala Dinner and Launches EKONOMOS Issue 7, 2026
Articles

ABSC Inc. Hosts Annual Gala Dinner and Launches EKONOMOS Issue 7, 2026

  • Quick Links:
  • Articles
  • DESMOND KELLY
  • Dr Harold Gunatillake
  • English Videos
  • Sri Lanka
  • Sinhala Videos
  • eLanka Newsletters
  • Obituaries
  • Sunil Thenabadu
  • Dr. Harold Gunatillake
  • Tamil Videos
  • Sinhala Movies
  • Trevine Rodrigo
  • eLanka Newsletter
  • Photos

eLanka

Your Trusted Source for News & Community Stories: Stay connected with reliable updates, inspiring features, and breaking news. From politics and technology to culture, lifestyle, and events, eLanka brings you stories that matter — keeping you informed, engaged, and connected 24/7.
Kerrie road, Oatlands , NSW 2117 , Australia.
Email : info@eLanka.com.au / rasangivjes@gmail.com.
WhatsApp : +61402905275 / +94775882546
  • About eLanka
  • Terms & Conditions

Disclaimer:
eLanka is committed to sharing positive and community-focused stories. We do not publish or endorse political, religious, or ethnic viewpoints. The content published on eLanka, including articles and newsletters, reflects the opinions and views of the respective authors and not those of eLanka. eLanka accepts no responsibility or liability for the accuracy, completeness, or consequences of any content provided by contributors.

(c) 2005 – 2025 eLanka Pty Ltd. All Rights Reserved.