Imitação da voz humana através do processo de análise-por-síntese utilizando algoritmo genético e sintetizador de voz por formantes

ARAÚJO, Fabiola Pantoja Oliveira

Tese

Imitação da voz humana através do processo de análise-por-síntese utilizando algoritmo genético e sintetizador de voz por formantes

Voice imitation through the utterance copy mechanism is estimating the value of the input parameters of a speech synthesizer to generate a similar signal with the original voice. This process is distinct from the more traditional text-to-speech, but yet used in many areas, especially, Linguistics an...

ver descrição completa

Autor principal:	ARAÚJO, Fabiola Pantoja Oliveira
Grau:	Tese
Idioma:	por
Publicado em:	Universidade Federal do Pará 2017
Assuntos:	Imitação da voz Sistemas de processamento da fala Algoritmos genéticos Análise-por-síntese Sintetizador por formantes Voice imitation Genetic slgorithm Analysis-by-synthesis Formant synthesizer Speech processing system CNPQ::CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO CNPQ::ENGENHARIAS::ENGENHARIA ELETRICA::TELECOMUNICACOES::SISTEMAS DE TELECOMUNICACOES
Acesso em linha:	http://repositorio.ufpa.br/jspui/handle/2011/7749

id	ir-2011-7749
recordtype	dspace
spelling	ir-2011-77492022-04-08T12:25:51Z Imitação da voz humana através do processo de análise-por-síntese utilizando algoritmo genético e sintetizador de voz por formantes ARAÚJO, Fabiola Pantoja Oliveira KLAUTAU JÚNIOR, Aldebaro Barreto da Rocha http://lattes.cnpq.br/1596629769697284 Imitação da voz Sistemas de processamento da fala Algoritmos genéticos Análise-por-síntese Sintetizador por formantes Voice imitation Genetic slgorithm Analysis-by-synthesis Formant synthesizer Speech processing system CNPQ::CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO CNPQ::ENGENHARIAS::ENGENHARIA ELETRICA::TELECOMUNICACOES::SISTEMAS DE TELECOMUNICACOES Voice imitation through the utterance copy mechanism is estimating the value of the input parameters of a speech synthesizer to generate a similar signal with the original voice. This process is distinct from the more traditional text-to-speech, but yet used in many areas, especially, Linguistics and Health System. Imitate the human voice through this mechanism is a difficult inverse problem because the mapping is non-linear and from many to one. For instance, there are different combinations of the synthesizer input parameters values that produce the same synthetic voice signal. Therefore, perform voice imitation manually requires a considerable amount of time. In addition to automatic methods are our interest of study as well, as proposed here. This work presents our system based on Genetic Algorithm (GA) to automatically estimate the value of the input parameters of a speech formant synthesizer using the analysis-by-synthesis process. Results are presented for synthetic (computer-generated) and natural (human-generated) speech in American English, for male and female speakers. These results are compared with the ones obtained with Winsnoori, the only currently available software that performs the same task. The experiments showed that the proposed newGASpeech framework is an effective alternative to the laborious manual process of estimating the input parameters values of a formant synthesizer. Besides it has overcome the quality of the generated voices by the baseline if compared to five objective metrics and a subjective evaluation applied to twenty seven no-expert listeners in the speech area neither the adopted language. CNPq - Conselho Nacional de Desenvolvimento Científico e Tecnológico A imitação da voz através do mecanismo de utterance copy consiste em estimar os parâmetros de entrada de um sintetizador de voz para gerar um sinal parecido com o da voz original. Este processo distingue-se da tradicional conversão texto-fala, porém é usado em muitas áreas, especialmente, em Linguística e na Saúde. Imitar a voz humana através deste mecanismo é um problema inverso difícil, pois este mapeamento é não linear e de muitos para um. Por exemplo, existem diferentes combinações dos valores dos parâmetros de entrada do sintetizador que produzem o mesmo sinal de voz sintética. Sendo assim, realizar manualmente a imitação da voz requer uma quantidade considerável de tempo e métodos automáticos, como o proposto aqui, são de interesse. Este trabalho apresenta um arcabouço baseado em algoritmo genético (AG) para estimar automaticamente os valores dos parâmetros de entrada de um sintetizador de voz por formantes, utilizando o processo de análise-por-síntese. Os resultados apresentados compreendem a imitação de vozes sintéticas (geradas por computador) e naturais (geradas por humanos) em inglês americano, para falantes masculinos e femininos. Estes resultados são comparados com os obtidos através do Winsnoori (baseline), o único software disponível atualmente que executa a mesma tarefa. Os experimentos mostraram que o arcabouço desenvolvido (newGASpeech) é uma alternativa eficaz para o trabalhoso processo manual de estimar os valores dos parâmetros de entrada de um sintetizador por formantes, superando a qualidade das vozes geradas pelo baseline em relação à cinco métricas objetivas utilizadas e à avaliação subjetiva aplicada a vinte e sete ouvintes não especialistas na área de voz e nem no idioma adotado. 2017-02-22T16:23:02Z 2017-02-22T16:23:02Z 2015-12-18 Tese ARAUJO, Fabiola Pantoja Oliveira. Imitação da voz humana através do processo de análise-por-síntese utilizando algoritmo genético e sintetizador de voz por formantes. 2015. 107 f. Orientador: Aldebaro Barreto da Rocha Klautau Júnior. Tese (Doutoradoem Engenharia Elétrica) - Instituto de Tecnologia, Universidade Federal do Pará, Instituto de Tecnologia, Belém, 2015. Disponível em: http://repositorio.ufpa.br/jspui/handle/2011/7749. Acesso em:. http://repositorio.ufpa.br/jspui/handle/2011/7749 por Acesso Aberto application/pdf Universidade Federal do Pará Brasil Instituto de Tecnologia UFPA Programa de Pós-Graduação em Engenharia Elétrica
institution	Repositório Institucional - Universidade Federal do Pará
collection	RI-UFPA
language	por
topic	Imitação da voz Sistemas de processamento da fala Algoritmos genéticos Análise-por-síntese Sintetizador por formantes Voice imitation Genetic slgorithm Analysis-by-synthesis Formant synthesizer Speech processing system CNPQ::CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO CNPQ::ENGENHARIAS::ENGENHARIA ELETRICA::TELECOMUNICACOES::SISTEMAS DE TELECOMUNICACOES
spellingShingle	Imitação da voz Sistemas de processamento da fala Algoritmos genéticos Análise-por-síntese Sintetizador por formantes Voice imitation Genetic slgorithm Analysis-by-synthesis Formant synthesizer Speech processing system CNPQ::CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO CNPQ::ENGENHARIAS::ENGENHARIA ELETRICA::TELECOMUNICACOES::SISTEMAS DE TELECOMUNICACOES ARAÚJO, Fabiola Pantoja Oliveira Imitação da voz humana através do processo de análise-por-síntese utilizando algoritmo genético e sintetizador de voz por formantes
topic_facet	Imitação da voz Sistemas de processamento da fala Algoritmos genéticos Análise-por-síntese Sintetizador por formantes Voice imitation Genetic slgorithm Analysis-by-synthesis Formant synthesizer Speech processing system CNPQ::CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO CNPQ::ENGENHARIAS::ENGENHARIA ELETRICA::TELECOMUNICACOES::SISTEMAS DE TELECOMUNICACOES
description	Voice imitation through the utterance copy mechanism is estimating the value of the input parameters of a speech synthesizer to generate a similar signal with the original voice. This process is distinct from the more traditional text-to-speech, but yet used in many areas, especially, Linguistics and Health System. Imitate the human voice through this mechanism is a difficult inverse problem because the mapping is non-linear and from many to one. For instance, there are different combinations of the synthesizer input parameters values that produce the same synthetic voice signal. Therefore, perform voice imitation manually requires a considerable amount of time. In addition to automatic methods are our interest of study as well, as proposed here. This work presents our system based on Genetic Algorithm (GA) to automatically estimate the value of the input parameters of a speech formant synthesizer using the analysis-by-synthesis process. Results are presented for synthetic (computer-generated) and natural (human-generated) speech in American English, for male and female speakers. These results are compared with the ones obtained with Winsnoori, the only currently available software that performs the same task. The experiments showed that the proposed newGASpeech framework is an effective alternative to the laborious manual process of estimating the input parameters values of a formant synthesizer. Besides it has overcome the quality of the generated voices by the baseline if compared to five objective metrics and a subjective evaluation applied to twenty seven no-expert listeners in the speech area neither the adopted language.
author_additional	KLAUTAU JÚNIOR, Aldebaro Barreto da Rocha
author_additionalStr	KLAUTAU JÚNIOR, Aldebaro Barreto da Rocha
format	Tese
author	ARAÚJO, Fabiola Pantoja Oliveira
title	Imitação da voz humana através do processo de análise-por-síntese utilizando algoritmo genético e sintetizador de voz por formantes
title_short	Imitação da voz humana através do processo de análise-por-síntese utilizando algoritmo genético e sintetizador de voz por formantes
title_full	Imitação da voz humana através do processo de análise-por-síntese utilizando algoritmo genético e sintetizador de voz por formantes
title_fullStr	Imitação da voz humana através do processo de análise-por-síntese utilizando algoritmo genético e sintetizador de voz por formantes
title_full_unstemmed	Imitação da voz humana através do processo de análise-por-síntese utilizando algoritmo genético e sintetizador de voz por formantes
title_sort	imitação da voz humana através do processo de análise-por-síntese utilizando algoritmo genético e sintetizador de voz por formantes
publisher	Universidade Federal do Pará
publishDate	2017
url	http://repositorio.ufpa.br/jspui/handle/2011/7749
_version_	1832603648253755392
score	11.755432

Imitação da voz humana através do processo de análise-por-síntese utilizando algoritmo genético e sintetizador de voz por formantes

Registros relacionados