Academic literature on the topic 'Phone duration modeling'

Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles

Select a source type:

Consult the lists of relevant articles, books, theses, conference reports, and other scholarly sources on the topic 'Phone duration modeling.'

Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.

You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.

Journal articles on the topic "Phone duration modeling"

1

Yamagishi, Junichi, Hisashi Kawai, and Takao Kobayashi. "Phone duration modeling using gradient tree boosting." Speech Communication 50, no. 5 (2008): 405–15. http://dx.doi.org/10.1016/j.specom.2007.12.003.

Full text
APA, Harvard, Vancouver, ISO, and other styles
2

Lazaridis, Alexandros, Iosif Mporas, and Todor Ganchev. "Phone Duration Modeling of Affective Speech Using Support Vector Regression." International Journal of Intelligent Systems and Applications 4, no. 8 (2012): 1–9. http://dx.doi.org/10.5815/ijisa.2012.08.01.

Full text
APA, Harvard, Vancouver, ISO, and other styles
3

Norkevičius, Giedrius, and Gailius Raškinis. "Modeling Phone Duration of Lithuanian by Classification and Regression Trees, using Very Large Speech Corpus." Informatica 19, no. 2 (2008): 271–84. http://dx.doi.org/10.15388/informatica.2008.213.

Full text
APA, Harvard, Vancouver, ISO, and other styles
4

Lazaridis, Alexandros, Todor Ganchev, Theodoros Kostoulas, Iosif Mporas, and Nikos Fakotakis. "Phone duration modeling: overview of techniques and performance optimization via feature selection in the context of emotional speech." International Journal of Speech Technology 13, no. 3 (2010): 175–88. http://dx.doi.org/10.1007/s10772-010-9077-x.

Full text
APA, Harvard, Vancouver, ISO, and other styles
5

Laccetti, Andrew L., Rebecca Slack Tidwell, Nipa P. Sheth, Christopher Logothetis, and Michael VanAlstine. "Remote patient monitoring using smart phone derived patient reported outcomes and Fitbit data to enable longitudinal predictive modeling in prostate cancer: Feasibility results and lessons on platform development." Journal of Clinical Oncology 37, no. 15_suppl (2019): e18068-e18068. http://dx.doi.org/10.1200/jco.2019.37.15_suppl.e18068.

Full text
Abstract:
e18068 Background: Wearable activity trackers and frequent interrogation of remote patient reported outcomes (PROs) have the potential to revolutionize oncology clinical trial design, therapeutic sequencing and patient (pt) safety. Established feasibility and novel systems of data processing are necessary to substantiate value for these methods. Methods: Expanding upon our clinical research data platform, Prometheus, we have developed a HIPPA compliant, remote pt monitoring system capable of collecting minute-by-minute step count and heart rate (via Fitbit) in addition to smart phone derived symptom surveys. To pilot this program, a cohort of metastatic castration resistant prostate cancer (mCRPC) pts participated in continuous Fitbit monitoring and thrice weekly NCI-PRO-CTCAE survey for 26 weeks. Pre-specified interim feasibility results were examined with a focus on pt characteristics, treatment and compliance. Qualitative assessment of barriers to pt enrollment and device use was performed. Results: 15 mCRPC pts completed our pilot program: mean age 63.1 years (range 48-79), pre-treatment PSA 33.2 ng/mL (range 0.3 – 499.9), 80.0% stage M1b, ECOG 0 or 1. All pts were treated per protocol NCT02703623: abiraterone/prednisone/apalutamide for 8 weeks followed by continuation of therapy (5 pts), cabazitaxel/carboplatin (6 pts) or addition of ipilimumab (2 pts). 33.3% of pts reported grade 3 clinician interpreted adverse events. 14 pts had data available for compliance analysis. Mean Fitbit compliance was 62.6% (STD 35.5%) with rates trending down over the study duration (week 1 vs 26 = 86.7% vs 33.3%; p= 0.002; R2= 0.716). Mean smart phone derived NCI-PRO-CTCAE survey completion rate was 37.1%. Barriers to pt enrollment included slow Fitbit app download times, incompatible smart phones and the need for extensive device use education/counseling. Barriers to data collection were missed survey text prompts and inconsistent use of Bluetooth. Conclusions: MD Anderson’s novel, home-grown, remote pt monitoring platform, utilizing Fitbit and smart phone derived PROs, is feasible in function and pt use. Automated compliance checks, streamlined enrollment and pt education are critical to future application. This foundational work will facilitate longitudinal signal variation benchmarked against standard monitoring methods, ultimately aiming to improve outcomes and support of discovery through enriched access to pt experience.
APA, Harvard, Vancouver, ISO, and other styles
6

Bourbonnais, Pierre-Léo, and Catherine Morency. "Factors Affecting Interview Duration in Web-Based Travel Surveys." Transportation Research Record: Journal of the Transportation Research Board 2672, no. 42 (2018): 33–44. http://dx.doi.org/10.1177/0361198118790376.

Full text
Abstract:
Historically, travel surveys have been conducted face-to-face, by mail, or by phone. With the increasing share of households having access to the Internet, other survey modes have been deployed. This paper focuses on web surveys. Among other advantages, using the web to conduct surveys reduces costs and helps mitigate poor response rates among young households. Very few studies have been conducted on interview duration and its determinant using paradata from web travel surveys. Such knowledge is necessary to validate the context in which travel data are gathered and can be used to understand sample and data quality. Interview duration modeling is also essential for allocating survey servers and monitoring interviews during the data collection phase. This paper models interview duration using paradata from nine web surveys conducted in the Quebec province from 2010 to 2014. The main objectives of the model are to assist the monitoring of interviews by detecting outliers, provide a better estimate of the interview duration to respondents and survey managers during the interview, and allow a more precise evaluation of the server performance needed before conducting web travel surveys. Using a multiple regression model, we observed that the most important variables in explaining interview duration were number of car and transit trips as well as number of unique places visited during a day. Conducting the interview on a small-screen device also increased interview duration. The model also provides a baseline estimate of interview duration on the basis of demographic features and questionnaire design.
APA, Harvard, Vancouver, ISO, and other styles
7

Meenan, Richard T., Kim D. Reynolds, David B. Buller, et al. "Economic Evaluation of a Sun Protection Promotion Program in California Elementary Schools." American Journal of Health Promotion 34, no. 8 (2020): 848–56. http://dx.doi.org/10.1177/0890117120905217.

Full text
Abstract:
Background: An economic evaluation of Sun Safe Schools intervention designed to aid California elementary schools with implementing sun safety practices consistent with local board–approved policy. Design: Program cost analysis: intervention delivery and practice implementation. Setting: California elementary schools (58 interventions and 60 controls). Principals at 52 intervention and 53 control schools provided complete implementation data. Participants: Principals completing pre-/postintervention surveys assessing practice implementation. Intervention: Phone-based 45-minute session with a project coach on practice implementation, follow-up e-mails/phone contacts, $500 mini-grant. Schools chose from a list of 10 practices for implementation: ultraviolet monitoring, clothing, hats, and/or sunscreen recommendations, outdoor shade, class education, staff training and/or modeling, parent outreach, and resource allocation. The duration of intervention was 20 months. Rolling recruitment/intervention: February 2014 to December 2017. Measures: Intervention delivery and practice implementation costs. Correlations of school demographics and administrator beliefs with costs. Analysis: Intervention delivery activities micro-costed. Implemented practices assessed using costing template. Results: Intervention schools: 234 implemented practices, control schools: 157. Twenty-month delivery costs: $29 310; $16 653 (per school: $320) for project staff, mostly mini-grants and coaching time. Administrator costs: $12 657 (per school: $243). Per-student delivery costs: $1.01. Costs of implemented practices: $641 843 for intervention schools (per-school mean: $12 343, median: $6 969); $496 365 for controls (per-school mean: $9365, median: $3123). Delivery costs correlated with implemented practices (0.37, P < .01) and total practice costs (0.37, P < .05). Implemented practices correlated with principal beliefs about the importance of skin cancer prevention to student health (0.46, P < .001) and parents (0.45, P < .001). Conclusion: Coaching of elementary school personnel can stimulate sun safety practice implementation at a reasonable cost. Findings can assist schools in implementing appropriate sun safety practices.
APA, Harvard, Vancouver, ISO, and other styles
8

Lazaridis, Alexandros, Iosif Mporas, Todor Ganchev, George Kokkinakis, and Nikos Fakotakis. "Improving phone duration modelling using support vector regression fusion." Speech Communication 53, no. 1 (2011): 85–97. http://dx.doi.org/10.1016/j.specom.2010.07.005.

Full text
APA, Harvard, Vancouver, ISO, and other styles
9

Goubanova, Olga, and Simon King. "Bayesian networks for phone duration prediction." Speech Communication 50, no. 4 (2008): 301–11. http://dx.doi.org/10.1016/j.specom.2007.10.002.

Full text
APA, Harvard, Vancouver, ISO, and other styles
10

Chung, Soonwan, and Jae B. Kwak. "Realistic warpage evaluation of printed board assembly during reflow process." Soldering & Surface Mount Technology 27, no. 4 (2015): 137–45. http://dx.doi.org/10.1108/ssmt-12-2014-0023.

Full text
Abstract:
Purpose – This paper aims to develop an estimation tool for warpage behavior of slim printed circuit board (PCB) array while soldering with electronic components by using finite element method. One of the essential requirements for handheld devices, such as smart phone, digital camera, and Note-PC, is the slim design to satisfy the customers’ desires. Accordingly, the printed circuit board (PCB) should be also thinner for a slim appearance, which would result in decreasing the PCB’s bending stiffness. This means that PCB deforms severely during the reflow (soldering) process where the peak temperature goes up to 250°C. Therefore, it is important to estimate PCB deformation at a high temperature for thermo-mechanical quality/reliability after reflow process. Design/methodology/approach – A numerical simulation technique was devised and customized to accurately estimate the behavior of a thin printed board assembly (PBA) during reflow by considering all components, including PCB, microelectronic packages and solder interconnects. Findings – By applying appropriate constraints and boundary conditions, it was found that PBA’s warpage can be accurately predicted during the reflow process. The results were also validated by warpage measurement, which showed a fairly good agreement with one and another. Research limitations/implications – For research limitations, there are many assumptions regarding numerical modeling. That is, the viscoplastic material property of solder ball is ignored, the reflow profile is simplified and the accurate heat capacity is not considered. Furthermore, the residual stress within the PCB, generated at PCB manufacturing process, is not included in this paper. Practical implications – This paper shows how to calculate PBA warpage during the reflow process as accurately as possible. This methodology helps a PCB designer and surface-mount technology (SMT) process manager to predict a PBA warpage issue and modify PCB design before PCB real fabrication. Practically, this modeling and simulation process can be easily performed by using a graphical user interface (GUI) module, so that the engineer can handle an issue by inputting some numbers and clicking some buttons. Social implications – In a common sense manner, a numerical simulation method can decrease time and cost in manufacturing real samples. This PCB warpage method can also decrease product development duration and produce a new product earlier. Furthermore, PCB is a common component in all the electronic devices. So, this PCB warpage method can have various applications. Originality/value – Because of an economic advantage, the development of a numerical simulation tool for estimating the thin PBA warpage behaviour during reflow process was attempted. The developed tool contains the features of detailed modeling for electronic components and contact boundary conditions of the supporting rails in the reflow oven.
APA, Harvard, Vancouver, ISO, and other styles
More sources

Dissertations / Theses on the topic "Phone duration modeling"

1

Norkevičius, Giedrius. "Method for creating phone duration models using very large, multi-speaker, automatically annotated speech corpus." Doctoral thesis, Lithuanian Academic Libraries Network (LABT), 2011. http://vddb.laba.lt/obj/LT-eLABa-0001:E.02~2011~D_20110201_144440-12017.

Full text
Abstract:
Two heretofore unanalyzed aspects are addressed in this dissertation: 1. Building a model capable of predicting phone duration of Lithuanian. All existing investigations of phone durations of Lithuanian were performed by linguists. Usually these investigations are the kind of exploratory statistics and are limited to a single factor, affecting phone duration, analysis. Phone duration dependencies on contextual factors were estimated and written in explicit form (decision tree) in this work by means of machine learning method. 2. Construction of language independent method for creating phone duration models using very large, multi-speaker, automatically annotated speech corpus. Most of the researchers worldwide use speech corpus that are: relatively small scale, single speaker, manually annotated or at least validated by experts. Usually the referred reasons are: using multi-speaker speech corpora is inappropriate because different speakers have different pronunciation manners and speak in different speech rate; automatically annotated corpuses lack accuracy. The created method for phone duration modeling enables the use of such corpus. The main components of the created method are: the reduction of noisy data in speech corpus; normalization of speaker specific phone durations by using phone type clustering. The performed listening tests of synthesized speech, showed that: the perceived naturalness is affected by the underlying phones durations; The use of contextual... [to full text]<br>Disertacijoje nagrinėjamos dvi iki šiol netyrinėtos problemos: 1. Lietuvių kalbos garsų trukmių prognozavimo modelių kūrimas Iki šiol visi darbai, kuriuose yra nagrinėjamos lietuvių kalbos garsų trukmės, yra atlikti kalbininkų, tačiau šie tyrimai yra daugiau aprašomosios statistikos pobūdžio ir apsiriboja pavienių požymių įtakos garso trukmei analize. Šiame darbe, mašininio mokymo algoritmo pagalba, požymių įtaka garsų trukmei yra išmokstama iš duomenų ir užrašoma sprendimo medžio pavidalu. 2. Nuo kalbos nepriklausomų garsų trukmių prognozavimo modelių kūrimo metodas, naudojant didelės apimties daugelio, kalbėtojų automatiškai, anotuotą garsyną. Dėl skirtingų kalbėtojų tarties specifikos ir dėl automatinio anotavimo netikslumų, kuriant garsų trukmės modelius visame pasaulyje yra apsiribojama vieno kalbėtojo ekspertų anotuotais nedidelės apimties garsynais. Darbe pasiūlyti skirtingų kalbėtojų tarties ypatybių normalizavimo ir garsyno duomenų triukšmo atmetimo algoritmai leidžia garsų trukmių modelių kūrimui naudoti didelės apimties, daugelio kalbėtojų automatiškai anotuotus garsynus. Darbo metu atliktas audicinis tyrimas, kurio pagalba parodoma, kad šnekos signalą sudarančių garsų trukmės turi įtakos klausytojų/respondentų suvokiamam šnekos signalo natūralumui; kontekstinės informacijos panaudojimas garsų trukmių prognozavimo uždavinio sprendime yra svarbus faktorius įtakojantis sintezuotos šnekos natūralumą; natūralaus šnekos signalo atžvilgiu, geriausiai vertinamas yra... [toliau žr. visą tekstą]
APA, Harvard, Vancouver, ISO, and other styles
2

Sandra, Sovilj-Nikić. "Razvoj matematičkog modela trajanja glasova u automatskoj sintezi govora na srpskom jeziku." Phd thesis, Univerzitet u Novom Sadu, Fakultet tehničkih nauka u Novom Sadu, 2014. https://www.cris.uns.ac.rs/record.jsf?recordId=85851&source=NDLTD&language=en.

Full text
Abstract:
U okviru ove disertacije razvijeno je više različitih modela trajanja glasova u srpskom jeziku primenom odgovarajućih metoda automatskog učenja. Izvršena je objektivna evaluacija razvijenih modela i njihovo međusobno poređenje na osnovu kvantitativnih pokazatelja kao što su RMSE(engl. root-mean-squared error), MAE (engl. mean absolute error) i CC (engl. correlation coefficient). Takođe je izvršeno poređenje modela za srpski jezik sa performansama modela razvijenih za druge jezike, pri čemu je uočeno da su performanse modela razvijenih u ovoj disertaciji uporedljive ili čak prevazilaze performanse modela koji su razvijeni za druge jezike.<br>In this dissertation several different phone duration models of the Serbainlanguage using appropriate machine learning algorithms were developed.The objective evaluation of the models obtained and their mutual comparisonbased on quantitative measures such as RMSE (root-mean-squared error),MAE (mean absolute error) and CC (correlation coefficient) were performed.The comparison of the models developed for the Serbian language with theperformances of the models developed for other languages is also carriedout. It was observed that the performances of the models developed in thisdissertation are comparable or even outperform the performances of themodels that have been developed for other languages.
APA, Harvard, Vancouver, ISO, and other styles
3

Norkevičius, Giedrius. "Garsų trukmių modelių kūrimo metodas, naudojant didelės apimties daugelio kalbėtojų garsyną." Doctoral thesis, Lithuanian Academic Libraries Network (LABT), 2011. http://vddb.laba.lt/obj/LT-eLABa-0001:E.02~2011~D_20110201_144413-26783.

Full text
Abstract:
Disertacijoje nagrinėjamos dvi iki šiol netyrinėtos problemos: 1. Lietuvių kalbos garsų trukmių prognozavimo modelių kūrimas Iki šiol visi darbai, kuriuose yra nagrinėjamos lietuvių kalbos garsų trukmės, yra atlikti kalbininkų, tačiau šie tyrimai yra daugiau aprašomosios statistikos pobūdžio ir apsiriboja pavienių požymių įtakos garso trukmei analize. Šiame darbe, mašininio mokymo algoritmo pagalba, požymių įtaka garsų trukmei yra išmokstama iš duomenų ir užrašoma sprendimo medžio pavidalu. 2. Nuo kalbos nepriklausomų garsų trukmių prognozavimo modelių kūrimo metodas, naudojant didelės apimties daugelio, kalbėtojų automatiškai, anotuotą garsyną. Dėl skirtingų kalbėtojų tarties specifikos ir dėl automatinio anotavimo netikslumų, kuriant garsų trukmės modelius visame pasaulyje yra apsiribojama vieno kalbėtojo ekspertų anotuotais nedidelės apimties garsynais. Darbe pasiūlyti skirtingų kalbėtojų tarties ypatybių normalizavimo ir garsyno duomenų triukšmo atmetimo algoritmai leidžia garsų trukmių modelių kūrimui naudoti didelės apimties, daugelio kalbėtojų automatiškai anotuotus garsynus. Darbo metu atliktas audicinis tyrimas, kurio pagalba parodoma, kad šnekos signalą sudarančių garsų trukmės turi įtakos klausytojų/respondentų suvokiamam šnekos signalo natūralumui; kontekstinės informacijos panaudojimas garsų trukmių prognozavimo uždavinio sprendime yra svarbus faktorius įtakojantis sintezuotos šnekos natūralumą; natūralaus šnekos signalo atžvilgiu, geriausiai vertinamas yra... [toliau žr. visą tekstą]<br>Two heretofore unanalyzed aspects are addressed in this dissertation: 1. Building a model capable of predicting phone duration of Lithuanian. All existing investigations of phone durations of Lithuanian were performed by linguists. Usually these investigations are the kind of exploratory statistics and are limited to a single factor, affecting phone duration, analysis. Phone duration dependencies on contextual factors were estimated and written in explicit form (decision tree) in this work by means of machine learning method. 2. Construction of language independent method for creating phone duration models using very large, multi-speaker, automatically annotated speech corpus. Most of the researchers worldwide use speech corpus that are: relatively small scale, single speaker, manually annotated or at least validated by experts. Usually the referred reasons are: using multi-speaker speech corpora is inappropriate because different speakers have different pronunciation manners and speak in different speech rate; automatically annotated corpuses lack accuracy. The created method for phone duration modeling enables the use of such corpus. The main components of the created method are: the reduction of noisy data in speech corpus; normalization of speaker specific phone durations by using phone type clustering. The performed listening tests of synthesized speech, showed that: the perceived naturalness is affected by the underlying phones durations; The use of contextual... [to full text]
APA, Harvard, Vancouver, ISO, and other styles
4

Λαζαρίδης, Αλέξανδρος. "Prosody modelling using machine learning techniques for neutral and emotional speech synthesis." Thesis, 2011. http://nemertes.lis.upatras.gr/jspui/handle/10889/4553.

Full text
Abstract:
In this doctoral dissertation three proposed approaches were evaluated using two databases of different languages, one American-English and one Greek. The proposed approaches were compared to the state-of-the-art models in the phone duration modelling task. The SVR model outperformed all the other individual models evaluated in this dissertation. Their ability to outperform all the other models is mainly based on their advantage of coping in a better way with high-dimensionality feature spaces in respect to the other models used in phone duration modelling, which makes them appropriate even for the case when the amount of the training data would be small respectively to the number of the feature set used. The proposed fusion scheme, taking advantage of the observation that different prediction algorithms perform better in different conditions, when implemented with SVR (SVR-fusion), contributed to the improvement of the phone duration prediction accuracy over that of the best individual model (SVR). Furthermore the SVR-fusion model managed to reduce the outliers in respect to the best individual model (SVR). Moreover, the proposed two-stage scheme using individual phone duration models as feature constructors in the first stage and feature vector extension (FVE) in the second stage, implemented with SVR (SVR-FVE), improved the prediction accuracy over the best individual predictor (SVR), and the SVR-fusion scheme and moreover managed to reduce the outliers in respect to the other two proposed schemes (SVR and SVR-fusion). The SVR two-stage scheme confirms in this way their advantage over all the other algorithms of coping well with high-dimensionality feature sets. The improved accuracy of phone duration modelling contributes to a better control of the prosody, and thus quality of synthetic speech. Furthermore, the first proposed method (SVR) was also evaluated on the phone duration modelling task in emotional speech, outperforming all the state-of-the-art models in all the emotional categories. Finally, perceptual tests were performed evaluating the impact of the proposed phone duration models to synthetic speech. The perceptual test for both the databases confirmed the results of objective tests showing the improvement achieved by the proposed models in the naturalness of synthesized speech.<br>Η παρούσα διδακτορική διατριβή πραγματεύεται προβλήματα που αφορούν στο χώρο της τεχνολογίας ομιλίας, με στόχο την μοντελοποίηση προσωδίας με χρήση τεχνικών μηχανικής μάθησης στα πλαίσια ουδέτερης και συναισθηματικής συνθετικής ομιλίας. Μελετήθηκαν τρεις καινοτόμες μέθοδοι μοντελοποίησης προσωδίας, οι οποίες αξιολογήθηκαν με αντικειμενικά τεστ και με υποκειμενικά τεστ ποιότητας ομιλίας για την συνεισφορά τους στην βελτίωση της ποιότητα της συνθετικής ομιλίας: Η πρώτη τεχνική μοντελοποίησης διάρκειας φωνημάτων, βασίζεται στην μοντελοποίηση με χρήση Μηχανών Υποστήριξης Διανυσμάτων (Support Vector Regression – SVR). Η μέθοδος αυτή δεν έχει χρησιμοποιηθεί έως σήμερα στην πρόβλεψη διάρκειας φωνημάτων. Η μέθοδος αυτή συγκρίθηκε και ξεπέρασε σε απόδοση όλες τις μεθόδους της επικρατούσας τεχνολογίας (state-of-the-art) στη μοντελοποίηση της διάρκειας φωνημάτων. Η δεύτερη τεχνική, βασίζεται στην μοντελοποίηση διάρκειας φωνημάτων με συνδυαστικό μοντέλο πολλαπλών προβλέψεων. Συγκεκριμένα, οι προβλέψεις διάρκειας φωνημάτων από ένα σύνολο ανεξάρτητων μοντέλων πρόβλεψης διάρκειας φωνημάτων χρησιμοποιούνται ως είσοδος σε ένα μοντέλο μηχανικής μάθησης, το οποίο συνδυάζει τις εξόδους από τα ανεξάρτητα μοντέλα πρόβλεψης και επιτυγχάνει μοντελοποίηση της διάρκειας φωνημάτων με μεγαλύτερη ακρίβεια, μειώνοντας επιπλέον και τα μεγάλα σφάλματα (outliers), δηλαδή τα σφάλματα που βρίσκονται μακριά από το μέσο όρο των σφαλμάτων. Η τρίτη τεχνική, είναι μια μέθοδος μοντελοποίησης διάρκειας φωνημάτων δύο σταδίων με κατασκευή νέων χαρακτηριστικών και επέκταση του διανύσματος χαρακτηριστικών. Συγκεκριμένα, στο πρώτο στάδιο, ένα σύνολο ανεξάρτητων μοντέλων πρόβλεψης διάρκειας φωνημάτων που χρησιμοποιούνται ως παραγωγοί νέων χαρακτηριστικών εμπλουτίζουν το διάνυσμα χαρακτηριστικών. Στο δεύτερο στάδιο, το εμπλουτισμένο διάνυσμα χρησιμοποιείται για να εκπαιδευτεί ένα μοντέλο πρόβλεψης διάρκειας φωνημάτων το οποίο επιτυγχάνει υψηλότερη απόδοση σε σχέση με όλες τις προηγούμενες μεθόδους, και μειώνει τα μεγάλα σφάλματα. Επιπλέον εφαρμόστηκε η πρώτη μέθοδος σε συναισθηματική ομιλία. Το προτεινόμενο SVR μοντέλο επιτυγχάνει την υψηλότερη απόδοση συγκρινόμενο με όλα τα state-of-the-art μοντέλα. Τέλος, πραγματοποιήθηκαν υποκειμενικά τεστ ποιότητας ομιλίας ώστε να αξιολογηθεί η συνεισφορά των τριών προτεινόμενων μεθόδων στη βελτίωση της ποιότητας της συνθετικής ομιλίας. Τα τεστ αυτά επιβεβαίωσαν την αξία των προτεινόμενων μεθόδων και τη συνεισφορά τους στη βελτίωση της ποιότητας στην συνθετική ομιλία.
APA, Harvard, Vancouver, ISO, and other styles

Book chapters on the topic "Phone duration modeling"

1

Lazaridis, Alexandros, Todor Ganchev, Iosif Mporas, Theodoros Kostoulas, and Nikos Fakotakis. "Feature Selection for Improved Phone Duration Modeling of Greek Emotional Speech." In Artificial Intelligence: Theories, Models and Applications. Springer Berlin Heidelberg, 2010. http://dx.doi.org/10.1007/978-3-642-12842-4_43.

Full text
APA, Harvard, Vancouver, ISO, and other styles

Conference papers on the topic "Phone duration modeling"

1

Hadian, Hossein, Daniel Povey, Hossein Sameti, and Sanjeev Khudanpur. "Phone Duration Modeling for LVCSR Using Neural Networks." In Interspeech 2017. ISCA, 2017. http://dx.doi.org/10.21437/interspeech.2017-1680.

Full text
APA, Harvard, Vancouver, ISO, and other styles
2

Alumäe, Tanel, and Rena Nemoto. "Phone duration modeling using clustering of rich contexts." In Interspeech 2013. ISCA, 2013. http://dx.doi.org/10.21437/interspeech.2013-445.

Full text
APA, Harvard, Vancouver, ISO, and other styles
3

Lazaridis, Alexandros, Iosif Mporas, Todor Ganchev, and Nikos Fakotakis. "Support vector regression fusion scheme in phone duration modeling." In ICASSP 2011 - 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2011. http://dx.doi.org/10.1109/icassp.2011.5947412.

Full text
APA, Harvard, Vancouver, ISO, and other styles
4

Rallabandi, Sai Krishna, Sai Sirisha Rallabandi, Padmini Bandi, and Suryakanth V. Gangashetty. "Learning continuous representation of text for phone duration modeling in statistical parametric speech synthesis." In 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU). IEEE, 2015. http://dx.doi.org/10.1109/asru.2015.7404782.

Full text
APA, Harvard, Vancouver, ISO, and other styles
5

Lo, Wai-Kit, Alissa M. Harrison, and Helen Meng. "Statistical phone duration modeling to filter for intact utterances in a computer-assisted pronunciation training system." In 2010 IEEE International Conference on Acoustics, Speech and Signal Processing. IEEE, 2010. http://dx.doi.org/10.1109/icassp.2010.5494988.

Full text
APA, Harvard, Vancouver, ISO, and other styles
6

Lazaridis, Alexandros, Pierre-Edouard Honnet, and Philip N. Garner. "SVR vs MLP for Phone Duration Modelling in HMM-based Speech Synthesis." In 7th International Conference on Speech Prosody 2014. ISCA, 2014. http://dx.doi.org/10.21437/speechprosody.2014-198.

Full text
APA, Harvard, Vancouver, ISO, and other styles
7

Bonilla-Escribano, Pablo, David Ramirez, and Antonio Artes-Rodriguez. "Modeling Phone Call Durations via Switching Poisson Processes with Applications in Mental Health." In 2020 IEEE 30th International Workshop on Machine Learning for Signal Processing (MLSP). IEEE, 2020. http://dx.doi.org/10.1109/mlsp49062.2020.9231856.

Full text
APA, Harvard, Vancouver, ISO, and other styles
We offer discounts on all premium plans for authors whose works are included in thematic literature selections. Contact us to get a unique promo code!

To the bibliography