Log in

Relevant bibliographies by topics / VoiceXML / Dissertations / Theses

To see the other types of publications on this topic, follow the link: VoiceXML.

Dissertations / Theses on the topic 'VoiceXML'

Author: Grafiati

Published: 4 June 2021

Last updated: 1 February 2022

Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles

Select a source type:

Consult the top 21 dissertations / theses for your research on the topic 'VoiceXML.'

Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.

You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.

Browse dissertations / theses on a wide variety of disciplines and organise your bibliography correctly.

1

Will, Thomas. "Creating a dynamic speech dialogue : how to implement dialogue initiatives and question selection strategies with VoiceXML agents /." Saarbrücken : VDM Verlag Dr. Müller, 2007. http://deposit.d-nb.de/cgi-bin/dokserv?id=3040741&prov=M&dok_var=1&dok_ext=htm.

Full text

APA, Harvard, Vancouver, ISO, and other styles

2

King, Adam. "Constructing a low-cost, open-source, VoiceXML." Thesis, Rhodes University, 2007. http://hdl.handle.net/10962/d1004735.

Full text

Abstract:

Voice-enabled applications, applications that interact with a user via an audio channel, are used extensively today. Their use is growing as speech related technologies improve, as speech is one of the most natural methods of interaction. They can provide customer support as IVRs, can be used as an assistive technology, or can become an aural interface to the Internet. Given that the telephone is used extensively throughout the globe, the number of potential users of voice-enabled applications is very high. VoiceXML is a popular, open, high-level, standard means of creating voice-enabled applications which was designed to bring the benefits of web based development to services. While VoiceXML is an ideal language for creating these applications, VoiceXML gateways, the hardware and software responsible for interpreting VoiceXML applications and interfacing with the PSTN, are still expensive and so there is a need for a low-cost gateway. Asterisk, and open-source, TDM/VoIP telephony platform, can be used as a low-cost PSTN interface. This thesis investigates adding a VoiceXML service to Asterisk, creating a low-cost VoiceXML prototype gateway which is able to render voice-enabled applications. Following the Component-Based Software Engineering (CBSE) paradigm, the VoiceXML gateway is divided into a set of components which are sourced from the open-source community, and integrated to create the gateway. The browser requires a VoiceXML interpreter (OpenVXI), a Text-To-Speech engine (Festival) and a speech recognition engine (Sphinx 4). The integration of the components results in a low-cost, open-source VoiceXML gateway. System tests show that the integration of the components was successful, and that the system can handle concurrent calls. A fully compliant version of the gateway can be used in the real world to render voice-enabled applications at a low cost.<br>KMBT_363<br>Adobe Acrobat 9.55 Paper Capture Plug-in

APA, Harvard, Vancouver, ISO, and other styles

3

Trinh, Viet. "Using voicexml to provide real-time traffic information." Honors in the Major Thesis, University of Central Florida, 2002. http://digital.library.ucf.edu/cdm/ref/collection/ETH/id/307.

Full text

Abstract:

This item is only available in print in the UCF Libraries. If this is your Honors Thesis, you can help us make it available online for use by researchers around the world by following the instructions on the distribution consent form at http://library.ucf.edu/Systems/DigitalInitiatives/DigitalCollections/InternetDistributionConsentAgreementForm.pdf You may also contact the project coordinator, Kerri Bottorff, at kerri.bottorff@ucf.edu for more information.<br>Bachelors<br>Engineering<br>Computer Engineering

APA, Harvard, Vancouver, ISO, and other styles

4

Nxumalo, Mduduzi E. "Transcoding multilingual and non-standard web content to voiceXML." Master's thesis, University of Cape Town, 2010. http://hdl.handle.net/11427/5132.

Full text

Abstract:

Includes abstract.<br>Includes bibliographical references (leaves 112-119).<br>Transcoding systems redesign and reformat already existing web interfaces into other formats so that they can be available to other audiences. For example, change it into audio, sign language or other medium. The bene_t of such systems is less work on meeting the needs of di_erent audiences. This thesis describes the design and the implementation details of a transcoding system called Dinaco. Dinaco is targeted at converting HTML web pages which are created using Extensible MarkupLanguage (XML) technologies to speech interfaces. The di_erentiating feature ofDinaco is that it uses separated annotations during its transcoding process, while previous transcoding systems use HTML dependent annotations. These separated annotations enable Dinaco to pre-normalize non-standard words and to generate VoiceXML interfaces which have semantics of content. The semantics help Textto-Speech (TTS) tools to read multilingual text and to do text normalization. The results from experiments indicate that pre-normalizing non-standard words and appending semantics enable Dinaco to generate VoiceXML interfaces which are more usable than those which are generated by transcoding systems which use HTML dependent annotations. The thesis uses the design of Dinaco to demonstrate how separating annotations makes it possible to write descriptions of content which cannot be written using external HTML dependent annotations and how separating annotations makes it easy to write, maintain, re-use and share annotations.

APA, Harvard, Vancouver, ISO, and other styles

5

Magno, Andrade Maciel Alexandre. "Investigação de um ambiente para o processamento de voz utilizando VoiceXML." Universidade Federal de Pernambuco, 2007. https://repositorio.ufpe.br/handle/123456789/2689.

Full text

Abstract:

Made available in DSpace on 2014-06-12T16:00:18Z (GMT). No. of bitstreams: 2 arquivo6359_1.pdf: 1628775 bytes, checksum: 08db22b2b12255c4a2cdff785454508f (MD5) license.txt: 1748 bytes, checksum: 8a4605be74aa9ea9d79846c1fba20a33 (MD5) Previous issue date: 2007<br>Desde a criação das primeiras máquinas, no século XVI, o homem busca estabelecer algum tipo de comunicação verbal com elas. Hoje, mais de 200 anos depois, a disponibilidade e a facilidade do uso do telefone para dialogar e a capacidade e a diversidade de conteúdos disponíveis na Web, podem ser unidos através das tecnologias de reconhecimento e síntese de voz, originando uma forma de comunicação verbal homem-máquina com grandes potenciais. Este trabalho tem com base o estudo das tecnologias de voz e contextualização das mesmas para a criação de uma interface com modo de interação via telefone e o levantamento dos modos de iniciativa de diálogo (do sistema, do usuário e mista). Para isso, são estudadas duas linguagens de marcação para a criação de aplicações de voz : VoiceXML e SALT. A linguagem VoiceXML foi escolhida pela sua maior disponibilidade e por estar numa etapa de amadurecimento maior que a linguagem SALT.Foram estudadas as principais características, funcionalidades e arquitetura para o desenvolvimento, hospedagem e publicação das aplicações desenvolvidas. Foram desenvolvidas três aplicações com modo de interação apenas por telefone e com modos de iniciativa distintos. Os sistemas construídos apresentam fluxo de diálogo e exemplos de execução para melhor entendimento. Por fim, realizou-se uma análise com pontos positivos, negativos e sugestões de aplicabilidade para cada uma delas

APA, Harvard, Vancouver, ISO, and other styles

6

Schnelle, Dirk. "Context Aware Voice User Interfaces for Workflow Support." Phd thesis, Saarbrücken VDM Verlag Dr. Müller, 2007. https://tuprints.ulb.tu-darmstadt.de/876/1/PhD.pdf.

Full text

Abstract:

Audio is a significant factor in the design of the human computer interface in ubiquitous computing. The characteristics of the medium allow for a hands-free interaction without the need to switch the focus to a display (eyes-free).Moreover, determining the graphical capabilities of the wearable device to adapt the output to the device is not needed. But audio based interfaces are also challenging, since humans are visually oriented. The ubiquitous computing community recognized the advantages of the audio channel, but the restrictions inherent to the medium are mostly ignored. Authors of such systems who know about these challenges often look for a solution by using additional modalities, preferably visually oriented. This thesis analyses these challenges with respect to the human’s cognitive capabilities and shows a possible solution using audio design patterns. Users on the move face additional challenges, since wearable devices do not have the computational power and storage capacity of desktop PCs which are required to use speech recognizers and speech synthesizers off the shelf. Thus recognition and synthesis software running on such a device does not have the capabilities of their desktop size counterparts. Streaming technologies can use high-end voice technology on a server at the cost of network dependency. This thesis introduces a new approach combining both worlds. In order to aid the user best, context information is used, location at first place. The context information is obtained from the current environment and from the task that she wants to accomplish. workflow engines are a standard for storing and controlling tasks as processes. Additionally, workflow engines consider the data flow which is a fundamental aspect of each process. Initial approaches to using context information in workflow engines exist but they lack a reusable concept.This thesis introduces such a concept. The process descriptions in workflow engines are modality independent in order to have reusable business logic. Modality dependant specifications of the current activity reside outside the workflow engine and are stored in a meta format that can be used to address multiple modalities. The user interfaces are generated by code generators at run-time. Current concepts consider neither a separation of modality independent business logic nor do they regard data flow. If a user is working on multiple processes in parallel, she needs support in returning into the context of an interrupted process. The challenges inherent to audio are not well suited to aid her resuming the task efficiently. In this thesis, we introduce a first concept of an audio-only support for task resumption.

APA, Harvard, Vancouver, ISO, and other styles

7

Bouyer, Antoine. "Plateformes et services multimodaux basés sur des interfaces plastiques." Caen, 2010. http://www.theses.fr/2010CAEN2048.

Full text

Abstract:

Cette thèse est à la jointure de deux domaines que sont la multimodalité et la plasticité des interfaces. La multimodalité consiste à pouvoir accéder à un système simultanément au travers de diff´erents modes de communication. La mise en place de telles infrastructures est r´eputée être difficile à cause de la diversité, technique comme fonctionnelle, des périphériques pouvant s’y connecter. Tout autrement, la plasticité consiste à d´efinir de manière abstraite une seule interface qui sera ensuite transformée vers différents formats, en fonctions de critères prédéfinis. Même si ces deux domaines ont été très étudiés par le passé, personne n’a pour l’instant utilisé la plasticité pour faciliter la conception et le développement de services multimodaux. Le travail r´ealisé durant cette thèse a étébde réfléchir, concevoir et mettre en place une telle plateforme. Cette thèse se décompose en trois parties. La première partie d´ecrit l’existant au d´ebut de la thèse. Nous retraçons en particulier l’´etat de l’art des deux domaines que nous utilisons par la suite. Nous évoquons aussi l’architecture de la plateforme PMX précédemment d´eveloppée et dont nous réutilisons certains de ces concepts. La seconde partie traite de la r´eflexion menée tout au long de nos travaux, et qui nous a amené à développer successivement deux plateformes multimodales. Nous expliquons en d´etails les raisons et les enjeux nous ayant poussés à effectuer des améliorations de l’existant. La dernière partie présente la validation et la concrétisation de nos travaux au travers des différents services mis en place et expérimentés, ainsi que des débouchés industriels actuellement en cours de transfert<br>This thesis brings together two research domains, namely multimodality and plasticity of interfaces. Multimodality allows for the accessing of a system simultaneously through different modes of communication. The realization of such infrastructure is difficult because of the technical and functional diversity of the devices that may be connected. Plasticity defines abstract interfaces that are then converted to different formats. Although these two domains have been studied extensively in the past, no one has, to date, used plasticity to facilitate the design and the development of multimodal services. The work carried out during this thesis was to think through, design, and implement such a platform. This thesis is divided into three parts. The first part describes the existing technology at the beginning of the thesis period. More precisely, we present the state of the art of the two domains. We also discuss the architecture of the PMX platform previously developed and from which we reuse some concepts. The second part deals with our reasoning throughout this study. We have developed two successive multimodal platforms, and we explain in detail the reasons and the issues that have pushed us to improve upon the existing technology. The final section presents the validation and implementation of our work through the different services we have developed and experimented. The market opportunities currently being transferred are also presented

APA, Harvard, Vancouver, ISO, and other styles

8

Maema, Mathe. "OVR : a novel architecture for voice-based applications." Thesis, Rhodes University, 2011. http://hdl.handle.net/10962/d1006694.

Full text

Abstract:

Despite the inherent limitation of accessing information serially, voice applications are increasingly growing in popularity as computing technologies advance. This is a positive development, because voice communication offers a number of benefits over other forms of communication. For example, voice may be better for delivering services to users whose eyes and hands may be engaged in other activities (e.g. driving) or to semi-literate or illiterate users. This thesis proposes a knowledge based architecture for building voice applications to help reduce the limitations of serial access to information. The proposed architecture, called OVR (Ontologies, VoiceXML and Reasoners), uses a rich backend that represents knowledge via ontologies and utilises reasoning engines to reason with it, in order to generate intelligent behaviour. Ontologies were chosen over other knowledge representation formalisms because of their expressivity and executable format, and because current trends suggest a general shift towards the use of ontologies in many systems used for information storing and sharing. For the frontend, this architecture uses VoiceXML, the emerging, and de facto standard for voice automated applications. A functional prototype was built for an initial validation of the architecture. The system is a simple voice application to help locate information about service providers that offer HIV (Human Immunodeficiency Virus) testing. We called this implementation HTLS (HIV Testing Locator System). The functional prototype was implemented using a number of technologies. OWL API, a Java interface designed to facilitate manipulation of ontologies authored in OWL was used to build a customised query interface for HTLS. Pellet reasoner was used for supporting queries to the knowledge base and Drools (JBoss rule engine) was used for processing dialog rules. VXI was used as the VoiceXML browser and an experimental softswitch called iLanga as the bridge to the telephony system. (At the heart of iLanga is Asterisk, a well known PBX-in-a-box.) HTLS behaved properly under system testing, providing the sought initial validation of OVR.<br>LaTeX with hyperref package

APA, Harvard, Vancouver, ISO, and other styles

9

Chuffart, Florent. "Conception d'une plate-forme de services ubiquitaires intégrant des interfaces multimodales distribuées." Phd thesis, Université de Caen, 2007. http://tel.archives-ouvertes.fr/tel-00203240.

Full text

Abstract:

Les technologies web ont permis de définir des services accessibles par tous, n'importe où et depuis de multiples terminaux. Si les utilisateurs intègrent maintenant l'usage des interfaces graphiques traditionnelles, la miniaturisation des équipements et l'émergence de nouvelles modalités offrent de nouvelles perspectives en matière d'informatique ubiquitaire.<br /> Dans ce mémoire, nous modélisons l'interaction ubiquitaire selon les principes fondamentaux des modèles de la littérature. Nous proposons une implémentation de notre modèle UbiArch sous forme d'une plate-forme de services et nous mettons à la disposition du concepteur d'applications web un ensemble d'outils interactifs, prêts à être intégrés dans des interfaces web.<br /> Nous illustrons les principes de notre modèle et le fonctionnement de notre plate-forme au travers de trois démonstrateurs. Chacune de ces réalisations intègre les capacités d'interaction des téléphones dans une application web et illustre les apports de notre travail en terme d'interaction multimodale distribuée.

APA, Harvard, Vancouver, ISO, and other styles

10

Zhao, Yuxiao. "XML-based Frameworks for Internet Commerce and an Implementation of B2B e-procurement." Licentiate thesis, Linköping University, Linköping University, PELAB - Programming Environment Laboratory, 2001. http://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-5735.

Full text

Abstract:

<p>It is not easy to apply XML in e-commerce development for achieving interoperability in heterogeneous environments. One of the reasons is a multitude of XML-based Frameworks for Internet Commerce (XFIC), or industrial standards. This thesis surveys 15 frameworks, i.e., ebXML, eCo Framework, UDDI, SOAP, BizTalk, cXML, ICE, Open Applications Group, RosettaNet, Wf-XML, OFX, VoiceXML, RDF, WSDL and xCBL.</p><p>This thesis provides three models to systematically understand how the 15 frameworks meet the requirements of e-commerce. A hierarchical model is presented to show the purpose and focus of various XFIC initiatives. A relationship model is given to show the cooperative and competitive relationships between XFIC. A chronological model is provided to look at the development of XFIC. In addition, the thesis offers guidelines for how to apply XFIC in an e-commerce development.</p><p>We have also implemented a B2B e-procurement system. That not only demonstrates the feasibility of opensource or freeware, but also validates the complementary roles of XML and Java: XML is for describing contents and Java is for automating XML documents (session handling). Auction-based dynamic pricing is also realized as a feature of interest. Moreover, the implementation shows the suitability of e-procurement for educational purposes in e-commerce development.</p><br>Report code: LiU-Tek-Lic-2001:19.

APA, Harvard, Vancouver, ISO, and other styles

11

楊繡如. "Automatic Table Dialog Model on VoiceXML." Thesis, 2004. http://ndltd.ncl.edu.tw/handle/13036517084660491040.

Full text

Abstract:

碩士<br>國立臺灣師範大學<br>資訊教育研究所<br>92<br>ABSTRACT Recently, because of the development of technology, people rely more and more on various information services on Internet in their daily life. In the past, people using computers to access information services yielded to traditional Input/Output interface, for example, keyboard and mouse. Now, the appearance of mobile telecommunication and speech technology enable people to browse web pages by their voice and telephone, and this has become a new trend for using information services. Besides, these technologies can help disabilities, especially the sight-impaired people, to browse web pages and access information services by dialog interaction. Since the mid-1990s, the touch-tone interactive voice response (IVR) system was born. IVR systems only provide static voice service by sound recording. In 2000, VoiceXML came up. It not only provides more flexible voice services by speech recognition and speech synthesis but also integrates telecommunication and Internet for information services. However, VoiceXML is complicated and hard to develop. Consequently, this thesis proposed a methodology to transcode HTML to VoiceXML. This research focuses on transcoding the HTML table information and classifies HTML tables to six types. According to each type of HTML tables, the dialog models corresponding to each type of HTML tables is designed. Also, the VTG (Voice Table Generator) system which converts HTML tables to VoiceXML and VXPB (VoiceXML Portal Builder) system which helps user to create VoiceXML portal are presented. By means of VTG and VXPB, web page designer can build voice portal by easy operation. Telephone users can access voice portal using their voice to obtain the HTML table information. Therefore, people can obtain the information not only by “seeing” the web page but also “listening” the auditory web pages. Moreover, in order to test and verify VXPB and VTG, this research also uses VXPB and VTG to build voice portal with query functionality, such as "Web Bookstore Information" and "Portal of Department Information".

APA, Harvard, Vancouver, ISO, and other styles

12

Wang, Chih-Yuan, and 王至遠. "AutoVoice：A Translator from HTML to VoiceXML." Thesis, 2007. http://ndltd.ncl.edu.tw/handle/90261537895830926021.

Full text

Abstract:

碩士<br>國立中正大學<br>資訊工程所<br>96<br>Nowadays, WWW provides abundant information and services to people via graphical devices. With the prevalence of personal mobile voice devices, it would be very beneficial to provide the abundant information and services of WWW via personal mobile voice device.This thesis describes a translator, called AutoVoice, that automatically or semi-automatically translates HTML files into VoiceXML files. The structure is used to maintain the structure of relation in the WWW. AutoVoice consists of three components: a structure generator, a structure editor, and a VoiceXML generator. The structure generator automatically constructs the structure by extracting the data relation from HTML files. The structure editor provides the flexibility of editing the structure. The VoiceXML generator automatically translates the structure into VoiceXML files. AutoVoice could significantly reduce the cost of creating and maintaining a voice website.

APA, Harvard, Vancouver, ISO, and other styles

13

簡璽恩. "Extending Library Information Services by VoiceXML." Thesis, 2005. http://ndltd.ncl.edu.tw/handle/66743627570772224581.

Full text

Abstract:

碩士<br>國立交通大學<br>電機資訊學院碩士在職專班<br>94<br>The Internet information environment is developing into a compound communication model with multimodal access, various interfaces and single data source. New technologies are continuously employed to construct new services. Facing a predicament of budget shortage, libraries still endeavor to sustain qualified service level; however, few libraries have enough budget and technique skill to provide their reader with new and creative services. Self-service voice systems are popular nowadays and have applied in commerce for many years, but rarely in the library domain. Tremendous investment and difficulty of the traditional voice technology is the main obstacle for libraries to implement self-service voice system. In view of this, this thesis attempts exploiting the VoiceXML standard to develop a low-cost voice service in libraries for expanding the scope of library information service This thesis aims at proposing a method that integrates VoiceXML with an existent WebPAC system to provide a voice service in a library. With the application of dynamic VoiceXML programming for capturing necessary information from HTTP responses, developers can overcome many mechanisms that are protected by vendors of the WebPAC system, including its authentication mechanism, without changing its data or architecture. In this manner, libraries can develop an appropriate voice service quickly and integrate it into the underlying WebPAC system easily and safely. Furthermore, libraries can provide their users a new channel to interact with WebPAC systems by a telephone or a voice interface.

APA, Harvard, Vancouver, ISO, and other styles

14

Chia-Wei, Chang, and 張家維. "SIVE：An Implementation of Simple Interactive VoiceXML Editor." Thesis, 2001. http://ndltd.ncl.edu.tw/handle/02963082885271965403.

Full text

Abstract:

碩士<br>國立中正大學<br>資訊工程研究所<br>89<br>In the current world of the Internet, web browsers are the most popular application for everyone. The VoiceXML browser’s input and output use audio, (e.g. a microphone and speaker or a telephone). Certainly, for people who are not familiar with a computer, the latter is more user-friendly and more attractive. VoiceXML is a kind of XML (eXtensible Markup Language) language. It has the characteristics of the XML language, and it is used to process speech sounds. Speech Recognition Grammar Format and Speech Synthesis Markup Language are the speech technologies that apply in speech engine platforms by W3C. In this text, I describe a simplified, user-friendly VoiceXML interactive editor. The core adopts a DOM (Document Object Model) tree parser technology, and generates VoiceXML documents. Especially, I add the module of Elements and Attributes class in our SIVE implementations. In the future, this module can be expanded for the new version if VoiceXML 2.0 is released. This editor is also a part of a CAI (computer-assisted instruction) software project. The CAI software uses just-in-time online code. We are adding its voice functions. According to the students’ vocal responses, we can diagnose their study achievement and adjust the degree of difficulty of teaching material.

APA, Harvard, Vancouver, ISO, and other styles

15

Chung-Chun, Chen, and 陳宗俊. "A Simple VoiceXML Component for Computer Assisted Instruction." Thesis, 2002. http://ndltd.ncl.edu.tw/handle/31081212750146431088.

Full text

Abstract:

碩士<br>國立中正大學<br>資訊工程研究所<br>90<br>VoiceXML (Voice Extensible Markup Language) is a markup language based on XML. It was proposed by W3C and designed mainly for telephony applications over telephones and cell phones. VoiceXML provides a speech user interface for the users to access web pages with telephones or cell phones by listening to the spoken prompts from system and submitting a request according to the user's spoken input. Our VoiceXML component (“VXP” for short) here is implemented as an ActiveX Control to be embedded into any ActiveX host application such as “Microsoft Internet Explorer”. Our VXP supports the syntax and semantics defined in the new VoiceXML 2.0 specification and is dedicated to provide the Text-to-Speech (TTS) and Speech Recognition (SR) functions that we need in our Computer Assisted Instruction (CAI) application.

APA, Harvard, Vancouver, ISO, and other styles

16

Jia-Yih, Lee, and 李佳益. "SVB: An Implementation of a Simple VoiceXML Browser." Thesis, 2001. http://ndltd.ncl.edu.tw/handle/40901452444763681020.

Full text

Abstract:

碩士<br>國立中正大學<br>資訊工程研究所<br>89<br>VoiceXML is the Voice Extensible Markup Language. VoiceXML is a member of many standard XML-based markup languages. VoiceXML inherits XML's characteristics, like strict syntax and structure, easy implementation and processing, and extensibility. But because of these characteristics and the fact that the specification did not describe each VoiceXML element very clearly, few existing implementations are fully compatible with one another, especially in describing element relations. Here, we will illustrate each element very clearly and describe the elements relations with each other.

APA, Harvard, Vancouver, ISO, and other styles

17

Cho, Ming-Hua, and 卓明華. "The Establishment of VoiceXML Service Environment and Design of the Development Tools." Thesis, 2002. http://ndltd.ncl.edu.tw/handle/73592982733505186860.

Full text

Abstract:

碩士<br>元智大學<br>資訊工程學系<br>90<br>Traditionally, due to the restriction of existing various API interfaces, there is no way to design a uniform speech-application development language. As a result, the developing procedure of an application system is time-consuming and it is usually a serious problem to transfer the works among the programmers. VoiceXML is a technology standard developed and managed by VoiceXML Forum, founded by IBM, AT&T, Lucent Technologies and Motorola, with the aim of driving the market for voice-enabled Internet access. The release of the VoiceXML Specification has already been presented to the W3C (The World Wide Web Consortium) for approval, thus developing application programs for VoiceXML will be as easy as those for HTML in the near future. The main goal is to bring about the rich contents of the Internet to the voice services, and enable the users to get the online information through telephones. However, at present, the browsing and editing tools are both designed by XML-based markup languages without the function of transforming the voice to text in Mandarin. Therefore, it is essential to develop VXML editorial and browsing tools supporting Chinese language. In this thesis, we will establish a VoiceXML service environment using a Cisco AS5300 VOIP Gateway, RTSP, TFTP and Web server. Based on IVR, Microsoft XML and Visual Basic, a VXML editing tool is designed and TTS (Text To Speech) for Chinese is also constructed. In this way, the developers are able to create VXML automatically by using the graphical interfaces designed in the custom tool. In order to provide the convenience of testing and monitoring for the developers, the input of DTMF and the Chinese voice browsing process are able to be heard through computer audio facilities.

APA, Harvard, Vancouver, ISO, and other styles

18

Chen, Mu-Yen, and 陳牧言. "The Study of the VoiceXML Mandarin Dialog System in the E-Commerce Networking Environment." Thesis, 2002. http://ndltd.ncl.edu.tw/handle/34940795585903894718.

Full text

Abstract:

碩士<br>國立交通大學<br>資訊管理所<br>90<br>The web-based application by VoiceXML service on the Internet is gradually accepted for human-machine interaction because it provides the speech-enabled functionality and makes the telephone access a reality. However, it is not cost efficient to build voice only stand-alone web implementation and is more reasonable that voice interfaces should be retrofitted to be compatible or collaborated with the existing HTML or XML based web applications. Therefore, this thesis considers that the web site construction should be able to incorporate multiple access modes so that human beings can perceive and interact with either visual or speech response simultaneously. Under this principle, our research develops an integration web based Mandarin dialog system which adopts ASR, TTS, VoiceXML browser, and VoIP technologies to create user friendly interfaces for GUI and VUI. The user can use either traditional telephone line, cellular phone connection, or even VoIP by personal computer to interact with the VoiceXML server. In the mean time, browse the web content from the Internet and access the same document. The implementation system shows excellent performance and can be easily constructed into banks, tourisms, and e-commerce transactions with VoiceXML for wide accessibility.

APA, Harvard, Vancouver, ISO, and other styles

19

Chang, Tsung-Chuan, and 張聰泉. "The Interactive Voice Response System with Functions of Speech Recognition and Synthesis based on VoiceXML." Thesis, 2003. http://ndltd.ncl.edu.tw/handle/01251913527754000130.

Full text

Abstract:

碩士<br>國立高雄第一科技大學<br>電腦與通訊工程所<br>91<br>The telephone network is today''s most widely usedcommunication tool. Utilizing telephone for informationaccess, however, usually requires interactive with machine and human. The most traditional method is done through keystrokes notifying the information supplier what information to obtain.This type of primitive interaction method is inconvenient,the action of keyboard entry is cumbersome especially in the current rapid development of mobile communication. The ability to develop a human machine interface capable of accepting the user''s spoken dialog will fulfill the optimal dialog model desired by information suppliers. This dialog model includes the features to automatically recognize the user''s spoken language, and to synthesize voice for transformation of alphanumeric data into voice information. The objective of this work is to integrate speech recognition and synthesis. The integration of speech recognition and synthesis in this research is based on VoiceXML language specification, a technology jointly developed by IBM, AT&T, Lucent, Motorola, and other companies to allow consumers surf the web by means of voice interaction. The most notable advantage of the specification published by W3C is easy integration of Automatic Speech Recognition and Synthesis, and easier control and arrangement of dialog flow. These features make it suitable for developing voice application languages. To make voice applications more flexible, JSP language is utilized to dynamically generate VoiceXML files. This combination will enable more effective and more flexible development of voice applications. In this work we discuss VoiceXML language grammar rules,and articulate the interrelations of each dialog control elements. In-depth discussion on the related technologies supported by J2EE, especially the combined application of JSP and JavaBeans, enables more dynamic application of these related languages in developing voice application systems. Finally we discuss the design concepts of voice user interface, and analyze the structure of dialog flow. We compile a popular standard dialog control model and integrate it in a VoiceXML aided design network. It allows designers to conveniently query the needed design information and to apply the standard model, and to provide a real test environment to facilitate development of voice application systems. In our experiments we apply the research on related languages as described above by constructing an automatic Voice Response Central System for computer training courses to verify our research results, and build a Interactive Voice Response Central System with function of Speech Recognition and Synthesis using VoiceXML as the development language.

APA, Harvard, Vancouver, ISO, and other styles

20

Chaochun, Wang, and 王詔君. "Research of building and integrating of parent-teacher communication system of elementary school with VoiceXML." Thesis, 2005. http://ndltd.ncl.edu.tw/handle/wa796a.

Full text

Abstract:

碩士<br>國立台北師範學院<br>數學暨資訊教育學系(含數學教育碩士班)<br>93<br>As the time changes, there are more and more dual-earner families in Taiwan. Parents usually have no time to get along with their children in the day time. Also, in the call sound of the educational reform, parents should pay more and more attention to children's behavior and learning in school. However, up to now, there is no good communication platform for teachers and parents. Usually, parents and teachers can communicate with each other by using traditional communication books. And, the teacher may use E-mail to deliver the messages of the class to the students and parents. However, some families have no computer network at home yet, and the computer is too cumbersome to carry, so message is unable to receive immediately. Otherwise by telephone network system, though convenient, users can receive the message immediately, but both the users should be on line at the same time. And, limited to the small telephone-set button, it is difficult to operate when users want to send the messages. Therefore, users always feel beset with difficulties while sending messages. So, in this research we use VoiceXML to integrate two major networks, computer network and PSTN, and to construct out a multi-functionally communication platform for parents and teachers of the elementary school. In order to let class members be able to communicate with each other in the most convenient way, we designed both web and voice application programs, and adopt the systematic structure of three tier architecture to integrate them. In the part of web application, user can not only receive and send messages, deliver or answer questionnaires, manage personal information, but also can monitor the replied messages so as to make sure the message is properly delivered to the recipient. In the part of voice application, we use VoiceXML to develop the interactive voice response system. The user not only can listen to the latest message and answer the questionnaire designed by the teacher, but also can find the person that he want to contact with by using the automated voice switching system. We tested this system in a class of a elementary school. For evaluating the effects of the proposed platform on communication of parents and teachers, we carried out some experiments for testing the system efficiency and user satisfaction. Experimental results show that our designed system is a useful communication platform for students, parents and teachers in elementary schools.

APA, Harvard, Vancouver, ISO, and other styles

21

Wu, Guo-Wei, and 吳國瑋. "Design of EIML / VoiceXML-based Dialogue Learning Gateway for Interactive Voice Learning System based on SCORM Content Aggregation Model." Thesis, 2004. http://ndltd.ncl.edu.tw/handle/06164566848862943300.

Full text

Abstract:

碩士<br>國立中正大學<br>電機工程研究所<br>92<br>The environment on the real world is more and more convenient for visually disabled people. But there are still huge amount of obstacles on the Internet for visually disabled people. They can not search information on the Internet with mice, keyboards, screens and their eyes. The major goal of this thesis is to propose a dialogue Learning Gateway. This gateway includes some states. These states can guide visually disabled user to voice browse the websites. This thesis also designs a tag mapping mechanism. There is a new markup language devised first. This new markup language is called Extensible Interface Markup Language (EIML) which is derived from XML. This tap mapping mechanism can map HTML tags to VoiceXML tags through EIML tags. Visually disabled users can search information on the Internet through the system which is consisted of the Dialogue Learning Gateway and tag mapping mechanism. They can browser the websites like the people with ordinary eyes. Finally, there is a case study. It shows a learning system. This learning system can supplies the courses to visually disabled learners. Visually disabled learners can take learning activities on the Internet through this Dialogue Learning Gateway.

APA, Harvard, Vancouver, ISO, and other styles

We offer discounts on all premium plans for authors whose works are included in thematic literature selections. Contact us to get a unique promo code!