Dissertations / Theses on the topic 'Shan (The Chinese word)'
Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles
Consult the top 50 dissertations / theses for your research on the topic 'Shan (The Chinese word).'
Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.
You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.
Browse dissertations / theses on a wide variety of disciplines and organise your bibliography correctly.
Lo, Wing Sze. "Statistics-based Chinese word segmentation and new word detection /." View Abstract or Full-Text, 2002. http://library.ust.hk/cgi/db/thesis.pl?ELEC%202002%20LOW.
Full textIncludes bibliographical references (leaves 83-86). Also available in electronic version. Access restricted to campus users.
Chen, Yiping. "Word recognition and reading in Chinese." Thesis, University of Oxford, 1993. http://ora.ox.ac.uk/objects/uuid:8f034ff4-f07c-459b-95f7-4b0a72e07c45.
Full textHu, Wenze. "Functional Perspectives and Chinese Word Order." The Ohio State University, 1995. http://rave.ohiolink.edu/etdc/view?acc_num=osu1392908184.
Full textHung, Chun-tak. "Chinese workbench : an integrated environment for Chinese writers /." [Hong Kong : University of Hong Kong], 1992. http://sunzi.lib.hku.hk/hkuto/record.jsp?B13227695.
Full text鄭佩芳 and Pui-fong Cheng. "A study on parts of speech, word formation, and the change of word meaning in modern Chinese." Thesis, The University of Hong Kong (Pokfulam, Hong Kong), 1994. http://hub.hku.hk/bib/B31234124.
Full textLu, Xiaofei. "Hybrid models for Chinese unknown word resolution." Columbus, Ohio : Ohio State University, 2006. http://rave.ohiolink.edu/etdc/view?acc%5Fnum=osu1154631880.
Full text馮陳善奇 and Sydney S. K. Fung. "The poetry of Han-shan in English: a culturalapproach." Thesis, The University of Hong Kong (Pokfulam, Hong Kong), 2001. http://hub.hku.hk/bib/B31224386.
Full text洪進德 and Chun-tak Hung. "Chinese workbench: an integrated environment for Chinese writers." Thesis, The University of Hong Kong (Pokfulam, Hong Kong), 1992. http://hub.hku.hk/bib/B31210314.
Full textNolan, J. E. "Investigations on Gor-Kan-Shan-Lien-Tan, a Chinese herbal medicine." Thesis, Cardiff University, 1986. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.372346.
Full textPiao, Scott. "Sentence and word alignment between Chinese and English." Thesis, Lancaster University, 2000. http://eprints.lancs.ac.uk/52143/.
Full textMing, Yan, Reinhold Kliegl, Hua Shu, Jinger Pan, and Xiaolin Zhou. "Parafoveal Load of Word N+1 Modulates Preprocessing Effectivenessof Word N+2 in Chinese Reading." Universität Potsdam, 2010. http://opus.kobv.de/ubp/volltexte/2011/5710/.
Full textDai, Yee-wa, and 戴綺華. "The effectiveness of component word acquisition on the first grade students' repertoire of word." Thesis, The University of Hong Kong (Pokfulam, Hong Kong), 2012. http://hub.hku.hk/bib/B50178052.
Full textpublished_or_final_version
Education
Master
Master of Education
Liu, Lezhong. "A Corpus-based Approach to the Chinese Word Segmentation." Diss., lmu, 2005. http://nbn-resolving.de/urn:nbn:de:bvb:19-56621.
Full textLau, Kai-yan Dustin, and 劉啟欣. "Compound word processing: development and disorder." Thesis, The University of Hong Kong (Pokfulam, Hong Kong), 2012. http://hub.hku.hk/bib/B47249675.
Full textpublished_or_final_version
Speech and Hearing Sciences
Doctoral
Doctor of Philosophy
Fung, Chan Shin-kei Sydney. "The poetry of Han-shan in English : a cultural approach /." Hong Kong : University of Hong Kong, 2001. http://sunzi.lib.hku.hk/hkuto/record.jsp?B2327301x.
Full textZhang, Phyllis Ni. "Word order variation and end focus in Chinese : pragmatic functions /." Access Digital Full Text version, 1994. http://pocketknowledge.tc.columbia.edu/home.php/bybib/11714827.
Full textTypescript; issued also on microfilm. Sponsor: Clifford A. Hill. Dissertation Committee: Franklin E. Horowitz. Includes bibliographical references (leaves 125-128).
Hu, Xiaoling. "Topics in Chinese syntax : word order in synchrony and diachrony." Thesis, Durham University, 1999. http://etheses.dur.ac.uk/1196/.
Full textZheng, Zhonghua. "Chinese characters and word recognition: An analysis and a model." Thesis, University of Ottawa (Canada), 1990. http://hdl.handle.net/10393/5573.
Full textChen, Jing. "Character, Word, and Student Characteristics Contributing to Concept of Word in Chinese: Cross-Classified Multilevel Logistic Models." The Ohio State University, 2015. http://rave.ohiolink.edu/etdc/view?acc_num=osu1448290601.
Full textZhou, Xiaolin. "The mental representation of Chinese disyllabic words." Thesis, University of Cambridge, 1992. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.259648.
Full text周韻琼 and Wan-king Janice Chow. "Urban villa for Chinese folk arts and crafts." Thesis, The University of Hong Kong (Pokfulam, Hong Kong), 2002. http://hub.hku.hk/bib/B31986390.
Full textWen, Yun. "Behavioural and electrophysiological investigations of Chinese translation activation during English word recognition in Chinese-English bilinguals." Thesis, University of Nottingham, 2017. http://eprints.nottingham.ac.uk/43208/.
Full textWong, Leung-wai, and 王良慧. "Investigating consistency and orthographic neighbourhood density effects in Chinese character processing." Thesis, The University of Hong Kong (Pokfulam, Hong Kong), 2012. http://hdl.handle.net/10722/197118.
Full textpublished_or_final_version
Linguistics
Master
Master of Philosophy
Lau, Ka-po Natalie. "Semantic interference of Chinese words in the picture-word interference task /." View the Table of Contents & Abstract, 2006. http://sunzi.lib.hku.hk/hkuto/record/B36846181.
Full textLau, Ka-po Natalie, and 劉家寶. "Semantic interference of Chinese words in the picture-word interference task." Thesis, The University of Hong Kong (Pokfulam, Hong Kong), 2006. http://hub.hku.hk/bib/B45007597.
Full textLu, Ching Ching, and 呂菁菁. "Chinese Word Recognition." Thesis, 1996. http://ndltd.ncl.edu.tw/handle/76563553576796520363.
Full textChou, Tsai-jung, and 周彩蓉. "A Study of Polysemous Words 'Shang' and 'Xia' in Chinese." Thesis, 1999. http://ndltd.ncl.edu.tw/handle/26051142098246236972.
Full text國立清華大學
語言學研究所
87
Human beings use language to express their thoughts and ideas. Polysemy, i.e. a property that a linguistic form has more than one meaning, is common in a natural language system. This thesis attempts to furnish a cognitive perspective toward lexical semantics by the observations and analyses of related issues concerning shang (上) and xia (下). Polysemous words shang and xia can be used not only as locative words, but also as main verbs and verbal complements, and sense extension occurs ranging from spatial domain to non-spatial domain. The network of these related senses also attests to cognition as the foundation of linguistic analysis. The methodology adopted in this thesis is conceptual structure approach (Jackedoff 1983, 1990). The surface expressions concerning shang and xia are decomposed into a set of finite semantic elements, and the mapping from conceptual structures to surface forms are made possible by lexicalization patterns. Expressions of locative meaning are represented by image-schema in order to illustrate how we transform spatial configuration in physical world into linguistic forms. The mechanisms involved in sense extension are the notion of compositionality, metaphor, metonymy and inference from context or real world knowledge. The basic sense and extended senses form a radial structure category. Meanwhile, we also find that shang and xia are in the process of grammaticalization. Both sense extension and grammaticalization are motivated by the fact that vertical spatial notion is human beings' basic cognitive ability.
Franck, Mary E. "Chinese color word evolution/." 2002. https://scholarworks.umass.edu/theses/1525.
Full textLin, Alex, and 林士揚. "Grassroots Innovation: Chinese “Shan Zhai” Mobile Phones." Thesis, 2010. http://ndltd.ncl.edu.tw/handle/96899127038283773652.
Full text國立清華大學
經營管理碩士在職專班
98
In 2008, “Shan Zhai” went mainstream, particularly “Shan Zhai” mobile phones” which have flooded China during past few years. In this process, MediaTek, the so-called king of boxing in Taiwan’s semiconductor industry, played a key role. However, can the success of “Shan Zhai” mobile phones be attributed to MediaTek only? This research expects to base on “disruptive innovation” theory to explore the development and evolution of “Shan Zhai” mobile phones value chains and the role that the Chinese government played. Through linking data to theories, this research aims to decipher the secrets of how “Shan Zhai” mobile phones achieved success and where the sources of disruptive innovation in “Shan Zhai” mobile phones came from. In this research, I found behind Mediatek's success is the development of a cheap and easy-to-develop SOC (system on chip) that integrates the software for all complex functions a mobile phone needs on a single chip. With its innovative chipset, Mediatek has lowered the barrier for producing mobile phones to a minimal level. Besides this, we also cannot neglect the fact of China’s being the world’s biggest factory agglomerated particularly at Shenzhen. The “ant army” nurtured through the whole process of supply chain formation, has been growing since the Chinese economic reform and has formed a “virtual factory” led by semiconductor firms. By using speed and cost as its main strategic advantages, this factory nibbled away like a silkworm or swallowed like a whale the market share of the Chinese mobile phone market. To the surprise of the Chinese government, the policy that was designed to create the 2nd Lenovo, Huei-Wei, or the strong intervention that was geared towards the development of China’s own telecom standard, have resulted in the rise of “Shan Zhai” mobile phones. The success of “Shan Zhai” mobile phones was an accident. As a follower, the vendors of “Shan Zhai” mobile phones were brave to try something new and create an innovative environment. Moreover, low-end customers in the countryside which consisted of the majority of China’s population, generated a lot of demand for China’s domestic market and became the major source of “grass-roots” innovation. In the end, the research came out the conclusion and contribution as the following: “Shan Zhai” mobile phones bring not only advantages but also disadvantages to suppliers across the Taiwan Strait. We can see many issues that vendors of “Shan Zhai” mobile phones are facing in the short term such as over-competition, difficult to reform, being resisted by many countries. These issues concern not only vendors of “Shan Zhai” mobile phones but also Taiwanese suppliers which locate at middle- and down- stream of the whole supply chain. Taiwan’s government should pay more attention to the economic influence of “Shan Zhai” culture across the Strait, and to compromise and negotiate with the Chinese government by establishing bilateral institutions. After all, given that the conditions of “Shan Zhai” mobile phones keeps involving and outside environment is constantly changing, Taiwanese firms can never satisfy with the currently situation, no matter system firms like EMS or supplier firms like equipments and components. They must have the attitude to increase their competitive capabilities through continuously improving technologies, speed, human resource, strategies, marketing and branding. Nobody can foresee “Shan Zhai” mobile phone will be phase-out or remain mainstream in the market so far.
Ho, Chia Yun, and 何佳芸. "Chinese word recognition: evidence from word superiority effect." Thesis, 1996. http://ndltd.ncl.edu.tw/handle/29309921461752611114.
Full textYANG, WAN-CHUN, and 楊婉君. "The Relation of Chinese Word Clustering and Word Chunking." Thesis, 2013. http://ndltd.ncl.edu.tw/handle/36277945631514920982.
Full text國立交通大學
電信工程研究所
101
Word Chunking is an important task in NLP. The difference between the structure of word chunk, Part of Speech and the structure of word has effects on the recognition of word chunk. Good word chunker can be used in many applications of NLP, such as search engine, speech recognition and text to speech. Constrcuting a good Chinese word chunker is a challenging task because Chinese is more complicated than other languages in many aspects. For instances, Chinese grammar structure is very complicated, there is no boundary between words, and Chinese lexicon is large. This study focuses on Chinese word clustering and word chunking. We first use the TFIDF technique to create a lexicon with 180k words whose coverage is greater than 99%. We then find the relations of preceding-word and following-word to create feature vectors, and use symmetric Kullback-Leibler divergence (KL) to cluster words into classes. The results of word clustering are then used in two word chunking studies. One employs the condition random field (CRF) method to build a base-phrase chunk model from the Sinica Treebank Corpus. The features used include POS, information of word clusters, and sematic information in E-Hownet. In another study, we use the features of M-value and boundary entropy to find rules for chunking words and word-classes.
鄭茂隆. "Chinese Word Clustering for compounding." Thesis, 2014. http://ndltd.ncl.edu.tw/handle/75548826491619740204.
Full text國立交通大學
電信工程研究所
103
The thesis preposes to use word clustering to assist in Chinese word compounding. It first uses co-occurrence frequencies of the current word and two nearest neighboring words to define a symmetric Kullback-Leibler divergence (KLD) as the distance measure of two words. Using the KL distance, words of the same part-of-speech (POS) are divided into clusters. Further clustering is then performed based on semantic distance of words. The forward and backward bigrams algorithm is then applied to find highly correlated word-cluster pairs for compound-word construction. Word-cluster pairs of some unreasonable POS pairs are excluded. Besides, pseudo compound words are detected by using a stataistics-based method. The proposed method was evaluated using Sinica Treebank sentences comprising two-word basic phrases. Experimental results showed that an F-measure of 0.58 was achieved. The performance was better than the method using KLD-based clustering by 0.14. The word compounding results are then used together with POS, word length, and some word-level features to detect Chinese basic phrases. Two methods using conditional random field (CRF) and support vector machine (SVM) are studied. Experimental results on the Sinica Treebank Corpus showed that the F-measure of the SVM-based method is 0.857 which is 0.011 better than that of the CRF-based method
Chen, Shan-Long, and 陳山龍. "The study of Chinese Word." Thesis, 1994. http://ndltd.ncl.edu.tw/handle/97670729667497026044.
Full textLou, Yi-Hsuan, and 樓逸軒. "Using the Word Restructure Method to Improve Chinese Word Segmentation." Thesis, 2016. http://ndltd.ncl.edu.tw/handle/jf7r73.
Full text中原大學
資訊管理研究所
104
In this paper, we propose a word restructure method which is based on simple probability to reorganize the high probabilities combinations of Chinese sentences after Chinese segment processes, also use word restructure method to conduct terms restructure, computing of restructure probability and comparing of restructure combinations, then generate the higher probability of restructure combinations to represent word units. Finally, use CBMI to achieve optimized Chinese word segmentation of two or more terms. CKIP has great segment performance in terms of short length by 99.69%, but the accuracy and segment performance will have biased error within the terms of length increase. Hung proposed CBMI that added the domain concepts from WOMs and modified the fitness function of Genetic Algorithm which use the statistical technique of Mutual Information to enhance segmentation results more correctly and efficiently, it can find not only the relevance between Chinese words but also increase the influence of domain concepts and Chinese words connections. In this paper, we proposed a processes of Chinese word segmentation which are (1) CKIP, (2) Word Restructure and (3) CBMI to improve CKIP results of Chinese word segmentation, also (1) and (3) can change other Chinese word segmentation to show the applicability of this processes. Otherwise, we improve the CBMI which added the factor of terms length to influence the computing of weightings in GA fitness function. Finally, by the comparison of this proposed model, CKIP for domain classification of Chinese documentation, we then can conclude which has better performances in Chinese word segmentation. We also take three different domain concepts WOMs (movie, food and cosmetic reviews) which are crawled on the internet as datasets for experiments in this paper. As the results, the Chinese segmentation model which we proposed has the best performance of accuracy in move and food domain datasets.
Huang, Pei-ru, and 黃珮茹. "Research on constructing Chinese word-relationship ontology to improve word selection in English-to-Chinese translation." Thesis, 2010. http://ndltd.ncl.edu.tw/handle/44507865443137884003.
Full text中國文化大學
資訊管理學系
98
Translation between languages is a high challenge work due to difficulties occurred during the translation process. Mapping and analysis between sentence structures, proper semantic selection of polysemous words, translation of idioms, and so on are major factors causing difficulties in translation between different language systems. Because of the above factors, machine language translators cannot meet users’ sa-tisfaction even thought they have been published for years. The research emphasizes on Chinese word selection in English-to-Chinese translation, especially on polysemous words. The associated relationships among Chinese words are discovered to construct an ontology. The Chinese word-relationship ontology is capable of improve the accuracy in English-to-Chinese translation. The exact meaning of a word is determined on its part of speech and its associated pre-words and post-words in the sentence. In this thesis research, five rules which con-tain important and meaningful part of speech such as verb, noun, adjective, adverb, are established to detect word relationships and word status of translating language in a sentence. They are (R1) noun + noun; (R2) adjective + noun; (R3) adverb + verb; (R4) noun + verb, and (R5) verb + noun. By recursively applying these rules along with the knowledge in the Chinese word-relationship ontology, the Chinese translation of the English word is determined. Our experiments showed that use of the ontology and increase the number of rules in the ontology can improve the accuracy of word selection obviously.
Chen, Train-Min, and 陳春敏. "Morphological Encoding in Chinese Word Production." Thesis, 2007. http://ndltd.ncl.edu.tw/handle/94717112452368421826.
Full text國立中正大學
心理學所
95
The dissertation investigated the issue of whether the production of a Mandarin Chinese word involves the processing of its constituent morphemes. Experiments 1-6 employed the implicit priming task. In the task, the participants first learned a set of word pairs, the members of which were associatively related. Then, they were shown the first word of each pair and had to say out loud the second word as quickly and correctly as possible. When the second words to be produced in a block shared the initial portion, production latencies were typically shorter than when they did not share the initial portion. This has been referred to as the implicit priming effect. Experiment 1 demonstrated a reliable implicit priming effect when the disyllabic compound words to be spoken shared the first tonal syllable (i.e., syllable plus tone), but the effect did not vary whether the shared tonal syllables had the same orthography. Experiment 2 showed that the implicit priming effect was observable only when the disyllabic compound words to be spoken shared the first character, but not when they shared the second character. Experiment 3 showed that characters and their corresponding tonal syllables produced equal sizes of implicit priming effect. This was replicated in Experiment 4 using disyllabic monomorphemic words. Experiment 5 observed comparable implicit priming effects whether or not the shared characters corresponded to the same morpheme. Experiment 6 showed that the implicit priming effect of the shared morpheme was insensitive to the frequency of the morpheme. The combined evidence from the implicit priming task gave no support for the involvement of morpheme in Chinese word production. To examine whether the null results were restricted to the implicit priming task, the picture-word interference task was adopted in the next set of experiments. The task involved having participants name a picture while ignoring a distractor word that was morphologically related to the name of the picture (overlapped on morpheme, character, and tonal syllable), character related to it (overlapped on character, and tonal syllable), phonologically related to it (overlapped on tonal syllable), semantically related to it (overlapped on meaning), or unrelated to it. Under the conditions when the distractor word preceded the picture by 100 ms (i.e., SOA = -100 ms), there were reliable effects for morphologically and character related trials (both being facilitatory), but not for semantically or phonologically related trials. When the distractor word was directly superposed on the picture (i.e., SOA = 0 ms), there were reliable facilitatory effects for morphologically, character, and phonologically related trials. At both SOAs, morphological facilitation was equal to character facilitation, and character facilitation was greater than phonological facilitation. When the pictures were replaced by their names and the SOA was set at -100 ms, the results showed reliable morphological, character, and phonological facilitations, all of equal size. Finally, when the distractor word and the paired picture were separated by 7-10 unrelated trials (words or pictures), the result showed reliable morphological and character facilitations, and morphological facilitation was slightly greater (by 8 ms) than character facilitation. The evidence from the picture-word interference task suggested that constituent morphemes are processed in the course of word production in Chinese, but they contribute very little to the outcome of word production. The dissertation concludes with a discussion of why such might be the case.
Lin, Qian-Xiang, and 林千翔. "Chinese Word Segmentation using Specialized HMM." Thesis, 2006. http://ndltd.ncl.edu.tw/handle/jea684.
Full text國立中央大學
資訊工程研究所
94
The first step in Chinese language processing tasks is word segmentation. Various methods have been proposed to address this problem in previous studies, e.g. heuristic-based approaches, statistical-based approaches, etc. HMM is a statistical machine learning approach that has been successfully applied in many fields, e.g. POS tagging, shallow parsing, and so on. However, we find that standard HMM achieved only 80% results in Chinese word segmentation. As is commonly known, segmentation ambiguity and unknown word occurrence are two main problems in Chinese word segmentation. In this paper, we proposed a two-stage specialized HMM by incorporating these information into the model. In the first stage, we combine the maximum matching heuristics to incorporate segmentation ambiguity and use a masking approach to handle unknown word information. By extending the observation symbols, the proposed M-HMM is improved from 0.812 to 0.953 in F-measure. At the second stage, we use lexicalization technique to further enrich HMM performance. The idea is to add new state symbols for high frequency characters or high tagging error symbols. Experimental results show that Lexicalized M-HMM is improved from 0.953 to 0.963 in F-measure.
Wu, Yi-An, and 吳怡安. "Word Sense Disambiguation with Chinese Wordnet." Thesis, 2016. http://ndltd.ncl.edu.tw/handle/07643624890039959381.
Full text國立臺灣大學
語言學研究所
104
The aim of the thesis attempts to establish a sense tagger for the Chinese Wordnet, which is an important representations of lexical semantics and provides distinction of senses and lexical semantic relations. The construction of the sense tagger requires the techniques of Word Sense Disambiguation (WSD), which is an old but still unsolved problem in Natural Language Processing (NLP). The approaches of the study involve the supervised learning methods and the neural word embeddings for certain lexical samples. The training corpora are the human annotated texts from Sinica Corpus and PTT Corpus. The LOPE Text Analytics, which is the implementation of the study, provides several applications including sense tagger and other text analytic modules for researchers.
Pei-fen, Lee, and 李佩芬. "Chinese Bilingual Children’s Word Definition Skill." Thesis, 2001. http://ndltd.ncl.edu.tw/handle/84977086321186624510.
Full text國立臺灣大學
語言學研究所
89
The aim of this study is to examine the word definition skills of three different groups of Chinese bilingual children. These three bilingual groups are: (1) Mandarin-English bilingual children in an immersion program (ME-I); (2) Mandarin-English bilingual children in a bilingual program (ME-B); (3) Mandarin-Taiwanese bilingual children in a monolingual program (MT-M). There are also three age levels: (1) middle class in the kindergarten (K2); (2) top class in the kindergarten (K3); and (3) first grade in the elementary school (G1). In addition to the Chinese word definition task, these children’s language proficiency in three languages (Mandarin, English, Taiwanese), comprehension of related superordinate terms and their flexibility in classifying various sorts of objects were also examined as controls. One of our major concerns is on the possible differences on the metalinguistic development of these three bilingual groups. Results from the word definition task indicated that Mandarin-English bilinguals of first graders who had once enrolled in an immersion program perform significantly better than their Mandarin-Taiwanese counterparts as well as the Mandarin-English bilinguals in the bilingual program. The differences between Mandarin-English bilinguals in immersion program and Mandarin-Taiwanese bilinguals in monolingual program would be caused by their different route in acquiring the second language while the differences between the two types of Mandarin-English bilinguals would be caused by their different amounts of input of the second language. In the two control tasks, the comprehension of the superordinate term and the object classification task, no differences were found among these three different groups. Besides, the possibility that the source of the significant growth in word definition might come from these children’s better language proficiency was also excluded by further examination. To conclude, only bilingual children who receive second language education at school with enough stimuli will show better metalinguistic skills.
Ming-Yu, Lin, and 林銘裕. "The Study in Chinese Word Segmentation." Thesis, 1993. http://ndltd.ncl.edu.tw/handle/88008624061686807896.
Full textChen, Qiyou. "Dang dai shan shui xiao pin wen yan jiu." 1985. http://catalog.hathitrust.org/api/volumes/oclc/18830707.html.
Full textZhan, Jia-Cheng, and 詹嘉丞. "Handling Non-Traditional Chinese Words in a Chinese Word Segmentation System." Thesis, 2009. http://ndltd.ncl.edu.tw/handle/99452650729173633924.
Full text國立臺灣海洋大學
資訊工程學系
97
This paper propose methods to handle non-Traditional Chinese words in a Traditional Chinese article when doing word segmentation. The cases of non-Traditional Chinese words include Japanese names (whether written in Japanese Kanji or Chinese), variant forms, and foreign words. Both input sentence and output sentence of our Chinese word segmentation is coded in UTF-8 to allow multi-language writing in one sentence. Besides using a dictionary, we also developed identification rules for many special word classes, including temporal expressions, numerical expressions, and person names. These resources are used to propose candidate words in the input sentence. All possible segmentation paths are then generated. The one with the highest probability is proposed as the final segmentation result. The sets of person name and number classes are huge and infinite. They also cause higher ambiguity in word segmentation. So we use their class probabilities instead of individual word probabilities to calculate the probability of segmentation. When dealing with person names, we propose a new model, name pattern probability model, to capture the probability of seeing a certain name pattern in an article. Experimental results show that the newly added probability model is helpful. When dealing with Japanese names, we need to map the writings into Chinese. We propose a method to map variant Hanzi into Traditional Chinese characters. By doing so, it is possible for our system to handle names written in Japanese Kanji and Chinese in the same sentence. With proposing candidate words of special word classes and Japanese or Chinese person names, the F-measure score of our word segmentation is improved from 94.16% to 96.23%. Moreover, 109 news articles with hand-tagged Japanese names are also tested. 81.4% of 862 Japanese names are correctly generated. The method of mapping variants can also be used to generate variant terms of Traditional Chinese words. By doing so, it is possible to propose all candidates whenever they are written in Traditional Chinese, Simplified Chinese, or variant forms.
Lin, Chun-Wei, and 林俊偉. "Searching Documents with Composite Chinese Word Segmentations." Thesis, 2010. http://ndltd.ncl.edu.tw/handle/26152106905264225206.
Full text國立中央大學
企業管理研究所
98
In natural language, “word” is the most basic element. Owing to an article is constituted by lots of words; we must separate those words apart first then go to research. Because there is no word-spacing in Chinese article, like the one as the boundary between every word in English article; therefore, we need to divide Chinese words through Word Segmentation from the very first beginning then go to further analysis. In Chinese Word Segmentation Process, CKIP in Academia Sinica is the most prominent. However, due to the design principle of CKIP has been constrained by the words only in the data, as for the words not included (named Unidentified Word) are hardly to sort out correct word segmentation result. The purpose of this research is to solve the problem of Unidentified Word: to raise the accuracy of Word Segmentation System for further Keyword, which is really able to be represented the whole article, Search Processing. The foundation of this research is the result from Chinese Word Segmentation System, and I tried to combine the result to piece up the complete lexicon and original meaning, compensating for the deficiency of undiscovered when Unidentified Word being separated. Afterward I snatched further Keyword through the original word segmentation result and combined lexicon. The results showed that precision and recall of composite Chinese word segmentation are much better than CKIP and Google, it also verify the validity of the system.
Peng, Shu-Chen, and 彭淑貞. "Chinese Bilingual Children''s Word Learning Strategies." Thesis, 1999. http://ndltd.ncl.edu.tw/handle/39730357575423044211.
Full text國立臺灣大學
語言學研究所
87
The mutual exclusivity constraint, which is suggested to be one of the most often used strategies when children learn new words, refers to the finding that children often avoid two labels for the same object (Markman, 1994). This constraint is generally considered to have four effects-disambiguation, correction, rejection, and restriction effects (Merriman & Bowman, 1989). Davidson et al. (1997) further showed that the mutual exclusivity constraint would be affected by bilingualism; that is, bilingual children did not use the constraint as extensively as that of monolingual children. Pursuing this line of inquiry, the present study focused on Mandarin-Taiwanese bilingual children''s use of the mutual exclusivity constraint. Following Davidson et al. (1997), the present study comprised three tests; namely, the disambiguation test, the rejection test, and the restriction test, to examine whether bilingualism would affect children''s use of the mutual exclusivity constraints. Furthermore, these tests were accessed in three different input conditions, to examine these children''s sensitivities to different language inputs. These different input conditions included: (1) All-Mandarin: Throughout the test, only Mandarin was used, including the target artificial words; (2) Mixed-code: Throughout the test, only Mandarin was used, except that the target artificial names resemble to Taiwanese words in terms of phonological shapes; (3) All-Taiwanese: Throughout the test, only Taiwanese was used, including the target artificial words. 38 monolingual Mandarin-speaking and 58 Mandarin-Taiwanese bilingual children participated in the three tests, including younger (4-year-old) and older (6-year-old) ones. The results indicated that the influence of bilingualism was only significantly different in the restriction test. Bilingual children tended to suspend the use of the mutual exclusivity constraint more often than monolingual ones did. Besides, our results indicated that children''s sensitivities to different languages were influenced by bilingualism. In the disambiguation test and the restriction test, children of the bilingual group were more sensitive to different language inputs than monolingual children were. However, our results supported that neither children''s applications of the mutual exclusivity constraint nor their sensitivity to different language inputs were influenced by their developmental pattern, except that in the disambiguation test, only younger children of the monolingual group were sensitive to different language inputs, while older ones were not. Taken together, the findings of the present study suggested that young bilinguals in Taiwan are sensitive to the type of language they are spoken to and would invoke different learning strategies accordingly.
yi, Lin hsin, and 林欣怡. "Segmental Mismatch in Chinese Spoken Word Recognition." Thesis, 2008. http://ndltd.ncl.edu.tw/handle/27324463728021840592.
Full text國立高雄師範大學
聽力學與語言治療研究所
96
The purpose of the study was to investigate if phonological priming effect would be observed when the prime mismatches with the target by one segment in first constituent of Chinese disyllabic words. The study was a cross-modal priming paradigm, where an auditory disyallabic stimulus was followed by a disyllabic word or nonword visually presented. Subjects made lexical decision responses to disyllabic words or nonwords. In Experiment 1, the consonant in first position of word was varied, with the other part of word being held constant. Experiment 2, the vowel in first position of word was varied, with the other part of word being held constant. In the two experiments, the priming effect was not observed when the prime mismatches the onset syllable of disyllabic words by one segment. The results seemed to be consistent with the Cohort model which imposed the most stringent restriction as to the membership of the candidate word set, but against the TRACE model and the model of lexical representation of Chinese disyllabic words (Zhou & Marslen-Wilson, 1994)
Chen, Young-Teh, and 陳永德. "Using Word Segmentational Rules of Long word first, frequency contrast and front word first in reading Chinese." Thesis, 1997. http://ndltd.ncl.edu.tw/handle/01258303667318395067.
Full textTyng, Lim Uei, and 林威廷. "Development of Fo Guang Shan in Malaysia’s Chinese Community – Research study on Fo Guang Shan Monastery in Johor Bahru." Thesis, 2011. http://ndltd.ncl.edu.tw/handle/71251029700375872606.
Full textHUANG, SHI-ZHE, and 黃士哲. "The study of environmental perception of Chinese landscape (shan-shui) painters." Thesis, 1987. http://ndltd.ncl.edu.tw/handle/34834404236822693350.
Full textMei, Lin Ju, and 林如美. "Distributed word recognition teaching and Chinese Characters in groups teaching in word recognition difficulties student's word recognition performance." Thesis, 2005. http://ndltd.ncl.edu.tw/handle/46934930829903599578.
Full text國立屏東師範學院
特殊教育學系碩士班
94
Abstract The major purpose of this study was to compare the effectiveness between the method of “Chinese Characters in groups teaching” to the method of “Distributed word recognition teaching” with three of Three-Grade elementary students with word recognition difficulties. The effectiveness included evaluation word recognition performance in the alternating and maintaining stage, and learning attitude of two methods. The research method adopted in this study was alternating treatment design of single subject research including base line stage, alternating stage, and maintaining stage. The findings were concluded as follow: 1. The alternating stage:The effectiveness of “ Distributed word recognition teaching” method was better than “Chinese Characters in groups teaching ”method in the word recognition performance of the subject. 2. The maintaining stage: The effectiveness of “ Distributed word recognition teaching” method was better than “Chinese Characters in groups teaching ” method in the word recognition performance of the subject. 3. Different assessment stage: The students’ performance of immediate phase was better than that of maintaining phase after the experimental instruction program had been completed. 4. Different assessment methods: The students performed the best in the word-recognition with context clues between two methods. 5. Error type analysis: In base line stage, alternating stage and maintaining phase, two word-recognition teachings had similar error types. But, there were some difference between two methods. 6. Learning attitude: Three students had different attitude toward two methods. They displayed less attention when the researcher use “Chinese Characters in groups teaching ”method. According to the results above, the researcher provided some recommendations to the further relevant studies and teachers to teach students with word recognition difficulties.
Chuang, Chun-Yu, and 莊淳宇. "Computer Word Entry Modifies Processing of Chinese Characters." Thesis, 2006. http://ndltd.ncl.edu.tw/handle/85540224337716639904.
Full text