To see the other types of publications on this topic, follow the link: Syntactic features.

Dissertations / Theses on the topic 'Syntactic features'

Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles

Select a source type:

Consult the top 38 dissertations / theses for your research on the topic 'Syntactic features.'

Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.

You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.

Browse dissertations / theses on a wide variety of disciplines and organise your bibliography correctly.

1

Brigadoi, Ivan. "Genre classification using syntactic features." Thesis, Uppsala universitet, Institutionen för lingvistik och filologi, 2021. http://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-454667.

Full text
Abstract:
This thesis work adresses text classification in relation to genre identification using different feature sets, with a focus on syntactic based features. We built our models by means of traditional machine learning algorithms, i.e. Naive Bayes, K-nearest neighbour, Support Vector Machine and Random Forest in order to predict the literary genre of books. We trained our models using as feature sets bag-of-words (BOW), bigrams, syntactic-based bigrams and emotional features, as well as combinations of features. Results obtained using the best features, i.e. BOW combined with bigrams based on syntactic relations between words, on the test set showed an enhancement in performance by 2% in F1-score over the baseline using BOW features, which translates into a positive impact of using syntactic information in the task of text classification.
APA, Harvard, Vancouver, ISO, and other styles
2

Lumsden, John Stewart. "Syntactic features : parametric variation in the history of English." Thesis, Massachusetts Institute of Technology, 1987. http://hdl.handle.net/1721.1/14702.

Full text
Abstract:
Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Linguistics and Philosophy, 1987.
Title as it appears in M.I.T. Graduate List, Sept. 1987: Syntactic features--parameters in the history of English.
Bibliography: v. 2, leaves 418-422.
by John Stewart Lumsden.
Ph.D.
APA, Harvard, Vancouver, ISO, and other styles
3

Nădejde, Maria. "Syntactic and semantic features for statistical and neural machine translation." Thesis, University of Edinburgh, 2018. http://hdl.handle.net/1842/31346.

Full text
Abstract:
Machine Translation (MT) for language pairs with long distance dependencies and word reordering, such as German-English, is prone to producing output that is lexically or syntactically incoherent. Statistical MT (SMT) models used explicit or latent syntax to improve reordering, however failed at capturing other long distance dependencies. This thesis explores how explicit sentence-level syntactic information can improve translation for such complex linguistic phenomena. In particular, we work at the level of the syntactic-semantic interface with representations conveying the predicate-argument structures. These are essential to preserving semantics in translation and SMT systems have long struggled to model them. String-to-tree SMT systems use explicit target syntax to handle long-distance reordering, but make strong independence assumptions which lead to inconsistent lexical choices. To address this, we propose a Selectional Preferences feature which models the semantic affinities between target predicates and their argument fillers using the target dependency relations available in the decoder. We found that our feature is not effective in a string-to-tree system for German-English and that often the conditioning context is wrong because of mistranslated verbs. To improve verb translation, we proposed a Neural Verb Lexicon Model (NVLM) incorporating sentence-level syntactic context from the source which carries relevant semantic information for verb disambiguation. When used as an extra feature for re-ranking the output of a German-English string-to-tree system, the NVLM improved verb translation precision by up to 2.7% and recall by up to 7.4%. While the NVLM improved some aspects of translation, other syntactic and lexical inconsistencies are not being addressed by a linear combination of independent models. In contrast to SMT, neural machine translation (NMT) avoids strong independence assumptions thus generating more fluent translations and capturing some long-distance dependencies. Still, incorporating additional linguistic information can improve translation quality. We proposed a method for tightly coupling target words and syntax in the NMT decoder. To represent syntax explicitly, we used CCG supertags, which encode subcategorization information, capturing long distance dependencies and attachments. Our method improved translation quality on several difficult linguistic constructs, including prepositional phrases which are the most frequent type of predicate arguments. These improvements over a strong baseline NMT system were consistent across two language pairs: 0.9 BLEU for German-English and 1.2 BLEU for Romanian-English.
APA, Harvard, Vancouver, ISO, and other styles
4

Holmberg, Anders. "Word order and syntactic features in the Scandinavian languages and English /." Stockholm : Dept. of General Linguistics, University of Stockholm, 1986. http://bibpurl.oclc.org/web/33078.

Full text
APA, Harvard, Vancouver, ISO, and other styles
5

Sundström, Alex. "Investigation into predicting unit test failure using syntactic source code features." Thesis, KTH, Skolan för elektroteknik och datavetenskap (EECS), 2018. http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-233382.

Full text
Abstract:
In this thesis the application of software defect prediction to predict unit test failure is investigated. Data for this purpose was collected from a Continuous Integration development environment. Experiments were performed using semantic features from the source code. As the data was imbalanced with defective samples being in minority different degrees of oversampling were also evaluated. The data collection process revealed that even though several different code commits were available few ever failed a unit test. Difficulties with linking a failure to a specific file were also encountered. The machine learning model used in the project produced poor results when compared against related work, from which it was based on. In F-measure, it on average achieve 53% of the mean performance of state-of-the-art for software defect prediction on bugs in Java source files. Specifically, it would appear that very little information was available for the model to learn defects in files not present in training data.
I denna avhandling undersöks applikationen av prognos för mjukvarudefekter för att förutse enhetstestfel. Data för detta syfte samlades in från en utvecklingsmiljö med kontinuerlig integration. Experimenten utfördes med användning av semantiska särdrag samlade från källkod. Då data var obalanserat med defekta exempel i minoritet evaluerades olika grader av översampling. Datainsamlingsprocessen visade att även om det fanns många kodinlämningar så misslyckades få någonsin ett enhetstest. Svårigheter med att länka testmisslyckanden till en specifik fil påträffades också. Den använda maskininlärningsmodellen uppvisade också dåliga resultat i jämförelse med relaterade värk. Mätt i F-measure uppnåddes i genomsnitt 53% av genomsnittlig prestandan av bästa möjliga prognos av mjukvarudefekter av buggar i Java källkod. Specifikt så framträdde det att väldigt lite information verkar finnas för modellen att lära sig defekter i filer som ej fanns med i träningsdata.
APA, Harvard, Vancouver, ISO, and other styles
6

Lyon, Caroline. "The representation of natural language to enable neural networks to detect syntactic features." Thesis, University of Hertfordshire, 1994. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.387160.

Full text
APA, Harvard, Vancouver, ISO, and other styles
7

Jiang, Ying Alisa. "An analysis of syntactic structures and semantic features of de-constructions in Chinese." HKBU Institutional Repository, 2010. http://repository.hkbu.edu.hk/etd_ra/1208.

Full text
APA, Harvard, Vancouver, ISO, and other styles
8

Zhang, Mingjian 1958. "Syntactic features of the English interlanguage of learners of English as a second language." Monash University, School of Languages, Cultures and Linguistics, 2003. http://arrow.monash.edu.au/hdl/1959.1/7730.

Full text
APA, Harvard, Vancouver, ISO, and other styles
9

Vega, Vilanova Jorge [Verfasser], and Susann [Akademischer Betreuer] Fischer. "Catalan Participle Agreement : Syntactic Features and Language Change / Jorge Vega Vilanova ; Betreuer: Susann Fischer." Hamburg : Staats- und Universitätsbibliothek Hamburg, 2020. http://d-nb.info/1214811892/34.

Full text
APA, Harvard, Vancouver, ISO, and other styles
10

Seale, Jennifer Marie Chelliah Shobhana Lakshmi. "An analysis of the syntactic and lexical features of an Indian English oral narrative a pear story study /." [Denton, Tex.] : University of North Texas, 2007. http://digital.library.unt.edu/permalink/meta-dc-5123.

Full text
APA, Harvard, Vancouver, ISO, and other styles
11

Seale, Jennifer Marie. "An analysis of the syntactic and lexical features of an Indian English oral narrative: A Pear Story study." Thesis, University of North Texas, 2007. https://digital.library.unt.edu/ark:/67531/metadc5123/.

Full text
Abstract:
This pilot study addresses the distribution of nonstandard syntactic and lexical features in Indian English (IE) across a homogeneous group of highly educated IE speakers. It is found that nonstandard syntactic features of article use, number agreement and assignment of verb argument structure do not display uniform intragroup distribution. Instead, a relationship is found between nonstandard syntactic features and the sociolinguistic variables of lower levels of exposure to and use of English found within the group. While nonstandard syntactic features show unequal distribution, nonstandard lexical features of semantic reassignment, and mass nouns treated as count nouns display a more uniform intragroup distribution.
APA, Harvard, Vancouver, ISO, and other styles
12

Suen, Lee Wa Ann. "The similarities and differences between semantic and syntactic features of Mandarin perfective aspect marker le and Cantonese perfective aspect marker jo." HKBU Institutional Repository, 1997. http://repository.hkbu.edu.hk/etd_ra/120.

Full text
APA, Harvard, Vancouver, ISO, and other styles
13

Packer, Thomas L. "Surface Realization Using a Featurized Syntactic Statistical Language Model." Diss., CLICK HERE for online access, 2006. http://contentdm.lib.byu.edu/ETD/image/etd1195.pdf.

Full text
APA, Harvard, Vancouver, ISO, and other styles
14

Pienaar, Cheryl Leelavathie. "Towards a corpus of Indian South African English (ISAE) : an investigation of lexical and syntactic features in a spoken corpus of contemporary ISAE." Thesis, Rhodes University, 2008. http://hdl.handle.net/10962/d1002640.

Full text
Abstract:
There is consensus among scholars that there is not just one English language but a family of “World Englishes”. The umbrella-term “World Englishes” provides a conceptual framework to accommodate the different varieties of English that have evolved as a result of the linguistic cross-fertilization attendant upon colonization, migration, trade and transplantation of the original “strain” or variety. Various theoretical models have emerged in an attempt to understand and classify the extant and emerging varieties of this global language. The hierarchically based model of English, which classifies world English as “First Language”, “Second Language” and “Foreign Language”, has been challenged by more equitably-conceived models which refer to the emerging varieties as New Englishes. The situation in a country such as multi-lingual South Africa is a complex one: there are 11 official languages, one of which is English. However the English used in South Africa (or “South African English”), is not a homogeneous variety, since its speakers include those for whom it is a first language, those for whom it is an additional language and those for whom it is a replacement language. The Indian population in South Africa are amongst the latter group, as theirs is a case where English has ousted the traditional Indian languages and become a de facto first language, which has retained strong community resonances. This study was undertaken using the methodology of corpus linguistics to initiate the creation of a repository of linguistic evidence (or corpus), of Indian South African English, a sub-variety of South African English (Mesthrie 1992b, 1996, 2002). Although small (approximately 60 000 words), and representing a narrow age band of young adults, the resulting corpus of spoken data confirmed the existence of robust features identified in prior research into the sub-variety. These features include the use of ‘y’all’ as a second person plural pronoun, the use of but in a sentence-final position, and ‘lakker’ /'lVk@/ as a pronunciation variant of ‘lekker’ (meaning ‘good’, ‘nice’ or great’). An examination of lexical frequency lists revealed examples of general South African English such as the colloquially pervasive ‘ja’, ‘bladdy’ (for bloody) and jol(ling) (for partying or enjoying oneself) together with neologisms such as ‘eish’, the latter previously associated with speakers of Black South African English. The frequency lists facilitated cross-corpora comparisons with data from the British National Corpus and the Corpus of London Teenage Language and similarities and differences were noted and discussed. The study also used discourse analysis frameworks to investigate the role of high frequency lexical items such as ‘like’ in the data. In recent times ‘like’ has emerged globally as a lexicalized discourse marker, and its appearance in the corpus of Indian South African English confirms this trend. The corpus built as part of this study is intended as the first building block towards a full corpus of Indian South African English which could serve as a standard for referencing research into the sub-variety. Ultimately, it is argued that the establishment of similar corpora of other known sub-varieties of South African English could contribute towards the creation of a truly representative large corpus of South African English and a more nuanced understanding and definition of this important variety of World English.
APA, Harvard, Vancouver, ISO, and other styles
15

Chang, Che Ho Anthony. "The similarities and differences between semantic and syntactic features of Mandarin imperfective aspect marker zhe(著) and Cantonese imperfective aspect marker jyuh(住)." HKBU Institutional Repository, 1999. http://repository.hkbu.edu.hk/etd_ra/398.

Full text
APA, Harvard, Vancouver, ISO, and other styles
16

Gangireddy, Siva Reddy. "Recurrent neural network language models for automatic speech recognition." Thesis, University of Edinburgh, 2017. http://hdl.handle.net/1842/28990.

Full text
Abstract:
The goal of this thesis is to advance the use of recurrent neural network language models (RNNLMs) for large vocabulary continuous speech recognition (LVCSR). RNNLMs are currently state-of-the-art and shown to consistently reduce the word error rates (WERs) of LVCSR tasks when compared to other language models. In this thesis we propose various advances to RNNLMs. The advances are: improved learning procedures for RNNLMs, enhancing the context, and adaptation of RNNLMs. We learned better parameters by a novel pre-training approach and enhanced the context using prosody and syntactic features. We present a pre-training method for RNNLMs, in which the output weights of a feed-forward neural network language model (NNLM) are shared with the RNNLM. This is accomplished by first fine-tuning the weights of the NNLM, which are then used to initialise the output weights of an RNNLM with the same number of hidden units. To investigate the effectiveness of the proposed pre-training method, we have carried out text-based experiments on the Penn Treebank Wall Street Journal data, and ASR experiments on the TED lectures data. Across the experiments, we observe small but significant improvements in perplexity (PPL) and ASR WER. Next, we present unsupervised adaptation of RNNLMs. We adapted the RNNLMs to a target domain (topic or genre or television programme (show)) at test time using ASR transcripts from first pass recognition. We investigated two approaches to adapt the RNNLMs. In the first approach the forward propagating hidden activations are scaled - learning hidden unit contributions (LHUC). In the second approach we adapt all parameters of RNNLM.We evaluated the adapted RNNLMs by showing the WERs on multi genre broadcast speech data. We observe small (on an average 0.1% absolute) but significant improvements in WER compared to a strong unadapted RNNLM model. Finally, we present the context-enhancement of RNNLMs using prosody and syntactic features. The prosody features were computed from the acoustics of the context words and the syntactic features were from the surface form of the words in the context. We trained the RNNLMs with word duration, pause duration, final phone duration, syllable duration, syllable F0, part-of-speech tag and Combinatory Categorial Grammar (CCG) supertag features. The proposed context-enhanced RNNLMs were evaluated by reporting PPL and WER on two speech recognition tasks, Switchboard and TED lectures. We observed substantial improvements in PPL (5% to 15% relative) and small but significant improvements in WER (0.1% to 0.5% absolute).
APA, Harvard, Vancouver, ISO, and other styles
17

Holmberg, Mattias. "A Construction Grammar Analysis of the expression /on the one hand...on the other hand/." Thesis, Stockholm University, Department of English, 2008. http://urn.kb.se/resolve?urn=urn:nbn:se:su:diva-8124.

Full text
Abstract:

The expression /on the one hand…on the other hand/ (OH1 OH2) is a fixed linguistic pattern which is used to emphasize the comparison between two possibly complex propositions (henceforth X and Y). The static syntactic form of the pattern and the specific semantic comparison it evokes are strong indicators that it is a construction of the type discussed in the analytical method Construction Grammar (henceforth CxG). Thus, the aim of this essay is to argue that the pattern OH1X OH2Y is a CxG construction with specific syntactic and semantic constraints, and at the same time to give a descriptive account of the features of the construction. The British National Corpus was used to get examples containing the pattern. The syntactic and semantic features of these examples were analysed and the results were compared with how the traditional descriptive grammarians account for the pattern.

APA, Harvard, Vancouver, ISO, and other styles
18

Anderson, Michael Don. "Ellipsis as a Diagnostic Tool of Feature Strength and the Syntactic Structure of Ilocano." Diss., The University of Arizona, 2009. http://hdl.handle.net/10150/195692.

Full text
Abstract:
This dissertation examines Ilocano, an Austronesian Filipino language, within the Minimalist Framework, in an effort to tease apart the general syntactic properties of the language. I show that Ilocano underlying structure can easily be captured within the standard syntactic structures proposed for languages generally (Universal Grammar). In my examination of ellipsis in Ilocano, I concern myself strictly with syntactic and not semantic properties. I show that syntactic feature distribution (e.g. [+FOC], [+NEG], [+DET]) in combination with the two basic operations of the Minimalist Program: FEATURE-CHECKING and MERGE can account for both the underlying structure of Ilocano utterances as well as the word-order at Spell-Out, without making any stipulations not found in languages generally.My research also reveals new insights and corrects existing assumptions about certain previously undiscovered underlying structural properties of Ilocano. I account for the restrictive word ordering and structure found in Ilocano by assigning a universally applicable, non-controversial set of functional and lexical features to morphemes. These features satisfy, individually or collectively, feature-checking requirements in the language, resulting in the attested output of Ilocano. The types of ellipsis considered as a diagnostic toward that end are: NP-ellipsis, Bare Argument Ellipsis/Stripping, Gapping, Sluicing and Psuedogapping. I argue that the primary mechanism which licenses ellipsis in Ilocano is FOCUS-RAISING which allows extraction of remnant material prior to ellipsis of the TP in the case of all verbal-type ellipsis in Ilocano; or the DP in terms of Ilocano NP-ellipsis.
APA, Harvard, Vancouver, ISO, and other styles
19

Goshorn, Rachel Elizabeth. "Syntactical classification of extracted sequential spectral features adapted to priming selected interference cancelers /." Diss., Connect to a 24 p. preview or request complete full text in PDF format. Access restricted to UC campuses, 2005. http://wwwlib.umi.com/cr/ucsd/fullcit?p3170219.

Full text
APA, Harvard, Vancouver, ISO, and other styles
20

Ssempuuma, Jude [Herausgeber]. "Morphological and Syntactic Feature Analysis of Ugandan English : Influence from Luganda, Runyankole-Rukiga, and Acholi-Lango." Frankfurt a.M. : Peter Lang GmbH, Internationaler Verlag der Wissenschaften, 2018. http://d-nb.info/118148801X/34.

Full text
APA, Harvard, Vancouver, ISO, and other styles
21

Ssempuuma, Jude [Verfasser], Christiane [Gutachter] Meierkord, and Ulrike [Gutachter] Gut. "Morphological and syntactic feature analysis of Ugandan english : influence from Luganda, Runyankole-Rukiga, and Acholi-Lango / Jude Ssempuuma ; Gutachter: Christiane Meierkord, Ulrike Gut ; Fakultät für Philologie." Bochum : Ruhr-Universität Bochum, 2019. http://d-nb.info/1182682553/34.

Full text
APA, Harvard, Vancouver, ISO, and other styles
22

Kim, So-Yun. "Propriétés syntaxico-sémantiques des verbes à complément en -e en coréen." Phd thesis, Université Paris-Est, 2010. http://tel.archives-ouvertes.fr/tel-00626249.

Full text
Abstract:
Cette étude est une classification générale des constructions verbales et une description des propriétés syntaxico-sémantiques des verbes à complément essentiel introduit par la postposition e en coréen. Ce travail a pour modèle théorique le lexique-grammaire, qui a été élaboré par M. Gross (1975), sur la base des principes de Z. S. Harris (1968). Dans cette étude, nous avons examiné les propriétés syntaxico-sémantiques des 3000 verbes demandant ce complément et nous les avons regroupés en 8 classes. Ces propriétés constitueront des informations syntaxiques qui servent à l'analyse de structure d'une phrase et au traitement automatique des textes en coréen.
APA, Harvard, Vancouver, ISO, and other styles
23

Sanchez, Monica Eszter. "Syntactic features in agrammatic production." Thesis, 1996. http://hdl.handle.net/2429/6621.

Full text
Abstract:
This thesis examines the nature of the language deficit called agrammatism, the linguistic syndrome usually associated with Broca's aphasia. I focus on the narratives produced by agrammatics of five different languages, English, Dutch, German, French, and Italian, the transcripts of which are collected in Menn & Obler (1990). My goal is to account for the omission and substitution errors that characterize agrammatic production. Agrammatic utterances with omissions display appropriate word order. Among these are structures that include adverbs, negation, and verb-second matrix clauses. These structures are derived by movement to functional projections. I argue that agrammatic clauses include minimally three functional projections above VP. I propose that the full array of functional projections is present in agrammatic speech. I argue that the most concise account of agrammatic production is one in which Universal Grammar governs agrammatic speech. Although any syntactic category may be omitted, not all categories are omitted with the same frequency. Lexical categories are better retained than functional categories; and nominal categories are better retained than verbal categories. I propose a Principle of Robustness whereby the more Formal features a category is specified for, the more Robust it is. The net result is that the more features a syntactic category is specified for, the more likely it is retrieved. This results in the following Retrieval Hierarchy: N > V, A , D > P, T, K > C, where ">" means "better retained than". In addition to omissions, agrammatic speech includes substitutions: Syntactic substitutions display two striking characteristics. First, substitutions are not cross-categorial. Second, substitutions are subject to the Single Feature Constraint: only one optional Formal feature from agreement (person, number and gender), Case and tense is altered. To derive these characteristics, I argue that the structure of the Lexicon is paradigmatic. Both omissions and substitutions lead me to a discussion of Lexical Insertion, the process by which words are inserted into syntactic structures. I conclude that the agrammatic deficit lies outside the phonological, syntactic and semantic components proper. Instead, omissions and substitutions result from an impairment to the interface mechanisms between the Lexicon, the Syntax and the Phonology.
APA, Harvard, Vancouver, ISO, and other styles
24

Yang, Su-Mei, and 楊素梅. "The Syntactic Features of The Jinwen Shangshu." Thesis, 2006. http://ndltd.ncl.edu.tw/handle/42161090044265113673.

Full text
APA, Harvard, Vancouver, ISO, and other styles
25

Lin, Jia-Hung, and 林家宏. "Semantic Segmentation of Multimedia Data Based on Audiovisual Syntactic Features." Thesis, 2006. http://ndltd.ncl.edu.tw/handle/66192777489109783560.

Full text
Abstract:
碩士
國立成功大學
電機工程學系碩博士班
94
In this thesis, we propose techniques for audio and visual segmentation in syntactic features for multimedia. In audio segmentation, we detect the speaker/scene change boundaries between different speakers/scenes, and then separate the audio segments. In visual segmentation, we propose a detection method of the camera motion. The director uses the camera motion and audio intonation to dramatize movies or teleplays. We analyze these characteristics so that we can know dramatization of video. In visual segmentation, this algorithm uses the motion vectors to estimate the parameters of camera motion. We built the motion vector fields and analyze the fields with camera motion characteristics. We propose two methods which are divergence method and Infinite Impulse Response (IIR) method to estimate the camera motion. We use the divergence theory to estimate the parameter of zoom scenes, and also extract the parameters of pan and tilt by modified equations. In audio segmentation, we use a feature vector of Mel-Frequency Cepstrum Coefficients (MFCC) and Bayesian Information Criterion (BIC) with windows of fixed size to detect the changing speaker. We use the fixed size window of BIC to speed up calculations. A maximum filter is adopted to remove fault segments caused by audio noise, and a post process with non-fixed size window is used to reduce false alarms.
APA, Harvard, Vancouver, ISO, and other styles
26

Khanh, Pham Thuy Quoc, and 范瑞國卿. "Exploring Syntactic and Semantic Features of Parallelism in Sayings of Happiness." Thesis, 2019. http://ndltd.ncl.edu.tw/handle/a9hwmc.

Full text
Abstract:
碩士
南臺科技大學
應用英語系
107
Abstract This research was focused on the syntactic and semantic features of parallel sayings about happiness in English. Each aspect would explore what revealed in terms of lexical words, syntactic structure, and sematic expressions. The data to be examined was essentially a mini corpus composed of more than150 samples which listed in the appendix part. Most of the examples were available in online websites, books, literature, magazine, etc... These findings emerging were, to the effect, (a) Most sayings were written in a well-organized syntactic structure, no matter what lexica adopted, to show deep semantic expressions. (b) Figurative languages were crafted for not only the ambiguous purposes of literary expressions, but also put the attempts to combine idiomatic and cultural backgrounds in writing. What was implied further in this study would be twofold in a nutshell. Theoretically, linguistic structure conveyed more diversified in appearance than in essence. Empirically, however, the findings of this study contributed to the interesting and innovative field for English teaching and learning in pedagogy and linguistic analysis in cultures. Keywords: Syntactic features, Semantic features, Parallelism, sayings
APA, Harvard, Vancouver, ISO, and other styles
27

Esseesy, Mohssen. "Morphological and syntactic features of Arabic numerals as evidence of their diachronic evolution." 2000. http://catalog.hathitrust.org/api/volumes/oclc/60361483.html.

Full text
APA, Harvard, Vancouver, ISO, and other styles
28

Chang, Li-ping, and 張莉萍. "Salient Linguistic Features of Chinese Lexical Items and Syntactic Structures Across the CEFR Levels." Thesis, 2013. http://ndltd.ncl.edu.tw/handle/w6pnj7.

Full text
Abstract:
博士
國立臺灣師範大學
華語文教學系
101
The study aims to explore the salient linguistic features of Chinese lexical items and syntactic structures across the CEFR levels. The research based on corpus linguistics and contrastive interlanguage analysis method, including comparing learner corpus (i.e. L2 corpus) and native-speaker corpus (i.e. L1 corpus), as well as different native language background L2 sub-corpora. The L2 corpus which consists of more than 1.14 million Chinese words from novice proficiency to advanced learners’ texts is mainly from the computer-based writing Test of Chinese as a Foreign Language (TOCFL). L1 corpus is from the Academia Sinica balanced corpus. In lexical part, we try to seek possible criterial features (Hawkins &; Buttery, 2009) from the aspects of vocabulary size, high frequency information, error rate of word use and keyword-keyness analysis. Some salient features are discovered. For examples, English L2 shows the unusual high frequency on pronouns and unusual low frequency on sentential final particles in Chinese writing. And Japanese as well as Korean L2s tend to overuse the post form ‘de hua’ when expressing the If sentence, and overuse ‘suoyi’ when expressing the cause-result relation. The dissertation also provides possible explanations for the findings from the aspects of L2s’ native language typology, linguistic structure, syntactic category and culture. In structure part, we observe and discuss the syntactic features from the aspects of nominal phrase structure, frequent patterns and word order expression by different proficiency learners. We find the A2 level learners have acquired the canonical Chinese word order, for example, SVO. The specific structures like Ba sentence, Bei sentence, and Lian sentence are proved as the criterial features to discriminate the different proficiency levels. And the features of frequency of using relative clause and the expansion ability of nominal phrase are also discriminative for Chinese learners.
APA, Harvard, Vancouver, ISO, and other styles
29

"Apply syntactic features in a maximum entropy framework for English and Chinese reading comprehension." Thesis, 2008. http://library.cuhk.edu.hk/record=b6074652.

Full text
Abstract:
Automatic reading comprehension (RC) systems integrate various kinds of natural language processing (NLP) technologies to analyze a given passage and generate or extract answers in response to questions about the passage. Previous work applied a lot of NLP technologies including shallow syntactic analyses (e.g. base noun phrases), semantic analyses (e.g. named entities) and discourse analyses (e.g. pronoun referents) in the bag-of-words (BOW) matching approach. This thesis proposes a novel RC approach that integrates a set of NLP technologies in a maximum entropy (ME) framework to estimate candidate answer sentences' probabilities being answers. In contrast to previous RC approaches, which are in English-only, the presented RC approach is the first one for both English and Chinese, the two languages used by most people in the world. In order to support the evaluation of the bilingual RC systems, a parallel English and Chinese corpus is also designed and developed. Annotations deemed relevant to the RC task are also included in the corpus. In addition, useful NLP technologies are explored from a new perspective---referring the pedagogical guidelines of humans, reading skills are summarized and mapped to various NLP technologies. Practical NLP technologies, categorized as shallow syntactic analyses (i.e. part-of-speech tags, voices and tenses) and deep syntactic analyses (i.e. syntactic parse trees and dependency parse trees) are then selected for integration. The proposed approach is evaluated on an English corpus, namely Remedia and our bilingual corpus. The experimental results show that our approach significantly improves the RC results on both English and Chinese corpora.
Xu, Kui.
Adviser: Helen Mei-Ling Meng.
Source: Dissertation Abstracts International, Volume: 70-06, Section: B, page: 3618.
Thesis (Ph.D.)--Chinese University of Hong Kong, 2008.
Includes bibliographical references (leaves 132-141).
Electronic reproduction. Hong Kong : Chinese University of Hong Kong, [2012] System requirements: Adobe Acrobat Reader. Available via World Wide Web.
Electronic reproduction. [Ann Arbor, MI] : ProQuest Information and Learning, [200-] System requirements: Adobe Acrobat Reader. Available via World Wide Web.
Abstracts in English and Chinese.
School code: 1307.
APA, Harvard, Vancouver, ISO, and other styles
30

"Sensitivity to differences between speech and writing: Hong Kong students' use of syntactic features in English." Thesis, 2011. http://library.cuhk.edu.hk/record=b6075097.

Full text
Abstract:
Analysis on most heavily overused and underused syntactic features shows that, when compared with native speakers, Hong Kong students favour the use of present tense, tentative style, simple noun phrase structure and explicit clause-relation marking in oral presentations. They prefer using present tense constructions, adopting pronouns for nominal functions and using an interactive tone in written essays. In both the learner speech data and the learner writing data, quite a number of the overused and underused syntactic features are closely related to the differences between speech and writing found in Biber (1988), hinting at Hong Kong students' inadequate sensitivity to mode differences in English.
Analysis on textual dimensions suggests that Hong Kong students' oral presentations differ from comparable native-speaker oral production by being more written-like in terms of the use of explicit/situation-dependent reference and the inclusion of abstract/non-abstract information. Hong Kong students' written essays deviate from comparable native-speaker written production by being more spoken-like in terms of the involved/informational focus and the use of explicit/situation-dependent reference. Moreover, both the learner speech data and the learner writing data bear some resemblance to common native-speaker genres in the opposite mode.
The present study clearly demonstrates that Hong Kong students have limited sensitivity to the conventional usage of syntactic features in spoken and written English. The teaching profession should help the students develop better sociolinguistic competence when teaching grammar, speaking and writing. Future research on second language acquisition should also focus more on the learners' sociolinguistic development so that second language learners' communicative ability can be better understood.
This thesis investigates Hong Kong secondary school students' sensitivity to differences between spoken and written English by examining their use of 67 syntactic features. A model specifying how native speakers vary their use of syntactic features across speech and writing, Biber (1988), has been adopted as the theoretical framework. Fifty-two oral presentations delivered by Form 6 students and 52 public examination essays written by Form 7 students, both of which total about 10,000 words, have been analysed. The students' performance is compared with native speakers' performance in similar spoken and written production on the level of syntactic features and the level of textual dimensions.
Chui, Sze Yan.
"December 2010."
Adviser: Gerald Nelson.
Source: Dissertation Abstracts International, Volume: 73-04, Section: A, page: .
Thesis (Ph.D.)--Chinese University of Hong Kong, 2011.
Includes bibliographical references (leaves 255-266).
Electronic reproduction. Hong Kong : Chinese University of Hong Kong, [2012] System requirements: Adobe Acrobat Reader. Available via World Wide Web.
Electronic reproduction. [Ann Arbor, MI] : ProQuest Information and Learning, [201-] System requirements: Adobe Acrobat Reader. Available via World Wide Web.
Abstract also in Chinese; some appendixes also in Chinese.
APA, Harvard, Vancouver, ISO, and other styles
31

"An analysis of the syntactic and lexical features of an Indian English oral narrative: A Pear Story study." UNIVERSITY OF NORTH TEXAS, 2008. http://pqdtopen.proquest.com/#viewpdf?dispub=1452028.

Full text
APA, Harvard, Vancouver, ISO, and other styles
32

Crossley, Susan L. "The syntactic features of South African Indian English among students in Natal, with regard to use and attitudes towards usage." Thesis, 1987. http://hdl.handle.net/10413/9684.

Full text
Abstract:
Two tests were developed and administered to 122 South African students of Indian descent and 70 South African students of European descent. One test elicited use of certain syntactic constructions through requesting subjects to make specified grammatical changes to verbally presented sentences. The other test elicited attitudes regarding acceptability and beliefs about own and others' use of nonstandard syntactic forms. In both cases, test items were based on non-standard syntactic forms reported to occur in the speech of Indian South Africans. Results were analysed statistically and on the basis of significant group differences, fourteen syntactic features of South African Indian English were identified. These results, as well as those concerning language attitudes and beliefs, are discussed in terms of the literature and implications for clinical application and further research. Sex differences in use and attitudes were also investigated but were found to be of little significance.
Thesis (M.Speech Pathology)-University of Durban-Westville, 1987.
APA, Harvard, Vancouver, ISO, and other styles
33

Válková, Veronika. "Překlad povídky J. D. Salingera Dokonalý den pro banánové rybičky. Lingvostylistická analýza." Master's thesis, 2019. http://www.nusl.cz/ntk/nusl-404045.

Full text
Abstract:
This thesis aims to analyse several translations of J.D. Salinger's short story A Perfect Day for Bananafish: namely two translations and two edited versions of the latter one. The contrastive view is used to discern the main differences in meaning of the original and the translated texts. There are several points of view adopted to recognise the differences. On the syntactic level, the contrast between the texts was found to be present in the treatment of the author's style, predominantly in the narrative perspective. Not employing nominal structures a great deal, the author's style was not lost in the translation process. Comparing the translated texts, a tendency to adhere to the syntactic surface structure of the original text was found in the less recent versions. Another level of analysis explored the temporal relations of the texts, with the original text relying mostly on structures expressing sequences of events and also simultaneity of events, but not the perfective aspect to a great degree. These time relations are also found to be communicated in the translated texts successfully. It is not only the narrator that is discussed in this thesis, as a great portion of the short story consist of dialogues. The analysis aimed, using certain excerpts of the story, to find to what degree...
APA, Harvard, Vancouver, ISO, and other styles
34

Chan, Chun-ting, and 詹鈞婷. "The Relation between Chinese and English Syntactic Features amd the Changing Ear-voice Span: A Study on Simultaneous Interpreting from English to Chinese." Thesis, 2015. http://ndltd.ncl.edu.tw/handle/98231339923716688748.

Full text
Abstract:
碩士
國立彰化師範大學
翻譯研究所
103
Interpreting tactics play a crucial role in interpreting training programs. However, most previous research focused on the relation between interpreting strategies and the quality of renditions, rather than on the relation between interpreting tactics and the syntactic features. The current study highlights one of the most widely used tactics, which is changing ear-voice span. The purpose of this study is to observe the relation between Chinese and English syntactic features and the changing ear-voice span. Seven GITI interpreting trainees participated in this experiment. All of them were Chinese native speakers, and their B language was English. The experiment consisted of two speeches. A complete speech was used to observe whether all interpreters utilized the tactic of changing ear-voice span, and the other speech was used to further illustrate the relation between Chinese and English syntactic features and the changing ear-voice span. Results of the present study shows that interpreters often need to adjust their ear-voice span to generate the Chinese renditions when the Chinese and English syntactic features are divergent. Furthermore, this study also concludes ten strategies for using the tactic of changing ear-voice span.
APA, Harvard, Vancouver, ISO, and other styles
35

Huang, Shih-Yuan, and 黃世源. "Applying Syntactic Analysis to News Feature Extraction." Thesis, 2012. http://ndltd.ncl.edu.tw/handle/29602483145200280961.

Full text
Abstract:
碩士
國立雲林科技大學
資訊管理系碩士班
100
In Chinese Natural Language Processing, Chinese Segmentation has become an important and basic issue. Many researches on text segmentation use corpus segmentation method in experiments and improvements, but still there are issues that need to be solve, for example, 1) the expensive cost of corpus maintenance, 2) unable to capture unknown words, 3) improve performance, etc. On the other hand, previous document retrieval researches could shorten the original text by automatic summarization or use DTD feature in XML to define keywords. Whether the content in documents is able to show terse words of concept features, assisting readers to remove long messages could simply enhance the timeliness of reading. In our research we proposed a new method by artificial observation, selection, and induction through syntax implicit rules in news articles. By implementing Syntactic Analysis and cross-match sentences, we could extract important keywords including names, events, places, and time. Our rules follow two criteria: 1) news sentences corresponding to rules to extract feature terms, 2) rules corresponding to news sentences to confirm the feature terms are precisely extracted. In our overall system evaluation, precision performed 76%, showed the highest recall of 80%, and F-measure presented 78%, sufficiently show our extraction feature words has a great performance.
APA, Harvard, Vancouver, ISO, and other styles
36

Loewen, Gina. "The syntactic structure of noun phrases in Indonesian." 2011. http://hdl.handle.net/1993/4876.

Full text
Abstract:
Recent developments in linguistic theory carried out within the Minimalist Program (Chomsky 1995; Adger 2003;) provide a functional and concrete framework for an analysis of noun phrases in the Indonesian language, a Western-Malayo Polynesian sub-branch of the Austronesian language family. An analysis of Indonesian noun phrase structure within this framework demonstrates that the head noun occurs in a base-generated position, at the bottom of a DP, while pre- and post-nominal modifiers are contained within a number of additional projections that merge above the head noun. In this thesis, the proposal is made for a relatively unrestricted adjunction analysis, whereby head adjunction via Merge allows for the direct expansion of the head N at various levels of the Indonesian DP. Evidence is presented to show that the adjoined status of attributive nouns and adjectives, a plural feature [PL], and the feature [DEF] generates a complex hierarchical structure in which there is no predefined order between a specifier or complement and the head noun. In addition, it is argued that bare nouns are neutral with respect to number and, given that number-marking, possession and (in)definiteness are optional, all projections that merge above the head N are optional and context is needed to accurately interpret an Indonesian bare noun.
APA, Harvard, Vancouver, ISO, and other styles
37

Hadouche, Fadila. "Annotation syntaxico-sémantique des actants en corpus spécialisé." Thèse, 2010. http://hdl.handle.net/1866/5032.

Full text
Abstract:
L’annotation en rôles sémantiques est une tâche qui permet d’attribuer des étiquettes de rôles telles que Agent, Patient, Instrument, Lieu, Destination etc. aux différents participants actants ou circonstants (arguments ou adjoints) d’une lexie prédicative. Cette tâche nécessite des ressources lexicales riches ou des corpus importants contenant des phrases annotées manuellement par des linguistes sur lesquels peuvent s’appuyer certaines approches d’automatisation (statistiques ou apprentissage machine). Les travaux antérieurs dans ce domaine ont porté essentiellement sur la langue anglaise qui dispose de ressources riches, telles que PropBank, VerbNet et FrameNet, qui ont servi à alimenter les systèmes d’annotation automatisés. L’annotation dans d’autres langues, pour lesquelles on ne dispose pas d’un corpus annoté manuellement, repose souvent sur le FrameNet anglais. Une ressource telle que FrameNet de l’anglais est plus que nécessaire pour les systèmes d’annotation automatisé et l’annotation manuelle de milliers de phrases par des linguistes est une tâche fastidieuse et exigeante en temps. Nous avons proposé dans cette thèse un système automatique pour aider les linguistes dans cette tâche qui pourraient alors se limiter à la validation des annotations proposées par le système. Dans notre travail, nous ne considérons que les verbes qui sont plus susceptibles que les noms d’être accompagnés par des actants réalisés dans les phrases. Ces verbes concernent les termes de spécialité d’informatique et d’Internet (ex. accéder, configurer, naviguer, télécharger) dont la structure actancielle est enrichie manuellement par des rôles sémantiques. La structure actancielle des lexies verbales est décrite selon les principes de la Lexicologie Explicative et Combinatoire, LEC de Mel’čuk et fait appel partiellement (en ce qui concerne les rôles sémantiques) à la notion de Frame Element tel que décrit dans la théorie Frame Semantics (FS) de Fillmore. Ces deux théories ont ceci de commun qu’elles mènent toutes les deux à la construction de dictionnaires différents de ceux issus des approches traditionnelles. Les lexies verbales d’informatique et d’Internet qui ont été annotées manuellement dans plusieurs contextes constituent notre corpus spécialisé. Notre système qui attribue automatiquement des rôles sémantiques aux actants est basé sur des règles ou classificateurs entraînés sur plus de 2300 contextes. Nous sommes limités à une liste de rôles restreinte car certains rôles dans notre corpus n’ont pas assez d’exemples annotés manuellement. Dans notre système, nous n’avons traité que les rôles Patient, Agent et Destination dont le nombre d’exemple est supérieur à 300. Nous avons crée une classe que nous avons nommé Autre où nous avons rassemblé les autres rôles dont le nombre d’exemples annotés est inférieur à 100. Nous avons subdivisé la tâche d’annotation en sous-tâches : identifier les participants actants et circonstants et attribuer des rôles sémantiques uniquement aux actants qui contribuent au sens de la lexie verbale. Nous avons soumis les phrases de notre corpus à l’analyseur syntaxique Syntex afin d’extraire les informations syntaxiques qui décrivent les différents participants d’une lexie verbale dans une phrase. Ces informations ont servi de traits (features) dans notre modèle d’apprentissage. Nous avons proposé deux techniques pour l’identification des participants : une technique à base de règles où nous avons extrait une trentaine de règles et une autre technique basée sur l’apprentissage machine. Ces mêmes techniques ont été utilisées pour la tâche de distinguer les actants des circonstants. Nous avons proposé pour la tâche d’attribuer des rôles sémantiques aux actants, une méthode de partitionnement (clustering) semi supervisé des instances que nous avons comparée à la méthode de classification de rôles sémantiques. Nous avons utilisé CHAMÉLÉON, un algorithme hiérarchique ascendant.
Semantic role annotation is a process that aims to assign labels such as Agent, Patient, Instrument, Location, etc. to actants or circumstants (also called arguments or adjuncts) of predicative lexical units. This process often requires the use of rich lexical resources or corpora in which sentences are annotated manually by linguists. The automatic approaches (statistical or machine learning) are based on corpora. Previous work was performed for the most part in English which has rich resources, such as PropBank, VerbNet and FrameNet. These resources were used to serve the automated annotation systems. This type of annotation in other languages for which no corpora of annotated sentences are available often use FrameNet by projection. Although a resource such as FrameNet is necessary for the automated annotation systems and the manual annotation by linguists of a large number of sentences is a tedious and time consuming work. We have proposed an automated system to help linguists in this task so that they have only to validate annotations proposed. Our work focuses on verbs that are more likely than other predicative units (adjectives and nouns) to be accompanied by actants realized in sentences. These verbs are specialized terms of the computer science and Internet domains (ie. access, configure, browse, download) whose actantial structures have been annotated manually with semantic roles. The actantial structure is based on principles of Explanatory and Combinatory Lexicology, LEC of Mel’čuk and appeal in part (with regard to semantic roles) to the notion of Frame Element as described in the theory of frame semantics (FS) of Fillmore. What these two theories have in common is that they lead to the construction of dictionaries different from those resulting from the traditional theories. These manually annotated verbal units in several contexts constitute the specialized corpus that our work will use. Our system designed to assign automatically semantic roles to actants is based on rules and classifiers trained on more than 2300 contexts. We are limited to a restricted list of roles for certain roles in our corpus have not enough examples manually annotated. In our system, we addressed the roles Patient, Agent and destination that the number of examples is greater than 300. We have created a class that we called Autre which we bring to gether the other roles that the number of annotated examples is less than 100. We subdivided the annotation task in the identification of participant actants and circumstants and the assignment of semantic roles to actants that contribute to the sense of the verbal lexical unit. We parsed, with Syntex, the sentences of the corpus to extract syntactic informations that describe the participants of the verbal lexical unit in the sentence. These informations are used as features in our learning model. We have proposed two techniques for the task of participant detection: the technique based in rules and machine learning. These same techniques are used for the task of classification of these participants into actants and circumstants. We proposed to the task of assigning semantic roles to the actants, a partitioning method (clustering) semi supervised of instances that we have compared to the method of semantic role classification. We used CHAMELEON, an ascending hierarchical algorithm.
APA, Harvard, Vancouver, ISO, and other styles
38

PLECHINGEROVÁ, Jitka. "Výskyt reprodukcí cizí a vlastní řeči v běžné komunikaci a mluvenostní prostředky v nich užívané." Master's thesis, 2014. http://www.nusl.cz/ntk/nusl-174519.

Full text
Abstract:
This thesis aims to map the occurrence of reproduction of own and other people's language in everyday communication and focus on characteristic features of spokenness. After the first part, which will study this issue in theoretical level, proceed to the analysis. Everything will be completed by pondering on the research results and their implications. The material from which the work is based, consists monologues and dialogues of 65 people of different ages, gender and social inclusion. By these criteria the material is also categorized and analyzed in the context of reproduction and of characteristic features of spokenness in all layers of simple communicative style. All the material isn't older than two years and using by a recording equipment is maintained in spoken and subsequently also in written form.
APA, Harvard, Vancouver, ISO, and other styles
We offer discounts on all premium plans for authors whose works are included in thematic literature selections. Contact us to get a unique promo code!

To the bibliography