Academic literature on the topic 'Corpora analysis'

Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles

Select a source type:

Consult the lists of relevant articles, books, theses, conference reports, and other scholarly sources on the topic 'Corpora analysis.'

Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.

You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.

Journal articles on the topic "Corpora analysis"

1

Park, Chanjun, Midan Shim, Sugyeong Eo, et al. "Empirical Analysis of Parallel Corpora and In-Depth Analysis Using LIWC." Applied Sciences 12, no. 11 (2022): 5545. http://dx.doi.org/10.3390/app12115545.

Full text
Abstract:
The machine translation system aims to translate source language into target language. Recent studies on MT systems mainly focus on neural machine translation. One factor that significantly affects the performance of NMT is the availability of high-quality parallel corpora. However, high-quality parallel corpora concerning Korean are relatively scarce compared to those associated with other high-resource languages, such as German or Italian. To address this problem, AI Hub recently released seven types of parallel corpora for Korean. In this study, we conduct an in-depth verification of the quality of corresponding parallel corpora through Linguistic Inquiry and Word Count (LIWC) and several relevant experiments. LIWC is a word-counting software program that can analyze corpora in multiple ways and extract linguistic features as a dictionary base. To the best of our knowledge, this study is the first to use LIWC to analyze parallel corpora in the field of NMT. Our findings suggest the direction of further research toward obtaining the improved quality parallel corpora through our correlation analysis in LIWC and NMT performance.
APA, Harvard, Vancouver, ISO, and other styles
2

Orzigul Ablakulova. "Enhancing English Language Teaching through Corpora Analysis." Texas Journal of Philology, Culture and History 28 (March 6, 2024): 24–26. http://dx.doi.org/10.62480/tjpch.2024.vol28.pp24-26.

Full text
Abstract:
This article discusses the integration of corpora analysis into English language teaching. It highlights the benefits, methodologies, and potential challenges associated with incorporating corpora analysis in the classroom. The author explains that corpora, which are large collections of written and spoken texts, provide teachers and learners with authentic language data to facilitate language learning and teaching. The article presents various methods for implementing corpora analysis, including selecting suitable corpora, introducing learners to corpora tools and software, and incorporating corpus-based activities into the curriculum
APA, Harvard, Vancouver, ISO, and other styles
3

Juhary, Jowati, Erda Wati Bakar, Mardziah Shamsudin, and Asniah Alias. "Understanding Malay Corpora: A Content Analysis of 15 Malay Corpora." JOURNAL OF ADVANCES IN LINGUISTICS 12 (October 8, 2021): 18–26. http://dx.doi.org/10.24297/jal.v12i.9122.

Full text
Abstract:
Corpus research becomes an important area of research of late, especially in Malaysia and for the national language, Malay language. A corpus includes texts and transcriptions of speeches for variety of situations. For this short paper, the focus is on Malay language, which is the national and official language of Malaysia. The purposes of this paper are to identify features and types of Malay Corpora and to determine the needs for a military biased Malay Corpus. In so doing, as a short paper, the methodology involves only content analysis of relevant documents on the development of Malay language corpora. Preliminary findings suggest that there are at least 15 Malay corpora in existence, and that some of the features in these corpora overlap. Further, the researchers argue for the need for a Malay Corpus for Military Operations since the existing corpora do not fully cater for this type of corpus.
APA, Harvard, Vancouver, ISO, and other styles
4

Adolphs, Svenja, Dawn Knight, and Ronald Carter. "Capturing context for heterogeneous corpus analysis." International Journal of Corpus Linguistics 16, no. 3 (2011): 305–24. http://dx.doi.org/10.1075/ijcl.16.3.02ado.

Full text
Abstract:
Heterogeneous corpora are emergent multi-modal datasets which comprise a variety of different records of everyday communication, from SMS/MMS messages to interactions in virtual environments, and from GPS data to phone and video calls. By tracking a person’s specific (inter)actions over time and place, the analysis of such “ubiquitous” corpora enables more detailed investigations of the interface between different communicative modes. This paper outlines some of the ways in which multi-modal, heterogeneous corpora can be utilised in corpus-based analyses of language-in-use and how we can construct richer descriptions of language use in relation to context. The paper further illustrates how the compilation of such corpora may enable us to extrapolate further information about communication across different speakers, media and environments, helping to generate useful insights into the extent to which everyday language and communicative choices are determined by different spatial, temporal and social contexts.
APA, Harvard, Vancouver, ISO, and other styles
5

Van Thin, Dang, Ngan Luu-Thuy Nguyen, Tri Minh Truong, Lac Si Le, and Duy Tin Vo. "Two New Large Corpora for Vietnamese Aspect-based Sentiment Analysis at Sentence Level." ACM Transactions on Asian and Low-Resource Language Information Processing 20, no. 4 (2021): 1–22. http://dx.doi.org/10.1145/3446678.

Full text
Abstract:
Aspect-based sentiment analysis has been studied in both research and industrial communities over recent years. For the low-resource languages, the standard benchmark corpora play an important role in the development of methods. In this article, we introduce two benchmark corpora with the largest sizes at sentence-level for two tasks: Aspect Category Detection and Aspect Polarity Classification in Vietnamese. Our corpora are annotated with high inter-annotator agreements for the restaurant and hotel domains. The release of our corpora would push forward the low-resource language processing community. In addition, we deploy and compare the effectiveness of supervised learning methods with a single and multi-task approach based on deep learning architectures. Experimental results on our corpora show that the multi-task approach based on BERT architecture outperforms the neural network architectures and the single approach. Our corpora and source code are published on this footnoted site. 1
APA, Harvard, Vancouver, ISO, and other styles
6

Hodková, Kateřina. "Les relations sémantiques au carrefour des champs conceptuels du droit." Studia Romanistica 22, no. 1 (2022): 57–71. http://dx.doi.org/10.15452/sr.2022.22.0004.

Full text
Abstract:
The present study concerns the analysis of semantic relationships that exist between legal concepts in Czech and French law. The study combines textual approach, which is necessary for identification of relationships, and the approach of constructing conceptual fields by applying the theory of semic analysis, which help to distinguish terminological and conceptual units from other linguistic units in the texts. Two corpora of legal texts serve as source of legal concepts. These corpora concern the same thematic domain and were established for the purpose of this study. After the theorical and methodological delimitations of the key notions (the definition of concept and term, conceptual field, semic analysis, content of corpora), the study proceeds to a detailed description of linguistic relations within the corpora. This paper focuse on semantic relationships and analyses the following ones: synonymy, opposition (antonymy and contrastivity) and hierarchical relationships (hyperonymy, meronymy and hierarchy of conceptual fields). The analysis concerning two languages and two legal systems enables to compare the data related to the given corpora. For each relationship this study offers a short explanation of the nature of the relationship, its frequency in the two corpora, examples borrowed from the corpora and, if present, the description of other phenomena encountered during the research. These phenomena include, among other things, different types of synonymy, the absence of hyperonyme or holonyme in some hierarchical structures or different types of meronymy.
APA, Harvard, Vancouver, ISO, and other styles
7

Ledinek, Nina. "Skladenjska analiza slovenščine in slovenski jezikoslovno označeni korpusi." Jezik in slovstvo 63, no. 2-3 (2024): 103–16. http://dx.doi.org/10.4312/jis.63.2-3.103-116.

Full text
Abstract:
The article deals with the possibilities of using linguistically annotated corpora of Slovenian for syntactic analyses. Due to the inadequately developed Slovenian language infrastructure – at least eight syntactically annotated corpora of Slovenian are available to users, but due to their small size they only allow a limited scope of syntactic analysis – there is a small number of systematic and comprehensive corpus-based studies on Slovenian syntax, most of which rely on the analysis of morphosyntactically annotated corpora of Slovenian.
APA, Harvard, Vancouver, ISO, and other styles
8

Beigman Klebanov, Beata, Chaitanya Ramineni, David Kaufer, Paul Yeoh, and Suguru Ishizaki. "Advancing the validity argument for standardized writing tests using quantitative rhetorical analysis." Language Testing 36, no. 1 (2017): 125–44. http://dx.doi.org/10.1177/0265532217740752.

Full text
Abstract:
Essay writing is a common type of constructed-response task used frequently in standardized writing assessments. However, the impromptu timed nature of the essay writing tests has drawn increasing criticism for the lack of authenticity for real-world writing in classroom and workplace settings. The goal of this paper is to contribute evidence to a validity argument for standardized writing tests. Using measurements of distances between rhetorical profiles in the corpora of interest, we examined connections between argumentative writing on standardized assessments and in external writing situations; namely, opinionated writing in academic and real-life settings. The results show that test corpora, focusing on argumentation in two standardized tests, are rhetorically similar to academic argumentative writing in a graduate-school setting, and about as similar as a corpus of civic writing in the same genre. The proximity between the test corpora and corpora representing external criteria of interest support the assessment use argument. The argumentative writing skills employed on the test are similar to the skills employed in academic and civic settings, despite the differences in the nature of the settings under which the writing samples for these different corpora are produced.
APA, Harvard, Vancouver, ISO, and other styles
9

Newman, John. "Corpora and cognitive linguistics." Revista Brasileira de Linguística Aplicada 11, no. 2 (2011): 521–59. http://dx.doi.org/10.1590/s1984-63982011000200010.

Full text
Abstract:
Corpora are a natural source of data for cognitive linguists, since corpora, more than any other source of data, reflect "usage" - a notion which is often claimed to be of critical importance to the field of cognitive linguistics. Corpora are relevant to all the main topics of interest in cognitive linguistics: metaphor, polysemy, synonymy, prototypes, and constructional analysis. I consider each of these topics in turn and offer suggestions about which methods of analysis can be profitably used with available corpora to explore these topics further. In addition, I consider how the design and content of currently used corpora need to be rethought if corpora are to provide all the types of usage data that cognitive linguists require.
APA, Harvard, Vancouver, ISO, and other styles
10

Crossley, Scott, and Max M. Louwerse. "Multi-dimensional register classification using bigrams." International Journal of Corpus Linguistics 12, no. 4 (2007): 453–78. http://dx.doi.org/10.1075/ijcl.12.4.02cro.

Full text
Abstract:
A corpus linguistic analysis investigated register classification using frequency of bigrams in nine spoken and two written corpora. Four dimensions emerged from a factor analysis using bigram frequencies shared across corpora: (1) Scripted vs. Unscripted Discourse, (2) Deliberate vs. Unplanned Discourse, (3) Spatial vs. Non-Spatial Discourse, and (4) Directional vs. Non-Directional Discourse. These findings were replicated in a second analysis. Both analyses demonstrate the strength of bigrams for classifying spoken and written registers, especially in locating distinct collocations among spoken corpora, as well as revealing syntactic and discourse features through a data-driven approach.
APA, Harvard, Vancouver, ISO, and other styles
More sources

Dissertations / Theses on the topic "Corpora analysis"

1

Panteli, Maria. "Computational analysis of world music corpora." Thesis, Queen Mary, University of London, 2018. http://qmro.qmul.ac.uk/xmlui/handle/123456789/36696.

Full text
Abstract:
The comparison of world music cultures has been considered in musicological research since the end of the 19th century. Traditional methods from the field of comparative musicology typically involve the process of manual music annotation. While this provides expert knowledge, the manual input is timeconsuming and limits the potential for large-scale research. This thesis considers computational methods for the analysis and comparison of world music cultures. In particular, Music Information Retrieval (MIR) tools are developed for processing sound recordings, and data mining methods are considered to study similarity relationships in world music corpora. MIR tools have been widely used for the study of (mainly) Western music. The first part of this thesis focuses on assessing the suitability of audio descriptors for the study of similarity in world music corpora. An evaluation strategy is designed to capture challenges in the automatic processing of world music recordings and different state-of-the-art descriptors are assessed. Following this evaluation, three approaches to audio feature extraction are considered, each addressing a different research question. First, a study of singing style similarity is presented. Singing is one of the most common forms of musical expression and it has played an important role in the oral transmission of world music. Hand-designed pitch descriptors are used to model aspects of the singing voice and clustering methods reveal singing style similarities in world music. Second, a study on music dissimilarity is performed. While musical exchange is evident in the history of world music it might be possible that some music cultures have resisted external musical influence. Low-level audio features are combined with machine learning methods to find music examples that stand out in a world music corpus, and geographical patterns are examined. The last study models music similarity using descriptors learned automatically with deep neural networks. It focuses on identifying music examples that appear to be similar in their audio content but share no (obvious) geographical or cultural links in their metadata. Unexpected similarities modelled in this way uncover possible hidden links between world music cultures. This research investigates whether automatic computational analysis can uncover meaningful similarities between recordings of world music. Applications derive musicological insights from one of the largest world music corpora studied so far. Computational analysis as proposed in this thesis advances the state-of-the-art in the study of world music and expands the knowledge and understanding of musical exchange in the world.
APA, Harvard, Vancouver, ISO, and other styles
2

Sudhahar, Saatviga. "Automated analysis of narrative text using network analysis in large corpora." Thesis, University of Bristol, 2015. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.685924.

Full text
Abstract:
In recent years there has been an increased interest in computational social sciences, digital humanities and political sciences to perform automated quantitative narrative analysis (QNA) of text in large scale, by studying actors, actions and relations in a given narration. Social scientists have always relied on news media content to study opinion biases and extraction of socio-historical relations and events. Yet in order to perform analysis they had to face labour-intensive coding where basic narrative information was manually extracted from text and annotated by hand. This PhD thesis addresses this problem using a big-data approach based on automated information extraction using state of the art Natural Language Processing, Text mining and Artificial Intelligence tools. A text corpus is transformed into a semantic network formed of subject-verb-object (SVO) triplets, and the resulting network is analysed drawing from various theories and techniques such as graph partitioning, network centrality, assortativity, hierarchy and structural balance. Furthermore we study the position of actors in the network of actors and actions; generate scatter plots describing the subject/object bias, positive/ negative bias of each actor; and investigate the types of actions each actor is most associated with. Apart from QNA, SVO triplets extracted from text can also be used to summarize documents. Our findings are demonstrated on two different corpora containing English news articles about US elections and Crime and a third corpus containing ancieilt folklore stories from the Gutenberg Project. Amongst potentially interesting findings we found the 2012 US elections campaign was very much focused on 'Economy' and 'Rights'; and overall, the media reported more frequently positive statements for the Democrats than the Republicans. In the Crime study we found that the network identified men as frequent perpetrators, and women and children as victims, of violent crime. A network approach to text based on semantic graphs is a promising approach to analyse large corpora of texts and, by retaining relational information pertaining to actors and objects, this approach can reveal latent and hidden patterns, and therefore has relevance in the social sciences and humanities.
APA, Harvard, Vancouver, ISO, and other styles
3

Kura, Deekshit. "Categorization of Large Corpora of Malicious Software." ScholarWorks@UNO, 2013. http://scholarworks.uno.edu/td/1746.

Full text
Abstract:
Malware is computer software written by someone with mischievous or, more usually, malicious and/or criminal intent and specifically designed to damage data, hosts or networks. The variety of malware is increasing proportionally with the increase in computers and we are not aware of newly emerging malware. Tools are needed to categorize families of malware, so that analysts can compare new malware samples to ones that have been previously analyzed and determine steps to detect and prevent malware infections. In this thesis, I developed a technique to catalog and characterize the behavior of malware, so that malware families, the level of potential threat, and the effects of malware can be identified. Combinations of complementary techniques, including third-party tools, are integrated to scan and illustrate how malware may harm a target machine, search for related malware behavior, and organize malware into families, based on a number of characteristics.
APA, Harvard, Vancouver, ISO, and other styles
4

Lucas, Christopher G. "Patent semantics : analysis, search and visualization of large text corpora." Thesis, Massachusetts Institute of Technology, 2004. http://hdl.handle.net/1721.1/33146.

Full text
Abstract:
Thesis (M. Eng. and S.B.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2004.<br>Includes bibliographical references (leaves 47-48).<br>Patent Semantics is system for processing text documents by extracting features capturing their semantic content, and searching, clustering, and relating them by those same features. It is set apart from existing methodologies by combining a visualization scheme that integrates retrieval and clustering, providing a variety of ways to find and relate documents depending on their goals. In addition, the system provides an explanatory mechanism that makes the retrieval an understandable process rather than a black box. The domain in which the system currently works is biochemistry and molecular biology patents but it is not intrinsically constrained to any document set.<br>by Christopher G. Lucas.<br>M.Eng.and S.B.
APA, Harvard, Vancouver, ISO, and other styles
5

Ghanem, Amer G. "Identifying Patterns of Epistemic Organization through Network-Based Analysis of Text Corpora." University of Cincinnati / OhioLINK, 2015. http://rave.ohiolink.edu/etdc/view?acc_num=ucin1448274706.

Full text
APA, Harvard, Vancouver, ISO, and other styles
6

Gashteovski, Kiril [Verfasser], and Rainer [Akademischer Betreuer] Gemulla. "Compact open information extraction: methods, corpora, analysis / Kiril Gashteovski ; Betreuer: Rainer Gemulla." Mannheim : Universitätsbibliothek Mannheim, 2021. http://d-nb.info/123650285X/34.

Full text
APA, Harvard, Vancouver, ISO, and other styles
7

Kwan, Yu Hang. "Assessing pre-service teaching practicum: a corpus-assisted discourse analysis of field experience supervision forms." HKBU Institutional Repository, 2014. https://repository.hkbu.edu.hk/etd_oa/117.

Full text
Abstract:
This study analyses the moves, linguistic realisations and mitigation devices of four teaching practicum supervisors' cmmnents written to eighteen supervisees on fifty- four standard field experience supervision forms. Broadly speaking, the results reveal that the supervisors use evaluative adjectives, modality markers and imperatives to give praise and acknowledge good practice, identify weaknesses, and suggest improvements in relation to teaching and managing learning. As the supervision exercise can be face-threatening, the supervisors demonstrate sensitivity to redress their negative comments through such mitigation strategies as hedging, praise-criticism pairs, rhetorical questions and personal attributions, although the strengths of such devices may vary according to contextual issues. These findings enable readers to understand how the pragma-linguistic resources realise two global communicative purposes, i.e., "Assessment of Learning" and "Assessment for Learning".
APA, Harvard, Vancouver, ISO, and other styles
8

Cid, Uribe Miriam Elizabeth. "Contrastive analysis of English and Spanish intonation using computer corpora - a preliminary study." Thesis, University of Leeds, 1989. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.236136.

Full text
APA, Harvard, Vancouver, ISO, and other styles
9

Sawalha, Majdi Shaker Salem. "Open-source resources and standards for Arabic word structure analysis : fine grained morphological analysis of Arabic text corpora." Thesis, University of Leeds, 2011. http://etheses.whiterose.ac.uk/2165/.

Full text
Abstract:
Morphological analyzers are preprocessors for text analysis. Many Text Analytics applications need them to perform their tasks. The aim of this thesis is to develop standards, tools and resources that widen the scope of Arabic word structure analysis - particularly morphological analysis, to process Arabic text corpora of different domains, formats and genres, of both vowelized and non-vowelized text. We want to morphologically tag our Arabic Corpus, but evaluation of existing morphological analyzers has highlighted shortcomings and shown that more research is required. Tag-assignment is significantly more complex for Arabic than for many languages. The morphological analyzer should add the appropriate linguistic information to each part or morpheme of the word (proclitic, prefix, stem, suffix and enclitic); in effect, instead of a tag for a word, we need a subtag for each part. Very fine-grained distinctions may cause problems for automatic morphosyntactic analysis – particularly probabilistic taggers which require training data, if some words can change grammatical tag depending on function and context; on the other hand, finegrained distinctions may actually help to disambiguate other words in the local context. The SALMA – Tagger is a fine grained morphological analyzer which is mainly depends on linguistic information extracted from traditional Arabic grammar books and prior knowledge broad-coverage lexical resources; the SALMA – ABCLexicon. More fine-grained tag sets may be more appropriate for some tasks. The SALMA –Tag Set is a theory standard for encoding, which captures long-established traditional fine-grained morphological features of Arabic, in a notation format intended to be compact yet transparent. The SALMA – Tagger has been used to lemmatize the 176-million words Arabic Internet Corpus. It has been proposed as a language-engineering toolkit for Arabic lexicography and for phonetically annotating the Qur’an by syllable and primary stress information, as well as, fine-grained morphological tagging.
APA, Harvard, Vancouver, ISO, and other styles
10

Van, Olmen Daniel. "The imperative in English and Dutch : a functional analysis in comparable and parallel corpora." Thesis, Lancaster University, 2011. http://eprints.lancs.ac.uk/66233/.

Full text
APA, Harvard, Vancouver, ISO, and other styles
More sources

Books on the topic "Corpora analysis"

1

Schmidt, Thomas, and Kai Wörner, eds. Multilingual Corpora and Multilingual Corpus Analysis. John Benjamins Publishing Company, 2012. http://dx.doi.org/10.1075/hsm.14.

Full text
APA, Harvard, Vancouver, ISO, and other styles
2

Schmidt, Thomas, and Kai Wörner. Multilingual corpora and multilingual corpus analysis. John Benjamins Pub. Co., 2012.

Find full text
APA, Harvard, Vancouver, ISO, and other styles
3

Michael, Hoey, ed. Text, discourse, and corpora: Theory and analysis. Continuum, 2007.

Find full text
APA, Harvard, Vancouver, ISO, and other styles
4

Facchinetti, Roberta. News as changing texts: Corpora, methodologies and analysis. Cambridge Scholars Pub., 2012.

Find full text
APA, Harvard, Vancouver, ISO, and other styles
5

Reppen, Randi. Using corpora in the language classroom. Cambridge University Press, 2010.

Find full text
APA, Harvard, Vancouver, ISO, and other styles
6

Gavioli, Laura. Exploring corpora for ESP learning. John Benjamins, 2005.

Find full text
APA, Harvard, Vancouver, ISO, and other styles
7

Marina, Bondi, Gavioli Laura, and Silver Marc, eds. Academic discourse, genre and small corpora. Officina edizioni, 2004.

Find full text
APA, Harvard, Vancouver, ISO, and other styles
8

1958-, Kawaguchi Yuji, Minegishi Makoto, and Durand Jacques 1947-, eds. Corpus analysis and variation in linguistics. John Benjamins Pub. Co., 2009.

Find full text
APA, Harvard, Vancouver, ISO, and other styles
9

Linda, Lombardo, ed. Using corpora to learn about language and discourse. Peter Lang, 2009.

Find full text
APA, Harvard, Vancouver, ISO, and other styles
10

Miklavčič, Jana Zemljarič. Govorni korpusi. Znanstvena založba Filozofske fakultete, 2008.

Find full text
APA, Harvard, Vancouver, ISO, and other styles
More sources

Book chapters on the topic "Corpora analysis"

1

Coxhead, Averil. "Analysis of corpora." In The Routledge Handbook of Research Methods in Applied Linguistics. Routledge, 2019. http://dx.doi.org/10.4324/9780367824471-39.

Full text
APA, Harvard, Vancouver, ISO, and other styles
2

Minker, Wolfgang, Alex Waibel, and Joseph Mariani. "Applications and Corpora." In Stochastically-Based Semantic Analysis. Springer US, 1999. http://dx.doi.org/10.1007/978-1-4615-5255-0_3.

Full text
APA, Harvard, Vancouver, ISO, and other styles
3

Liu, Xinghua, and Anne McCabe. "Appraisal Analysis." In Corpora and Intercultural Studies. Springer Singapore, 2017. http://dx.doi.org/10.1007/978-981-10-6415-9_5.

Full text
APA, Harvard, Vancouver, ISO, and other styles
4

Lam Sut I, Michelle. "Case Analysis." In Corpora and Intercultural Studies. Springer Nature Singapore, 2023. http://dx.doi.org/10.1007/978-981-99-1195-0_4.

Full text
APA, Harvard, Vancouver, ISO, and other styles
5

Matsusaka, Yosuke, Yasuhiro Katagiri, Masato Ishizaki, and Mika Enomoto. "Unsupervised Clustering in Multimodal Multiparty Meeting Analysis." In Multimodal Corpora. Springer Berlin Heidelberg, 2009. http://dx.doi.org/10.1007/978-3-642-04793-0_6.

Full text
APA, Harvard, Vancouver, ISO, and other styles
6

Reidsma, Dennis, Dirk Heylen, and Rieks op den Akker. "On the Contextual Analysis of Agreement Scores." In Multimodal Corpora. Springer Berlin Heidelberg, 2009. http://dx.doi.org/10.1007/978-3-642-04793-0_8.

Full text
APA, Harvard, Vancouver, ISO, and other styles
7

Bednarek, Monika. "Enacting Affect: Pragmatic Analysis." In Emotion Talk Across Corpora. Palgrave Macmillan UK, 2008. http://dx.doi.org/10.1057/9780230285712_6.

Full text
APA, Harvard, Vancouver, ISO, and other styles
8

Gentile, Federico Pio. "Contrastive Analysis and Results." In Corpora, Corpses and Corps. Springer International Publishing, 2021. http://dx.doi.org/10.1007/978-3-030-78276-4_7.

Full text
APA, Harvard, Vancouver, ISO, and other styles
9

Angermeyer, Philipp S., Bernd Meyer, and Thomas Schmidt. "Sharing community interpreting corpora." In Multilingual Corpora and Multilingual Corpus Analysis. John Benjamins Publishing Company, 2012. http://dx.doi.org/10.1075/hsm.14.19ang.

Full text
APA, Harvard, Vancouver, ISO, and other styles
10

Seki, Yohei. "Opinion Analysis Corpora Across Languages." In Evaluating Information Retrieval and Access Tasks. Springer Singapore, 2020. http://dx.doi.org/10.1007/978-981-15-5554-1_6.

Full text
APA, Harvard, Vancouver, ISO, and other styles

Conference papers on the topic "Corpora analysis"

1

Galeshchuk, Svitlana, Ju Qiu, and Julien Jourdan. "Sentiment Analysis for Multilingual Corpora." In Proceedings of the 7th Workshop on Balto-Slavic Natural Language Processing. Association for Computational Linguistics, 2019. http://dx.doi.org/10.18653/v1/w19-3717.

Full text
APA, Harvard, Vancouver, ISO, and other styles
2

Aizawa, Akiko. "Analysis of source identified text corpora." In the 41st Annual Meeting. Association for Computational Linguistics, 2003. http://dx.doi.org/10.3115/1075096.1075145.

Full text
APA, Harvard, Vancouver, ISO, and other styles
3

Gupta, Madhumita, and Sreya Guha. "Topic Based Analysis of Text Corpora." In Second International Conference on Advances in Computer Science and Information Technology. Academy & Industry Research Collaboration Center (AIRCC), 2016. http://dx.doi.org/10.5121/csit.2016.61403.

Full text
APA, Harvard, Vancouver, ISO, and other styles
4

Cruz, Andre Ferreira, Gil Rocha, and Henrique Lopes Cardoso. "Exploring Spanish Corpora for Portuguese Coreference Resolution." In 2018 Fifth International Conference on Social Networks Analysis, Management and Security (SNAMS). IEEE, 2018. http://dx.doi.org/10.1109/snams.2018.8554705.

Full text
APA, Harvard, Vancouver, ISO, and other styles
5

Mover, Sergio, Sriram Sankaranarayanan, Rhys Braginton Pettee Olsen, and Bor-Yuh Evan Chang. "Mining framework usage graphs from app corpora." In 2018 IEEE 25th International Conference on Software Analysis, Evolution and Reengineering (SANER). IEEE, 2018. http://dx.doi.org/10.1109/saner.2018.8330216.

Full text
APA, Harvard, Vancouver, ISO, and other styles
6

binti Khamis, Noorli, and Imran Ho bin Abdullah. "Correspondence analysis: Comparing wordlists across specialised corpora." In 2012 IEEE Colloquium on Humanities, Science and Engineering Research (CHUSER). IEEE, 2012. http://dx.doi.org/10.1109/chuser.2012.6504377.

Full text
APA, Harvard, Vancouver, ISO, and other styles
7

Ren, Xiang, Yuanhua Lv, Kuansan Wang, and Jiawei Han. "Comparative Document Analysis for Large Text Corpora." In WSDM 2017: Tenth ACM International Conference on Web Search and Data Mining. ACM, 2017. http://dx.doi.org/10.1145/3018661.3018690.

Full text
APA, Harvard, Vancouver, ISO, and other styles
8

Karna, Hrvoje, Anita Gudelj, and Silvana Kokan. "Text Analysis of the Hybrid Digital Corpora." In 2021 International Conference on Software, Telecommunications and Computer Networks (SoftCOM). IEEE, 2021. http://dx.doi.org/10.23919/softcom52868.2021.9559119.

Full text
APA, Harvard, Vancouver, ISO, and other styles
9

Medhat, Walaa, Ahmed H. Yousef, and Hoda K. Mohamed. "Component analysis of a Sentiment Analysis framework on different corpora." In 2014 9th International Conference on Computer Engineering & Systems (ICCES). IEEE, 2014. http://dx.doi.org/10.1109/icces.2014.7030976.

Full text
APA, Harvard, Vancouver, ISO, and other styles
10

"Hot Events Analysis based on Japanese Corpora Processing." In 2017 8th International Computer Systems and Education Management Conference. Francis Academic Press, 2017. http://dx.doi.org/10.25236/icsemc.2017.02.

Full text
APA, Harvard, Vancouver, ISO, and other styles

Reports on the topic "Corpora analysis"

1

Carlson, Andrew, Tom M. Mitchell, and Ian Fette. Data Analysis Project: Leveraging Massive Textual Corpora Using n-Gram Statistics. Defense Technical Information Center, 2008. http://dx.doi.org/10.21236/ada485623.

Full text
APA, Harvard, Vancouver, ISO, and other styles
2

Charumathi, Dr B. Developing a corporate social justice index. Indian School of Development Management, 2024. https://doi.org/10.58178/246.1047.

Full text
Abstract:
This study aims to provide a comprehensive understanding of how large companies address and report on social justice issues, ultimately fostering greater transparency and accountability with a sustainability focus. This study develops a conceptual framework which involves identifying major social justice themes and corporate stakeholders, recognising social justice issues that have been addressed and grouping them into sub-indices and to develop an unweighted Corporate Social Justice Disclosure Index (CSJDI). After checking the reliability and validity of the index using content analysis, this study measures the extent of disclosure in the annual reports of BSE 100 companies over a seven-year period (FY 2016-17 to FY 2022-23). Using qualitative data analysis, it studies the determinants of disclosure and analyses its impact on the performance indicators. There is an increasing trend in the CSJ disclosures and its components over the years and there exist significant year, company, and industry-wise differences, especially before and after Business Responsibility and Sustainability Report (BRSR). Revenue, women on board and ESG disclosure scores positively and significantly influences the CSJ disclosures. The CSJ disclosure positively and significantly influences current market capitalisation, revenue and ESG scores. This study encourages corporations to enhance their disclosure practices and makes regulators bring changes in the CSJ reporting landscape, ultimately contributing to the advancement of corporate social responsibility and broader societal well-being.
APA, Harvard, Vancouver, ISO, and other styles
3

Morck, Randall, Andrei Shleifer, and Robert Vishny. Management Ownership and Corporate Performance: An Empirical Analysis. National Bureau of Economic Research, 1986. http://dx.doi.org/10.3386/w2055.

Full text
APA, Harvard, Vancouver, ISO, and other styles
4

White, Michelle. Economic Analysis of Corporate and Personal Bankruptcy Law. National Bureau of Economic Research, 2005. http://dx.doi.org/10.3386/w11536.

Full text
APA, Harvard, Vancouver, ISO, and other styles
5

KOSHEVENKO, S. THE KEY HR TRENDS IN CORPORATE TRAINING. Science and Innovation Center Publishing House, 2021. http://dx.doi.org/10.12731/2070-7568-2021-10-5-1-49-53.

Full text
Abstract:
The article examines the main trends in corporate training in 2021. The author conducted an analysis of research in the field of hr-training, highlighted the main needs and directions of changes in corporate personnel training systems. The article identifies the main trends in the training of personnel of organizations in the period after the pandemic in 2021.
APA, Harvard, Vancouver, ISO, and other styles
6

Johnson, M. L., M. S. Phifer, W. A. Bratten, and M. L. Emrich. Corporate data base machines: Market analysis for the Joint Staff. Office of Scientific and Technical Information (OSTI), 1989. http://dx.doi.org/10.2172/6004609.

Full text
APA, Harvard, Vancouver, ISO, and other styles
7

Raso-Domínguez, Xavier, Diana Vazquez Espinosa, Gift Dembetembe, and Philippe Gugler. The Equality Cookbook: Swiss Companies’ Recipes for Gender Parity. Cantonal and University Library Fribourg, 2025. https://doi.org/10.51363/unifr.ewp.r7xxx6.

Full text
Abstract:
Corporations are under increasing pressure to align with SDG 5 (Gender Equality), ensuring equal opportunities for women and men in leadership, pay, and career growth. While studies on gender equality are growing, limited research explores the evolutionary paths leading to gender parity. This paper employs Time-Series fuzzy-set qualitative comparative analysis (TS/fsQCA) to examine the distinct and evolving paths that lead to gender parity within Swiss companies between 2020 and 2023. By analysing configurations of variables, including board independence, changes in women’s representation in managerial positions, executive compensation tied to ESG goals, CSR evaluations, and alignment with SDGs reported in annual disclosures, the study identifies how these elements combine to advance gender parity. The analysis reveals multiple pathways to achieving gender parity, with distinct combinations of conditions driving progress over time. While earlier years highlight the pivotal role of women in leadership positions, later years demonstrate increasing reliance on integrated strategies incorporating ESG-linked practices and SDG alignment. Furthermore, the study identifies the evolution of corporate approaches, showing that pathways have become more complex as firms adapt to changing regulatory and societal expectations. These findings contribute to understanding how corporate governance structures, strategic alignment with social goals, and evolving business practices support the advancement of gender parity in the corporate sector. By shedding light on temporal variations, the study offers actionable insights for policymakers and practitioners aiming to foster gender parity through sustainable corporate strategies.
APA, Harvard, Vancouver, ISO, and other styles
8

Alonso-Robisco, Andrés, Andrés Alonso-Robisco, José Manuel Carbó, et al. Empowering financial supervision: a SupTech experiment using machine learning in an early warning system. Banco de España, 2025. https://doi.org/10.53479/39320.

Full text
Abstract:
New technologies have made available a vast amount of new data in the form of text, recording an exponentially increasing share of human and corporate behavior. For financial supervisors, the information encoded in text is a valuable complement to the more traditional balance sheet data typically used to track the soundness of financial institutions. In this study, we exploit several natural language processing (NLP) techniques as well as network analysis to detect anomalies in the Spanish corporate system, identifying both idiosyncratic and systemic risks. We use sentiment analysis at the corporate level to detect sentiment anomalies for specific corporations (idiosyncratic risks), while employing a wide range of network metrics to monitor systemic risks. In the realm of supervisory technology (SupTech), anomaly detection in sentiment analysis serves as a proactive tool for financial authorities. By continuously monitoring sentiment trends, SupTech applications can provide early warnings of potential financial distress or systemic risks.
APA, Harvard, Vancouver, ISO, and other styles
9

Chong, Alberto E., and Florencio López-de-Silanes. Corporate Governance and Firm Value in Mexico. Inter-American Development Bank, 2006. http://dx.doi.org/10.18235/0010864.

Full text
Abstract:
The objective of this paper is twofold. On one hand, we undertake an analysis of the recent evolution of capital markets and their effect on the availability of external financing in Mexico in the last two decades. On the other hand, based on a newly assembled firm-level data set on corporate governance and firm performance, we show that better firm-level corporate governance practices are linked to higher valuations, better performance and more dividends disbursed to investors. These results hold after controlling for endogeneity. Overall, the evidence shows that the Mexican legal environment poses serious problems for access to capital.
APA, Harvard, Vancouver, ISO, and other styles
10

Sembler, Jose Ignacio, Ernesto Cuestas, Roni Szwedzki, et al. Corporate Evaluation: Evaluation of IDB Invest. Inter-American Development Bank, 2023. http://dx.doi.org/10.18235/0005014.

Full text
Abstract:
At the 2015 annual meeting in Busan, the Boards of Governors of the Inter-American Development Bank (IDB) and the Inter-American Investment Corporation (IIC) decided to consolidate the IDB Group's private-sector operations into the IIC. This process of consolidation and capitalization, known as the private sector merge-out, took effect on 1 January 2016. The Busan Resolution set forth a “Renewed Vision” for promoting development in the region through the private sector. This Renewed Vision provides a long-term framework (2016-2025) for IDB Invest and focuses on the objectives of: (i) strengthening effectiveness and additionality; (ii) maximizing synergies between the public and private sectors; and (iii) maximizing the efficient use of resources and ensuring long-term financial sustainability. This evaluation seeks to independently assess and report on the effectiveness of the implementation to date of the Renewed Vision, aimed at promoting development in the region through the private sector. Specifically, the general question that the evaluation aims to answer is the following: To what extent is IDB Invest on its way to achieving the end objectives set out in the Renewed Vision? To that end, the Office of Evaluation and Oversight (OVE) used a combination of complementary methods, including a review of strategic and corporate documents, financial and portfolio analyses, interviews and surveys, and documentary analyses of a sample of operations. This evaluation covers the 2016-2021 period and uses as reference the findings in OVE's 2017 midterm review of implementation of the merge-out to further analyze areas that had not yet matured at that time. The evaluation was also guided by a reference framework that linked the objectives of the Renewed Vision to the main activities and initiatives undertaken thus far to help achieve those objectives.
APA, Harvard, Vancouver, ISO, and other styles
We offer discounts on all premium plans for authors whose works are included in thematic literature selections. Contact us to get a unique promo code!