Journal articles: 'Automated language translation'

1

Boitet, Christian. "Automated Translation." Revue française de linguistique appliquée VIII, no. 2 (2003): 99. http://dx.doi.org/10.3917/rfla.082.121.

Full text

APA, Harvard, Vancouver, ISO, and other styles

2

Rychtyckyj, Nestor, and Craig Plesco. "Applying Automated Language Translation at a Global Enterprise Level." AI Magazine 34, no. 1 (December 6, 2012): 43. http://dx.doi.org/10.1609/aimag.v34i1.2436.

Full text

Abstract:

In 2007 we presented a paper that described the application of Natural Language Processing (NLP) and Machine Translation (MT) for the automated translation of process build instructions from English to other languages to support Ford’s assembly plants in non-English speaking countries. This project has continued to evolve with the addition of new languages and improvements to the translation process. However, we discovered that there was a large demand for automated language translation across all of Ford Motor Company and we decided to expand the scope of our project to address these requirements. This paper will describe our efforts to meet all of Ford’s internal translation requirements with AI and MT technology and focus on the challenges and lessons that we learned from applying advanced technology across an entire corporation.

APA, Harvard, Vancouver, ISO, and other styles

3

Karamysheva, Iryna, Roksolyana Nazarchuk, and Kateryna Lishnievska. "PECULIARITIES OF TRANSLATION OF ENGLISH LANGUAGE INSTRUCTIONS WITH THE USE OF ADDITIONAL TOOLS IN SDL TRADOS STUDIO AND MEMOQ TRANSLATOR PRO ENVIRONMENTS." Research Bulletin Series Philological Sciences 1, no. 193 (April 2021): 376–82. http://dx.doi.org/10.36550/2522-4077-2021-1-193-376-382.

Full text

Abstract:

The presented research focuses upon the analysis of additional specific tools (namely translation memory (TM) technologies) of SDL Trados Studio 2017 and MemoQ Translator Pro 2017 automated translation systems and their application for translation of English-language instructions into Ukrainian. With the help of the above-mentioned software tools 60 English-language operating instructions for household appliances have been translated into Ukrainian (three projects were created in both systems, each containing 10 instructions). TM is a database consisting of segments of source text (sentences, paragraphs, headings, etc.) and translations of each of these segments. TM, used in both SDL Trados Studio and MemoQ Translator Pro systems, significantly improves the quality, speed, consistency and efficiency of each translation task. SDL Trados Studio 2017 and MemoQ Translator Pro 2017 compare content of the current segment of the source file with segments of the same language already contained in the TM. If the system finds a similar segment that is currently stored in the TM, it prompts the translator to use a ready-made translation. The degree of equivalence between the segment of the source document and the segment contained in the TM is expressed as a percentage. Thus, both software tools capture the cases of «Exact match», «Perfect match» and «Fuzzy match». SDL Trados Studio 2017 andMemoQ Translator Pro 2017 slightly differ in segment statuses and colour segment marking. Both systems do not make adjustments automatically, but their identification and navigation capabilities allow one to quickly correct such errors by hand. Unfortunately, the initial focus on the Russian-language market (and, consequently, on the Russian language system) has led to another peculiarity of automated translation into Ukrainian in SDL Trados Studio and MemoQ Translator Pro systems, namely a large number of stylistic errors that require quality personalized correction.

APA, Harvard, Vancouver, ISO, and other styles

4

Park, Mina, and Milam Aiken. "Automated Translation Preference with a Bilingual Chatbot." INTERNATIONAL JOURNAL OF COMPUTERS & TECHNOLOGY 19 (July 23, 2019): 7502–9. http://dx.doi.org/10.24297/ijct.v19i0.8343.

Full text

Abstract:

Free, online language translation services are being used by people around to the world to facilitate communication. However, it takes time and effort for a person to load the Web page in a browser, copy and paste text into the site, and translate words. The process quickly becomes tiresome. Instead, some computer programs are providing automated translation. However, no studies have been conducted to determine the efficiency or effectiveness of such an approach. In this study, we compare how students used an English-based chatbot with and without German automatic translation. Results show that students took nearly 1.5 times longer than their stated upper time limit to manually translate. In contrast, the automated translation was at least 30 times faster. In addition, the students were significantly more satisfied with the automated than the manual system.

APA, Harvard, Vancouver, ISO, and other styles

5

Vanallemeersch, Tom, and Hendrik Kockaert. "Automated detection of inconsistent phraseology translation." Southern African Linguistics and Applied Language Studies 28, no. 3 (November 2010): 283–90. http://dx.doi.org/10.2989/16073614.2010.545030.

Full text

APA, Harvard, Vancouver, ISO, and other styles

6

T., S., Yoshiki Mikami, AIC Mohideen, and K. G. "Automated Language Translation: Opportunities and Impact on the Society." International Journal of Computer Applications 178, no. 34 (July 17, 2019): 43–50. http://dx.doi.org/10.5120/ijca2019919232.

Full text

APA, Harvard, Vancouver, ISO, and other styles

7

Sugden, Don. "Machine Aids to Translation: Automated Language Processing System (ALPS)." Meta: Journal des traducteurs 30, no. 4 (1985): 403. http://dx.doi.org/10.7202/004310ar.

Full text

APA, Harvard, Vancouver, ISO, and other styles

8

Harris, Brian. "Origins and conceptual analysis of the term ‘traductologie/translatology’." Babel. Revue internationale de la traduction / International Journal of Translation 57, no. 1 (April 19, 2011): 15–31. http://dx.doi.org/10.1075/babel.57.1.02har.

Full text

Abstract:

The term traductologie was coined in the early 1970s to correspond to the establishment of translation as a valid object of scientific and academic study. Its English equivalent is usually translation studies but sometimes translatology.Traductologie has two conceptual levels: the metalevel of study and analysis and the object level of what is thus examined, namely translations and translating. Both levels are variegated. The metalevel can usefully be mapped into broad ‘paradigms’ or disciplinary approaches: literary, linguistic, semiotic, philosophical, historical, lexico-terminological, automated (MT), prescriptive and pedagogical, scientific-experimental, text and corpus oriented, process oriented, social etc. The object level is traditionally divided between written translation and oral translation (interpretation), and the former is often categorized by ‘text types’.Permutations of the meta and object categories characterize different varieties of traductologie. When discoursing about it, one should be aware that a statement that is meaningful in one variety may be meaningless in another.

APA, Harvard, Vancouver, ISO, and other styles

9

Mahardika, Rizka. "THE USE OF TRANSLATION TOOL IN EFL LEARNING: DO MACHINE TRANSLATION GIVE POSITIVE IMPACT IN LANGUAGE LEARNING?" Pedagogy : Journal of English Language Teaching 5, no. 1 (July 30, 2017): 49. http://dx.doi.org/10.32332/pedagogy.v5i1.755.

Full text

Abstract:

Translation tools are commonly used for translating a text written in one language (source language) into another language (target language). They are used to help translators in translating big numbers of translation works in effective time. There are three types of translation tools being studied in the article entitled Machine Translation Tools: Tools of the Translator’s Trade written by Peter Katsberg published in 2012. They are Fully Automated Machine Translation (or FAMT), Human Aided Machine Translation (or HAMT) and Machine Aided Human Translation (or MAHT). Katsberg analyzed how each translation tool works, the naturality and approriateness of its translation and the compatibility of using it. In this digital era, translation tools are not only popular among translators but also among EFL learners. Beginning with the use of portable dictionary such as Alfalink and expanding to the more sopisticated translation tool such as Google Translate. Some novice learners usually use this translation tools in doing their task without recorrecting the translation result. This happens perhaps because they do not have enough background knowledge to evaluate the translation result. Thus, it will be better when the learners have good mastery in basic English and train them to be aware in evaluating the result from translation tools. On the other words, Human Aided Machine Translation may be the wise choice to do translation task effectively and efficiently particularly in managing the time.

APA, Harvard, Vancouver, ISO, and other styles

10

Vukčević, Miodrag M. "Turns of the centuries. The Transkribus automated tool for recognition, transcription and translation of handwritten historical documents." Babel. Revue internationale de la traduction / International Journal of Translation 66, no. 2 (April 7, 2020): 294–310. http://dx.doi.org/10.1075/babel.00159.vuk.

Full text

Abstract:

Abstract The translation of handwritten historical documents faces many challenges due to variation in the writing style, local language, and an inevitable language change. Even the transliteration from Cyrillic to Latin characters is standardized by the bijective transliteration standard ISO 9. This presentation introduces a number of tools offered by Transkribus for the automated processing of documents, such as Handwritten Text Recognition (HTR) and Document Understanding, which are needed for the translation of historical documents. Next to the problem of decoding handwritten documents, written for example in Kurrentschrift using ancient terminology, changed meanings and different spelling have additionally to be considered during the translation of texts from earlier centuries. Resolution strategies on a case study show different methods for ensuring quality translations.

APA, Harvard, Vancouver, ISO, and other styles

11

Deckert, Mikołaj. "Translatorial and non-translatorial automaticity." Jezikoslovlje 20, no. 1 (2019): 85–118. http://dx.doi.org/10.29162/jez.2019.4.

Full text

Abstract:

The paper examines empirically a subset of cognitive processes in trainee translators with the objective of gaining an insight into their decision-making. Specifically, we are interested in the nature and role of automated processing – above all, how pronounced it can be and how it influences the quality of decisions. The paper’s objective is then to come up with an integrative view of the relationship between translatorial automaticity and cognitive automaticity in general, viz. that not associated with translation. This could help us better capture some of the characteristics of translator behaviour and supplement our understanding of translation competence. Results from experiments with trainees reported in the paper show no correlation between the two dimensions of automated processing, and indicate that translatorial automaticity could be harder to override than its more general counterpart.

APA, Harvard, Vancouver, ISO, and other styles

12

Kim, Hayan, Jin-Kook Lee, Jaeyoung Shin, and Jungsik Choi. "Visual language approach to representing KBimCode-based Korea building code sentences for automated rule checking." Journal of Computational Design and Engineering 6, no. 2 (August 20, 2018): 143–48. http://dx.doi.org/10.1016/j.jcde.2018.08.002.

Full text

Abstract:

Abstract The Building Information Modeling (BIM) and its applications enable an automatic building permit process based on 3D building models and their associated information. A crucial part of the building permit process is the interpretation and transformation of natural language-based building regulation into a computer-readable and executable format. As other countries and their projects have developed a certain type of rule-translation methods, KBimCode, part of the KBim application series, has been developed and supported by the Korean government to ignite an automatic, BIM-based building permit system on top of the current e-submission system, which is called Seumter. The rule translation process usually employs a computer hard-coded approach because of its ease of implementation, and there have been advances in making the computer understand the natural language-based building regulations using parametric input tables and script languages. This project includes a step for developing a logic rule-based approach for translating natural language into computer-executable code. However, the main contribution of this study is the introduction of an approach to represent such text-based regulations using visual language for novice programmers, architects, and rule reviewers. This paper describes a KBimCode visual language that is easy-to-write and intuitive because it uses visual symbols instead of textual coding. Highlights KBVL is an approach to computer-readable building code in visual form. KBVL supports easy-to-make rules for complicated building design rules. This paper demonstrates KBVL using KBimCode for Korean Building permit requirements.

APA, Harvard, Vancouver, ISO, and other styles

13

BEDNAR, Branislav, Zuzana BILANOVA, and Adrian PEKAR. "THE AUTOMATED TRANSLATION OF NATURAL LANGUAGE SENTENCES INTO INTENSIONAL LOGIC AT THE TYPE ANALYSIS AND CONSTRUCTION SYNTHESIS LEVELS." Acta Electrotechnica et Informatica 19, no. 4 (January 13, 2020): 3–7. http://dx.doi.org/10.15546/aeei-2019-0023.

Full text

APA, Harvard, Vancouver, ISO, and other styles

14

Mouratidis, Despoina, and Katia Kermanidis. "Ensemble and Deep Learning for Language-Independent Automatic Selection of Parallel Data." Algorithms 12, no. 1 (January 18, 2019): 26. http://dx.doi.org/10.3390/a12010026.

Full text

Abstract:

Machine translation is used in many applications in everyday life. Due to the increase of translated documents that need to be organized as useful or not (for building a translation model), the automated categorization of texts (classification), is a popular research field of machine learning. This kind of information can be quite helpful for machine translation. Our parallel corpora (English-Greek and English-Italian) are based on educational data, which are quite difficult to translate. We apply two state of the art architectures, Random Forest (RF) and Deeplearnig4j (DL4J), to our data (which constitute three translation outputs). To our knowledge, this is the first time that deep learning architectures are applied to the automatic selection of parallel data. We also propose new string-based features that seem to be effective for the classifier, and we investigate whether an attribute selection method could be used for better classification accuracy. Experimental results indicate an increase of up to 4% (compared to our previous work) using RF and rather satisfactory results using DL4J.

APA, Harvard, Vancouver, ISO, and other styles

15

Schaeffer, Moritz, and Michael Carl. "Shared representations and the translation process." Describing Cognitive Processes in Translation: Acts and events 8, no. 2 (November 29, 2013): 169–90. http://dx.doi.org/10.1075/tis.8.2.03sch.

Full text

Abstract:

The purpose of the present paper is to investigate automated processing during translation. We provide evidence from a translation priming study which suggests that translation involves activation of shared lexico-semantic and syntactical representations, i.e., the activation of features of both source and target language items which share one single cognitive representation. We argue that activation of shared representations facilitates automated processing. The paper revises the literal translation hypothesis and the monitor model (Ivir 1981; Toury 1995; Tirkkonen-Condit 2005), and re-defines it in terms of findings from translation process research. On the basis of the evidence, we propose a recursive model of translation.

APA, Harvard, Vancouver, ISO, and other styles

16

Svoboda, Tomáš. "No linguistic borders ahead? Looking beyond the knocked-down language barrier." TranscUlturAl: A Journal of Translation and Cultural Studies 9, no. 2 (September 22, 2017): 86. http://dx.doi.org/10.21992/t93q0f.

Full text

Abstract:

The article deals with the concept of borders and barriers in considering scenarios where the linguistic barrier is eventually lifted by technology one day. It begins with reflections on the biblical narrative of the Tower of Babel as an ancient representation of the concept of linguistic barriers between language communities. It gives numerous examples of the uptake of this narrative, from Translation Studies, to project calls and the marketing statements of machine translation technology. In the following section, examples of existing technology are presented, which could be considered as a first generation of automatic translation/interpretation systems. In the main section, several trends are predicted for both the translators’ profession and general economic/business/political/societal developments. The consequences are anticipated of a situation where ordinary cross-language communication will eventually have been almost fully taken over by automated systems. The article points to both the technology’s positive potential and, by showing the eventual risks involved, it equally rejects an attitude of the technology’s uncritical uptake. The article closes by pointing to the ethical dimension of machine translation systems linked with their types of uses and the choices reserved for their users.

APA, Harvard, Vancouver, ISO, and other styles

17

Cui, Chang Chang, and Guo Qi Li. "Translate the Stateflow Models into Alloy for Safety Analysis." Applied Mechanics and Materials 490-491 (January 2014): 1702–5. http://dx.doi.org/10.4028/www.scientific.net/amm.490-491.1702.

Full text

Abstract:

Safety analysis is widely appreciated for critical systems. Model-based safety analysis is emerged under the background of mode-based development. For automated model-based safety analysis, extended models should be translated into script of input of formal analyzer. Consequently, many translation systems are constructed. In this paper, we explore the translating method between Stateflow models and Alloy, which is a declarative specification language for expressing complex structural constraints and behavior in a software system. Concrete translation systems could be built based on the method.

APA, Harvard, Vancouver, ISO, and other styles

18

Pandey, Priyank, Manju Khari, Raghavendra Kumar, and Dac-Nhuong Le. "Automatic Generation of Synsets for Wordnet of Hindi Language." International Journal of Natural Computing Research 7, no. 2 (April 2018): 31–47. http://dx.doi.org/10.4018/ijncr.2018040103.

Full text

Abstract:

India is a land of 122 languages and numerous dialects. Lack of competent lexical resources for Indian languages is a ubiquitous fact, which negatively affects the development of tools for NLP of Indian languages. Recent advancements like the Indo WordNet project has significantly contributed to dealing with the scarcity of lexicons, but the progress and coverage is a matter of dispute. The bottlenecks, cost, time, and skilled lexicographers further slackens the progress. In this article, the authors propose a technique to automate the generation of lexical entries using a machine learning approach which visibly expedites the process of lexicon generation like WordNet. The reluctance to adopt an automated approach is majorly credited to a lack of accuracy, the inability to capture a regional touch of a language, incorrect back-translation, etc. To overcome this issue, the author will use Wikipedia to validate the synsets.

APA, Harvard, Vancouver, ISO, and other styles

19

Munyangeyo, Théophile. "A Reflective Account on Human Translation and Interpreting Faced with the Automated Text and Speech Processing Tools." Studies in Linguistics and Literature 4, no. 4 (September 9, 2020): p1. http://dx.doi.org/10.22158/sll.v4n4p1.

Full text

Abstract:

This reflection aims to depict the prospective position and role of translation and interpreting in the globalised world. Demographic factors point to a long-lasting multilingualism, which reflects the co-existence of linguistic identities within a variety of settings. From schools, to workplaces and communities, different languages are and will continue to be in use. In many countries, there is an increasing wave of using vernacular and migrant languages in education. However, the current global academic discourse on language situations does not sufficiently reflect this new looming reality. The focus of translation and interpreting studies has traditionally been placed on those languages that were perceived as internationally important. One would assume that economic and diplomatic interests have influenced that approach and attitude. With changes affecting the globalised world in relation to the rise of some emerging economies and new resources, it is clear that the interlingual communication will be one of the greatest challenges of the coming age. In this regard, a new paradigm in overall language promotion and education must be formulated within which human translation and interpretation continue be seen as important skills to be generally acquired.

APA, Harvard, Vancouver, ISO, and other styles

20

Chatzikoumi, Eirini. "How to evaluate machine translation: A review of automated and human metrics." Natural Language Engineering 26, no. 2 (September 11, 2019): 137–61. http://dx.doi.org/10.1017/s1351324919000469.

Full text

Abstract:

AbstractThis article presents the most up-to-date, influential automated, semiautomated and human metrics used to evaluate the quality of machine translation (MT) output and provides the necessary background for MT evaluation projects. Evaluation is, as repeatedly admitted, highly relevant for the improvement of MT. This article is divided into three parts: the first one is dedicated to automated metrics; the second, to human metrics; and the last, to the challenges posed by neural machine translation (NMT) regarding its evaluation. The first part includes reference translation–based metrics; confidence or quality estimation (QE) metrics, which are used as alternatives for quality assessment; and diagnostic evaluation based on linguistic checkpoints. Human evaluation metrics are classified according to the criterion of whether human judges directly express a so-called subjective evaluation judgment, such as ‘good’ or ‘better than’, or not, as is the case in error classification. The former methods are based on directly expressed judgment (DEJ); therefore, they are called ‘DEJ-based evaluation methods’, while the latter are called ‘non-DEJ-based evaluation methods’. In the DEJ-based evaluation section, tasks such as fluency and adequacy annotation, ranking and direct assessment (DA) are presented, whereas in the non-DEJ-based evaluation section, tasks such as error classification and postediting are detailed, with definitions and guidelines, thus rendering this article a useful guide for evaluation projects. Following the detailed presentation of the previously mentioned metrics, the specificities of NMT are set forth along with suggestions for its evaluation, according to the latest studies. As human translators are the most adequate judges of the quality of a translation, emphasis is placed on the human metrics seen from a translator-judge perspective to provide useful methodology tools for interdisciplinary research groups that evaluate MT systems.

APA, Harvard, Vancouver, ISO, and other styles

21

Rodriguez-Vivas, Angela, Oscar Mauricio Caicedo, Armando Ordoñez, Jéferson Campos Nobre, and Lisandro Zambenedetti Granville. "NORA: An Approach for Transforming Network Management Policies into Automated Planning Problems." Sensors 21, no. 5 (March 4, 2021): 1790. http://dx.doi.org/10.3390/s21051790.

Full text

Abstract:

Realizing autonomic management control loops is pivotal for achieving self-driving networks. Some studies have recently evidence the feasibility of using Automated Planning (AP) to carry out these loops. However, in practice, the use of AP is complicated since network administrators, who are non-experts in Artificial Intelligence, need to define network management policies as AP-goals and combine them with the network status and network management tasks to obtain AP-problems. AP planners use these problems to build up autonomic solutions formed by primitive tasks that modify the initial network state to achieve management goals. Although recent approaches have investigated transforming network management policies expressed in specific languages into low-level configuration rules, transforming these policies expressed in natural language into AP-goals and, subsequently, build up AP-based autonomic management loops remains unexplored. This paper introduces a novel approach, called NORA, to automatically generate AP-problems by translating Goal Policies expressed in natural language into AP-goals and combining them with both the network status and the network management tasks. NORA uses Natural Language Processing as the translation technique and templates as the combination technique to avoid network administrators to learn policy languages or AP-notations. We used a dataset containing Goal Policies to evaluate the NORA’s prototype. The results show that NORA achieves high precision and spends a short-time on generating AP-problems, which evinces NORA aids to overcome barriers to using AP in autonomic network management scenarios.

APA, Harvard, Vancouver, ISO, and other styles

22

Angona, Tazkia Mim, A. S. M. Siamuzzaman Shaon, Kazi Tahmid Rashad Niloy, Tajbia Karim, Zarin Tasnim, S. M. Salim Reza, and Tasmima Noushiba Mahbub. "Automated Bangla sign language translation system for alphabets by means of MobileNet." TELKOMNIKA (Telecommunication Computing Electronics and Control) 18, no. 3 (June 1, 2020): 1292. http://dx.doi.org/10.12928/telkomnika.v18i3.15311.

Full text

APA, Harvard, Vancouver, ISO, and other styles

23

Rinalduzzi, Matteo, Alessio De Angelis, Francesco Santoni, Emanuele Buchicchio, Antonio Moschitta, Paolo Carbone, Paolo Bellitti, and Mauro Serpelloni. "Gesture Recognition of Sign Language Alphabet Using a Magnetic Positioning System." Applied Sciences 11, no. 12 (June 17, 2021): 5594. http://dx.doi.org/10.3390/app11125594.

Full text

Abstract:

Hand gesture recognition is a crucial task for the automated translation of sign language, which enables communication for the deaf. This work proposes the usage of a magnetic positioning system for recognizing the static gestures associated with the sign language alphabet. In particular, a magnetic positioning system, which is comprised of several wearable transmitting nodes, measures the 3D position and orientation of the fingers within an operating volume of about 30 × 30 × 30 cm, where receiving nodes are placed at known positions. Measured position data are then processed by a machine learning classification algorithm. The proposed system and classification method are validated by experimental tests. Results show that the proposed approach has good generalization properties and provides a classification accuracy of approximately 97% on 24 alphabet letters. Thus, the feasibility of the proposed gesture recognition system for the task of automated translation of the sign language alphabet for fingerspelling is proven.

APA, Harvard, Vancouver, ISO, and other styles

24

Aiken, Milam. "Transterpreting Multilingual Electronic Meetings." International Journal of Management & Information Systems (IJMIS) 13, no. 1 (July 11, 2011): 35. http://dx.doi.org/10.19030/ijmis.v13i1.4940.

Full text

Abstract:

Communicating in a non-native language during a traditional, oral meeting is difficult, but a Group Support System (GSS) along with online machine translation (MT) can increase the efficiency and effectiveness of the discussion. An experimental study shows that a group facilitator can use a Web-based translation service to support a multilingual meeting, but completely automated language support is likely to be more efficient for large groups.

APA, Harvard, Vancouver, ISO, and other styles

25

ZHUSSUPOVA, B. T., D. ZH ALIPPAYEVA, and S. A. KUDUBAYEVA. "Dictionaries of the kazakh language as the basis of semantic analysis in the system of computer sign language translation." Bulletin of the National Engineering Academy of the Republic of Kazakhstan 4, no. 78 (January 10, 2020): 71–75. http://dx.doi.org/10.47533/2020.1606-146x.34.

Full text

Abstract:

his article discusses the development of a semantic dictionary of the Kazakh language for a computer translation system from Kazakh to Kazakh sign language, which will take into account the semantics of the Kazakh language and the Kazakh sign language. The semantic dictionary of the Kazakh language serves as the basis of computer translation technology from the Kazakh language to the Kazakh sign language. In the future, it will allow semantic analysis of the source text. The authors of the article analyzed and selected the available dictionaries of the Kazakh language used in the development of the semantic dictionary database. Dictionaries of the Kazakh language provide an opportunity for computer-based sign language translation of the Kazakh sign language. The article also presents the possibility of using L. S. Dimskis notation to develop a dictionary of the structure of gestures of the Kazakh sign language. The prospect of its inclusion in the database of semantic dictionary is revealed. And also revealed the need for a dictionary of gestures in the development of automated sign language translation system as a whole, taking into account its effectiveness and the possibility of full practical use.

APA, Harvard, Vancouver, ISO, and other styles

26

ZHUSSUPOVA, B. T., D. ZH ALIPPAYEVA, and S. A. KUDUBAYEVA. "Dictionaries of the kazakh language as the basis of semantic analysis in the system of computer sign language translation." Bulletin of the National Engineering Academy of the Republic of Kazakhstan 4, no. 78 (January 10, 2020): 71–75. http://dx.doi.org/10.47533/2020.1606-146x.34.

Full text

Abstract:

his article discusses the development of a semantic dictionary of the Kazakh language for a computer translation system from Kazakh to Kazakh sign language, which will take into account the semantics of the Kazakh language and the Kazakh sign language. The semantic dictionary of the Kazakh language serves as the basis of computer translation technology from the Kazakh language to the Kazakh sign language. In the future, it will allow semantic analysis of the source text. The authors of the article analyzed and selected the available dictionaries of the Kazakh language used in the development of the semantic dictionary database. Dictionaries of the Kazakh language provide an opportunity for computer-based sign language translation of the Kazakh sign language. The article also presents the possibility of using L. S. Dimskis notation to develop a dictionary of the structure of gestures of the Kazakh sign language. The prospect of its inclusion in the database of semantic dictionary is revealed. And also revealed the need for a dictionary of gestures in the development of automated sign language translation system as a whole, taking into account its effectiveness and the possibility of full practical use.

APA, Harvard, Vancouver, ISO, and other styles

27

Pym, Anthony, and Ester Torres-Simón. "Is automation changing the translation profession?" International Journal of the Sociology of Language 2021, no. 270 (May 24, 2021): 39–57. http://dx.doi.org/10.1515/ijsl-2020-0015.

Full text

Abstract:

Abstract As a language-intensive profession, translation is of frontline interest in the era of language automation. In particular, the development of neural machine translation systems since 2016 has brought with it fears that soon there will be no more human translators. When considered in terms of the history of automation, however, any such direct effect is far from obvious: the translation industry is still growing and machine translation is only one instance of automation. At the same time, data on remuneration indicate structural wage dispersion in professional translation services, with some signs that this dispersion may increase in certain market segments as automated workflows and translation technologies are adopted more by large language-service providers more than by smaller companies and individual freelancers. An analysis of recent changes in discourses on and in the translation profession further indicates conceptual adjustments in the profession that may be attributed to growing automation, particularly with respect to expanding skills set associated with translation, the tendency to combine translation with other forms of communication, and the use of interactive communication skills to authorize and humanize the results of automation.

APA, Harvard, Vancouver, ISO, and other styles

28

Hermawan, Muhammad, Herry Sujaini, and Novi Safriadi. "Influence Analysis of Smoothing Algorithms in Language Modelling for Indonesian Statistical Machine Translation." International Journal of Engineering and Applied Science Research 1, no. 1 (July 30, 2020): 11. http://dx.doi.org/10.26418/ijeasr.v1i1.41088.

Full text

Abstract:

The diversity of languages makes the need for translation so that communication between individuals of different languages can be appropriately established. The statistical translator engine (SMT) was a translator engine based on a statistical approach to parallel corpus analysis. One crucial part of SMT was language modeling (LM). LM was the calculation of word probability from a corpus-based on n-grams. There was a smoothing algorithm in LM where this algorithm will bring up the probability of a word whose value was zero. This study compares the use of the best smoothing algorithm from each of the three LM according to the standard Moses, namely KenLM, SRILM, and IRSTLM. For SRILM using smoothing algorithm interpolation with Witten-bell and interpolation with Ristads natural discounting, for KenLM using interpolation with modified Kneser-ney smoothing algorithm, and for IRSTLM using modified Kneser-ney and Witten-bell algorithm which was referenced based on previously researched. This study uses a corpus of 10,000 sentences. Tests carried out by BLEU and testing by Melayu Sambas linguists. Based on the results of BLEU testing and linguist testing, the best smoothing algorithm was chosen, namely modified Kneser-ney in KenLM LM, where the average results of automated testing, for Indonesian-Melayu Sambas and vice versa were 41. 6925% and 46. 66%. Moreover, for testing linguists, the accuracy of the Indonesian-Melayu Sambas language and vice versa was 77. 3165% and 77. 9095%

APA, Harvard, Vancouver, ISO, and other styles

29

Krzykawski, Michał. "Towards Idiodiversity." Philosophy Today 65, no. 2 (2021): 265–87. http://dx.doi.org/10.5840/philtoday2021414398.

Full text

Abstract:

This article discusses translation as a technique of doing philosophy and introduces the concept of idiodiversity as an alternative to the current model of automated translation machines. The dominant functionalist approach to technology has made these machines the agents of linguistic homogenisation, which constitutes a threat for the diversity of idiomatic open systems this article advocates for. However, as this article argues, the challenge is not merely to accuse automated translation technologies of impoverishing the knowledge of how to translate but, rather, to determine whether these technologies can be reappropriated for the purpose of preservation and revalorisation of translation and, more generally, as a conveyor of noodiversity. This challenge also involves the need to draw attention to the political significance of translation practices and to elaborate an alternative to the mechanistic approaches to translation, typical of computational linguistics and language engineering, through a heterodox approach to cybernetics.

APA, Harvard, Vancouver, ISO, and other styles

30

O’Sullivan, Carol. "Imagined spectators." Target. International Journal of Translation Studies 28, no. 2 (August 4, 2016): 261–75. http://dx.doi.org/10.1075/target.28.2.07osu.

Full text

Abstract:

Abstract This article considers theoretical and methodological questions of language and translation policy in the dissemination of audiovisual products across languages. This is an area where scholarly research is inevitably playing catch-up with rapid change both in the language industries and in film and television production. For example, we have a general sense of ‘dubbing territories’ and ‘subtitling territories’ but in reality the picture is more complex. Norms changed in the course of the home entertainment revolution, with the arrival of the DVD format in the late 1990s ostensibly increasing viewer choice and flexibility of translation provision. The relocation of much audiovisual material to an online environment has also generated fundamental changes in the way that works circulate, with volunteer translators and automated translation processes playing a larger role. Policy developments in access translation have meant that there have also been great changes relatively recently in the availability of SDH subtitling, audio description and other modes of access translation. This is a very broad field which raises many compelling research questions. At the same time, its very breadth does not lend itself to a comprehensive overview. The article will therefore aim to provide an orientation to, rather than a summary of, the theoretical and methodological challenges of research on this topic.

APA, Harvard, Vancouver, ISO, and other styles

31

Jiang, Jinlin, Ying Qin, and Ya Sun. "Constructing Automated Scoring Model for Human Translation with Multidisciplinary Technologies." International Journal of Emerging Technologies in Learning (iJET) 11, no. 02 (February 23, 2016): 57. http://dx.doi.org/10.3991/ijet.v11i02.5137.

Full text

Abstract:

This study constructed a computer scoring model for Chinese EFL learners’ English-to-Chinese translations using multidisciplinary techniques in corpus linguistics, natural language processing, information retrieval and statistics. The proposed model, once implemented as computer software, can score English-to-Chinese translations in large-scale examinations. This study built five tentative scoring models with 50, 100, 130, 150 and 180 translations as the training set for 300 translations of an expository writing. The correlation coefficients between the computed scores of these models and human-assigned scores were above 0.8. The results further indicated that the computed scores with 130 training translations were closest to human-assigned scores. Therefore, it was concluded that the text features extracted in this research were effective and the finalized model can produce reliable scores for Chinese EFL learners’ English-to-Chinese expository translations.

APA, Harvard, Vancouver, ISO, and other styles

32

Törcsvári, Attila. "Translation memory and computer assisted translation tool for medieval texts." Acta Periodica Duellatorum 2015, no. 2 (October 1, 2015): 28–46. http://dx.doi.org/10.1515/apd-2015-0008.

Full text

Abstract:

Abstract Translation memories (TMs), as part of Computer Assisted Translation (CAT) tools, support translators reusing portions of formerly translated text. Fencing books are good candidates for using TMs due to the high number of repeated terms. Medieval texts suffer a number of drawbacks that make hard even “simple” rewording to the modern version of the same language. The analyzed difficulties are: lack of systematic spelling, unusual word orders and typos in the original. A hypothesis is made and verified that even simple modernization increases legibility and it is feasible, also it is worthwhile to apply translation memories due to the numerous and even extremely long repeated terms. Therefore, methods and algorithms are presented 1. for automated transcription of medieval texts (when a limited training set is available), and 2. collection of repeated patterns. The efficiency of the algorithms is analyzed for recall and precision.

APA, Harvard, Vancouver, ISO, and other styles

33

Warburton, Kara. "Processing terminology for the translation pipeline." Terminology 19, no. 1 (April 29, 2013): 93–111. http://dx.doi.org/10.1075/term.19.1.04war.

Full text

Abstract:

Companies must translate their content if they want to operate multinationally. Both quality and speed of translation are key factors in determining market share in the target market. Proactively managing terminology, including pre-translating key terms for a translation project, has beneficial effects on these factors. However, in commercial environments, the volumes of content and subsequently of the required terminology are typically large. Therefore, integrating terminology into the translation pipeline requires a process that is as automated as possible. Term extraction is the cornerstone of this process, but to maximize efficiency it requires a post-processing strategy that repurposes existing lexical resources. Terms extracted from corpora and subsequently translated should be channeled into the company termbase so that they can be leveraged for other purposes. These and other effective practices for processing extracted terms are discussed, based on the author’s experiences in one large company.

APA, Harvard, Vancouver, ISO, and other styles

34

Lagutina, Nadezhda Stanislavovna, Ksenia Vladimirovna Lagutina, Elena Igorevna Boychuk, Inna Alekseevna Vorontsova, and Ilya Vyacheslavovich Paramonov. "Automated Search of Rhythm Figures in a Literary Text for Comparative Analysis of Originals and Translations Based on the Material of the English and Russian Languages." Modeling and Analysis of Information Systems 26, no. 3 (September 28, 2019): 420–40. http://dx.doi.org/10.18255/1818-1015-2019-3-420-440.

Full text

Abstract:

Analysis of the functional equivalence of an original text and its translation based on the achievement of rhythm equivalence is an extremely important task of modern linguistics. Moreover, the rhythm component is an integral part of functional equivalence that cannot be achieved without communication of rhythm figures of the text. To analyze rhythm figures in an original literary text and its translation, the authors developed the ProseRhythmDetector software tool that allows to find and visualize lexical and syntactic figures in English- and Russian-language prose texts: anaphora, epiphora, symploce, anadiplosis, epanalepsis, reduplication, epistrophe, polysyndeton, and aposiopesis. The goal of this work is to present the results of ProseRhythmDetector testing on two works by English authors and their translations into Russian: Ch. Bronte “Villette” and I. Murdoch “The Black Prince”. Basing on the results of the tool, the authors compared rhythm figures in an original text and its translation both in aspects of the rhythm and their contexts. This experiment made it possible to identify how the features of the author’s style are communicated by the translator, to detect and explain cases of mismatch of rhythm figures in the original and translated texts. The application of the ProseRhythm-Detector software tool made it possible to significantly reduce the amount of linguistsexperts work by automated detection of lexical and syntactic figures with quite high precision (from 62 % to 93 %) for various rhythm figures.

APA, Harvard, Vancouver, ISO, and other styles

35

Kuzmin, Oleg. "Modern Machine Translation Systems: Trends and Prospects." Nizhny Novgorod Linguistics University Bulletin, no. 53 (March 31, 2021): 41–52. http://dx.doi.org/10.47388/2072-3490/lunn2021-53-1-41-52.

Full text

Abstract:

The modern world is moving towards global digitalization and accelerated software development with a clear tendency to replace human resources by digital services or programs that imitate the doing of similar tasks. There is no doubt that, long term, the use of such technologies has economic benefits for enterprises and companies. Despite this, however, the quality of the final result is often less than satisfactory, and machine translation systems are no exception, as editing of texts translated by using online translation services is still a demanding task. At the moment, producing high-quality translations using only machine translation systems remains impossible for multiple reasons, the main of which lies in the mysteries of natural language: the existence of sublanguages, abstract words, polysemy, etc. Since improving the quality of machine translation systems is one of the priorities of natural language processing (NLP), this article describes current trends in developing modern machine translation systems as well as the latest advances in the field of natural language processing (NLP) and gives suggestions about software innovations that would minimize the number of errors. Even though recent years have seen a significant breakthrough in the speed of information analysis, in all probability, this will not be a priority issue in the future. The main criteria for evaluating the quality of translated texts will be the semantic coherence of these texts and the semantic accuracy of the lexical material used. To improve machine translation systems, we should introduce elements of data differentiation and personalization of information for individual users and their tasks, employing the method of thematic modeling for determining the subject area of a particular text. Currently, there are algorithms based on deep learning that are able to perform these tasks. However, the process of identifying unique lexical units requires a more detailed linguistic description of their semantic features. The parsing methods that will be used in analyzing texts should also provide for the possibility of clustering by sublanguages. Creating automated electronic dictionaries for specific fields of professional knowledge will help improve the quality of machine translation systems. Notably, to date there have been no successful projects of creating dictionaries for machine translation systems for specific sub-languages. Thus, there is a need to develop such dictionaries and to integrate them into existing online translation systems.

APA, Harvard, Vancouver, ISO, and other styles

36

Kamune, Kalyani Pradiprao, and Avinash Agrawal. "Hybrid Model of Automated Anaphora Resolution." IAES International Journal of Artificial Intelligence (IJ-AI) 3, no. 3 (September 1, 2014): 105. http://dx.doi.org/10.11591/ijai.v3.i3.pp105-111.

Full text

Abstract:

Anaphora resolution has proven to be a very difficult problem of natural language processing, and it is useful in discourse analysis, language understanding and processing, information exaction, machine translation and many more. This paper represents a system that instead of using a monolithic architecture for resolving anaphora, use the hybrid model which combines the constraint-based and preferences-based architectures, each uses a different source of knowledge, and proves effective on theoretical and computational basis. An algorithm identifies both inter-sentential and intra-sentential antecedents of “Third person pronoun anaphors”, “Pleonastic it”, and “Lexical noun phrase anaphora”. The algorithm use Charniak parser (parser05Aug16) as an associated tool, and it relays on the output generated by it. Salience measures derived from parse tree, in order to find out accurate antecedents from the list of potential antecedents. We have tested the system extensively on 'Reuters Newspaper corpus'.

APA, Harvard, Vancouver, ISO, and other styles

37

Rastenis, Justinas, Simona Ramanauskaitė, Ivan Suzdalev, Kornelija Tunaitytė, Justinas Janulevičius, and Antanas Čenys. "Multi-Language Spam/Phishing Classification by Email Body Text: Toward Automated Security Incident Investigation." Electronics 10, no. 6 (March 12, 2021): 668. http://dx.doi.org/10.3390/electronics10060668.

Full text

Abstract:

Spamming and phishing are two types of emailing that are annoying and unwanted, differing by the potential threat and impact to the user. Automated classification of these categories can increase the users’ awareness as well as to be used for incident investigation prioritization or automated fact gathering. However, currently there are no scientific papers focusing on email classification concerning these two categories of spam and phishing emails. Therefore this paper presents a solution, based on email message body text automated classification into spam and phishing emails. We apply the proposed solution for email classification, written in three languages: English, Russian, and Lithuanian. As most public email datasets almost exclusively collect English emails, we investigate the suitability of automated dataset translation to adapt it to email classification, written in other languages. Experiments on public dataset usage limitations for a specific organization are executed in this paper to evaluate the need of dataset updates for more accurate classification results.

APA, Harvard, Vancouver, ISO, and other styles

38

Ramón, Noelia, and Camino Gutiérrez-Lanza. "Translation description for assessment and post-editing." Target. International Journal of Translation Studies 30, no. 1 (February 7, 2018): 112–36. http://dx.doi.org/10.1075/target.15098.ram.

Full text

Abstract:

Abstract This paper presents a corpus-based descriptive research procedure for the identification of significant divergences between original Spanish and Spanish translated from English. When considering the language pair English-Spanish, personal pronouns seem to be good markers of significant differences (anchor phenomena), since they must obligatorily occur in English, but not in Spanish. To test this hypothesis, empirical data have been extracted from a large reference corpus in Spanish (CREA) and from an English-Spanish parallel corpus (P-ACTRES), in both cases from the fiction subcorpora. Statistically significant differences have been found in some of the uses of personal pronouns, having textual and pragmatic implications in the target texts. The aim is to use the results obtained in the case of personal pronouns, together with results from other linguistic areas, to build a semi-automated tool for the post-editing of Spanish translations of texts written originally in English.

APA, Harvard, Vancouver, ISO, and other styles

39

Lavie, Alon, and Mark Przybocki. "Introduction to the special issue on “Automated Metrics for Machine Translation Evaluation”." Machine Translation 23, no. 2-3 (September 2009): 69–70. http://dx.doi.org/10.1007/s10590-010-9071-8.

Full text

APA, Harvard, Vancouver, ISO, and other styles

40

Smirnova, V. N. "Scientific and technical translation training in the conditions of electronic information and educational environment in the construction university." Open Education 23, no. 2 (May 14, 2019): 4–13. http://dx.doi.org/10.21686/1818-4243-2019-2-4-13.

Full text

Abstract:

Purpose of the study. Distance education is one of the possibilities for the successful realization of the didactic potential of information and communication technologies in teaching foreign languages. In the situation of transition of a higher school to a competence-oriented system of education, mastering scientific and technical translation by students of non-linguistic faculties within the framework of an electronic information and educational environment is of particular relevance. The ability to correctly interpret an authentic text testifies not only to sufficient translation training, but also the ability to use electronic educational resources based on the student’s remote access to information referral systems and teaching materials. In this regard, the purpose of this study was to determine the capabilities of remote technologies in teaching scientific and technical translation and assessing the quality of its implementation.Materials and methods. Standardizing the definition of types of semantic distortions on the material of educational translations prepared in the framework of the electronic informational and educational environment of a building university allowed to evaluate the effectiveness of this training format, to develop guidelines for overcoming gaps in the basic interlingua training of future engineers and served as a criterion for evaluating in the process of attestation to master the discipline “Foreign language”. In the process of the study descriptive, comparative, categorical and statistical methods were used.Results. The article describes the modular organization of the electronic information and educational environment in a building university, which implies the inclusion of several interactive platforms in its composition with an indication of their capabilities in teaching scientific and technical translation. The theoretical concepts of understanding the semantic violation in the interlingua text transformation as an unreasonable deviation from the normative requirement of meaningful correspondence of the translation to the original are presented.Comparative analysis allowed us to identify two main categories of translation violations: (I) reducing the accuracy of the semantic content of the text and (II) distorting the semantic content of the original in the form of unjustified omission, addition and replacement. Practical assessment of violations of scientific and technical translation allowed describing the quantitative ratio of categories of errors with a predominance of semantic inaccuracies over semantic distortions, which demonstrates an increased level of formation of students’ linguistic translation competence.Conclusion. The effectiveness of the use of electronic information-educational environment in the daily work of a construction institution has been proved, which allows improving the quality of student learning in the framework of ensuring remote synchronous and asynchronous interaction between participants in the educational process.It was concluded that the future engineering and construction specialists need to focus on choosing the right translation solutions in the process of preparing practical tasks in the electronic information and educational environment.The following guidelines for the effective teaching of translation in non-language faculties in higher education have been developed:denotative correspondence of the translation of engineering and construction texts to the original at the lexical level should be associated with the selection of equivalent units of the translating language reflecting the semantics and mastery of application in the professional sphere;the denotative correspondence of the translation of engineering and construction texts at the syntactic level, understood as the preservation of semantic accents, is determined by the study of various traditions of the communicative progression of the text in English and Russian languages and the establishment of logical links between the syntactic units of the original text;mandatory editing of automated translation is required.

APA, Harvard, Vancouver, ISO, and other styles

41

Atkinson, David P. "Developing psychological skill for the global language industry." Translation Spaces 3 (November 28, 2014): 1–24. http://dx.doi.org/10.1075/ts.3.01atk.

Full text

Abstract:

This article presents a teaching methodology aimed at helping trainee translators and interpreters to increase their psychological skill. Psychological skill is here defined as the effects of self-efficacy, explanatory style, and locus of control, all of which shape their decision-making and courses of action. By increasing their level of psychological skill, I aim to help translators and interpreters better cope with the challenges of an increasingly automated and highly globalized profession. In particular, this article explores three general approaches for teaching and improving psychological skill, namely theoretical discussion, modelling, and role play. I will also describe examples for practical activities that might be carried out in translator/interpreter training. The overall aim of the article is to provide a research-based foundation for teachers and trainers so that they can implement teaching methodologies and strategies to help their translation/interpreting students to graduate with better psychological skill. A particular focus will be on psychological challenges arising from the globalized nature of our profession and the changes brought about by technological developments.

APA, Harvard, Vancouver, ISO, and other styles

42

T.S., Santosh Kumar. "Word Sense Disambiguation Using Semantic Web for Tamil to English Statistical Machine Translation." IRA-International Journal of Technology & Engineering (ISSN 2455-4480) 5, no. 2 (November 26, 2016): 22. http://dx.doi.org/10.21013/jte.v5.n2.p1.

Full text

Abstract:

<div> Machine Translation has been an area of linguistic research for almost more than two decades now. But it still remains a very challenging task for devising an automated system which will deliver accurate translations of the natural languages. However, great strides have been made in this field with more success owing to the development of technologies of the web and off late there is a renewed interest in this area of research. Technological advancements in the preceding two decades have influenced Machine Translation in a considerable way. Several MT approaches including Statistical Machine Translation greatly benefitted from these advancements, basically making use of the availability of extensive corpora. Web technology web3.0 uses the semantic web technology which represents any object or resource in the web both syntactically and semantically. This type of representation is very much useful for the computing systems to search any content on the internet similar to lexical search and improve the internet based translations making it more effective and efficient. In this paper we propose a technique to improve existing statistical Machine Translation methods by making use of semantic web technology. Our focus will be on Tamil and Tamil to English MT. The proposed method could successfully integrate a semantic web technique in the process of WSD which forms part of the MT system. The integration is accomplished by using the capabilities of RDFS and OWL into the WSD component of the MT model. The contribution of this work lies in showing that integrating a Semantic web technique in the WSD system significantly improves the performance of a statistical MT system for a translation from Tamil to English.</div> In this paper we assume the availability of large corpora in Tamil language and specific domain based ontologies with Tamil semantic web technology using web3.0. We are positive on the expansion and development of Tamil semantic web and subsequently infer that Tamil to English MT will greatly improve the disambiguation concept apart from other related benefits. This method could enable the enhancement of translation quality by improving on word sense disambiguation process while text is translated from Tamil to English language. This method can also be extended to other languages such as Hindi and Indian Languages.

APA, Harvard, Vancouver, ISO, and other styles

43

Chiarcos, Christian, Ilya Khait, Émilie Pagé-Perron, Niko Schenk, Jayanth, Christian Fäth, Julius Steuer, William Mcgrath, and Jinyan Wang. "Annotating a Low-Resource Language with LLOD Technology: Sumerian Morphology and Syntax." Information 9, no. 11 (November 19, 2018): 290. http://dx.doi.org/10.3390/info9110290.

Full text

Abstract:

This paper describes work on the morphological and syntactic annotation of Sumerian cuneiform as a model for low resource languages in general. Cuneiform texts are invaluable sources for the study of history, languages, economy, and cultures of Ancient Mesopotamia and its surrounding regions. Assyriology, the discipline dedicated to their study, has vast research potential, but lacks the modern means for computational processing and analysis. Our project, Machine Translation and Automated Analysis of Cuneiform Languages, aims to fill this gap by bringing together corpus data, lexical data, linguistic annotations and object metadata. The project’s main goal is to build a pipeline for machine translation and annotation of Sumerian Ur III administrative texts. The rich and structured data is then to be made accessible in the form of (Linguistic) Linked Open Data (LLOD), which should open them to a larger research community. Our contribution is two-fold: in terms of language technology, our work represents the first attempt to develop an integrative infrastructure for the annotation of morphology and syntax on the basis of RDF technologies and LLOD resources. With respect to Assyriology, we work towards producing the first syntactically annotated corpus of Sumerian.

APA, Harvard, Vancouver, ISO, and other styles

44

Lucas, Christopher, Richard A. Nielsen, Margaret E. Roberts, Brandon M. Stewart, Alex Storer, and Dustin Tingley. "Computer-Assisted Text Analysis for Comparative Politics." Political Analysis 23, no. 2 (2015): 254–77. http://dx.doi.org/10.1093/pan/mpu019.

Full text

Abstract:

Recent advances in research tools for the systematic analysis of textual data are enabling exciting new research throughout the social sciences. For comparative politics, scholars who are often interested in non-English and possibly multilingual textual datasets, these advances may be difficult to access. This article discusses practical issues that arise in the processing, management, translation, and analysis of textual data with a particular focus on how procedures differ across languages. These procedures are combined in two applied examples of automated text analysis using the recently introduced Structural Topic Model. We also show how the model can be used to analyze data that have been translated into a single language via machine translation tools. All the methods we describe here are implemented in open-source software packages available from the authors.

APA, Harvard, Vancouver, ISO, and other styles

45

Bakaev, Ilkhom Izatovich. "The development of stemming algorithm for the Uzbek language." Кибернетика и программирование, no. 1 (January 2021): 1–12. http://dx.doi.org/10.25136/2644-5522.2021.1.35847.

Full text

Abstract:

The automatic processing of unstructured texts in natural languages is one of the relevant problems of computer analysis and text synthesis. Within this problem, the author singles out a task of text normalization, which usually suggests such processes as tokenization, stemming, and lemmatization. The existing stemming algorithms for the most part are oriented towards the synthetic languages with inflectional morphemes. The Uzbek language represents an example of agglutinative language, characterized by polysemanticity of affixal and auxiliary morphemes. Although the Uzbek language largely differs from, for example, English language, it is successfully processed by stemming algorithms. There are virtually no examples of effective implementation of stemming algorithms for the Uzbek language; therefore, this questions is the subject of scientific interest and defines the goal of this work. In the course of this research, the author solved the task of bringing the given texts in the Uzbek language to normal form, which on the preliminary stage were tokenized and cleared of stop words. To author developed the method of normalization of texts in the Uzbek language based on the stemming algorithm. The development of stemming algorithm employed hybrid approach with application of algorithmic method, lexicon of linguistic rules and database of the normal word forms of the Uzbek language. The precision of the proposed algorithm depends on the precision of tokenization algorithm. At the same time, the article did not explore the question of finding the roots of paired words separated by spaces, as this task is solved at the stage of tokenization. The algorithm can be integrated into various automated systems for machine translation, information extraction, data retrieval, etc.

APA, Harvard, Vancouver, ISO, and other styles

46

Mihailov, Mihail, and Hannu Tommola. "Compiling Parallel Text Corpora: Towards Automation of Routine Procedures." Text Corpora and Multilingual Lexicography 6, no. 3 (December 17, 2001): 67–77. http://dx.doi.org/10.1075/ijcl.6.si.07mih.

Full text

Abstract:

The aim of the research project running at the Department of Translation Studies of the University of Tampere is to collect a Russian-Finnish parallel corpus of fiction. The corpus will be equipped with efficient search and analysis tools. The texts of the corpus will be stored as ordinary text files. Each text will be registered in a Microsoft Access database and supplied with a description. Automated parallel concordancing is being developed for the corpus. The program will find the keywords in text A (Russian), then look for possible translation equivalents of the keywords in language B (Finnish), and then search for the portion of text B (Finnish) where most of the keywords in question can be found.

APA, Harvard, Vancouver, ISO, and other styles

47

Alam, Mahabubul, Salam Salloum, and Mohammad Husain. "Improving Network Management by XML to Relational Data Translation." International Journal of Interdisciplinary Telecommunications and Networking 5, no. 3 (July 2013): 34–48. http://dx.doi.org/10.4018/jitn.2013070104.

Full text

Abstract:

Network management is a critical component in both wired and wireless network. In wireless networks, the network dynamics changes rapidly and the network management information needs to be updated frequently. Due to its structured form and ease of usage in communication, eXtensible Markup Language (XML) is preferred as a configuration and logging tool in network management. However, it is convenient to use relational databases such as SQL to store and process data of wireless network management where frequent updates are necessary. In this paper, the authors show an automated method of converting XML documents to store in relational. We have implemented a proof of concept and compared performance to existing approach.

APA, Harvard, Vancouver, ISO, and other styles

48

Burgun, A., L. P. Seka, D. Delamarre, and P. Le Beux. "Automated Coding of Patient Discharge Summaries Using Conceptual Graphs." Methods of Information in Medicine 34, no. 04 (July 1995): 345–51. http://dx.doi.org/10.1055/s-0038-1634611.

Full text

Abstract:

Abstract:In medicine, as in other domains, indexing and classification is a natural human task which is used for information retrieval and representation. In the medical field, encoding of patient discharge summaries is still a manual time-consuming task. This paper describes an automated coding system of patient discharge summaries from the field of coronary diseases into the ICD-9-CM classification. The system is developed in the context of the European AIM MENELAS project, a natural-language understanding system which uses the conceptual-graph formalism. Indexing is performed by using a two-step processing scheme; a first recognition stage is implemented by a matching procedure and a secondary selection stage is made according to the coding priorities. We show the general features of the necessary translation of the classification terms in the conceptual-graph model, and for the coding rules compliance. An advantage of the system is to provide an objective evaluation and assessment procedure for natural-language understanding.

APA, Harvard, Vancouver, ISO, and other styles

49

Musthofa, Musthofa. "COMPUTATIONAL LINGUISTICS (Model Baru Kajian Linguistik dalam Perspektif Komputer)." Adabiyyāt: Jurnal Bahasa dan Sastra 9, no. 2 (December 31, 2010): 247. http://dx.doi.org/10.14421/ajbs.2010.09203.

Full text

Abstract:

This paper describes a new discipline in applied linguistics studies, computational linguistics. It’s a new model of applied linguistics which is influenced by computer technology. Computational linguistics is a discipline straddling applied linguistics and computer science that is concerned with the computer processing of natural languages on all levels of linguistic description. Traditionally, computational linguistics was usually performed by computer scientists who had specialized in the application of computers to the processing of a natural language. Computational linguists often work as members of interdisciplinary teams, including linguists (specifically trained in linguistics), language experts (persons with some level of ability in the languages relevant to a given project), and computer scientists. The several areas of computational linguistics study encompasses such practical applications as speech recognition systems, speech synthesis, automated voice response systems, web search engines, text editors, grammar checking, text to speech, corpus linguistics, machine translation, text data mining, and others. This paper presents the definition of computational linguistics, relation between language and computer, and area of computational linguistics studies.

APA, Harvard, Vancouver, ISO, and other styles

50

Hopkins-Rossabi, Theresa, Mickey Rowe, Katlyn McGrattan, Sam Rossabi, and Bonnie Martin-Harris. "Respiratory–Swallow Training Methods: Accuracy of Automated Detection of Swallow Onset, Respiratory Phase, Lung Volume at Swallow Onset, and Real-Time Performance Feedback Tested in Healthy Adults." American Journal of Speech-Language Pathology 29, no. 2S (July 10, 2020): 1012–21. http://dx.doi.org/10.1044/2020_ajslp-19-00201.

Full text

Abstract:

Background Preliminary studies have shown that respiratory–swallow training (RST) is a successful treatment for oropharyngeal head and neck cancer patients with refractory dysphagia. Refining the RST protocol with automated analysis software to provide real-time performance feedback has the potential to improve accessibility, reproducibility, and translation to diverse clinical settings. Method An automated software program for data acquisition and analysis developed to detect swallows, determine respiratory phase, calculate lung volume at the onset of the swallow, and provide real-time performance feedback was tested for feasibility in a small cohort of healthy adults. Outcome Measures Percent difference in swallow detection and accuracy of real-time performance feedback of respiratory phase and lung volume at swallowing onset between the automated software and the manual gold standard method were determined. Results The automated software program accurately detected the onset of the swallow on 91% of the swallows completed during the training trials. Feedback of respiratory phase and lung volume was accurate on 94% of the trials in which the swallow was accurately detected. Conclusions This novel, automated, and real-time RST software successfully detected the onset of the swallow, respiratory phase, and lung volume at swallow onset and provided appropriate real-time performance feedback with a high degree of accuracy in healthy adults. The software has the potential to improve the accessibility, efficiency, and translation of RST to diverse patient populations.

APA, Harvard, Vancouver, ISO, and other styles

Journal articles on the topic 'Automated language translation'

Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles