Dissertations / Theses on the topic 'Medical Text'
Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles
Consult the top 50 dissertations / theses for your research on the topic 'Medical Text.'
Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.
You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.
Browse dissertations / theses on a wide variety of disciplines and organise your bibliography correctly.
Drusiani, Alberto. "Deep Learning Text Classification for Medical Diagnosis." Bachelor's thesis, Alma Mater Studiorum - Università di Bologna, 2018. http://amslaurea.unibo.it/17281/.
Full textSætre, Rune. "GeneTUC: Natural Language Understanding in Medical Text." Doctoral thesis, Norwegian University of Science and Technology, Department of Computer and Information Science, 2006. http://urn.kb.se/resolve?urn=urn:nbn:no:ntnu:diva-545.
Full textNatural Language Understanding (NLU) is a 50 years old research field, but its application to molecular biology literature (BioNLU) is a less than 10 years old field. After the complete human genome sequence was published by Human Genome Project and Celera in 2001, there has been an explosion of research, shifting the NLU focus from domains like news articles to the domain of molecular biology and medical literature. BioNLU is needed, since there are almost 2000 new articles published and indexed every day, and the biologists need to know about existing knowledge regarding their own research. So far, BioNLU results are not as good as in other NLU domains, so more research is needed to solve the challenges of creating useful NLU applications for the biologists.
The work in this PhD thesis is a “proof of concept”. It is the first to show that an existing Question Answering (QA) system can be successfully applied in the hard BioNLU domain, after the essential challenge of unknown entities is solved. The core contribution is a system that discovers and classifies unknown entities and relations between them automatically. The World Wide Web (through Google) is used as the main resource, and the performance is almost as good as other named entity extraction systems, but the advantage of this approach is that it is much simpler and requires less manual labor than any of the other comparable systems.
The first paper in this collection gives an overview of the field of NLU and shows how the Information Extraction (IE) problem can be formulated with Local Grammars. The second paper uses Machine Learning to automatically recognize protein name based on features from the GSearch Engine. In the third paper, GSearch is substituted with Google, and the task in this paper is to extract all unknown names belonging to one of 273 biomedical entity classes, like genes, proteins, processes etc. After getting promising results with Google, the fourth paper shows that this approach can also be used to retrieve interactions or relationships between the named entities. The fifth paper describes an online implementation of the system, and shows that the method scales well to a larger set of entities.
The final paper concludes the “proof of concept” research, and shows that the performance of the original GeneTUC NLU system has increased from handling 10% of the sentences in a large collection of abstracts in 2001, to 50% in 2006. This is still not good enough to create a commercial system, but it is believed that another 40% performance gain can be achieved by importing more verb templates into GeneTUC, just like nouns were imported during this work. Work has already begun on this, in the form of a local Masters Thesis.
Neamatullah, Ishna. "Automated de-identification of free-text medical records." Thesis, Massachusetts Institute of Technology, 2006. http://hdl.handle.net/1721.1/41622.
Full textIncludes bibliographical references (p. 62-64).
This paper presents a de-identification study at the Harvard-MIT Division of Health Science and Technology (HST) to automatically de-identify confidential patient information from text medical records used in intensive care units (ICUs). Patient records are a vital resource in medical research. Before such records can be made available for research studies, protected health information (PHI) must be thoroughly scrubbed according to HIPAA specifications to preserve patient confidentiality. Manual de-identification on large databases tends to be prohibitively expensive, time-consuming and prone to error, making a computerized algorithm an urgent need for large-scale de-identification purposes. We have developed an automated pattern-matching deidentification algorithm that uses medical and hospital-specific information. The current version of the algorithm has an overall sensitivity of around 0.87 and an approximate positive predictive value of 0.63. In terms of sensitivity, it performs significantly better than 1 person (0.81) but not quite as well as a consensus of 2 human de-identifiers (0.94). The algorithm will be published as open-source software, and the de-identified medical records will be incorporated into HST's Multi-Parameter Intelligent Monitoring for Intensive Care (MIMIC II) physiologic database.
by Ishna Neamatullah.
M.Eng.
Savkov, Aleksandar Dimitrov. "Deciphering clinical text : concept recognition in primary care text notes." Thesis, University of Sussex, 2017. http://sro.sussex.ac.uk/id/eprint/68232/.
Full textShu, Jennifer (Jennifer J. ). "Free text phrase encoding and information extraction from medical notes." Thesis, Massachusetts Institute of Technology, 2005. http://hdl.handle.net/1721.1/37064.
Full textIncludes bibliographical references (p. 87-90).
The Laboratory for Computational Physiology is collecting a large database of patient signals and clinical data from critically ill patients in hospital intensive care units (ICUs). The data will be used as a research resource to support the development of an advanced patient monitoring system for ICUs. Important pathophysiologic events in the patient data streams must be recognized and annotated by expert clinicians in order to create a "gold standard" database for training and evaluating automated monitoring systems. Annotating the database requires, among other things, analyzing and extracting important clinical information from textual patient data such as nursing admission and progress notes, and using the data to define and document important clinical events during the patient's ICU stay. Two major text-related annotation issues are addressed in this research. First, the documented clinical events must be described in a standardized vocabulary suitable for machine analysis. Second, an advanced monitoring system would need an automated way to extract meaning from the nursing notes, as part of its decision-making process. The thesis presents and evaluates methods to code significant clinical events into standardized terminology and to automatically extract significant information from free-text medical notes.
by Jennifer Shu.
M.Eng.
Civera, Saiz Jorge. "An evaluation of alternative strategies for clustering genes from medical text." Thesis, Georgia Institute of Technology, 2003. http://hdl.handle.net/1853/8664.
Full textLeong, Elaine. "Medical recipe collections in seventeenth-century England : knowledge, text and gender." Thesis, University of Oxford, 2006. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.432177.
Full textLeroy, Gondy, Hsinchun Chen, Jesse D. Martinez, Shauna Eggers, Ryan R. Falsey, Kerri L. Kislin, Zan Huang, et al. "Genescene: Biomedical Text And Data Mining." Wiley Periodicals, Inc, 2005. http://hdl.handle.net/10150/105791.
Full textTo access the content of digital texts efficiently, it is necessary to provide more sophisticated access than keyword based searching. Genescene provides biomedical researchers with research findings and background relations automatically extracted from text and experimental data. These provide a more detailed overview of the information available. The extracted relations were evaluated by qualified researchers and are precise. A qualitative ongoing evaluation of the current online interface indicates that this method to search the literature is more useful and efficient than keyword based searching.
Finch, Dezon K. "TagLine: Information Extraction for Semi-Structured Text Elements In Medical Progress Notes." Scholar Commons, 2012. http://scholarcommons.usf.edu/etd/4321.
Full textAlanazi, Saad. "A Named Entity Recognition system applied to Arabic text in the medical domain." Thesis, Staffordshire University, 2017. http://eprints.staffs.ac.uk/3129/.
Full textFrunza, Oana Magdalena. "Personalized Medicine through Automatic Extraction of Information from Medical Texts." Thèse, Université d'Ottawa / University of Ottawa, 2012. http://hdl.handle.net/10393/22724.
Full textNääs, Johanna, and Emma Thurfjell. "Språklig komplexitet och narrativ struktur i skriven text : En jämförelse mellan elever med och utan lässvårigheter." Thesis, Umeå universitet, Logopedi, 2018. http://urn.kb.se/resolve?urn=urn:nbn:se:umu:diva-148330.
Full textLässvårigheter, språklig förmåga och skolresultat i tidiga skolår
Rios, Anthony. "Deep Neural Networks for Multi-Label Text Classification: Application to Coding Electronic Medical Records." UKnowledge, 2018. https://uknowledge.uky.edu/cs_etds/71.
Full textNicholls, Joseph Anthony. "Text search : information-seeking strategies using paper and CD-ROM versions of a medical textbook." Thesis, University of Leeds, 1994. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.343634.
Full textBhooshan, Neha. "Classification of semantic relations in different syntactic structures in medical text using the MeSH hierarchy." Thesis, Massachusetts Institute of Technology, 2005. http://hdl.handle.net/1721.1/33111.
Full textIncludes bibliographical references (leaf 38).
Two different classification algorithms are evaluated in recognizing semantic relationships of different syntactic compounds. The compounds, which include noun- noun, adjective-noun, noun-adjective, noun-verb, and verb-noun, were extracted from a set of doctors' notes using a part of speech tagger and a parser. Each compound was labeled with a semantic relationship, and each word in the compound was mapped to its corresponding entry in the MeSH hierarchy. MeSH includes only medical terminology so it was extended to include everyday, non-medical terms. The two classification algorithms, neural networks and a classification tree, were trained and tested on the data set for each type of syntactic compound. Models representing different levels of MeSH were generated and fed into the neural networks. Both algorithms performed better than random guessing, and the classification tree performed better than the neural networks in predicting the semantic relationship between phrases from their syntactic structure.
by Neha Bhooshan.
M.Eng.
Fioramonte, Amy. "A Study of Pragmatic Competence: International Medical Graduates' and Patients' Negotiation of the Treatment Phase of Medical Encounters." Scholar Commons, 2014. https://scholarcommons.usf.edu/etd/5478.
Full textSethi, Iccha. "Clinician Decision Support Dashboard: Extracting value from Electronic Medical Records." Thesis, Virginia Tech, 2012. http://hdl.handle.net/10919/41894.
Full textMaster of Science
Christiansen, Ammon J. "Finding Relevant PDF Medical Journal Articles by the Content of Their Figures as well as Their Text." BYU ScholarsArchive, 2007. https://scholarsarchive.byu.edu/etd/872.
Full textHozayen, Ghada. "A study of the discussion section of the medical research article : a genre based approach to text analysis." Thesis, University of Birmingham, 1994. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.250267.
Full textHellström, Karlsson Rebecca. "Aiding Remote Diagnosis with Text Mining." Thesis, KTH, Människa och Kommunikation, 2017. http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-215760.
Full textÄmnet för detta examensarbete är hur text mining kan användas på patientrapporterade symptombeskrivningar, och hur det kan användas för att hjälpa läkare att utföra den diagnostiska processen. Sjukvården har idag svårigheter med att leverera vård till avlägsna orter, och vårdkostnader ökar i och med en åldrande population. Idag är det okänt hur text mining skulle kunna hjälpa doktorer i sitt arbete. Att undersöka om läkare blir hjälpta av att presenteras med mer information, baserat på vad patienter som skriver liknande saker som deras nuvarande patient gör, kan vara relevant för flera olika områden av sjukvården. Text mining har potential att förbättra vårdkvaliten för patienter med låg tillgänglighet till vård, till exempel på grund av avstånd. I detta arbete representerades patienttexter med en Bag-of-Words modell, och klustrades med en k-means algoritm. Den slutgiltiga klustringsmodellen använde sig av 41 kluster, och de tio viktigaste orden för klustercentroider användes för att representera respektive kluster. Därefter genomfördes ett experiment för att se om och hur läkare blev behjälpta i sin diagnostiska process, om patienters texter presenterades med de tio orden från de kluster som texterna hörde till. Resultaten från experimentet var att orden hjälpte läkarna i de mer komplicerade patientfallen, och att klustringsalgoritmen skulle kunna användas för att ställa specifika följdfrågor till patienter.
Ellis, Olga. "Voice vs. Text Chats: Their Efficacy for Learning Probing Questions by Non-Native Speaking Medical Professionals in Online Courses." Diss., The University of Arizona, 2012. http://hdl.handle.net/10150/228635.
Full textJabbour, Georgette. "Corpus linguistics, contextual collocation and ESP syllabus creation : a text-analysis approach to the study of medical research articles." Thesis, University of Birmingham, 1997. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.342109.
Full textIsenius, Niklas, Sumithra Velupillai, and Maria Kvist. "Initial Results in the Development of SCAN : a Swedish Clinical Abbreviation Normalizer." Stockholms universitet, Institutionen för data- och systemvetenskap, 2012. http://urn.kb.se/resolve?urn=urn:nbn:se:su:diva-82245.
Full textHåkansson, Susanne. "Health and Place : Terminology, proper nouns and titles of cited publications in the translation of a text on medical geology." Thesis, Linnaeus University, School of Language and Literature, 2010. http://urn.kb.se/resolve?urn=urn:nbn:se:lnu:diva-8227.
Full textThis essay deals with some of the difficulties that translation of a technical text may present, more specifically the handling of terminology, proper nouns and titles of cited publications. For this purpose, a text dealing with medical geology, taken from Essentials of Medical Geology (Selinus et al., 2005), was translated and analysed.
Medical geology is an interdisciplinary science and hence contains terminology from several different scientific areas. The present study includes terminology within the field of medicine and geochemistry in the analysis. The preferred and predominant translation procedure was literal translation (Munday, 2001:57). Many source text terms have synonyms in the target language. With the intention to preserve and transfer the level of technical style into the target text, terms were analysed and classified as belonging to one of three levels of technical style: academic, professional and popular (Newmark, 1988:151). The handling of proper nouns connected to medicine and geology was also included in the analysis. One common procedure is to use a translation which is established in the target language. The present study discusses the strategies used when no such established translation was found. The procedure of using a recognised translation was discussed in connection to the handling of titles of cited publications referred to in the source text.
Bustos, Aurelia. "Extraction of medical knowledge from clinical reports and chest x-rays using machine learning techniques." Doctoral thesis, Universidad de Alicante, 2019. http://hdl.handle.net/10045/102193.
Full textDaidoji, Keiko. "What a household with sick persons should know : expressions of body and illness in a medical text of early nineteenth-century Japan." Thesis, SOAS, University of London, 2009. http://eprints.soas.ac.uk/29267/.
Full textWilliams, Laura Elizabeth. "Painful transformations : a medical approach to experience, life cycle and text in British Library, Additional MS 61823, 'The Book of Margery Kempe'." Thesis, University of Exeter, 2016. http://hdl.handle.net/10871/24288.
Full textAssefa, Shimelis G. "Human concept cognition and semantic relations in the unified medical language system: A coherence analysis." Thesis, University of North Texas, 2007. https://digital.library.unt.edu/ark:/67531/metadc4008/.
Full textWalker, Briana Shanise. "Rethinking Document Classification: A Pilot for the Application of Text Mining Techniques To Enhance Standardized Assessment Protocols for Critical Care Medical Team Transfer of Care." Case Western Reserve University School of Graduate Studies / OhioLINK, 2017. http://rave.ohiolink.edu/etdc/view?acc_num=case1496760037827537.
Full textStrallhofer, Daniel, and Jonatan Ahlqvist. "Classifying Urgency : A Study in Machine Learning for Classifying the Level of Medical Emergency of an Animal’s Situation." Thesis, KTH, Skolan för elektroteknik och datavetenskap (EECS), 2018. http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-231476.
Full textDenna studie utforskar användandet av Naive Bayes samt Linear Support Vector Machines för att klassificera en text på en medicinsk skala. Den huvudsakliga datamängden som kommer att användas för att göra detta är kundinformation från en online veterinär. Aspekter som utforskas är om en enda text kan innehålla tillräckligt med information för att göra ett medicinskt beslut och om det finns alternativa metoder för att samla in mer anpassade datamängder i framtiden. Tidigare studier har bevisat att både Naive Bayes och SVMs ofta kan nå väldigt bra resultat. Vi visar hur man kan optimera resultat för att främja framtida studier. Optimala metoder för att samla in datamängder diskuteras som en del av optimeringsprocessen. Slutligen utforskas även de affärsmässiga aspekterna utigenom implementationen av ett datalogiskt system och hur detta kommer påverka kundflödet, goodwill, intäkter/kostnader och konkurrenskraft.
Schnackenberg, Andrew K. "Symbolizing Institutional Change: Media Representations and Legality in the Payday Loan and Medical Marijuana Industries." Case Western Reserve University School of Graduate Studies / OhioLINK, 2014. http://rave.ohiolink.edu/etdc/view?acc_num=case1405090956.
Full textLindell, Klara. "A standard case of subtitling. : A comparative analysis of the subtitling of Scrubs and House M.D. with a focus on medical terminology." Thesis, Stockholms universitet, Engelska institutionen, 2010. http://urn.kb.se/resolve?urn=urn:nbn:se:su:diva-78325.
Full textSamuel, Jarvie John. "Elicitation of Protein-Protein Interactions from Biomedical Literature Using Association Rule Discovery." Thesis, University of North Texas, 2010. https://digital.library.unt.edu/ark:/67531/metadc30508/.
Full textColepícolo, Eliane [UNIFESP]. "Epistemologia da Informática em Saúde: entre a teoria e a prática." Universidade Federal de São Paulo (UNIFESP), 2008. http://repositorio.unifesp.br/handle/11600/9468.
Full textEpistemologia da Informática em Saúde: entre a teoria e a prática. Eliane Colepí-colo. 2008. CONTEXTO. O objetivo dessa pesquisa é compreender a epistemologia da área de Informática em Saúde (IS) por meio de um estudo comparativo entre aspectos teóricos e práticos desta disciplina. MATERIAIS E MÉTODOS. O estudo foi dividido em 3 eta-pas: estudo estatístico, estudo terminológico e estudo epistemológico. O estudo esta-tístico envolveu o desenvolvimento e uso de robô para extração de metadados de arti-gos científicos da base PubMed, assim como a mineração de textos destes resumos de artigos, utilizados para estatísticas e análise posterior. O estudo terminológico visou o desenvolvimento de um tesauro especializado em IS, aqui denominado EpistemIS, que, integrado ao MeSH, serviu como base ao estudo estatístico. O estudo epistemo-lógico começou com o estudo dos metaconceitos da ação e pensamento humanos (MAPHs), que são arte, técnica, ciência, tecnologia e tecnociência. A seguir, realizou-se o desenvolvimento de um método epistemológico, baseado nas obras de Mário Bunge, para classificação epistemológica de conceitos da área provenientes do tesau-ro EpistemIS. Uma pesquisa de opinião com a comunidade científica da área foi reali-zada por meio de questionário na web. RESULTADOS. Obteve-se: uma caracteriza-ção dos MAPHs, mapas de sistematização do conhecimento em IS, classificações epistemológica e em MAPHs da IS, um mapa do conhecimento em IS e o consenso da comunidade sobre a epistemologia da IS. Por fim, foram calculadas estatísticas relati-vas: às classificações epistemológica e em MAPHs em IS, à integração entre o corpus de análise (437.289 artigos PubMed) e o tesauro EpistemIS. CONCLUSÃO. A partir de argumentos teóricos e práticos concluiu-se que a Informática em Saúde é uma tecno-ciência que se ocupa de solucionar problemas relativos aos domínios das Ciências da Vida, Ciências da Saúde e do Cuidado em Saúde, por meio da pesquisa científica in-terdisciplinar e do desenvolvimento de tecnologia para uso na sociedade.
TEDE
Andersson, Kronlid Maja, and Hanna Björklund. "Narrativ förmåga i återberättande hos elever med svag textförståelse i åk 2." Thesis, Umeå universitet, Logopedi, 2017. http://urn.kb.se/resolve?urn=urn:nbn:se:umu:diva-136475.
Full textBackground. Much of children’s spontaneous communication is in the form of storytelling. By examining micro- and macrostructures in retelling, different aspects of linguistic ability can be mapped. Children with poor text comprehension show weakness in many linguistic areas and an appropriate analysis can serve as a basis for identifying individuals who need support, as well as guidance for intervention. Aim. The first part of this study investigates micro- and macrostructural differences in narrative retelling between students with and without poor text comprehension. The second part investigates narrative retelling by students identified with poor narrative quality. Methods. In the first part of the study, retellings from second grade students were analyzed at micro- and macrostructural levels. In the second part, a qualitative analysis of the retellings based on these levels was performed. Results. In part 1, no significant differences between the groups were observed. However, students with poor text comprehension tended to have a higher linguistic productivity in their retellings and students with good text comprehension had more complex structures. Part 2, it was found that the students with poor narrative quality tended to summarize the story and exclude parts. Conclusions. Students with language difficulties are not a homogeneous group. Hence, an intervention needs to be adapted to the needs of the student. Further, the results indicate that a short intervention has the potential to equalize possible differences in narrative retelling between students with and without poor text comprehension.
Tidig intensivsatsning i avkodning och läsförståelse
Miller, P. W. "Vocabulary and reading medical texts." Thesis, Swansea University, 2003. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.638194.
Full textCederblad, Gustav. "Finding Synonyms in Medical Texts : Creating a system for automatic synonym extraction from medical texts." Thesis, Linköpings universitet, Institutionen för datavetenskap, 2018. http://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-149643.
Full textNelsson, Mikael. "Deep learning for medical report texts." Thesis, Uppsala universitet, Avdelningen för systemteknik, 2018. http://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-356140.
Full textBigeard, Elise. "Détection et analyse de la non-adhérence médicamenteuse dans les réseaux sociaux." Thesis, Lille 3, 2019. http://www.theses.fr/2019LIL3H026.
Full textDrug non-compliance refers to situations where the patient does not follow instructions from medical authorities when taking medications. Such situations include taking too much (overuse) or too little (underuse) of medications, drinking contraindicated alcohol, or making a suicide attempt using medication. According to [HAYNES 2002] increasing drug compliance may have a bigger impact on public health than any other medical improvements. However non-compliance data are difficult to obtain since non-adherent patients are unlikely to report their behaviour to their healthcare providers. This is why we use data from social media to study drug non-compliance. Our study is applied to French-speaking forums.First we collect a corpus of messages written by users from medical forums. We build vocabularies of medication and disorder names such as used by patients. We use these vocabularies to index medications and disorders in the corpus. Then we use supervised learning and information retrieval methods to detect messages talking about non-compliance. With machine learning, we obtain 0.433 F-mesure, with up to 0.421 precision or 0.610 recall. With information retrieval, we reach 0.8 precision on the first ten results.After that, we study the content of the non-compliance messages. We identify various non-compliance situations and patient's motivations. We identify 3 main motivations: self-medication, seeking an effect besides the effect the medication was prescribed for, or being in addiction or habituation situation. Self-medication is an umbrella for several situations: avoiding an adverse effect, adjusting the medication's effect, underuse a medication seen as useless, taking decisions without a doctor's advice. Non-compliance can also happen thanks to errors or carelessness, without any particular motivation.Our work provides several kinds of result: annotated corpus with non-compliance messages, classifier for the detection of non-compliance messages, typology of non-compliance situations and analysis of the causes of non-compliance
Barbosa, Alexandre Nunes. "Descoberta de conhecimento aplicado à base de dados textual de saúde." Universidade do Vale do Rio dos Sinos, 2012. http://www.repositorio.jesuita.org.br/handle/UNISINOS/4559.
Full textMade available in DSpace on 2015-07-18T12:21:33Z (GMT). No. of bitstreams: 1 42c.pdf: 1016491 bytes, checksum: 407619e0114b592531ee5a68ca0fd0f9 (MD5) Previous issue date: 2012
UNISINOS - Universidade do Vale do Rio dos Sinos
Este trabalho propõe um processo de investigação do conteúdo de uma base de dados, composta por dados descritivos e pré-estruturados do domínio da saúde, mais especificamente da área da Reumatologia. Para a investigação da base de dados, foram compostos 3 conjuntos de interesse. O primeiro composto por uma classe com conteúdo descritivo relativo somente a área da Reumatologia em geral, e outra cujo seu conteúdo pertence a outras áreas da medicina. O segundo e o terceiro conjunto, foram constituídos após análises estatísticas na base de dados. Um formado pelo conteúdo descritivo associado as 5 maiores frequências de códigos CID, e outro formado por conteúdo descritivo associado as 3 maiores frequências de códigos CID relacionados exclusivamente à área da Reumatologia. Estes conjuntos foram pré-processados com técnicas clássicas de Pré-processamento tais como remoção de Stopwords e Stemmer. Com o objetivo de extrair padrões que através de sua interpretação resultem na produção de conhecimento, foram aplicados aos conjuntos de interesse técnicas de classificação e associação, visando à relação entre o conteúdo textual que descreve sintomas de doenças com o conteúdo pré-estruturado, que define o diagnóstico destas doenças. A execução destas técnicas foi realizada através da aplicação do algoritmo de classificação Support Vector Machines e do algoritmo para extração de Regras de Associação Apriori. Para o desenvolvimento deste processo foi pesquisado referencial teórico relativo à mineração de dados, bem como levantamento e estudo de trabalhos científicos produzidos no domínio da mineração textual e relacionados a Prontuário Médico Eletrônico, focando o conteúdo das bases de dados utilizadas, técnicas de pré-processamento e mineração empregados na literatura, bem como os resultados relatados. A técnica de classificação empregada neste trabalho obteve resultados acima de 80% de Acurácia, demonstrando capacidade do algoritmo de rotular dados da saúde relacionados ao domínio de interesse corretamente. Também foram descobertas associações entre conteúdo textual e conteúdo pré-estruturado, que segundo a análise de especialistas, podem conduzir a questionamentos quanto à utilização de determinados CIDs no local de origem dos dados.
This study suggests a process of investigation of the content of a database, comprising descriptive and pre-structured data related to the health domain, more particularly in the area of Rheumatology. For the investigation of the database, three sets of interest were composed. The first one formed by a class of descriptive content related only to the area of Rheumatology in general, and another whose content belongs to other areas of medicine. The second and third sets were constituted after statistical analysis in the database. One of them formed by the descriptive content associated to the five highest frequencies of ICD codes, and another formed by descriptive content associated with the three highest frequencies of ICD codes related exclusively to the area of Rheumatology. These sets were pre-processed with classic Pre-processing techniques such as Stopword Removal and Stemming. In order to extract patterns that, through their interpretation, result in knowledge production, association and classification techniques were applied to the sets of interest, aiming at to relate the textual content that describes symptoms of diseases with pre-structured content, which defines the diagnosis of these diseases. The implementation of these techniques was carried out by applying the classification algorithm Support Vector Machines and the Association Rules Apriori Algorithm. For the development of this process, theoretical references concerning data mining were researched, including selection and review of scientific publications produced on text mining and related to Electronic Medical Record, focusing on the content of the databases used, techniques for pre-processing and mining used in the literature, as well as the reported results. The classification technique used in this study reached over 80% accurate results, demonstrating the capacity the algorithm has to correctly label health data related to the field of interest. Associations between text content and pre-structured content were also found, which, according to expert analysis, may be questioned as for the use of certain ICDs in the place of origin of the data.
Weidow, Rebecka, and Jennie Olsén. "Framtagning av förnyat material för Hörtröskel för tal-test." Thesis, Örebro universitet, Institutionen för hälsovetenskaper, 2020. http://urn.kb.se/resolve?urn=urn:nbn:se:oru:diva-85794.
Full textAlbitar, Shereen. "De l'usage de la sémantique dans la classification supervisée de textes : application au domaine médical." Thesis, Aix-Marseille, 2013. http://www.theses.fr/2013AIXM4343/document.
Full textThe main interest of this research is the effect of using semantics in the process of supervised text classification. This effect is evaluated through an experimental study on documents related to the medical domain using the UMLS (Unified Medical Language System) as a semantic resource. This evaluation follows four scenarios involving semantics at different steps of the classification process: the first scenario incorporates the conceptualization step where text is enriched with corresponding concepts from UMLS; both the second and the third scenarios concern enriching vectors that represent text as Bag of Concepts (BOC) with similar concepts; the last scenario considers using semantics during class prediction, where concepts as well as the relations between them are involved in decision making. We test the first scenario using three popular classification techniques: Rocchio, NB and SVM. We choose Rocchio for the other scenarios for its extendibility with semantics. According to experiment, results demonstrated significant improvement in classification performance using conceptualization before indexing. Moderate improvements are reported using conceptualized text representation with semantic enrichment after indexing or with semantic text-to-text semantic similarity measures for prediction
Keightley, Sofia. "Changements syntaxiques, modulations et adaptations dans un texte medical." Thesis, Linnéuniversitetet, Institutionen för språk (SPR), 2013. http://urn.kb.se/resolve?urn=urn:nbn:se:lnu:diva-26779.
Full textBlackman, Nicole Jill-Marie. "Criteria for demonstrating the efficacy of a medical test." Thesis, De Montfort University, 2002. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.246528.
Full textChen, Michelle W. M. Eng Massachusetts Institute of Technology. "Comparison of natural language processing algorithms for medical texts." Thesis, Massachusetts Institute of Technology, 2015. http://hdl.handle.net/1721.1/100298.
Full textThis electronic version was submitted by the student author. The certified thesis is available in the Institute Archives and Special Collections.
Title as it appears in MIT Commencement Exercises program, June 5, 2015: Comparison of NLP systems for medical text. Cataloged from student-submitted PDF version of thesis.
Includes bibliographical references (pages 57-58).
With the large corpora of clinical texts, natural language processing (NLP) is growing to be a field that people are exploring to extract useful patient information. NLP applications in clinical medicine are especially important in domains where the clinical observations are crucial to define and diagnose the disease. There are a variety of different systems that attempt to match words and word phrases to medical terminologies. Because of the differences in annotation datasets and lack of common conventions, many of the systems yield conflicting results. The purpose of this thesis project is (1) to create a visual representation of how different concepts compare to each other when using various annotators and (2) to improve upon the NLP methods to yield terms with better fidelity to what the clinicians are trying to express.
by Michelle W. Chen.
M. Eng.
Canlas, Joel. "Creating software libraries to improve medical device testing of the Pacing System Analyzer (PSA) at St. Jude Medical." DigitalCommons@CalPoly, 2011. https://digitalcommons.calpoly.edu/theses/599.
Full textMays, Patricia Faye. "Seal strength models for medical device trays." [College Station, Tex. : Texas A&M University, 2008. http://hdl.handle.net/1969.1/ETD-TAMU-2756.
Full textRadovanovic, Aleksandar. "Concept Based Knowledge Discovery from Biomedical Literature." Thesis, Online access, 2009. http://etd.uwc.ac.za/usrfiles/modules/etd/docs/etd_gen8Srv25Nme4_9861_1272229462.pdf.
Full textAl-Muhammad, Muhammad. "Patterns of textual cohesion in medical textbook discourse in English and Arabic." Thesis, University of Surrey, 1993. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.240160.
Full textKarlsson, Pernilla, and Maria Rytiniemi. "En normeringsstudie av ”The timed water swallow test” (TWST) för personer över 60 år." Thesis, Umeå universitet, Logopedi, 2019. http://urn.kb.se/resolve?urn=urn:nbn:se:umu:diva-161231.
Full text