To see the other types of publications on this topic, follow the link: Big data storage.

Dissertations / Theses on the topic 'Big data storage'

Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles

Select a source type:

Consult the top 50 dissertations / theses for your research on the topic 'Big data storage.'

Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.

You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.

Browse dissertations / theses on a wide variety of disciplines and organise your bibliography correctly.

1

Grohsschmiedt, Steffen. "Making Big Data Smaller : Reducing the storage requirements for big data with erasure coding for Hadoop." Thesis, KTH, Skolan för informations- och kommunikationsteknik (ICT), 2014. http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-177201.

Full text
Abstract:
The amount of data stored in modern data centres is growing rapidly nowadays. Large-scale distributed file systems, that maintain the massive data sets in data centres, are designed to work with commodity hardware. Due to the quality and quantity of the hardware components in such systems, failures are considered normal events and, as such, distributed file systems are designed to be highly fault-tolerant. A common approach to achieve fault tolerance is using redundancy by storing three copies of a file across different storage nodes, thereby increasing the storage requirements by a factor of
APA, Harvard, Vancouver, ISO, and other styles
2

Gong, Yifu. "Intelligent Energy-Efficient Storage System for Big-Data Applications." Diss., North Dakota State University, 2020. https://hdl.handle.net/10365/31752.

Full text
Abstract:
Static Random Access Memory (SRAM) is a critical component in mobile video processing systems. Because of the large video data size, the memory is frequently accessed, which dominates the power consumption and limits battery life. In energy-efficient SRAM design, a substantial amount of research is presented to discuss the mechanisms of approximate storage, but the content and environment adaptations were never a part of the consideration in memory design. This dissertation focuses on optimization methods for the SRAM system, specifically addressing three areas of Intelligent Energy-Efficient
APA, Harvard, Vancouver, ISO, and other styles
3

Jun, Sang-Woo. "Big data analytics made affordable using hardware-accelerated flash storage." Thesis, Massachusetts Institute of Technology, 2018. http://hdl.handle.net/1721.1/118088.

Full text
Abstract:
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, 2018.<br>Cataloged from PDF version of thesis.<br>Includes bibliographical references (pages 175-192).<br>Vast amount of data is continuously being collected from sources including social networks, web pages, and sensor networks, and their economic value is dependent on our ability to analyze them in a timely and affordable manner. High performance analytics have traditionally required a machine or a cluster of machines with enough DRAM to accommodate the entire working set, due to
APA, Harvard, Vancouver, ISO, and other styles
4

Stjerna, Albin. "Medium Data on Big Data Predicting Disk Failures in CERNs NetApp-based Data Storage System." Thesis, Uppsala universitet, Institutionen för informationsteknologi, 2017. http://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-337638.

Full text
Abstract:
I describe in this report an experimental system for using classification and regression trees to generate predictions of disk failures in a NetApp-based storage system at the European Organisation for Nuclear Research (CERN) based on a mixture of SMART data, system logs, and low-level system performance dataparticular to NetApp's storage solutions. Additionally, I make an attempt at profiling the system's built-in failure prediction method, and compiling statistics on historical complete-disk failures as well as bad blocks developed. Finally, I experiment with various parameters for producing
APA, Harvard, Vancouver, ISO, and other styles
5

Kroll, Lars. "Load Balancing in a Distributed Storage System for Big and Small Data." Thesis, KTH, Skolan för informations- och kommunikationsteknik (ICT), 2013. http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-129304.

Full text
Abstract:
Distributed storage services form the backbone of modern large-scale applications and data processing solutions. In this integral role they have to provide a scalable, reliable and performant service. One of the major challenges any distributed storage system has to address is skew in the data load, which can either be in the distribution of data items or data access over the nodes in the system. One widespread approach to deal with skewed load is data assignment based on uniform consistent hashing. However, there is an opposing desire to optimise and exploit data-locality. That is to say, it
APA, Harvard, Vancouver, ISO, and other styles
6

Xu, Yiqi. "Storage Management of Data-intensive Computing Systems." FIU Digital Commons, 2016. http://digitalcommons.fiu.edu/etd/2474.

Full text
Abstract:
Computing systems are becoming increasingly data-intensive because of the explosion of data and the needs for processing the data, and storage management is critical to application performance in such data-intensive computing systems. However, existing resource management frameworks in these systems lack the support for storage management, which causes unpredictable performance degradations when applications are under I/O contention. Storage management of data-intensive systems is a challenging problem because I/O resources cannot be easily partitioned and distributed storage systems require s
APA, Harvard, Vancouver, ISO, and other styles
7

Ikken, Sonia. "Efficient placement design and storage cost saving for big data workflow in cloud datacenters." Thesis, Evry, Institut national des télécommunications, 2017. http://www.theses.fr/2017TELE0020/document.

Full text
Abstract:
Les workflows sont des systèmes typiques traitant le big data. Ces systèmes sont déployés sur des sites géo-distribués pour exploiter des infrastructures cloud existantes et réaliser des expériences à grande échelle. Les données générées par de telles expériences sont considérables et stockées à plusieurs endroits pour être réutilisées. En effet, les systèmes workflow sont composés de tâches collaboratives, présentant de nouveaux besoins en terme de dépendance et d'échange de données intermédiaires pour leur traitement. Cela entraîne de nouveaux problèmes lors de la sélection de données distri
APA, Harvard, Vancouver, ISO, and other styles
8

Ikken, Sonia. "Efficient placement design and storage cost saving for big data workflow in cloud datacenters." Electronic Thesis or Diss., Evry, Institut national des télécommunications, 2017. http://www.theses.fr/2017TELE0020.

Full text
Abstract:
Les workflows sont des systèmes typiques traitant le big data. Ces systèmes sont déployés sur des sites géo-distribués pour exploiter des infrastructures cloud existantes et réaliser des expériences à grande échelle. Les données générées par de telles expériences sont considérables et stockées à plusieurs endroits pour être réutilisées. En effet, les systèmes workflow sont composés de tâches collaboratives, présentant de nouveaux besoins en terme de dépendance et d'échange de données intermédiaires pour leur traitement. Cela entraîne de nouveaux problèmes lors de la sélection de données distri
APA, Harvard, Vancouver, ISO, and other styles
9

Chihoub, Houssem Eddine. "Managing consistency for big data applications : tradeoffs and self-adaptiveness." Thesis, Cachan, Ecole normale supérieure, 2013. http://www.theses.fr/2013DENS0059/document.

Full text
Abstract:
Dans l’ère de Big Data, les applications intensives en données gèrent des volumes de données extrêmement grand. De plus, ils ont besoin de temps de traitement rapide. Une grande partie de ces applications sont déployées sur des infrastructures cloud. Ceci est afin de bénéficier de l’élasticité des clouds, les déploiements sur demande et les coûts réduits strictement relatifs à l’usage. Dans ce contexte, la réplication est un moyen essentiel dans le cloud afin de surmonter les défis de Big Data. En effet, la réplication fournit les moyens pour assurer la disponibilité des données à travers de n
APA, Harvard, Vancouver, ISO, and other styles
10

Marcu, Ovidiu-Cristian. "KerA : Un Système Unifié d'Ingestion et de Stockage pour le Traitement Efficace du Big Data : Un Système Unifié d'Ingestion et de Stockage pour le Traitement Efficace du Big Data." Thesis, Rennes, INSA, 2018. http://www.theses.fr/2018ISAR0028/document.

Full text
Abstract:
Le Big Data est maintenant la nouvelle ressource naturelle. Les architectures actuelles des environnements d'analyse des données massives sont constituées de trois couches: les flux de données sont acquis par la couche d’ingestion (e.g., Kafka) pour ensuite circuler à travers la couche de traitement (e.g., Flink) qui s’appuie sur la couche de stockage (e.g., HDFS) pour stocker des données agrégées ou pour archiver les flux pour un traitement ultérieur. Malheureusement, malgré les bénéfices potentiels apportés par les couches spécialisées (e.g., une mise en oeuvre simplifiée), déplacer des quan
APA, Harvard, Vancouver, ISO, and other styles
11

Giannini, Andrea. "Social Network Analysis: Architettura Streaming Big Data di Raccolta e Analisi Dati da Twitter." Master's thesis, Alma Mater Studiorum - Università di Bologna, 2022. http://amslaurea.unibo.it/25378/.

Full text
Abstract:
Negli ultimi anni i social media, come ad esempio Facebook, Twitter, WhatsApp, YouTube, si sono diffusi a macchia d'olio. Ormai quasi tutti accedono giornalmente su almeno uno di questi per informarsi, esprimere opinioni e interagire con altri utenti. Per questa ragione sono diventati fondamentali per i reparti marketing delle aziende essendo non solo un ottimo canale di comunicazione, ma anche una fonte di informazioni sui clienti e potenziali tali. La tesi si focalizza proprio su quest'ultimo aspetto. Il progetto Social Network Analysis (SNA) vuole essere infatti uno strumento attravers
APA, Harvard, Vancouver, ISO, and other styles
12

Schintler, Laurie A., and Manfred M. Fischer. "The Analysis of Big Data on Cites and Regions - Some Computational and Statistical Challenges." WU Vienna University of Economics and Business, 2018. http://epub.wu.ac.at/6637/1/2018%2D10%2D28_Big_Data_on_cities_and_regions_untrack_changes.pdf.

Full text
Abstract:
Big Data on cities and regions bring new opportunities and challenges to data analysts and city planners. On the one side, they hold great promise to combine increasingly detailed data for each citizen with critical infrastructures to plan, govern and manage cities and regions, improve their sustainability, optimize processes and maximize the provision of public and private services. On the other side, the massive sample size and high-dimensionality of Big Data and their geo-temporal character introduce unique computational and statistical challenges. This chapter provides overviews on the sal
APA, Harvard, Vancouver, ISO, and other styles
13

Sivasubramaniam, Ravishankar. "Performance Evaluation of LINQ to HPC and Hadoop for Big Data." UNF Digital Commons, 2013. http://digitalcommons.unf.edu/etd/463.

Full text
Abstract:
There is currently considerable enthusiasm around the MapReduce paradigm, and the distributed computing paradigm for analysis of large volumes of data. The Apache Hadoop is the most popular open source implementation of MapReduce model and LINQ to HPC is Microsoft's alternative to open source Hadoop. In this thesis, the performance of LINQ to HPC and Hadoop are compared using different benchmarks. To this end, we identified four benchmarks (Grep, Word Count, Read and Write) that we have run on LINQ to HPC as well as on Hadoop. For each benchmark, we measured each system’s performance metrics (
APA, Harvard, Vancouver, ISO, and other styles
14

Zhang, Yi-Fan. "Data distribution and task scheduling for distributed computing of all-to-all comparison problems." Thesis, Queensland University of Technology, 2016. https://eprints.qut.edu.au/92604/1/Yi-fan_Zhang_Thesis.pdf.

Full text
Abstract:
This research studied distributed computing of all-to-all comparison problems with big data sets. The thesis formalised the problem, and developed a high-performance and scalable computing framework with a programming model, data distribution strategies and task scheduling policies to solve the problem. The study considered storage usage, data locality and load balancing for performance improvement in solving the problem. The research outcomes can be applied in bioinformatics, biometrics and data mining and other domains in which all-to-all comparisons are a typical computing pattern.
APA, Harvard, Vancouver, ISO, and other styles
15

Munir, Rana Faisal. "Storage format selection and optimization for materialized intermediate results in data-intensive flows." Doctoral thesis, Universitat Politècnica de Catalunya, 2019. http://hdl.handle.net/10803/668476.

Full text
Abstract:
Modern organizations produce and collect large volumes of data, that need to be processed repeatedly and quickly for gaining business insights. For such processing, typically, Data-intensive Flows (DIFs) are deployed on distributed processing frameworks. The DIFs of different users have many computation overlaps (i.e., parts of the processing are duplicated), thus wasting computational resources and increasing the overall cost. The output of these computation overlaps (known as intermediate results) can be materialized for reuse, which helps in reducing the cost and saves computational resourc
APA, Harvard, Vancouver, ISO, and other styles
16

Demirsoy, Delil, and Erik Holm. "En studie om Big data och personlig integritet : Vad vet studenter om lagring av deras personliga uppgifter?" Thesis, Högskolan Väst, Avd för informatik, 2020. http://urn.kb.se/resolve?urn=urn:nbn:se:hv:diva-15395.

Full text
Abstract:
Denna studie handlar om studenters kännedom om de personliga uppgifter som lagras av institutioner inom högre utbildning, och om det finns skillnader mellan kön gällande kännedomen och hanteringen av dessa uppgifter. Då det i samband med den expanderande lagringen av data och användningen av den genom Big data inom organisationen, visat sig ha påverkan på den personliga integriteten. Tidigare forskning indikerar på att det finns en brist i kännedomen och hanteringen hos människor om vad som lagras av organisationer. Tidigare forskning har även indikerat på att det finns skillnader mellan kön i
APA, Harvard, Vancouver, ISO, and other styles
17

Caetano, André Francisco Morielo [UNESP]. "Griddler: uma estratégia configurável para armazenamento distribuído de objetos peer-to-peer que combina replicação e erasure coding com sistema de cache." Universidade Estadual Paulista (UNESP), 2017. http://hdl.handle.net/11449/151383.

Full text
Abstract:
Submitted by André Francisco Morielo Caetano null (andremorielo@hotmail.com) on 2017-08-18T20:54:09Z No. of bitstreams: 1 Dissertacao_Andre_Morielo-Principal.pdf: 2084639 bytes, checksum: d77158373f8168fc0224d407bb07aa99 (MD5)<br>Approved for entry into archive by Luiz Galeffi (luizgaleffi@gmail.com) on 2017-08-23T19:42:08Z (GMT) No. of bitstreams: 1 caetano_afm_me_sjrp.pdf: 2084639 bytes, checksum: d77158373f8168fc0224d407bb07aa99 (MD5)<br>Made available in DSpace on 2017-08-23T19:42:08Z (GMT). No. of bitstreams: 1 caetano_afm_me_sjrp.pdf: 2084639 bytes, checksum: d77158373f8168fc0224d407
APA, Harvard, Vancouver, ISO, and other styles
18

Navarro, Martín Joan. "From cluster databases to cloud storage: Providing transactional support on the cloud." Doctoral thesis, Universitat Ramon Llull, 2015. http://hdl.handle.net/10803/285655.

Full text
Abstract:
Durant les últimes tres dècades, les limitacions tecnològiques (com per exemple la capacitat dels dispositius d'emmagatzematge o l'ample de banda de les xarxes de comunicació) i les creixents demandes dels usuaris (estructures d'informació, volums de dades) han conduït l'evolució de les bases de dades distribuïdes. Des dels primers repositoris de dades per arxius plans que es van desenvolupar en la dècada dels vuitanta, s'han produït importants avenços en els algoritmes de control de concurrència, protocols de replicació i en la gestió de transaccions. No obstant això, els reptes moderns d'emm
APA, Harvard, Vancouver, ISO, and other styles
19

Megler, Veronika Margaret. "Ranked Similarity Search of Scientific Datasets| An Information Retrieval Approach." Thesis, Portland State University, 2014. http://pqdtopen.proquest.com/#viewpdf?dispub=3629331.

Full text
Abstract:
<p>In the past decade, the amount of scientific data collected and generated by scientists has grown dramatically. This growth has intensified an existing problem: in large archives consisting of datasets stored in many files, formats and locations, how can scientists find data relevant to their research interests? We approach this problem in a new way: by adapting Information Retrieval techniques, developed for searching text documents, into the world of (primarily numeric) scientific data. We propose an approach that uses a blend of automated and curated methods to extract metadata from larg
APA, Harvard, Vancouver, ISO, and other styles
20

Nguyen, Cong-Danh. "Workload- and Data-based Automated Design for a Hybrid Row-Column Storage Model and Bloom Filter-Based Query Processing for Large-Scale DICOM Data Management." Thesis, Université Clermont Auvergne‎ (2017-2020), 2018. http://www.theses.fr/2018CLFAC019/document.

Full text
Abstract:
Dans le secteur des soins de santé, les données d'images médicales toujours croissantes, le développement de technologies d'imagerie, la conservation à long terme des données médicales et l'augmentation de la résolution des images entraînent une croissance considérable du volume de données. En outre, la variété des dispositifs d'acquisition et la différence de préférences des médecins ou d'autres professionnels de la santé ont conduit à une grande variété de données. Bien que la norme DICOM (Digital Imaging et Communication in Medicine) soit aujourd'hui largement adoptée pour stocker et transf
APA, Harvard, Vancouver, ISO, and other styles
21

Pettersson, Emeli, and Albin Carlson. "Att hitta en nål i en höstack: Metoder och tekniker för att sålla och gradera stora mängder ostrukturerad textdata." Thesis, Malmö universitet, Fakulteten för teknik och samhälle (TS), 2019. http://urn.kb.se/resolve?urn=urn:nbn:se:mau:diva-20105.

Full text
Abstract:
Big Data är i dagsläget ett populärt ämne som kan användas för en mängd olika syften. Bland annat kan det användas för att analysera data på webben i hopp om att identifiera brott mot mänskliga rättigheter. Genom att tillämpa tekniker inom områden som Artificiell Intelligens (AI), Information Retrieval (IR) samt data- visualisering, hoppas företaget Globalworks AB kunna identifiera röster vilka uttrycker sig om förtryck och kränkningar i social media. Artificiell intelligens och informationshämtning är dock breda områden och forskning som behandlar dem kan finnas långt ti
APA, Harvard, Vancouver, ISO, and other styles
22

Sodhi, Bir Apaar Singh. "DATA MINING: TRACKING SUSPICIOUS LOGGING ACTIVITY USING HADOOP." CSUSB ScholarWorks, 2016. https://scholarworks.lib.csusb.edu/etd/271.

Full text
Abstract:
In this modern rather interconnected era, an organization’s top priority is to protect itself from major security breaches occurring frequently within a communicational environment. But, it seems, as if they quite fail in doing so. Every week there are new headlines relating to information being forged, funds being stolen and corrupt usage of credit card and so on. Personal computers are turned into “zombie machines” by hackers to steal confidential and financial information from sources without disclosing hacker’s true identity. These identity thieves rob private data and ruin the very purpos
APA, Harvard, Vancouver, ISO, and other styles
23

Homem, Irvin. "LEIA: The Live Evidence Information Aggregator : A Scalable Distributed Hypervisor‐based Peer‐2‐Peer Aggregator of Information for Cyber‐Law Enforcement I." Thesis, KTH, Skolan för informations- och kommunikationsteknik (ICT), 2013. http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-177902.

Full text
Abstract:
The Internet in its most basic form is a complex information sharing organism. There are billions of interconnected elements with varying capabilities that work together supporting numerous activities (services) through this information sharing. In recent times, these elements have become portable, mobile, highly computationally capable and more than ever intertwined with human controllers and their activities. They are also rapidly being embedded into other everyday objects and sharing more and more information in order to facilitate automation, signaling that the rise of the Internet of Thin
APA, Harvard, Vancouver, ISO, and other styles
24

Jun, Sang-Woo. "Scalable multi-access flash store for Big Data analytics." Thesis, Massachusetts Institute of Technology, 2014. http://hdl.handle.net/1721.1/87947.

Full text
Abstract:
Thesis: S.M., Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, 2014.<br>Cataloged from PDF version of thesis.<br>Includes bibliographical references (pages 47-49).<br>For many "Big Data" applications, the limiting factor in performance is often the transportation of large amount of data from hard disks to where it can be processed, i.e. DRAM. In this work we examine an architecture for a scalable distributed flash store which aims to overcome this limitation in two ways. First, the architecture provides a high-performance, high-capacity, scalabl
APA, Harvard, Vancouver, ISO, and other styles
25

Nishibe, Caio Arce. "Central de confrontos para um sistema automático de identificação biométrica: uma abordagem de implementação escalável." Universidade Tecnológica Federal do Paraná, 2017. http://repositorio.utfpr.edu.br/jspui/handle/1/3142.

Full text
Abstract:
Com a popularização do uso da biometria, determinar a identidade de um indivíduo é uma atividade cada vez mais comum em diversos contextos: controle de acesso físico e lógico, controle de fronteiras, identificações criminais e forenses, pagamentos. Sendo assim, existe uma demanda crescente por Sistemas Automáticos de Identificação Biométrica (ABIS) cada vez mais rápidos, com elevada acurácia e que possam operar com um grande volume de dados. Este trabalho apresenta uma abordagem de implementação de uma central de confrontos para um ABIS de grande escala utilizando um framework de computação em
APA, Harvard, Vancouver, ISO, and other styles
26

Chebbi, Imen. "Modèles de stockage et d’analyse des données massives appliquées à l’imagerie satellitaire." Electronic Thesis or Diss., Paris 8, 2021. http://www.theses.fr/2021PA080106.

Full text
Abstract:
Notre thèse s’inscrit dans le cadre spatiotemporel des images satellitaires, l’analyse du gros volume d'images devient de plus en plus difficile avec l'apparition des capteurs à très hautes résolutions spatiales, spectrales et temporelles. Afin de pouvoir situer notre thèse en rapport avec la littérature, nous avons étudié les principales étapes du pipeline de grand volume de données et nous avons travaillé sur deux contributions principales qui sont le stockage et le traitement des données. Parmi les objectifs de notre thèse est de développer une architecture adaptée pour notre système du poi
APA, Harvard, Vancouver, ISO, and other styles
27

Fellenberg, Kurt. "Storage and analysis of microarray data." [S.l.] : [s.n.], 2002. http://deposit.ddb.de/cgi-bin/dokserv?idn=964718839.

Full text
APA, Harvard, Vancouver, ISO, and other styles
28

Salzwedel, Kay A. "Data distribution algorithms for storage networks." [S.l. : s.n.], 2004. http://deposit.ddb.de/cgi-bin/dokserv?idn=972387013.

Full text
APA, Harvard, Vancouver, ISO, and other styles
29

Kalezhi, Josephat. "Modelling data storage in nano-island magnetic materials." Thesis, University of Manchester, 2011. https://www.research.manchester.ac.uk/portal/en/theses/modelling-data-storage-in-nanoisland-magnetic-materials(9b449925-1a39-4711-8d55-82e6d8ac215c).html.

Full text
Abstract:
Data storage in current hard disk drives is limited by three factors. These are thermal stability of recorded data, the ability to store data, and the ability to read back the stored data. An attempt to alleviate one factor can affect others. This ultimately limits magnetic recording densities that can be achieved using traditional forms of data storage. In order to advance magnetic recording and postpone these inhibiting factors, new approaches are required. One approach is recording on Bit Patterned Media (BPM) where the medium is patterned into nanometer-sized magnetic islands where each st
APA, Harvard, Vancouver, ISO, and other styles
30

Dousa, Robin, and Alexander Pers. "Business Intelligence - det stora kartläggningspusslet : En studie om insamling och analys av konsumentinformation i livsmedelsbranschen." Thesis, Södertörns högskola, Institutionen för samhällsvetenskaper, 2014. http://urn.kb.se/resolve?urn=urn:nbn:se:sh:diva-26241.

Full text
Abstract:
Syfte: Syftet med uppsatsen är att undersöka hur företag, med hjälp av den moderna alltmer avancerade och utvecklade teknologin, systematiskt kartlägger konsumenternas köpbeteenden, genom s.k. business intelligence. Uppsatsen ämnar ta reda på hur teknologin appliceras hos företag samt hur och i vilken mån den data som samlas in används för att få konsumenter till önskade köpbeslut. Teori: Arbetets teoretiska kärna utgörs dels av ett teoretiskt ramverk, i vilket redogörs för business intelligence, samt ett avsnitt där teorier om konsumenternas köpbeteende presenteras. Metod: Arbetet har sin met
APA, Harvard, Vancouver, ISO, and other styles
31

Suthakar, Uthayanath. "A scalable data store and analytic platform for real-time monitoring of data-intensive scientific infrastructure." Thesis, Brunel University, 2017. http://bura.brunel.ac.uk/handle/2438/15788.

Full text
Abstract:
Monitoring data-intensive scientific infrastructures in real-time such as jobs, data transfers, and hardware failures is vital for efficient operation. Due to the high volume and velocity of events that are produced, traditional methods are no longer optimal. Several techniques, as well as enabling architectures, are available to support the Big Data issue. In this respect, this thesis complements existing survey work by contributing an extensive literature review of both traditional and emerging Big Data architecture. Scalability, low-latency, fault-tolerance, and intelligence are key challen
APA, Harvard, Vancouver, ISO, and other styles
32

Кварамба, Рувімбо Рона, and Ruvimbo Ronah Kwaramba. "Methods of Big Data Analysis and Process in Creating a System of Recommendation for an online store." Master's thesis, Тернопільський національний технічний університет імені Івана Пулюя, 2021. http://elartu.tntu.edu.ua/handle/lib/36744.

Full text
Abstract:
Метою дослідження є обґрунтування математичного підходу та відповідного програмного забезпечення для рекомендаційної системи для рекомендації житла для клієнтів. Для досягнення цієї мети необхідно: проаналізувати характеристики вхідних даних та завдання, яке необхідно вирішити. Проаналізувати та обґрунтувати математичний підхід до побудови системи рекомендацій.  .Аналіз та обґрунтування програмних технологій для впровадження системи. Вибір та обґрунтування середовища виконання рекомендаційної системи та впровадження прототипу рекомендаційної системи<br>The purpose of the work is to develop a s
APA, Harvard, Vancouver, ISO, and other styles
33

鄧興汎 and Hing-fan Anthony Tang. "A hybrid relational data structure for virtual reality modelling." Thesis, The University of Hong Kong (Pokfulam, Hong Kong), 2001. http://hub.hku.hk/bib/B31225184.

Full text
APA, Harvard, Vancouver, ISO, and other styles
34

Manikowski, Adam. "The impact of product, service and in-store environment perceptions on customer satisfaction and behaviour." Thesis, Cranfield University, 2016. http://dspace.lib.cranfield.ac.uk/handle/1826/12309.

Full text
Abstract:
Much previous research concerning the effects of the in-store experience on customers’ decision-making has been laboratory-based. There is a need for empirical research in a real store context to determine the impact of product, service and in-store environment perceptions on customer satisfaction and behaviour. This study is based on a literature review (Project 1) and a large scale empirical study (Projects 2/3) combining two sources of secondary data from the largest retailer in the UK, Tesco, and their loyalty ‘Clubcard’ provider, Dunnhumby. Data includes customer responses to an online se
APA, Harvard, Vancouver, ISO, and other styles
35

Videla, Cavieres Iván Fernando. "Improvement of recommendation system for a wholesale store chain using advanced data mining techniques." Tesis, Universidad de Chile, 2015. http://repositorio.uchile.cl/handle/2250/133522.

Full text
Abstract:
Magíster en Gestión de Operaciones<br>Ingeniero Civil Industrial<br>En las empresas de Retail, las áreas de Customer Intelligence tienen muchas oportunidades de mejorar sus decisiones estratégicas a partir de la información que podrían obtener de los registros de interacciones con sus clientes. Sin embargo se ha convertido en un desafío poder procesar estos grandes volúmenes de datos. Uno de los problemas que se enfrentan día a día es segmentar o agrupar clientes. La mayoría de las empresas generan agrupaciones según nivel de gasto, no por similitud en sus canastas de compra, como propone la
APA, Harvard, Vancouver, ISO, and other styles
36

Girgis, Emad Azmy Sultan. "Development of ferromagnetic, insulator, ferromagnetic devices for digital magnetic data storage and magnetic field sensors." [S.l. : s.n.], 2000. http://deposit.ddb.de/cgi-bin/dokserv?idn=960301429.

Full text
APA, Harvard, Vancouver, ISO, and other styles
37

Bittner, Reinhard. "Basic investigations on PVK based photorefractive polymers focussing on their applicability as mass data storage media." [S.l. : s.n.], 2003. http://deposit.ddb.de/cgi-bin/dokserv?idn=972908420.

Full text
APA, Harvard, Vancouver, ISO, and other styles
38

Todorovic, Ljubisa, and Timi Hoxha. "Digitala verktyg i revisionsprocessen : En kvalitativ jämförelse mellan stora och små byråer." Thesis, Högskolan Kristianstad, Fakulteten för ekonomi, 2021. http://urn.kb.se/resolve?urn=urn:nbn:se:hkr:diva-22504.

Full text
Abstract:
Den digitala utvecklingen pågår i samhället i stort. Revisionsbranschen är en bransch som är i förändring till följd av digitaliseringen. Digitaliseringen tar bland annat uttryck i form av olika digitala verktyg som kan användas i revisionsprocessen. Syftet med digitala verktyg är att förenkla revisionsprocessen och effektivisera. Studiens syfte är att göra en jämförelse mellan hur stora och små byråer använder sig av digitala verktyg i revisionsprocessen. I syfte att göra en jämförelse valdes en kvalitativ metod. Studiens empiri samlades in genom intervjuer av revisorer från stora och små byr
APA, Harvard, Vancouver, ISO, and other styles
39

劉少華 and Siu-wah Lau. "A novel approach to deadlock prevention in store-and-forward networks." Thesis, The University of Hong Kong (Pokfulam, Hong Kong), 1991. http://hub.hku.hk/bib/B31209798.

Full text
APA, Harvard, Vancouver, ISO, and other styles
40

Clemensson, Lisa. "Utan data är HR bara en funktion med en åsikt? : En kvalitativ studie om datadriven HR inom offentlig sektor." Thesis, Luleå tekniska universitet, Institutionen för ekonomi, teknik och samhälle, 2019. http://urn.kb.se/resolve?urn=urn:nbn:se:ltu:diva-74720.

Full text
Abstract:
HR-funktionen har genomgått en del förändringar under de senaste åren. Förändringarna har främst skett genom att HR har gått från att vara en personaladministrativ funktion till en allt mer strategisk funktion. Detta har ställt nya krav på HR-funktionens roll och dess arbete. Bland annat har HR behövt bli mer datadrivna. Störst utmaning har detta inneburit för HR-funktioner inom den offentliga sektorn, som till skillnad från privat sektor, fortfarande ligger efter i den datadrivna utvecklingen, och lite forskning har gjorts i den offentliga kontexten. Syftet med denna masteruppsats är därför a
APA, Harvard, Vancouver, ISO, and other styles
41

Camerlengo, Terry Luke. "Techniques for Storing and Processing Next-Generation DNA Sequencing Data." The Ohio State University, 2014. http://rave.ohiolink.edu/etdc/view?acc_num=osu1388502159.

Full text
APA, Harvard, Vancouver, ISO, and other styles
42

Anglesjö, Alice. "Interaktiv visualisering av stora dataset med webbtekniker : En jämförelse mellan JavaScript-biblioteken Leaflet och OpenLayers." Thesis, Högskolan i Skövde, Institutionen för informationsteknologi, 2020. http://urn.kb.se/resolve?urn=urn:nbn:se:his:diva-18774.

Full text
Abstract:
Visualisering är ett kraftfullt sätt att ge människor en förståelse för data med hjälp av mönster. Fördelarna med att kunna visualisera data på rätt kan resultera i förbättrat beslutsfattande, bättre inriktad dataanalys och bättre samarbete och informationsdelning. Visualisering av stora datamängder medför en rad olika utmaningar tack vare dess storlek och struktur. I och med att dagens webbteknologier utvecklats så pass mycket att de kan mäta sig med desktop-applikationer är webbtekniker ett utmärkt hjälpmedel vid visualisering av data för att också göra det mer tillgängligt. I arbetet skapas
APA, Harvard, Vancouver, ISO, and other styles
43

Smedley, Mark, and Gary Simpson. "SHOCK & VIBRATION TESTING OF AN AIRBORNE INSTRUMENTATION DIGITAL RECORDER." International Foundation for Telemetering, 2000. http://hdl.handle.net/10150/606747.

Full text
Abstract:
International Telemetering Conference Proceedings / October 23-26, 2000 / Town & Country Hotel and Conference Center, San Diego, California<br>Shock and vibration testing was performed on the Metrum-Datatape Inc. 32HE recorder to determine its viability as an airborne instrumentation recorder. A secondary goal of the testing was to characterize the recorder operational shock and vibration envelope. Both flight testing and laboratory environmental testing of the recorder was performed to make these determinations. This paper addresses the laboratory portion of the shock and vibration testin
APA, Harvard, Vancouver, ISO, and other styles
44

Ho, Lai-ming, and 何禮明. "Evaluation of the development and impact of clinical information systems." Thesis, The University of Hong Kong (Pokfulam, Hong Kong), 1998. http://hub.hku.hk/bib/B31236984.

Full text
APA, Harvard, Vancouver, ISO, and other styles
45

Karlsson, Benjamin, and Emil Johansson. "Visualisering av styrdiagram : En fallstudie av fallgropar inom dashboard design." Thesis, Umeå universitet, Institutionen för informatik, 2017. http://urn.kb.se/resolve?urn=urn:nbn:se:umu:diva-138724.

Full text
Abstract:
Visualization in process-oriented organizations is becoming increasingly more important for better decision making. This case study explores the design process of an IT company's dashboard software and the software requirements of its client. Based on our results and literature review we propose a control chart design module to incorporate in the dashboard that visualizes organizational variation in an interactive way. We found that user participation was considered very important by the developers but not implemented during early stages of development and that fictional clients were developed
APA, Harvard, Vancouver, ISO, and other styles
46

Settelmeier, Jens. "Theoretical Fundamentals of Computational Proteomics and Deep Learning- Based Identification of Chimeric Mass Spectrometry Data." Thesis, KTH, Skolan för elektroteknik och datavetenskap (EECS), 2021. http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-294322.

Full text
Abstract:
A complicating factor for peptide identification by MS/MS experiments is the presence of “chimeric” spectra where at least two precursor ions with similar retention time and mass co- elute in the mass spectrometer. This results in a spectrum that is a superposition of the spectra of the individual peptides. These chimeric spectra make peptide identification more difficult, so chimeric detection tools are needed to improve peptide identification rates. GLEAMS is a learned embedding algorithm for efficient joint analysis of millions of mass spectra. In this work, we first simulate chimeric spect
APA, Harvard, Vancouver, ISO, and other styles
47

von, Wenckstern Michael. "Web applications using the Google Web Toolkit." Master's thesis, Technische Universitaet Bergakademie Freiberg Universitaetsbibliothek "Georgius Agricola", 2013. http://nbn-resolving.de/urn:nbn:de:bsz:105-qucosa-115009.

Full text
Abstract:
This diploma thesis describes how to create or convert traditional Java programs to desktop-like rich internet applications with the Google Web Toolkit. The Google Web Toolkit is an open source development environment, which translates Java code to browser and device independent HTML and JavaScript. Most of the GWT framework parts, including the Java to JavaScript compiler as well as important security issues of websites will be introduced. The famous Agricola board game will be implemented in the Model-View-Presenter pattern to show that complex user interfaces can be created with the Google
APA, Harvard, Vancouver, ISO, and other styles
48

"Evaluation of Storage Systems for Big Data Analytics." Master's thesis, 2017. http://hdl.handle.net/2286/R.I.46221.

Full text
Abstract:
abstract: Recent trends in big data storage systems show a shift from disk centric models to memory centric models. The primary challenges faced by these systems are speed, scalability, and fault tolerance. It is interesting to investigate the performance of these two models with respect to some big data applications. This thesis studies the performance of Ceph (a disk centric model) and Alluxio (a memory centric model) and evaluates whether a hybrid model provides any performance benefits with respect to big data applications. To this end, an application TechTalk is created that uses Ceph to
APA, Harvard, Vancouver, ISO, and other styles
49

Che=Wei, Chuang, and 莊哲偉. "A Scalable Storage System for Big Data Analysis." Thesis, 2015. http://ndltd.ncl.edu.tw/handle/35852997098724933407.

Full text
Abstract:
碩士<br>國立交通大學<br>電子工程學系 電子研究所<br>104<br>Recently, Machine learning has been widely used in various areas. Since Machine Learning is basically for Big Data Analysis which requires large amount of computation loads and storage, Machine learning will be efficiently accelerated if only if computation ability and storage equipment are both properly optimized through some methodologies. We tried to explore a hardware/software co-design platform for big data analysis with machine learning capability and storage scalability to solve the two major problems in Machine learning that is power, and Speed. T
APA, Harvard, Vancouver, ISO, and other styles
50

Nachiappan, Rekha. "Efficient data reliability management of cloud storage systems for big data applications." Thesis, 2020. http://hdl.handle.net/1959.7/uws:57792.

Full text
Abstract:
Cloud service providers are consistently striving to provide efficient and reliable service, to their client's Big Data storage need. Replication is a simple and flexible method to ensure reliability and availability of data. However, it is not an efficient solution for Big Data since it always scales in terabytes and petabytes. Hence erasure coding is gaining traction despite its shortcomings. Deploying erasure coding in cloud storage confronts several challenges like encoding/decoding complexity, load balancing, exponential resource consumption due to data repair and read latency. This thesi
APA, Harvard, Vancouver, ISO, and other styles
We offer discounts on all premium plans for authors whose works are included in thematic literature selections. Contact us to get a unique promo code!