Relevant bibliographies by topics / Data Driven Inference

Journal articles
Dissertations / Theses
Books
Book chapters
Conference papers

Academic literature on the topic 'Data Driven Inference'

Author: Grafiati

Published: 4 June 2021

Last updated: 5 February 2022

Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles

Select a source type:

Consult the lists of relevant articles, books, theses, conference reports, and other scholarly sources on the topic 'Data Driven Inference.'

Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.

You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.

Journal articles on the topic "Data Driven Inference"

Padhi, Saswat, Rahul Sharma, and Todd Millstein. "Data-driven precondition inference with learned features." ACM SIGPLAN Notices 51, no. 6 (August 2016): 42–56. http://dx.doi.org/10.1145/2980983.2908099.

Full text

APA, Harvard, Vancouver, ISO, and other styles

GREGOR, J. "DATA-DRIVEN INDUCTIVE INFERENCE OF FINITE-STATE AUTOMATA." International Journal of Pattern Recognition and Artificial Intelligence 08, no. 01 (February 1994): 305–22. http://dx.doi.org/10.1142/s0218001494000140.

Full text

Abstract:

Within the field of structural pattern analysis, algorithms for inference of discrete mathematical models from samples are an important area of research. This paper gives an extensive survey of state-of-the-art methods for data-driven inductive inference of finite-state automata. In addition to providing notationally consistent descriptions of the methods’ fundamental mode of operation, aspects such as sequential learning, advantages and disadvantages, and the extension to stochastic automata are also addressed.

APA, Harvard, Vancouver, ISO, and other styles

Lapalme, Ervig, Jean-Marc Lina, and Jérémie Mattout. "Data-driven parceling and entropic inference in MEG." NeuroImage 30, no. 1 (March 2006): 160–71. http://dx.doi.org/10.1016/j.neuroimage.2005.08.067.

Full text

APA, Harvard, Vancouver, ISO, and other styles

Gile, Krista J., Isabelle S. Beaudry, Mark S. Handcock, and Miles Q. Ott. "Methods for Inference from Respondent-Driven Sampling Data." Annual Review of Statistics and Its Application 5, no. 1 (March 7, 2018): 65–93. http://dx.doi.org/10.1146/annurev-statistics-031017-100704.

Full text

APA, Harvard, Vancouver, ISO, and other styles

Wang, Dan, Shiguang Shan, Hongming Zhang, Wei Zeng, and Xilin Chen. "Data-driven hair segmentation with isomorphic manifold inference." Image and Vision Computing 32, no. 10 (October 2014): 739–50. http://dx.doi.org/10.1016/j.imavis.2014.02.011.

Full text

APA, Harvard, Vancouver, ISO, and other styles

Almeida, Alexandre CL, Anderson R. Duarte, Luiz H. Duczmal, Fernando LP Oliveira, and Ricardo HC Takahashi. "Data-driven inference for the spatial scan statistic." International Journal of Health Geographics 10, no. 1 (2011): 47. http://dx.doi.org/10.1186/1476-072x-10-47.

Full text

APA, Harvard, Vancouver, ISO, and other styles

Gile, Krista J., and Mark S. Handcock. "Network model-assisted inference from respondent-driven sampling data." Journal of the Royal Statistical Society: Series A (Statistics in Society) 178, no. 3 (January 27, 2015): 619–39. http://dx.doi.org/10.1111/rssa.12091.

Full text

APA, Harvard, Vancouver, ISO, and other styles

Hansen, Sofie Therese, Søren Hauberg, and Lars Kai Hansen. "Data-driven forward model inference for EEG brain imaging." NeuroImage 139 (October 2016): 249–58. http://dx.doi.org/10.1016/j.neuroimage.2016.06.017.

Full text

APA, Harvard, Vancouver, ISO, and other styles

Buscemi, Francesco, and Michele Dall’Arno. "Data-driven inference of physical devices: theory and implementation." New Journal of Physics 21, no. 11 (November 14, 2019): 113029. http://dx.doi.org/10.1088/1367-2630/ab5003.

Full text

APA, Harvard, Vancouver, ISO, and other styles

Cattaneo, Matias D., Richard K. Crump, and Michael Jansson. "Robust Data-Driven Inference for Density-Weighted Average Derivatives." Journal of the American Statistical Association 105, no. 491 (September 1, 2010): 1070–83. http://dx.doi.org/10.1198/jasa.2010.tm09590.

Full text

APA, Harvard, Vancouver, ISO, and other styles

More sources

Dissertations / Theses on the topic "Data Driven Inference"

Park, June Young. "Data-driven Building Metadata Inference." Research Showcase @ CMU, 2016. http://repository.cmu.edu/theses/127.

Full text

Abstract:

Building technology has been developed due to the improvement of information technology. Specifically, a human can control and monitor the building operation by a number of sensors and actuators. The sensors and actuators are installed on every single element in a building. Thus, the large stream of building data allows us to implement both quantitative and qualitative improvements. However, there are still limitations to mapping between the physical building element and cyber system. To solve this mapping issue, last summer, a text mining methodology was developed as part of a project conducted by the Consortium for Building Energy Innovation. Building data was extracted from building 661, in Philadelphia, PA. The ground truth of the building data point with semantic information was labeled by manual inspection. And a Support Vector Machine was implemented to investigate the relationship between the data point name and the semantic information. This algorithm achieves 93% accuracy with unseen building 661 data points. Techniques and lessons were gained from this project, and this knowledge was used to develop the framework for analyzing the building data from the Gates Hillman Center (GHC) building, Pittsburgh PA. This new framework consists of two stages. In the first stage, we initially tried to cluster the data points by similar semantic information, using the hierarchical clustering method. However, the effectiveness and accuracy of the clustering method is not adequate for this framework. Thus, the filtering and classification model is developed to identify the semantic information of the data points. From the filtering and classification method, it correctly identifies the damper position and supply air duct pressure data point with 90% accuracy by daily statistical features. Having the semantic information from the first stage, the second stage figures out the relationship between Variable Air Volume (VAV) terminal units and Air Handling Units (AHU). The intuitive thermal and flow relationship between VAVs and AHUs are investigated at the beginning, and the statistical features clustering method is applied from the VAV discharge temperature data. However, the control strategy of this building makes this relationship invisible. Alternatively we then compared the similarity between damper position at VAVs and supply air duct pressure at AHUs by calculating the cross correlation. Finally, this similarity scoring method achieved 80% accuracy to map the relationship between VAVs and AHUs. The suggested framework will guide the user to find the desired information such as the VAVs – AHUs relationship from the problem generated by a large number of heterogeneous sensor networks by using data-driven methodology.

APA, Harvard, Vancouver, ISO, and other styles

Spoon, Steven Alexander. "Demand-Driven Type Inference with Subgoal Pruning." Diss., Georgia Institute of Technology, 2005. http://hdl.handle.net/1853/7486.

Full text

Abstract:

Highly dynamic languages like Smalltalk do not have much static type information immediately available before the program runs. Static types can still be inferred by analysis tools, but historically, such analysis is only effective on smaller programs of at most a few tens of thousands of lines of code. This dissertation presents a new type inference algorithm, DDP, that is effective on larger programs with hundreds of thousands of lines of code. The approach of the algorithm borrows from the field of knowledge-based systems: it is a demand-driven algorithm that sometimes prunes subgoals. The algorithm is formally described, proven correct, and implemented. Experimental results show that the inferred types are usefully precise. A complete program understanding application, Chuck, has been developed that uses DDP type inferences. This work contributes the DDP algorithm itself, the most thorough semantics of Smalltalk to date, a new general approach for analysis algorithms, and experimental analysis of DDP including determination of useful parameter settings. It also contributes an implementation of DDP, a general analysis framework for Smalltalk, and a complete end-user application that uses DDP.

APA, Harvard, Vancouver, ISO, and other styles

Michelen, Strofer Carlos Alejandro. "Machine Learning and Field Inversion approaches to Data-Driven Turbulence Modeling." Diss., Virginia Tech, 2021. http://hdl.handle.net/10919/103155.

Full text

Abstract:

There still is a practical need for improved closure models for the Reynolds-averaged Navier-Stokes (RANS) equations. This dissertation explores two different approaches for using experimental data to provide improved closure for the Reynolds stress tensor field. The first approach uses machine learning to learn a general closure model from data. A novel framework is developed to train deep neural networks using experimental velocity and pressure measurements. The sensitivity of the RANS equations to the Reynolds stress, required for gradient-based training, is obtained by means of both variational and ensemble methods. The second approach is to infer the Reynolds stress field for a flow of interest from limited velocity or pressure measurements of the same flow. Here, this field inversion is done using a Monte Carlo Bayesian procedure and the focus is on improving the inference by enforcing known physical constraints on the inferred Reynolds stress field. To this end, a method for enforcing boundary conditions on the inferred field is presented. The two data-driven approaches explored and improved upon here demonstrate the potential for improved practical RANS predictions.
Doctor of Philosophy
The Reynolds-averaged Navier-Stokes (RANS) equations are widely used to simulate fluid flows in engineering applications despite their known inaccuracy in many flows of practical interest. The uncertainty in the RANS equations is known to stem from the Reynolds stress tensor for which no universally applicable turbulence model exists. The computational cost of more accurate methods for fluid flow simulation, however, means RANS simulations will likely continue to be a major tool in engineering applications and there is still a need for improved RANS turbulence modeling. This dissertation explores two different approaches to use available experimental data to improve RANS predictions by improving the uncertain Reynolds stress tensor field. The first approach is using machine learning to learn a data-driven turbulence model from a set of training data. This model can then be applied to predict new flows in place of traditional turbulence models. To this end, this dissertation presents a novel framework for training deep neural networks using experimental measurements of velocity and pressure. When using velocity and pressure data, gradient-based training of the neural network requires the sensitivity of the RANS equations to the learned Reynolds stress. Two different methods, the continuous adjoint and ensemble approximation, are used to obtain the required sensitivity. The second approach explored in this dissertation is field inversion, whereby available data for a flow of interest is used to infer a Reynolds stress field that leads to improved RANS solutions for that same flow. Here, the field inversion is done via the ensemble Kalman inversion (EKI), a Monte Carlo Bayesian procedure, and the focus is on improving the inference by enforcing known physical constraints on the inferred Reynolds stress field. To this end, a method for enforcing boundary conditions on the inferred field is presented. While further development is needed, the two data-driven approaches explored and improved upon here demonstrate the potential for improved practical RANS predictions.

APA, Harvard, Vancouver, ISO, and other styles

Marcou, Quentin. "Probabilistic approaches to the adaptive immune repertoire : a data-driven approach." Thesis, Sorbonne Paris Cité, 2017. http://www.theses.fr/2017USPCB029/document.

Full text

Abstract:

Le système immunitaire de chaque individu doit faire face à des agressions répétées d'un environnement en constante évolution, constituant ainsi un nombre de menaces virtuellement infini. Afin de mener ce rôle à bien, le système immunitaire adaptatif s'appuie sur une énorme diversité de lymphocytes T et B. Chacune de ces cellules exhibe à sa surface un récepteur unique, créé aléatoirement via le processus de recombinaison V(D)J, et spécifique à un petit nombre de pathogènes seulement. La diversité initiale générée lors de ce processus de recombinaison est ensuite réduite par une étape de sélection fonctionnelle basée sur les propriétés de repliement du récepteur ainsi que ses capacités à interagir avec des protéines du soi. Pour les cellules B, cette diversité peut être à nouveau étendue après rencontre d'un pathogène lors du processus de maturation d'affinité durant lequel le récepteur subit des cycles successifs d'hypermutation et sélection. Ces travaux présentent des approches probabilistes visant à inférer les distributions de probabilités sous-tendant les processus de recombinaison et d'hypermutation à partir de données de séquençage haut débit. Ces approches ont donné naissance à IGoR, un logiciel polyvalent dont les performances dépassent celles des outils existants. En utilisant les modèles obtenus comme base, je présenterai comment ces derniers peuvent être utilisés afin d'étudier le vieillissement et évolution du répertoire immunitaire, la présence d'emprunte parentale lors de la recombinaison V(D)J ou encore pour démontrer que les jumeaux échangent des lymphocytes au cours de la vie fœtale
An individual’s adaptive immune system needs to face repeated challenges of a constantly evolving environment with a virtually infinite number of threats. To achieve this task, the adaptive immune system relies on large diversity of B-cells and T-cells, each carrying a unique receptor specific to a small number of pathogens. These receptors are initially randomly built through the process of V(D)J recombination. This initial generated diversity is then narrowed down by a step of functional selection based on the receptors' folding properties and their ability to recognize self antigens. Upon recognition of a pathogen the B-cell will divide and its offsprings will undergo several rounds of successive somatic hypermutations and selection in an evolutionary process called affinity maturation. This work presents principled probabilistic approaches to infer the probability distribution underlying the recombination and somatic hypermutation processes from high throughput sequencing data using IGoR - a flexible software developed throughout the course of this PhD. IGoR has been developed as a versatile research tool and can encode a variety of models of different biological complexity to allow researchers in the field to characterize evermore precisely immune receptor repertoires. To motivate this data-driven approach we demonstrate that IGoR outperforms existing tools in accuracy and estimate the sample sizes needed for reliable repertoire characterization. Finally, using obtained model predictions, we show potential applications of these methods by demonstrating that homozygous twins share T-cells through cord blood, that the public core of the T cell repertoire is formed in the pre-natal period and finally estimate naive T cell clone lifetimes in human

APA, Harvard, Vancouver, ISO, and other styles

Das, Debasish. "Bayesian Sparse Regression with Application to Data-driven Understanding of Climate." Diss., Temple University Libraries, 2015. http://cdm16002.contentdm.oclc.org/cdm/ref/collection/p245801coll10/id/313587.

Full text

Abstract:

Computer and Information Science
Ph.D.
Sparse regressions based on constraining the L1-norm of the coefficients became popular due to their ability to handle high dimensional data unlike the regular regressions which suffer from overfitting and model identifiability issues especially when sample size is small. They are often the method of choice in many fields of science and engineering for simultaneously selecting covariates and fitting parsimonious linear models that are better generalizable and easily interpretable. However, significant challenges may be posed by the need to accommodate extremes and other domain constraints such as dynamical relations among variables, spatial and temporal constraints, need to provide uncertainty estimates and feature correlations, among others. We adopted a hierarchical Bayesian version of the sparse regression framework and exploited its inherent flexibility to accommodate the constraints. We applied sparse regression for the feature selection problem of statistical downscaling of the climate variables with particular focus on their extremes. This is important for many impact studies where the climate change information is required at a spatial scale much finer than that provided by the global or regional climate models. Characterizing the dependence of extremes on covariates can help in identification of plausible causal drivers and inform extremes downscaling. We propose a general-purpose sparse Bayesian framework for covariate discovery that accommodates the non-Gaussian distribution of extremes within a hierarchical Bayesian sparse regression model. We obtain posteriors over regression coefficients, which indicate dependence of extremes on the corresponding covariates and provide uncertainty estimates, using a variational Bayes approximation. The method is applied for selecting informative atmospheric covariates at multiple spatial scales as well as indices of large scale circulation and global warming related to frequency of precipitation extremes over continental United States. Our results confirm the dependence relations that may be expected from known precipitation physics and generates novel insights which can inform physical understanding. We plan to extend our model to discover covariates for extreme intensity in future. We further extend our framework to handle the dynamic relationship among the climate variables using a nonparametric Bayesian mixture of sparse regression models based on Dirichlet Process (DP). The extended model can achieve simultaneous clustering and discovery of covariates within each cluster. Moreover, the a priori knowledge about association between pairs of data-points is incorporated in the model through must-link constraints on a Markov Random Field (MRF) prior. A scalable and efficient variational Bayes approach is developed to infer posteriors on regression coefficients and cluster variables.
Temple University--Theses

APA, Harvard, Vancouver, ISO, and other styles

Wu, Jinlong. "Predictive Turbulence Modeling with Bayesian Inference and Physics-Informed Machine Learning." Diss., Virginia Tech, 2018. http://hdl.handle.net/10919/85129.

Full text

Abstract:

Reynolds-Averaged Navier-Stokes (RANS) simulations are widely used for engineering design and analysis involving turbulent flows. In RANS simulations, the Reynolds stress needs closure models and the existing models have large model-form uncertainties. Therefore, the RANS simulations are known to be unreliable in many flows of engineering relevance, including flows with three-dimensional structures, swirl, pressure gradients, or curvature. This lack of accuracy in complex flows has diminished the utility of RANS simulations as a predictive tool for engineering design, analysis, optimization, and reliability assessments. Recently, data-driven methods have emerged as a promising alternative to develop the model of Reynolds stress for RANS simulations. In this dissertation I explore two physics-informed, data-driven frameworks to improve RANS modeled Reynolds stresses. First, a Bayesian inference framework is proposed to quantify and reduce the model-form uncertainty of RANS modeled Reynolds stress by leveraging online sparse measurement data with empirical prior knowledge. Second, a machine-learning-assisted framework is proposed to utilize offline high-fidelity simulation databases. Numerical results show that the data-driven RANS models have better prediction of Reynolds stress and other quantities of interest for several canonical flows. Two metrics are also presented for an a priori assessment of the prediction confidence for the machine-learning-assisted RANS model. The proposed data-driven methods are also applicable to the computational study of other physical systems whose governing equations have some unresolved physics to be modeled.
Ph. D.
Reynolds-Averaged Navier–Stokes (RANS) simulations are widely used for engineering design and analysis involving turbulent flows. In RANS simulations, the Reynolds stress needs closure models and the existing models have large model-form uncertainties. Therefore, the RANS simulations are known to be unreliable in many flows of engineering relevance, including flows with three-dimensional structures, swirl, pressure gradients, or curvature. This lack of accuracy in complex flows has diminished the utility of RANS simulations as a predictive tool for engineering design, analysis, optimization, and reliability assessments. Recently, data-driven methods have emerged as a promising alternative to develop the model of Reynolds stress for RANS simulations. In this dissertation I explore two physics-informed, data-driven frameworks to improve RANS modeled Reynolds stresses. First, a Bayesian inference framework is proposed to quantify and reduce the model-form uncertainty of RANS modeled Reynolds stress by leveraging online sparse measurement data with empirical prior knowledge. Second, a machine-learning-assisted framework is proposed to utilize offline high fidelity simulation databases. Numerical results show that the data-driven RANS models have better prediction of Reynolds stress and other quantities of interest for several canonical flows. Two metrics are also presented for an a priori assessment of the prediction confidence for the machine-learning-assisted RANS model. The proposed data-driven methods are also applicable to the computational study of other physical systems whose governing equations have some unresolved physics to be modeled.

APA, Harvard, Vancouver, ISO, and other styles

Koseler, Kaan Tamer. "Realization of Model-Driven Engineering for Big Data: A Baseball Analytics Use Case." Miami University / OhioLINK, 2018. http://rave.ohiolink.edu/etdc/view?acc_num=miami1524832924255132.

Full text

APA, Harvard, Vancouver, ISO, and other styles

Sušak, Hana 1985. "The Hunt of cancer genes : statistical inference of cancer risk and driver genes using next generation sequencuing data." Doctoral thesis, Universitat Pompeu Fabra, 2017. http://hdl.handle.net/10803/668447.

Full text

Abstract:

International cancer sequencing projects have generated comprehensive catalogs of alterations found in tumor genomes, as well as germline variant data for thousands of individuals. In this thesis, we describe two statistical methods exploiting these rich datasets in order to better understand tumor initiation, tumor progression and the contribution of genetic variants to the lifetime risk of developing cancer. The first method, a Bayesian inference model named cDriver, utilizes multiple signatures of positive selection acting on tumor genomes to predict cancer driver genes. Cancer cell fraction is introduced as a novel signature of positive selection on a cellular level, based on the hypothesis that cells obtaining additional advantageous driver mutations will undergo rapid proliferation and clonal expansion. We benchmarked cDriver against state of the art driver prediction methods on three cancer datasets demonstrating equal or better performance than the best competing tool. The second method, termed REWAS is a comprehensive framework for rare-variant association studies (RVAS) aiming at improving identification of cancer predisposition genes. Nonetheless, REWAS is readily applicable to any case-control study of complex diseases. Besides integrating well-established RVAS methods, we developed a novel Bayesian inference RVAS method (BATI) based on Integrated Nested Laplace Approximation (INLA). We demonstrate that BATI outperforms other methods on realistic simulated datasets, especially when meaningful biological context (e.g. functional impact of variants) is available or when risk variants in sum explain low phenotypic variance. Both methods developed during my thesis have the potential to facilitate personalized medicine and oncology through identification of novel therapeutic targets and identification of genetic predisposition facilitating prevention and early diagnosis of cancer.
Els distints projectes internacionals de seqüenciació de càncer duts a terme en els últims anys han generat catàlegs complets d’alteracions trobades en els genomes tumorals, així com informació de variants germinals per a milers d'individus. En aquesta tesi descrivim dos mètodes estadístics aprofitant aquestes bases de dades per tal d’entendre millor la iniciació i la progressió dels tumors, i la contribució de variants genètiques al risc de desenvolupar càncer al llarg de la vida. El primer mètode, anomenat cDriver, es basa en un model d’inferència Bayesià que utilitza múltiples senyals de la selecció positiva que ocorre en els genomes tumorals per tal de predir els gens driver del càncer. En aquest mètode, hem inclòs la fracció de cèl·lules tumorals com a nova senyal de la selecció positiva a nivell cel·lular. Aquesta es basa en la hipòtesi que les cèl·lules que adquireixen mutacions ventajoses proliferaran i s’expandiran clonalment més ràpidament. Per avaluar cDriver, aquest es va comparar amb els mètodes més utilitzats per a la predicció de gens driver actuals. L’anàlisi es va dur a terme amb conjunts de dades de tres càncer diferents i els resultats van ser iguals o millors que els obtinguts per les eines més competitives en el tema. El segon mètode, anomenat REWAS, és un marc de treball per l’estudi d’associació de variants rares (RVAS) amb l'objectiu de millorar la identificació dels gens de predisposició al càncer. Tot i això, REWAS es pot aplicar a qualsevol estudi cas-control de malalties complexes. Per una altra part, a més d'integrar mètodes RVAS ben establerts, hem desenvolupat un nou mètode d'inferència Bayesiana RVAS basat en Integrated Nested Laplace Approximation (BATI). També demostrem que BATI mostra millors resultats que altres mètodes en dades simulades amb soroll de fons real, especialment quan el context biològic (p.e. variants amb impacte funcional) està disponible or quan les variants de risc expliquen en total poca variància fenotípica.

APA, Harvard, Vancouver, ISO, and other styles

Sušak, Hana 1985. "The Hunt of cancer genes : statistical inference of cancer risk and driver genes using next generation sequencing data." Doctoral thesis, Universitat Pompeu Fabra, 2017. http://hdl.handle.net/10803/664504.

Full text

Abstract:

Els distints projectes internacionals de seqüenciació de càncer duts a terme en els últims anys han generat catàlegs complets d’alteracions trobades en els genomes tumorals, així com informació de variants germinals per a milers d'individus. En aquesta tesi descrivim dos mètodes estadístics aprofitant aquestes bases de dades per tal d’entendre millor la iniciació i la progressió dels tumors, i la contribució de variants genètiques al risc de desenvolupar càncer al llarg de la vida. El primer mètode, anomenat cDriver, es basa en un model d’inferència Bayesià que utilitza múltiples senyals de la selecció positiva que ocorre en els genomes tumorals per tal de predir els gens driver del càncer. En aquest mètode, hem inclòs la fracció de cèl·lules tumorals com a nova senyal de la selecció positiva a nivell cel·lular. Aquesta es basa en la hipòtesi que les cèl·lules que adquireixen mutacions ventajoses proliferaran i s’expandiran clonalment més ràpidament. Per avaluar cDriver, aquest es va comparar amb els mètodes més utilitzats per a la predicció de gens driver actuals. L’anàlisi es va dur a terme amb conjunts de dades de tres càncer diferents i els resultats van ser iguals o millors que els obtinguts per les eines més competitives en el tema. El segon mètode, anomenat REWAS, és un marc de treball per l’estudi d’associació de variants rares (RVAS) amb l'objectiu de millorar la identificació dels gens de predisposició al càncer. Tot i això, REWAS es pot aplicar a qualsevol estudi cas-control de malalties complexes. Per una altra part, a més d'integrar mètodes RVAS ben establerts, hem desenvolupat un nou mètode d'inferència Bayesiana RVAS basat en Integrated Nested Laplace Approximation (BATI). També demostrem que BATI mostra millors resultats que altres mètodes en dades simulades amb soroll de fons real, especialment quan el context biològic (p.e. variants amb impacte funcional) està disponible or quan les variants de risc expliquen en total poca variància fenotípica.
International cancer sequencing projects have generated comprehensive catalogs of alterations found in tumor genomes, as well as germline variant data for thousands of individuals. In this thesis, we describe two statistical methods exploiting these rich datasets in order to better understand tumor initiation, tumor progression and the contribution of genetic variants to the lifetime risk of developing cancer. The first method, a Bayesian inference model named cDriver, utilizes multiple signatures of positive selection acting on tumor genomes to predict cancer driver genes. Cancer cell fraction is introduced as a novel signature of positive selection on a cellular level, based on the hypothesis that cells obtaining additional advantageous driver mutations will undergo rapid proliferation and clonal expansion. We benchmarked cDriver against state of the art driver prediction methods on three cancer datasets demonstrating equal or better performance than the best competing tool. The second method, termed REWAS is a comprehensive framework for rare-variant association studies (RVAS) aiming at improving identification of cancer predisposition genes. Nonetheless, REWAS is readily applicable to any case-control study of complex diseases. Besides integrating well-established RVAS methods, we developed a novel Bayesian inference RVAS method (BATI) based on Integrated Nested Laplace Approximation (INLA). We demonstrate that BATI outperforms other methods on realistic simulated datasets, especially when meaningful biological context (e.g. functional impact of variants) is available or when risk variants in sum explain low phenotypic variance. Both methods developed during my thesis have the potential to facilitate personalized medicine and oncology through identification of novel therapeutic targets and identification of genetic predisposition facilitating prevention and early diagnosis of cancer.

APA, Harvard, Vancouver, ISO, and other styles

Silva, Sanchez Rosa Elvira. "Contribution au pronostic de durée de vie des systèmes piles à combustible PEMFC." Thesis, Besançon, 2015. http://www.theses.fr/2015BESA2005/document.

Full text

Abstract:

Les travaux de cette thèse visent à apporter des éléments de solutions au problème de la durée de vie des systèmes pile à combustible (FCS – Fuel Cell System) de type à « membrane échangeuse de protons » (PEM – Proton Exchange Membrane) et se décline sur deux champs disciplinaires complémentaires :Une première approche vise à augmenter la durée de vie de celle-ci par la conception et la mise en œuvre d'une architecture de pronostic et de gestion de l'état de santé (PHM – Prognostics & Health Management). Les PEM-FCS, de par leur technologie, sont par essence des systèmes multi-physiques (électriques, fluidiques, électrochimiques, thermiques, mécaniques, etc.) et multi-échelles (de temps et d'espace) dont les comportements sont difficilement appréhendables. La nature non linéaire des phénomènes, le caractère réversible ou non des dégradations, et les interactions entre composants rendent effectivement difficile une étape de modélisation des défaillances. De plus, le manque d'homogénéité (actuel) dans le processus de fabrication rend difficile la caractérisation statistique de leur comportement. Le déploiement d'une solution PHM permettrait en effet d'anticiper et d'éviter les défaillances, d'évaluer l'état de santé, d'estimer le temps de vie résiduel du système, et finalement, d'envisager des actions de maîtrise (contrôle et/ou maintenance) pour assurer la continuité de fonctionnement. Une deuxième approche propose d'avoir recours à une hybridation passive de la PEMFC avec des super-condensateurs (UC – Ultra Capacitor) de façon à faire fonctionner la pile au plus proche de ses conditions opératoires optimales et ainsi, à minimiser l'impact du vieillissement. Les UCs apparaissent comme une source complémentaire à la PEMFC en raison de leur forte densité de puissance, de leur capacité de charge/décharge rapide, de leur réversibilité et de leur grande durée de vie. Si l'on prend l'exemple des véhicules à pile à combustible, l'association entre une PEMFC et des UCs peut être réalisée en utilisant un système hybride de type actif ou passif. Le comportement global du système dépend à la fois du choix de l'architecture et du positionnement de ces éléments en lien avec la charge électrique. Aujourd'hui, les recherches dans ce domaine se focalisent essentiellement sur la gestion d'énergie entre les sources et stockeurs embarqués ; et sur la définition et l'optimisation d'une interface électronique de puissance destinée à conditionner le flux d'énergie entre eux. Cependant, la présence de convertisseurs statiques augmente les sources de défaillances et pannes (défaillance des interrupteurs du convertisseur statique lui-même, impact des oscillations de courant haute fréquence sur le vieillissement de la pile), et augmente également les pertes énergétiques du système complet (même si le rendement du convertisseur statique est élevé, il dégrade néanmoins le bilan global)
This thesis work aims to provide solutions for the limited lifetime of Proton Exchange Membrane Fuel Cell Systems (PEM-FCS) based on two complementary disciplines:A first approach consists in increasing the lifetime of the PEM-FCS by designing and implementing a Prognostics & Health Management (PHM) architecture. The PEM-FCS are essentially multi-physical systems (electrical, fluid, electrochemical, thermal, mechanical, etc.) and multi-scale (time and space), thus its behaviors are hardly understandable. The nonlinear nature of phenomena, the reversibility or not of degradations and the interactions between components makes it quite difficult to have a failure modeling stage. Moreover, the lack of homogeneity (actual) in the manufacturing process makes it difficult for statistical characterization of their behavior. The deployment of a PHM solution would indeed anticipate and avoid failures, assess the state of health, estimate the Remaining Useful Lifetime (RUL) of the system and finally consider control actions (control and/or maintenance) to ensure operation continuity.A second approach proposes to use a passive hybridization of the PEMFC with Ultra Capacitors (UC) to operate the fuel cell closer to its optimum operating conditions and thereby minimize the impact of aging. The UC appear as an additional source to the PEMFC due to their high power density, their capacity to charge/discharge rapidly, their reversibility and their long life. If we take the example of fuel cell hybrid electrical vehicles, the association between a PEMFC and UC can be performed using a hybrid of active or passive type system. The overall behavior of the system depends on both, the choice of the architecture and the positioning of these elements in connection with the electric charge. Today, research in this area focuses mainly on energy management between the sources and embedded storage and the definition and optimization of a power electronic interface designated to adjust the flow of energy between them. However, the presence of power converters increases the source of faults and failures (failure of the switches of the power converter and the impact of high frequency current oscillations on the aging of the PEMFC), and also increases the energy losses of the entire system (even if the performance of the power converter is high, it nevertheless degrades the overall system)

APA, Harvard, Vancouver, ISO, and other styles

More sources

Books on the topic "Data Driven Inference"

Sullivan, Ryan. Th e dangers of data-driven inference: The case of calendar effects in stock returns. London: London School of Economics, Financial Markets Group, 1998.

Find full text

APA, Harvard, Vancouver, ISO, and other styles

Li, Quan. Using R for Data Analysis in Social Sciences. Oxford University Press, 2018. http://dx.doi.org/10.1093/oso/9780190656218.001.0001.

Full text

Abstract:

This book seeks to teach undergraduate and graduate students in social sciences how to use R to manage, visualize, and analyze data in order to answer substantive questions and replicate published findings. This book distinguishes itself from other introductory R or statistics books in three ways. First, targeting an audience rarely exposed to statistical programming, it adopts a minimalist approach and covers only the most important functions and skills in R that one will need for conducting reproducible research projects. Second, it emphasizes meeting the practical needs of students using R in research projects. Specifically, it teaches students how to import, inspect, and manage data; understand the logic of statistical inference; visualize data and findings via histograms, boxplots, scatterplots, and diagnostic plots; and analyze data using one-sample t-test, difference-of-means test, covariance, correlation, ordinary least squares (OLS) regression, and model assumption diagnostics. Third, it teaches students how to replicate the findings in published journal articles and diagnose model assumption violations. The principle behind this book is to teach students to learn as little R as possible but to do as much reproducible, substance-driven data analysis at the beginner or intermediate level as possible. The minimalist approach dramatically reduces the learning cost but still proves adequate information for meeting the practical research needs of senior undergraduate and beginning graduate students. Having completed this book, students can use R and statistical analysis to answer questions regarding some substantively interesting continuous outcome variable in a cross-sectional design.

APA, Harvard, Vancouver, ISO, and other styles

Book chapters on the topic "Data Driven Inference"

Nickel, Soeren, and Martin Nöllenburg. "Towards Data-Driven Multilinear Metro Maps." In Diagrammatic Representation and Inference, 153–61. Cham: Springer International Publishing, 2020. http://dx.doi.org/10.1007/978-3-030-54249-8_12.

Full text

APA, Harvard, Vancouver, ISO, and other styles

Oncina, José. "The data driven approach applied to the OSTIA algorithm." In Grammatical Inference, 50–56. Berlin, Heidelberg: Springer Berlin Heidelberg, 1998. http://dx.doi.org/10.1007/bfb0054063.

Full text

APA, Harvard, Vancouver, ISO, and other styles

Ravela, Sai. "Statistical Inference for Coherent Fluids." In Dynamic Data-Driven Environmental Systems Science, 121–33. Cham: Springer International Publishing, 2015. http://dx.doi.org/10.1007/978-3-319-25138-7_12.

Full text

APA, Harvard, Vancouver, ISO, and other styles

Wandelt, Benjamin D., Jens Jasche, and Guilhem Lavaux. "Robust, Data-Driven Inference in Non-linear Cosmostatistics." In Lecture Notes in Statistics, 27–40. New York, NY: Springer New York, 2012. http://dx.doi.org/10.1007/978-1-4614-3520-4_3.

Full text

APA, Harvard, Vancouver, ISO, and other styles

Nikolopoulos, Spiros, Georgios Th Papadopoulos, Ioannis Kompatsiaris, and Ioannis Patras. "An Evidence-Driven Probabilistic Inference Framework for Semantic Image Understanding." In Machine Learning and Data Mining in Pattern Recognition, 525–39. Berlin, Heidelberg: Springer Berlin Heidelberg, 2009. http://dx.doi.org/10.1007/978-3-642-03070-3_40.

Full text

APA, Harvard, Vancouver, ISO, and other styles

Markov, Zdravko, and Christo Dichev. "The net-clause language — A tool for data-driven inference." In Lecture Notes in Computer Science, 366–85. Berlin, Heidelberg: Springer Berlin Heidelberg, 1991. http://dx.doi.org/10.1007/bfb0018453.

Full text

APA, Harvard, Vancouver, ISO, and other styles

Shang, Chao. "Nonlinear Dynamic Soft Sensing Based on Bayesian Inference." In Dynamic Modeling of Complex Industrial Processes: Data-driven Methods and Application Research, 125–40. Singapore: Springer Singapore, 2018. http://dx.doi.org/10.1007/978-981-10-6677-1_7.

Full text

APA, Harvard, Vancouver, ISO, and other styles

Miliauskaitė, Jolanta, and Diana Kalibatiene. "Complexity Issues in Data-Driven Fuzzy Inference Systems: Systematic Literature Review." In Communications in Computer and Information Science, 190–204. Cham: Springer International Publishing, 2020. http://dx.doi.org/10.1007/978-3-030-57672-1_15.

Full text

APA, Harvard, Vancouver, ISO, and other styles

Salehnejad, Reza. "‘Homo economicus’ as an intuitive statistician (3): Data-driven causal inference." In Rationality, bounded rationality and microfoundations, 165–203. London: Palgrave Macmillan UK, 2007. http://dx.doi.org/10.1057/9780230625150_6.

Full text

APA, Harvard, Vancouver, ISO, and other styles

Rentz, Niklas, Steven Smyth, Lewe Andersen, and Reinhard von Hanxleden. "Extracting Interactive Actor-Based Dataflow Models from Legacy C Code." In Diagrammatic Representation and Inference, 361–77. Cham: Springer International Publishing, 2021. http://dx.doi.org/10.1007/978-3-030-86062-2_37.

Full text

Abstract:

AbstractGraphical actor-based models provide an abstract overview of the flow of data in a system. They are well-established for the model-driven engineering (MDE) of complex software systems and are supported by numerous commercial and academic tools, such as Simulink, LabVIEW or Ptolemy. In MDE, engineers concentrate on constructing and simulating such models, before application code (or at least a large fraction thereof) is synthesized automatically. However, a significant fraction of today’s legacy system has been coded directly, often using the C language. High-level models that give a quick, accurate overview of how components interact are often out of date or do not exist. This makes it challenging to maintain or extend legacy software, in particular for new team members.To address this problem, we here propose to reverse the classic synthesis path of MDE and to synthesize actor-based dataflow models automatically from source code. Here functions in the code get synthesized into nodes that represent actors manipulating data. Second, we propose to harness the modeling-pragmatic approach, which considers visual models not as static artefacts, but allows interactive, flexible views that also link back to textual descriptions. Thus we propose to synthesize actor models that can vary in level of detail and that allow navigation in the source code. To validate and evaluate our proposals, we implemented these concepts for C analysis in the open source, Eclipse-based KIELER project and conducted a small survey.

APA, Harvard, Vancouver, ISO, and other styles

Conference papers on the topic "Data Driven Inference"

Gokhale, Amruta, Daeyoung Kim, and Vinod Ganapathy. "Data-Driven Inference of API Mappings." In the 2nd Workshop. New York, New York, USA: ACM Press, 2014. http://dx.doi.org/10.1145/2688471.2688480.

Full text

APA, Harvard, Vancouver, ISO, and other styles

Miltner, Anders, Saswat Padhi, Todd Millstein, and David Walker. "Data-driven inference of representation invariants." In PLDI '20: 41st ACM SIGPLAN International Conference on Programming Language Design and Implementation. New York, NY, USA: ACM, 2020. http://dx.doi.org/10.1145/3385412.3385967.

Full text

APA, Harvard, Vancouver, ISO, and other styles

Padhi, Saswat, Rahul Sharma, and Todd Millstein. "Data-driven precondition inference with learned features." In PLDI '16: ACM SIGPLAN Conference on Programming Language Design and Implementation. New York, NY, USA: ACM, 2016. http://dx.doi.org/10.1145/2908080.2908099.

Full text

APA, Harvard, Vancouver, ISO, and other styles

Tan, Fei, Zhi Wei, Abhishek Pani, and Zhenyu Yan. "User Response Driven Content Understanding with Causal Inference." In 2019 IEEE International Conference on Data Mining (ICDM). IEEE, 2019. http://dx.doi.org/10.1109/icdm.2019.00168.

Full text

APA, Harvard, Vancouver, ISO, and other styles

Mani, Venugopal, Ramasubramanian Balasubramanian, Sushant Kumar, Abhinav Mathur, and Kannan Achan. "On Variational Inference for User Modeling in Attribute-Driven Collaborative Filtering." In 2020 IEEE International Conference on Big Data (Big Data). IEEE, 2020. http://dx.doi.org/10.1109/bigdata50022.2020.9377927.

Full text

APA, Harvard, Vancouver, ISO, and other styles

Bayramoglu, Neslihan, and A. Aydin Alatan. "Segmentation driven semantic information inference from 2.5D data." In 2009 IEEE 17th Signal Processing and Communications Applications Conference (SIU). IEEE, 2009. http://dx.doi.org/10.1109/siu.2009.5136468.

Full text

APA, Harvard, Vancouver, ISO, and other styles

Gao, Jingkun, Joern Ploennigs, and Mario Berges. "A Data-driven Meta-data Inference Framework for Building Automation Systems." In BuildSys '15: The 2nd ACM International Conference on Embedded Systems for Energy-Efficient Built Environments. New York, NY, USA: ACM, 2015. http://dx.doi.org/10.1145/2821650.2821670.

Full text

APA, Harvard, Vancouver, ISO, and other styles

Romer, Anne, Jan Maximilian Montenbruck, and Frank Allgower. "Sampling strategies for data-driven inference of passivity properties." In 2017 IEEE 56th Annual Conference on Decision and Control (CDC). IEEE, 2017. http://dx.doi.org/10.1109/cdc.2017.8264623.

Full text

APA, Harvard, Vancouver, ISO, and other styles

Sharma, Arun K., Dhanjeet Singh, and Nishchal K. Verma. "Data Driven Aerodynamic Modeling Using Mamdani Fuzzy Inference Systems." In 2018 International Conference on Sensing,Diagnostics, Prognostics, and Control (SDPC). IEEE, 2018. http://dx.doi.org/10.1109/sdpc.2018.8664870.

Full text

APA, Harvard, Vancouver, ISO, and other styles

Taibi, Imane, Yassine Hadjadj-Aoul, and Chadi Barakat. "Data Driven Network Performance Inference From Within The Browser." In 2020 IEEE Symposium on Computers and Communications (ISCC). IEEE, 2020. http://dx.doi.org/10.1109/iscc50000.2020.9219573.

Full text

APA, Harvard, Vancouver, ISO, and other styles

We offer discounts on all premium plans for authors whose works are included in thematic literature selections. Contact us to get a unique promo code!

Contents

Academic literature on the topic 'Data Driven Inference'

Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles

Journal articles on the topic "Data Driven Inference"

Dissertations / Theses on the topic "Data Driven Inference"

Books on the topic "Data Driven Inference"

Book chapters on the topic "Data Driven Inference"

Conference papers on the topic "Data Driven Inference"