Academic literature on the topic 'Genomics bioinformatics variant discovery sequence analysis'

Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles

Select a source type:

Consult the lists of relevant articles, books, theses, conference reports, and other scholarly sources on the topic 'Genomics bioinformatics variant discovery sequence analysis.'

Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.

You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.

Journal articles on the topic "Genomics bioinformatics variant discovery sequence analysis"

1

Ahmed, Zeeshan, Eduard Gibert Renart, and Saman Zeeshan. "Genomics pipelines to investigate susceptibility in whole genome and exome sequenced data for variant discovery, annotation, prediction and genotyping." PeerJ 9 (July 26, 2021): e11724. http://dx.doi.org/10.7717/peerj.11724.

Full text
Abstract:
Over the last few decades, genomics is leading toward audacious future, and has been changing our views about conducting biomedical research, studying diseases, and understanding diversity in our society across the human species. The whole genome and exome sequencing (WGS/WES) are two of the most popular next-generation sequencing (NGS) methodologies that are currently being used to detect genetic variations of clinical significance. Investigating WGS/WES data for the variant discovery and genotyping is based on the nexus of different data analytic applications. Although several bioinformatics applications have been developed, and many of those are freely available and published. Timely finding and interpreting genetic variants are still challenging tasks among diagnostic laboratories and clinicians. In this study, we are interested in understanding, evaluating, and reporting the current state of solutions available to process the NGS data of variable lengths and types for the identification of variants, alleles, and haplotypes. Residing within the scope, we consulted high quality peer reviewed literature published in last 10 years. We were focused on the standalone and networked bioinformatics applications proposed to efficiently process WGS and WES data, and support downstream analysis for gene-variant discovery, annotation, prediction, and interpretation. We have discussed our findings in this manuscript, which include but not are limited to the set of operations, workflow, data handling, involved tools, technologies and algorithms and limitations of the assessed applications.
APA, Harvard, Vancouver, ISO, and other styles
2

Wiggans, G. R., D. J. Null, J. B. Cole, and H. D. Norman. "256 GENOMIC EVALUATION OF FERTILITY TRAITS AND DISCOVERY OF HAPLOTYPES THAT AFFECT FERTILITY OF US DAIRY CATTLE." Reproduction, Fertility and Development 28, no. 2 (2016): 260. http://dx.doi.org/10.1071/rdv28n2ab256.

Full text
Abstract:
Genomic evaluations of dairy cattle became official in the United States in January 2009 for Holsteins and Jerseys, and later for Brown Swiss, Ayrshires, and Guernseys. Up to 33 yield, fitness, calving, and conformation traits are evaluated, and the fertility traits included daughter pregnancy rate and heifer and cow conception rates. Additional fertility traits, such as age at first calving and days from calving to first insemination, also are being studied. Male fertility (sire conception rate) is evaluated phenotypically rather than through genomics. Over 1 million animals have genotypes in the national database, which reflects collaboration with Canada and Europe. Most of the genotypes are from females and are from genotyping chips with <30 000 single nucleotide polymorphisms (SNP). To combine data across chips, genotypes are imputed to a set of >77 000 SNP. The imputation process involves dividing the chromosome into segments of approximately equal length and determining the paternal or maternal origin of the alleles. Because some segments were never homozygous, they were assumed to contain an abnormality that resulted in early embryonic death. If a decrease in sire conception rate could be associated with a bull that was a carrier of such a chromosomal segment, the haplotype was designated as affecting fertility. Once the region was identified, bioinformatic analysis was used to discover the causative variant for many of those haplotypes. Accuracy of genomic evaluations is determined by size of the reference population and heritability of the trait. The reference population for Holsteins includes >180 000 bulls and cows. Because fertility traits have low heritabilities, genomic information is particularly useful in improving evaluation accuracy. Accuracy of fertility evaluations is expected to increase further by discovering causative variants for various aspects of conception and gestation through investigation of sequence data.
APA, Harvard, Vancouver, ISO, and other styles
3

Smith, Frances, David Brawand, Laura Steedman, et al. "A Comprehensive Next Generation Sequencing Gene Panel Focused on Unexplained Anemia." Blood 126, no. 23 (2015): 946. http://dx.doi.org/10.1182/blood.v126.23.946.946.

Full text
Abstract:
Abstract Congenital anemia is difficult to diagnose once common causes have been excluded; for example 80% cases of congenital non-spherocytic hemolytic anemia are undiagnosed once pyruvate kinase and G6PD deficiencies have been excluded using phenotypic analysis. We describe a next generation sequencing strategy, targeting 147 genes, to facilitate the diagnosis of these conditions. The coding regions, splice sites and 200 bp into the untranslated regions were examined in each gene. All clinically significant variants were confirmed by Sanger sequencing, including confirmation in any appropriate family members. Illumina MiSeq data was analysed using a bespoke bioinformatics pipeline, which has been validated to a UK certified standard. The pipeline implements detection of genetic variants using multiple base callers and discovery of copy number variants based on sequencing depth. Variants are annotated with information from ClinVar, and population frequency data from ExAC and 1000 genomes project. All genes are sequenced in every individual but data analysis can easily be restricted to virtual subpanels, excluding analysis of genes not requested. Here we present three cases, highlighting the diagnostic utility of the panel as well as the underlying bioinformatics analysis. Case 1. A male Caucasian child of <1 year, presented with haemolysis (LDH 539 IU/L, total bilirubin 39 umol/L), haematology (Hb 92g/L, MCV 84.4, MCH 28.9, absolute retic count 313.8x109/L); his film showed marked anisopoikilocytes, microspherocytes and polychromasia. He had frontal bossing and a palpable spleen and had suffered several infections, the child was transfused once. His father's film showed elliptocytes, FBC (Hb 127g/L, MCV 89.6, MCH 30.6, absolute retic count 230.5x109/L) but he had never been transfused. The mother's FBC was normal (Hb 113g/L, MCV 87.0, MCH 29.2, absolute retic count 48.4x109/L) but her film also showed elliptocytes. Analysis using the red cell panel found the child to be compound heterozygous for c.83G>A; p.Arg28His and c.[5572C>G; 6531-12C>T]; p.[Leu1858Val;?] in the SPTA1 gene, suggesting the diagnosis of hereditary pyropoikilocytosis. The c.83G>A; p.Arg28His mutation was inherited from the father and the c.[5572C>G; 6531-12C>T]; p.[Leu1858Val;?] low expression allele was inherited from the mother, who was homozygous. Case 2. The post mortem report from a hydropic still birth (36/40) showed extensive extramedullary hematopoiesis and severe anemia. A DNA sample was sent to the laboratory accompanied by blood samples from both parents whose hematology was normal. The DNA sample from the proband was relatively small so only the parental samples were analyzed using the red cell panel. Sequence analysis identified the mother to carry the c.3173dupG; p.Gln1659fs pathogenic variant and the father carried the c.2867_2868+1dupCCG pathogenic variant in the CDAN1 gene. Sanger Sequencing showed that the child had inherited both mutations from the parents. Variants in CDAN1 are associated with CDA type 1 which is documented to be a rare form of anemia which can be lethal. Case 3. An Italian girl carrying a paternally inherited c.118C>T β0 thalassemia variant presented with a severe form of microcytic anemia (FBC, Hb 86g/L, RBC 4.87 x1012/L, MCV 55.2, MCH 17.7 and HbA2=5%). The severity of her anemia (not transfused) and palpable spleen suggested she had an additional pathogenic variant that had not been identified. Her mother had normal hematology FBC, Hb 133g/L, RBC 4.82x1012/L, MCV 79.9, MCH 27.6. After sequencing, Exome Depth analysis of the proband's LCR identified a novel deletion which removed the 5' HS1 and HS2 sites but left HS3-5 intact (confirmed by MLPA in the mother and proband). The combination of this mild down regulation of the beta globin locus in combination with the c.118C>T β0 thalassemia variant caused her phenotype to be more severe than just a beta thalassemia carrier. Identifying pathogenic variants in these families is important as it facilitates prognosis and treatment, and allows prenatal diagnosis to be offered in future. To date the panel has assessed 10 cases of anemia with unknown cause and has made a definitive diagnosis in 8 (80%). Of the two undiagnosed, one was a child that died at 3 weeks and received multiple intrauterine and neonatal transfusions and had severe anemia and the other was a suspected case of CDA with little associated phenotype. Disclosures No relevant conflicts of interest to declare.
APA, Harvard, Vancouver, ISO, and other styles
4

Bao, Riyue, Lei Huang, Jorge Andrade, et al. "Review of Current Methods, Applications, and Data Management for the Bioinformatics Analysis of Whole Exome Sequencing." Cancer Informatics 13s2 (January 2014): CIN.S13779. http://dx.doi.org/10.4137/cin.s13779.

Full text
Abstract:
The advent of next-generation sequencing technologies has greatly promoted advances in the study of human diseases at the genomic, transcriptomic, and epigenetic levels. Exome sequencing, where the coding region of the genome is captured and sequenced at a deep level, has proven to be a cost-effective method to detect disease-causing variants and discover gene targets. In this review, we outline the general framework of whole exome sequence data analysis. We focus on established bioinformatics tools and applications that support five analytical steps: raw data quality assessment, preprocessing, alignment, post-processing, and variant analysis (detection, annotation, and prioritization). We evaluate the performance of open-source alignment programs and variant calling tools using simulated and benchmark datasets, and highlight the challenges posed by the lack of concordance among variant detection tools. Based on these results, we recommend adopting multiple tools and resources to reduce false positives and increase the sensitivity of variant calling. In addition, we briefly discuss the current status and solutions for big data management, analysis, and summarization in the field of bioinformatics.
APA, Harvard, Vancouver, ISO, and other styles
5

Yang, Junmeng, Anna Liu, Isabella He, and Yongsheng Bai. "Bioinformatics Analysis Revealed Novel 3′UTR Variants Associated with Intellectual Disability." Genes 11, no. 9 (2020): 998. http://dx.doi.org/10.3390/genes11090998.

Full text
Abstract:
MicroRNAs (or miRNAs) are short nucleotide sequences (~17–22 bp long) that play important roles in gene regulation through targeting genes in the 3′untranslated regions (UTRs). Variants located in genomic regions might have different biological consequences in changing gene expression. Exonic variants (e.g., coding variant and 3′UTR variant) are often causative of diseases due to their influence on gene product. Variants harbored in the 3′UTR region where miRNAs perform their targeting function could potentially alter the binding relationships for target pairs, which could relate to disease causation. We gathered miRNA–mRNA targeting pairs from published studies and then employed the database of microRNA Target Site single nucleotide variants (SNVs) (dbMTS) to discover novel SNVs within the selected pairs. We identified a total of 183 SNVs for the 114 pairs of accurate miRNA–mRNA targeting pairs selected. Detailed bioinformatics analysis of the three genes with identified variants that were exclusively located in the 3′UTR section indicated their association with intellectual disability (ID). Our result showed an exceptionally high expression of GPR88 in brain tissues based on GTEx gene expression data, while WNT7A expression data were relatively high in brain tissues when compared to other tissues. Motif analysis for the 3′UTR region of WNT7A showed that five identified variants were well-conserved across three species (human, mouse, and rat); the motif that contains the variant identified in GPR88 is significant at the level of the 3′UTR of the human genome. Studies of pathways, protein–protein interactions, and relations to diseases further suggest potential association with intellectual disability of our discovered SNVs. Our results demonstrated that 3′UTR variants could change target interactions of miRNA–mRNA pairs in the context of their association with ID. We plan to automate the methods through developing a bioinformatics pipeline for identifying novel 3′UTR SNVs harbored by miRNA-targeted genes in the future.
APA, Harvard, Vancouver, ISO, and other styles
6

Tremblay, Olivier, Zachary Thow, and A. Rod Merrill. "Several New Putative Bacterial ADP-Ribosyltransferase Toxins Are Revealed from In Silico Data Mining, Including the Novel Toxin Vorin, Encoded by the Fire Blight Pathogen Erwinia amylovora." Toxins 12, no. 12 (2020): 792. http://dx.doi.org/10.3390/toxins12120792.

Full text
Abstract:
Mono-ADP-ribosyltransferase (mART) toxins are secreted by several pathogenic bacteria that disrupt vital host cell processes in deadly diseases like cholera and whooping cough. In the last two decades, the discovery of mART toxins has helped uncover the mechanisms of disease employed by pathogens impacting agriculture, aquaculture, and human health. Due to the current abundance of mARTs in bacterial genomes, and an unprecedented availability of genomic sequence data, mART toxins are amenable to discovery using an in silico strategy involving a series of sequence pattern filters and structural predictions. In this work, a bioinformatics approach was used to discover six bacterial mART sequences, one of which was a functional mART toxin encoded by the plant pathogen, Erwinia amylovora, called Vorin. Using a yeast growth-deficiency assay, we show that wild-type Vorin inhibited yeast cell growth, while catalytic variants reversed the growth-defective phenotype. Quantitative mass spectrometry analysis revealed that Vorin may cause eukaryotic host cell death by suppressing the initiation of autophagic processes. The genomic neighbourhood of Vorin indicated that it is a Type-VI-secreted effector, and co-expression experiments showed that Vorin is neutralized by binding of a cognate immunity protein, VorinI. We demonstrate that Vorin may also act as an antibacterial effector, since bacterial expression of Vorin was not achieved in the absence of VorinI. Vorin is the newest member of the mART family; further characterization of the Vorin/VorinI complex may help refine inhibitor design for mART toxins from other deadly pathogens.
APA, Harvard, Vancouver, ISO, and other styles
7

Alsamman, Alsamman M., Shafik D. Ibrahim, and Aladdin Hamwieh. "KASPspoon: an in vitro and in silico PCR analysis tool for high-throughput SNP genotyping." Bioinformatics 35, no. 17 (2019): 3187–90. http://dx.doi.org/10.1093/bioinformatics/btz004.

Full text
Abstract:
Abstract Motivation Fine mapping becomes a routine trial following quantitative trait loci (QTL) mapping studies to shrink the size of genomic segments underlying causal variants. The availability of whole genome sequences can facilitate the development of high marker density and predict gene content in genomic segments of interest. Correlations between genetic and physical positions of these loci require handling of different experimental genetic data types, and ultimately converting them into positioning markers using a routine and efficient tool. Results To convert classical QTL markers into KASP assay primers, KASPspoon simulates a PCR by running an approximate-match searching analysis on user-entered primer pairs against the provided sequences, and then comparing in vitro and in silico PCR results. KASPspoon reports amplimers close to or adjoining genes/SNPs/simple sequence repeats and those that are shared between in vitro and in silico PCR results to select the most appropriate amplimers for gene discovery. KASPspoon compares physical and genetic maps, and reports the primer set genome coverage for PCR-walking. KASPspoon could be used to design KASP assay primers to convert QTL acquired by classical molecular markers into high-throughput genotyping assays and to provide major SNP resource for the dissection of genotypic and phenotypic variation. In addition to human-readable output files, KASPspoon creates Circos configurations that illustrate different in silico and in vitro results. Availability and implementation Code available under GNU GPL at (http://www.ageri.sci.eg/index.php/facilities-services/ageri-softwares/kaspspoon). Supplementary information Supplementary data are available at Bioinformatics online.
APA, Harvard, Vancouver, ISO, and other styles
8

BLAXTER, M., M. ASLETT, D. GUILIANO, J. DAUB, and THE FILARIAL GENOME PROJECT. "Parasitic helminth genomics." Parasitology 118, no. 7 (1999): 39–51. http://dx.doi.org/10.1017/s0031182099004060.

Full text
Abstract:
The initiation of genome projects on helminths of medical importance promises to yield new drug targets and vaccine candidates in unprecedented numbers. In order to exploit this emerging data it is essential that the user community is aware of the scope and quality of data available, and that the genome projects provide analyses of the raw data to highlight potential genes of interest. Core bioinformatics support for the parasite genome projects has promoted these approaches. In the Brugia genome project, a combination of expressed sequence tag sequencing from multiple cDNA libraries representing the complete filarial nematode lifecycle, and comparative analysis of the sequence dataset, particularly using the complete genome sequence of the model nematode C. elegans, has proved very effective in gene discovery.
APA, Harvard, Vancouver, ISO, and other styles
9

Karabayev, Daniyar, Askhat Molkenov, Kaiyrgali Yerulanuly, et al. "re-Searcher: GUI-based bioinformatics tool for simplified genomics data mining of VCF files." PeerJ 9 (May 3, 2021): e11333. http://dx.doi.org/10.7717/peerj.11333.

Full text
Abstract:
Background High-throughput sequencing platforms generate a massive amount of high-dimensional genomic datasets that are available for analysis. Modern and user-friendly bioinformatics tools for analysis and interpretation of genomics data becomes essential during the analysis of sequencing data. Different standard data types and file formats have been developed to store and analyze sequence and genomics data. Variant Call Format (VCF) is the most widespread genomics file type and standard format containing genomic information and variants of sequenced samples. Results Existing tools for processing VCF files don’t usually have an intuitive graphical interface, but instead have just a command-line interface that may be challenging to use for the broader biomedical community interested in genomics data analysis. re-Searcher solves this problem by pre-processing VCF files by chunks to not load RAM of computer. The tool can be used as standalone user-friendly multiplatform GUI application as well as web application (https://nla-lbsb.nu.edu.kz). The software including source code as well as tested VCF files and additional information are publicly available on the GitHub repository (https://github.com/LabBandSB/re-Searcher).
APA, Harvard, Vancouver, ISO, and other styles
10

Knight, Samantha JL, Ruth Clifford, Pauline Robbe, et al. "The Identification of Further Minimal Regions of Overlap in Chronic Lymphocytic Leukemia Using High-Resolution SNP Arrays." Blood 124, no. 21 (2014): 3315. http://dx.doi.org/10.1182/blood.v124.21.3315.3315.

Full text
Abstract:
Abstract Background:Historically, the identification of minimal deleted regions (MDRs) has been a useful approach for pinpointing genes involved in the pathogenesis of human malignancies and constitutional disorders. Microarray technology has offered increased capability for newly identifying or refining existing MDRs and minimal overlapping regions (MORs) in cancer. Despite this, in chronic lymphocytic leukemia (CLL), published MORs that pinpoint only a few candidate genes have been limited and with the advent of NGS, the utility of high resolution array work as a discovery tool has become uncertain. Here, we show that profiling copy number abnormalities (CNAs) and cnLOH using arrays in a large patient series can still be a valuable approach for the identification of genes that are disrupted or mutated in CLL and have a role in CLL development and/or progression. Methods: 250 CLL patient DNAs from individuals enrolled in two UK-based Phase II randomised controlled trials (AdMIRe and ARCTIC trials) were tested using Infinium HumanOmni2.5-8 v1.1 according to manufacturer’s guidelines (Illumina Inc, San Diego, CA). Data were processed using GenomeStudioV2009.2 (Illumina Inc.) and analysed using Nexus Discovery Edition v6.1 (BioDiscovery, Hawthorne, CA). All Nexus plots were inspected visually to verify calls made, identify uncalled events and exclude likely false positives. To exclude common germline CNVs, the Database of Genomic Variants (DGV), a comprehensive catalog of structural variation in control data, was used. Copy number (CN) changes that encompassed fully changes noted in the DGV were excluded from further analysis. Regions of copy neutral loss of heterozygosity (cnLOH) were recorded if >1Mb in size, but were not used to define or refine MORs. Data from 1275 age-appropriate control samples minimised the reporting of common cnLOH events. All genomic coordinates were noted with reference to the GRCh37, hg19 assembly. MORs were investigated using Microsoft Excel filtering functions. A subset of genes (n=91) selected from MORs mainly on the basis of event frequency and/or number of genes within the MOR and/or literature interest were taken forward for targeted sequencing (exons only) of appropriate samples with/without CN Losses or cnLOH (Set 1 n=124; Set 2 n=126). These were tested using custom designed TruSeq Custom Amplicon panels (Illumina Inc) and processed according to manufacturer’s instructions. SAMHD1 was excluded from these panels since it had been studied separately within our laboratory. The data were analysed using an in-house bioinformatics pipeline that uses the sequence aligners MSR and Stampy and the variant callers GATK and Platypus, followed by stringent filtering. Results: Using our datasets we have identified >50 MORs previously unreported in the literature. Six of these showed copy number (CN) losses in >3% of patients studied. Furthermore, we have refined 14 MORs that overlapped with regions described previously and that had also a CN loss frequency of >3%. Thirteen MORs involved only a single reference gene, often a gene implicated previously in cancer (eg. SAMHD1, MTSS1, DCC and RFC1). Of the 91 genes taken forward for targeted sequencing, stringent data filtering led to a subset of 19 genes of interest harbouring exonic mutations. Genes with mutations identified include DCC, BAP1 and FBXW7, also implicated previously in cancer. Conclusion: We have generated high resolution CNA and cnLOH profiles for 250 first-line chemo-immunotherapy treated CLL patients and used this information to document newly identified MORs, to refine MORs reported previously and to identify mutation harbouring genes using targeted NGS. Functional knowledge supports our hypothesis that these genes may have a contributory role in CLL. For two genes, SAMHD1 and FBXW7, relevance in CLL has been established already. Taken together, our data validate the utility of high resolution arrays studies for the identification of candidate genes that may be involved in CLL development or progression when disrupted. Further studies are required to confirm a role for these genes in CLL and to elucidate the nature of the underlying biological mechanisms. Disclosures No relevant conflicts of interest to declare.
APA, Harvard, Vancouver, ISO, and other styles
More sources
We offer discounts on all premium plans for authors whose works are included in thematic literature selections. Contact us to get a unique promo code!

To the bibliography