Spondyloarthritis encompasses a group of common inflammatory diseases thought to be driven by IL-17A-secreting type-17 lymphocytes. Here we show increased numbers of GM-CSF-producing CD4 and CD8 lymphocytes in the blood and joints of patients with spondyloarthritis, and increased numbers of IL-17A(+)GM-CSF(+) double-producing CD4, CD8, γδ and NK cells. GM-CSF production in CD4 T cells occurs both independently and in combination with classical Th1 and Th17 cytokines. Type 3 innate lymphoid cells producing predominantly GM-CSF are expanded in synovial tissues from patients with spondyloarthritis. GM-CSF(+)CD4(+) cells, isolated using a triple cytokine capture approach, have a specific transcriptional signature. Both GM-CSF(+) and IL-17A(+)GM-CSF(+) double-producing CD4 T cells express increased levels of GPR65, a proton-sensing receptor associated with spondyloarthritis in genome-wide association studies and pathogenicity in murine inflammatory disease models. Silencing GPR65 in primary CD4 T cells reduces GM-CSF production. GM-CSF and GPR65 may thus serve as targets for therapeutic intervention of spondyloarthritis.
Recent advances in cryo-electron microscopy (cryoEM) have dramatically improved the resolutions at which vitrified biological specimens can be studied, revealing new structural and mechanistic insights over a broad range of spatial scales. Bolstered by these advances, much effort has been directed toward the development of hybrid modeling methodologies for the construction and refinement of high-fidelity atomistic models from cryoEM data. In this brief review, we will survey the key elements of cryoEM-based hybrid modeling, providing an overview of available computational tools and strategies as well as several recent applications.
Homozygous deletions are rare in cancers and often target tumour suppressor genes. Here, we build a compendium of 2218 primary tumours across 12 human cancer types and systematically screen for homozygous deletions, aiming to identify rare tumour suppressors. Our analysis defines 96 genomic regions recurrently targeted by homozygous deletions. These recurrent homozygous deletions occur either over tumour suppressors or over fragile sites, regions of increased genomic instability. We construct a statistical model that separates fragile sites from regions showing signatures of positive selection for homozygous deletions and identify candidate tumour suppressors within those regions. We find 16 established tumour suppressors and propose 27 candidate tumour suppressors. Several of these genes (including MGMT, RAD17, and USP44) show prior evidence of a tumour suppressive function. Other candidate tumour suppressors, such as MAFTRR, KIAA1551, and IGF2BP2, are novel. Our study demonstrates how rare tumour suppressors can be identified through copy number meta-analysis.
We screened variants on an exome-focused genotyping array in >300,000 participants (replication in >280,000 participants) and identified 444 independent variants in 250 loci significantly associated with total cholesterol (TC), high-density-lipoprotein cholesterol (HDL-C), low-density-lipoprotein cholesterol (LDL-C), and/or triglycerides (TG). At two loci (JAK2 and A1CF), experimental analysis in mice showed lipid changes consistent with the human data. We also found that: (i) beta-thalassemia trait carriers displayed lower TC and were protected from coronary artery disease (CAD); (ii) excluding the CETP locus, there was not a predictable relationship between plasma HDL-C and risk for age-related macular degeneration; (iii) only some mechanisms of lowering LDL-C appeared to increase risk for type 2 diabetes (T2D); and (iv) TG-lowering alleles involved in hepatic production of TG-rich lipoproteins (TM6SF2 and PNPLA3) tracked with higher liver fat, higher risk for T2D, and lower risk for CAD, whereas TG-lowering alleles involved in peripheral lipolysis (LPL and ANGPTL4) had no effect on liver fat but decreased risks for both T2D and CAD.
PRDM9 binding localizes almost all meiotic recombination sites in humans and mice. However, most PRDM9-bound loci do not become recombination hotspots. To explore factors that affect binding and subsequent recombination outcomes, we mapped human PRDM9 binding sites in a transfected human cell line and measured PRDM9-induced histone modifications. These data reveal varied DNA-binding modalities of PRDM9. We also find that human PRDM9 frequently binds promoters, despite their low recombination rates, and it can activate expression of a small number of genes including CTCFL and VCX. Furthermore, we identify specific sequence motifs that predict consistent, localized meiotic recombination suppression around a subset of PRDM9 binding sites. These motifs strongly associate with KRAB-ZNF protein binding, TRIM28 recruitment, and specific histone modifications. Finally, we demonstrate that, in addition to binding DNA, PRDM9's zinc fingers also mediate its multimerization, and we show that a pair of highly diverged alleles preferentially form homo-multimers.
Genetics in Medicine, | Read more2017. “Not pathogenic until proven otherwise”: perspectives of UK clinical genomics professionals toward secondary findings in context of a Genomic Medicine Multidisciplinary Team and the 100,000 Genomes Project
The discovery of genetic variants influencing sleep patterns can shed light on the physiological processes underlying sleep. As part of a large clinical sequencing project, WGS500, we sequenced a family in which the two male children had severe developmental delay and a dramatically disturbed sleep-wake cycle, with very long wake and sleep durations, reaching up to 106-h awake and 48-h asleep. The most likely causal variant identified was a novel missense variant in the X-linked GRIA3 gene, which has been implicated in intellectual disability. GRIA3 encodes GluA3, a subunit of AMPA-type ionotropic glutamate receptors (AMPARs). The mutation (A653T) falls within the highly conserved transmembrane domain of the ion channel gate, immediately adjacent to the analogous residue in the Grid2 (glutamate receptor) gene, which is mutated in the mouse neurobehavioral mutant, Lurcher. In vitro, the GRIA3(A653T) mutation stabilizes the channel in a closed conformation, in contrast to Lurcher. We introduced the orthologous mutation into a mouse strain by CRISPR-Cas9 mutagenesis and found that hemizygous mutants displayed significant differences in the structure of their activity and sleep compared to wild-type littermates. Typically, mice are polyphasic, exhibiting multiple sleep bouts of sleep several minutes long within a 24-h period. The Gria3A653T mouse showed significantly fewer brief bouts of activity and sleep than the wild-types. Furthermore, Gria3A653T mice showed enhanced period lengthening under constant light compared to wild-type mice, suggesting an increased sensitivity to light. Our results suggest a role for GluA3 channel activity in the regulation of sleep behavior in both mice and humans.
Mutations in FUS are causative for amyotrophic lateral sclerosis with a dominant mode of inheritance. In trying to model FUS-amyotrophic lateral sclerosis (ALS) in mouse it is clear that FUS is dosage-sensitive and effects arise from overexpression per se in transgenic strains. Novel models are required that maintain physiological levels of FUS expression and that recapitulate the human disease-with progressive loss of motor neurons in heterozygous animals. Here, we describe a new humanized FUS-ALS mouse with a frameshift mutation, which fulfils both criteria: the FUS Delta14 mouse. Heterozygous animals express mutant humanized FUS protein at physiological levels and have adult onset progressive motor neuron loss and denervation of neuromuscular junctions. Additionally, we generated a novel antibody to the unique human frameshift peptide epitope, allowing specific identification of mutant FUS only. Using our new FUSDelta14 ALS mouse-antibody system we show that neurodegeneration occurs in the absence of FUS protein aggregation. FUS mislocalization increases as disease progresses, and mutant FUS accumulates at the rough endoplasmic reticulum. Further, transcriptomic analyses show progressive changes in ribosomal protein levels and mitochondrial function as early disease stages are initiated. Thus, our new physiological mouse model has provided novel insight into the early pathogenesis of FUS-ALS.
AIMS/HYPOTHESIS: The genetic risk of type 1 diabetes has been extensively studied. However, the genetic determinants of age at diagnosis (AAD) of type 1 diabetes remain relatively unexplained. Identification of AAD genes and pathways could provide insight into the earliest events in the disease process. METHODS: Using ImmunoChip data from 15,696 cases, we aimed to identify regions in the genome associated with AAD. RESULTS: Two regions were convincingly associated with AAD (p < 5 × 10(-8)): the MHC on 6p21, and 6q22.33. Fine-mapping of 6q22.33 identified two AAD-associated haplotypes in the region nearest to the genes encoding protein tyrosine phosphatase receptor kappa (PTPRK) and thymocyte-expressed molecule involved in selection (THEMIS). We examined the susceptibility to type 1 diabetes at these SNPs by performing a meta-analysis including 19,510 control participants. Although these SNPs were not associated with type 1 diabetes overall (p > 0.001), the SNP most associated with AAD, rs72975913, was associated with susceptibility to type 1 diabetes in those individuals diagnosed at less than 5 years old (p = 2.3 × 10(-9)). CONCLUSION/INTERPRETATION: PTPRK and its neighbour THEMIS are required for early development of the thymus, which we can assume influences the initiation of autoimmunity. Non-HLA genes may only be detectable as risk factors for the disease in individuals diagnosed under the age 5 years because, after that period of immune development, their role in disease susceptibility has become redundant.
BACKGROUND: Genome-wide association studies have recently identified >400 loci that harbor DNA sequence variants that influence blood pressure (BP). Our earlier studies identified and validated 56 single nucleotide variants (SNVs) associated with BP from meta-analyses of exome chip genotype data. An additional 100 variants yielded suggestive evidence of association. METHODS AND RESULTS: Here, we augment the sample with 140 886 European individuals from the UK Biobank, in whom 77 of the 100 suggestive SNVs were available for association analysis with systolic BP or diastolic BP or pulse pressure. We performed 2 meta-analyses, one in individuals of European, South Asian, African, and Hispanic descent (pan-ancestry, ≈475 000), and the other in the subset of individuals of European descent (≈423 000). Twenty-one SNVs were genome-wide significant (P<5×10(-)(8)) for BP, of which 4 are new BP loci: rs9678851 (missense, SLC4A1AP), rs7437940 (AFAP1), rs13303 (missense, STAB1), and rs1055144 (7p15.2). In addition, we identified a potentially independent novel BP-associated SNV, rs3416322 (missense, SYNPO2L) at a known locus, uncorrelated with the previously reported SNVs. Two SNVs are associated with expression levels of nearby genes, and SNVs at 3 loci are associated with other traits. One SNV with a minor allele frequency <0.01, (rs3025380 at DBH) was genome-wide significant. CONCLUSIONS: We report 4 novel loci associated with BP regulation, and 1 independent variant at an established BP locus. This analysis highlights several candidate genes with variation that alter protein function or gene expression for potential follow-up.
BACKGROUND AND AIMS: Increased proinsulin relative to insulin levels have been associated with subclinical atherosclerosis (measured by carotid intima-media thickness (cIMT)) and are predictive of future cardiovascular disease (CVD), independently of established risk factors. The mechanisms linking proinsulin to atherosclerosis and CVD are unclear. A genome-wide meta-analysis has identified nine loci associated with circulating proinsulin levels. Using proinsulin-associated SNPs, we set out to use a Mendelian randomisation approach to test the hypothesis that proinsulin plays a causal role in subclinical vascular remodelling. METHODS: We studied the high CVD-risk IMPROVE cohort (n = 3345), which has detailed biochemical phenotyping and repeated, state-of-the-art, high-resolution carotid ultrasound examinations. Genotyping was performed using Illumina Cardio-Metabo and Immuno arrays, which include reported proinsulin-associated loci. Participants with type 2 diabetes (n = 904) were omitted from the analysis. Linear regression was used to identify proinsulin-associated genetic variants. RESULTS: We identified a proinsulin locus on chromosome 15 (rs8029765) and replicated it in data from 20,003 additional individuals. An 11-SNP score, including the previously identified and the chromosome 15 proinsulin-associated loci, was significantly and negatively associated with baseline IMTmean and IMTmax (the primary cIMT phenotypes) but not with progression measures. However, MR-Eggers refuted any significant effect of the proinsulin-associated 11-SNP score, and a non-pleiotropic SNP score of three variants (including rs8029765) demonstrated no effect on baseline or progression cIMT measures. CONCLUSIONS: We identified a novel proinsulin-associated locus and demonstrated that whilst proinsulin levels are associated with cIMT measures, proinsulin per se is unlikely to have a causative effect on cIMT.
A variety of models have been proposed to explain regions of recurrent somatic copy number alteration (SCNA) in human cancer. Our study employs Whole Genome DNA Sequence (WGS) data from tumor samples (n = 103) to comprehensively assess the role of the Knudson two hit genetic model in SCNA generation in prostate cancer. 64 recurrent regions of loss and gain were detected, of which 28 were novel, including regions of loss with more than 15% frequency at Chr4p15.2-p15.1 (15.53%), Chr6q27 (16.50%) and Chr18q12.3 (17.48%). Comprehensive mutation screens of genes, lincRNA encoding sequences, control regions and conserved domains within SCNAs demonstrated that a two-hit genetic model was supported in only a minor proportion of recurrent SCNA losses examined (15/40). We found that recurrent breakpoints and regions of inversion often occur within Knudson model SCNAs, leading to the identification of ZNF292 as a target gene for the deletion at 6q14.3-q15 and NKX3.1 as a two-hit target at 8p21.3-p21.2. The importance of alterations of lincRNA sequences was illustrated by the identification of a novel mutational hotspot at the KCCAT42, FENDRR, CAT1886 and STCAT2 loci at the 16q23.1-q24.3 loss. Our data confirm that the burden of SCNAs is predictive of biochemical recurrence, define nine individual regions that are associated with relapse, and highlight the possible importance of ion channel and G-protein coupled-receptor (GPCR) pathways in cancer development. We concluded that a two-hit genetic model accounts for about one third of SCNA indicating that mechanisms, such haploinsufficiency and epigenetic inactivation, account for the remaining SCNA losses.
RATIONALE: Microvesicles (MV) act as a non-soluble means of inter-cellular communication, with effector roles in disease pathogenesis and potentially as biomarkers. Previously, we reported that neutrophil MV expressing alpha-2-macroglobulin (A2MG) are protective in experimental sepsis and associate with survival in a small cohort of patients with sepsis due to community acquired pneumonia (CAP). OBJECTIVES: To characterise MV profiles in sepsis due to CAP or faecal peritonitis (FP) and determine their relation to outcome. To investigate the effects of novel sepsis treatments (granulocyte-macrophage colony stimulating factor (GM-CSF) and interferon-υ (IFN-γ)) on MV production and functions in-vitro. METHODS: Flow cytometry analysis of MV identified the cell of origin and the proportion of A2MG expression in the plasma of patients with sepsis secondary to CAP (n = 60) or FP (n = 40) and compared to healthy volunteers (HV, n = 10). The association between MV subsets and outcome was examined. The ability of GM-CSF and IFN-γ on A2MG MV production from whole blood was examined together with the assessment of their effect on neutrophil and endothelial functions. RESULTS: Circulating cell-derived and A2MG MV were higher in CAP compared to FP and HV. A2MG MV were higher in survivors of CAP, but not in FP. GM-CSF and IFN-γ enhanced A2MG MV production, with these MV eliciting pathogen clearance in vitro. CONCLUSIONS: Plasma MV profiles vary according to the source of infection. A2MG MV are associated with survival in CAP but not FP. We propose specific MV subsets as novel biomarkers in sepsis and potential effector for some of the actions of experimental therapeutic interventions.
Multiple sclerosis (MS) afflicts about 2.5 million people globally and poses a major personal and socioeconomic burden. The recognition of MS as an inflammatory disease, characterized by infiltration of immune cells into the central nervous system, has spurred research into the autoimmune etiology of the condition and has provided the rationale for its treatment through immunomodulation. Experience with immunotherapies in MS to date has suggested a disparity between the observed immune cell infiltration and the progressive loss of neurons. However, recent clinical efforts are providing new insights into progressive MS that once again place the immune system at center stage. This article reviews the main mechanisms of MS immunopathogenesis, and the benefits, risks and challenges of immunomodulatory treatments for the disease.
Trans-ethnic meta-analysis of genome-wide association studies (GWAS) across diverse populations can increase power to detect complex trait loci when the underlying causal variants are shared between ancestry groups. However, heterogeneity in allelic effects between GWAS at these loci can occur that is correlated with ancestry. Here, a novel approach is presented to detect SNP association and quantify the extent of heterogeneity in allelic effects that is correlated with ancestry. We employ trans-ethnic meta-regression to model allelic effects as a function of axes of genetic variation, derived from a matrix of mean pairwise allele frequency differences between GWAS, and implemented in the MR-MEGA software. Through detailed simulations, we demonstrate increased power to detect association for MR-MEGA over fixed- and random-effects meta-analysis across a range of scenarios of heterogeneity in allelic effects between ethnic groups. We also demonstrate improved fine-mapping resolution, in loci containing a single causal variant, compared to these meta-analysis approaches and PAINTOR, and equivalent performance to MANTRA at reduced computational cost. Application of MR-MEGA to trans-ethnic GWAS of kidney function in 71,461 individuals indicates stronger signals of association than fixed-effects meta-analysis when heterogeneity in allelic effects is correlated with ancestry. Application of MR-MEGA to fine-mapping four type 2 diabetes susceptibility loci in 22,086 cases and 42,539 controls highlights: (i) strong evidence for heterogeneity in allelic effects that is correlated with ancestry only at the index SNP for the association signal at the CDKAL1 locus; and (ii) 99% credible sets with six or fewer variants for five distinct association signals.
Human dynamin-like, interferon-induced myxovirus resistance 2 (Mx2 or MxB) is a potent HIV-1 inhibitor. Antiviral activity requires both the amino-terminal region of MxB and protein oligomerization, each of which has eluded structural determination due to difficulties in protein preparation. We report that maltose binding protein-fused, full-length wild-type MxB purifies as oligomers and further self-assembles into helical arrays in physiological salt. Guanosine triphosphate (GTP), but not guanosine diphosphate, binding results in array disassembly, whereas subsequent GTP hydrolysis allows its reformation. Using cryo-electron microscopy (cryoEM), we determined the MxB assembly structure at 4.6 Å resolution, representing the first near-atomic resolution structure in the mammalian dynamin superfamily. The structure revealed previously described and novel MxB assembly interfaces. Mutational analyses demonstrated a critical role for one of the novel interfaces in HIV-1 restriction.
BACKGROUND: Autoimmune disease-associated variants are preferentially found in regulatory regions in immune cells, particularly CD4(+) T cells. Linking such regulatory regions to gene promoters in disease-relevant cell contexts facilitates identification of candidate disease genes. RESULTS: Within 4 h, activation of CD4(+) T cells invokes changes in histone modifications and enhancer RNA transcription that correspond to altered expression of the interacting genes identified by promoter capture Hi-C. By integrating promoter capture Hi-C data with genetic associations for five autoimmune diseases, we prioritised 245 candidate genes with a median distance from peak signal to prioritised gene of 153 kb. Just under half (108/245) prioritised genes related to activation-sensitive interactions. This included IL2RA, where allele-specific expression analyses were consistent with its interaction-mediated regulation, illustrating the utility of the approach. CONCLUSIONS: Our systematic experimental framework offers an alternative approach to candidate causal gene identification for variants with cell state-specific functional effects, with achievable sample sizes.
Cancer is characterised by complex somatically acquired genetic aberrations that manifest as intra-tumour and inter-tumour genetic heterogeneity and can lead to treatment resistance. In this case study, we characterise the genome-wide somatic mutation dynamics in a metastatic melanoma patient during therapy using low-input (50 ng) PCR-free whole genome sequencing of cell-free DNA from pre-treatment and post-relapse blood samples. We identify de novo tumour-specific somatic mutations from cell-free DNA, while the sequence context of single nucleotide variants showed the characteristic UV-damage mutation signature of melanoma. To investigate the behaviour of individual somatic mutations during proto-oncogene B-Raf -targeted and immune checkpoint inhibition, amplicon-based deep sequencing was used to verify and track frequencies of 212 single nucleotide variants at 10 distinct time points over 13 months of treatment. Under checkpoint inhibition therapy, we observed an increase in mutant allele frequencies indicating progression on therapy 88 days before clinical determination of non-response positron emission tomogrophy-computed tomography. We also revealed mutations from whole genome sequencing of cell-free DNA that were not present in the tissue biopsy, but that later contributed to relapse. Our findings have potential clinical applications where high quality tumour-tissue derived DNA is not available.
PURPOSE OF REVIEW: Genome-wide association studies (GWAS) for type 2 diabetes (T2D) risk have identified a large number of genetic loci associated with disease susceptibility. However, progress moving from association signals through causal genes to functional understanding has so far been slow, hindering clinical translation. This review discusses the benefits and limitations of emerging, unbiased approaches for prioritising causal genes at T2D risk loci. RECENT FINDINGS: Candidate causal genes can be identified by a number of different strategies that rely on genetic data, genomic annotations, and functional screening of selected genes. To overcome the limitations of each particular method, integration of multiple data sets is proving essential for establishing confidence in the prioritised genes. Previous studies have also highlighted the need to support these efforts through identification of causal variants and disease-relevant tissues. Prioritisation of causal genes at T2D risk loci by integrating complementary lines of evidence promises to accelerate our understanding of disease pathology and promote translation into new therapeutics.
BACKGROUND: Glycated hemoglobin (HbA1c) is used to diagnose type 2 diabetes (T2D) and assess glycemic control in patients with diabetes. Previous genome-wide association studies (GWAS) have identified 18 HbA1c-associated genetic variants. These variants proved to be classifiable by their likely biological action as erythrocytic (also associated with erythrocyte traits) or glycemic (associated with other glucose-related traits). In this study, we tested the hypotheses that, in a very large scale GWAS, we would identify more genetic variants associated with HbA1c and that HbA1c variants implicated in erythrocytic biology would affect the diagnostic accuracy of HbA1c. We therefore expanded the number of HbA1c-associated loci and tested the effect of genetic risk-scores comprised of erythrocytic or glycemic variants on incident diabetes prediction and on prevalent diabetes screening performance. Throughout this multiancestry study, we kept a focus on interancestry differences in HbA1c genetics performance that might influence race-ancestry differences in health outcomes. METHODS & FINDINGS: Using genome-wide association meta-analyses in up to 159,940 individuals from 82 cohorts of European, African, East Asian, and South Asian ancestry, we identified 60 common genetic variants associated with HbA1c. We classified variants as implicated in glycemic, erythrocytic, or unclassified biology and tested whether additive genetic scores of erythrocytic variants (GS-E) or glycemic variants (GS-G) were associated with higher T2D incidence in multiethnic longitudinal cohorts (N = 33,241). Nineteen glycemic and 22 erythrocytic variants were associated with HbA1c at genome-wide significance. GS-G was associated with higher T2D risk (incidence OR = 1.05, 95% CI 1.04-1.06, per HbA1c-raising allele, p = 3 × 10-29); whereas GS-E was not (OR = 1.00, 95% CI 0.99-1.01, p = 0.60). In Europeans and Asians, erythrocytic variants in aggregate had only modest effects on the diagnostic accuracy of HbA1c. Yet, in African Americans, the X-linked G6PD G202A variant (T-allele frequency 11%) was associated with an absolute decrease in HbA1c of 0.81%-units (95% CI 0.66-0.96) per allele in hemizygous men, and 0.68%-units (95% CI 0.38-0.97) in homozygous women. The G6PD variant may cause approximately 2% (N = 0.65 million, 95% CI 0.55-0.74) of African American adults with T2D to remain undiagnosed when screened with HbA1c. Limitations include the smaller sample sizes for non-European ancestries and the inability to classify approximately one-third of the variants. Further studies in large multiethnic cohorts with HbA1c, glycemic, and erythrocytic traits are required to better determine the biological action of the unclassified variants. CONCLUSIONS: As G6PD deficiency can be clinically silent until illness strikes, we recommend investigation of the possible benefits of screening for the G6PD genotype along with using HbA1c to diagnose T2D in populations of African ancestry or groups where G6PD deficiency is common. Screening with direct glucose measurements, or genetically-informed HbA1c diagnostic thresholds in people with G6PD deficiency, may be required to avoid missed or delayed diagnoses.
Genome-wide sequencing technologies are beginning to be used in projects that have both clinical diagnostic and research components. The clinical application of this technology, which generates a huge amount of information of varying diagnostic certainty, involves addressing a number of challenges to establish appropriate standards. In this article, we explore the way that UK law may respond to three of these key challenges and could establish new legal duties in relation to feedback of findings that are unrelated to the presenting condition (secondary, additional or incidental findings); duties towards genetic relatives as well as the patient and duties on the part of researchers and professionals who do not have direct contact with patients. When considering these issues, the courts will take account of European and international comparisons, developing guidance and relevant ethical, social and policy factors. The UK courts will also be strongly influenced by precedent set in case law.
BACKGROUND: Host responses during sepsis are highly heterogeneous, which hampers the identification of patients at high risk of mortality and their selection for targeted therapies. In this study, we aimed to identify biologically relevant molecular endotypes in patients with sepsis. METHODS: This was a prospective observational cohort study that included consecutive patients admitted for sepsis to two intensive care units (ICUs) in the Netherlands between Jan 1, 2011, and July 20, 2012 (discovery and first validation cohorts) and patients admitted with sepsis due to community-acquired pneumonia to 29 ICUs in the UK (second validation cohort). We generated genome-wide blood gene expression profiles from admission samples and analysed them by unsupervised consensus clustering and machine learning. The primary objective of this study was to establish endotypes for patients with sepsis, and assess the association of these endotypes with clinical traits and survival outcomes. We also established candidate biomarkers for the endotypes to allow identification of patient endotypes in clinical practice. FINDINGS: The discovery cohort had 306 patients, the first validation cohort had 216, and the second validation cohort had 265 patients. Four molecular endotypes for sepsis, designated Mars1-4, were identified in the discovery cohort, and were associated with 28-day mortality (log-rank p=0·022). In the discovery cohort, the worst outcome was found for patients classified as having a Mars1 endotype, and at 28 days, 35 (39%) of 90 people with a Mars1 endotype had died (hazard ratio [HR] vs all other endotypes 1·86 [95% CI 1·21-2·86]; p=0·0045), compared with 23 (22%) of 105 people with a Mars2 endotype (HR 0·64 [0·40-1·04]; p=0·061), 16 (23%) of 71 people with a Mars3 endotype (HR 0·71 [0·41-1·22]; p=0·19), and 13 (33%) of 40 patients with a Mars4 endotype (HR 1·13 [0·63-2·04]; p=0·69). Analysis of the net reclassification improvement using a combined clinical and endotype model significantly improved risk prediction to 0·33 (0·09-0·58; p=0·008). A 140-gene expression signature reliably stratified patients with sepsis to the four endotypes in both the first and second validation cohorts. Only Mars1 was consistently significantly associated with 28-day mortality across the cohorts. To facilitate possible clinical use, a biomarker was derived for each endotype; BPGM and TAP2 reliably identified patients with a Mars1 endotype. INTERPRETATION: This study provides a method for the molecular classification of patients with sepsis to four different endotypes upon ICU admission. Detection of sepsis endotypes might assist in providing personalised patient management and in selection for trials. FUNDING: Center for Translational Molecular Medicine, Netherlands.
Endometriosis is a benign gynecologic disorder, affecting up to 10% of women, characterized by the presence of functional endometrial tissue at ectopic positions generally within the peritoneum. It is a heritable condition influenced by multiple genetic and environmental factors, with an overall heritability estimated at approximately 50%. In this study, we investigated whether single nucleotide polymorphisms (SNPs) rs7521902, rs10859871 and rs11031006, mapping to WNT4, VEZT and FSHB genetic loci, respectively, are associated with risk for endometriosis in a Greek population. This study included 166 women with histologically confirmed endometriosis diagnosed through surgery and 150 normal controls. Genotyping of the rs7521902, rs10859871 and rs11031006 SNPs was performed with Taqman primer/probe sets. A significant association was detected with the AC genotype of rs7521902 (WNT4) in patients with stage III and IV disease only. Evidence for association with endometriosis was also found for the AC genotype of the rs10859871 of VEZT. Notably, a significant difference in the distribution of the AG genotype and the minor allele A of FSHB rs11031006 SNP was found between the endometriosis patients and controls. We found a genetic association between rs11031006 (FSHB) SNP and endometriosis. WNT4 and VEZT genes constitute the most consistently associated genes with endometriosis. In the present study, an association of rs7521902 (WNT4) and rs10859871 (VEZT) was confirmed in women with endometriosis at the genotypic but not the allelic level.
The maintenance of peripheral naive T lymphocytes in humans is dependent on their homeostatic division, not continuing emigration from the thymus, which undergoes involution with age. However, postthymic maintenance of naive T cells is still poorly understood. Previously we reported that recent thymic emigrants (RTEs) are contained in CD31+CD25- naive T cells as defined by their levels of signal joint T cell receptor rearrangement excision circles (sjTRECs). Here, by differential gene expression analysis followed by protein expression and functional studies, we define that the naive T cells having divided the least since thymic emigration express complement receptors (CR1 and CR2) known to bind complement C3b- and C3d-decorated microbial products and, following activation, produce IL-8 (CXCL8), a major chemoattractant for neutrophils in bacterial defense. We also observed an IL-8-producing memory T cell subpopulation coexpressing CR1 and CR2 and with a gene expression signature resembling that of RTEs. The functions of CR1 and CR2 on T cells remain to be determined, but we note that CR2 is the receptor for Epstein-Barr virus, which is a cause of T cell lymphomas and a candidate environmental factor in autoimmune disease.
The majority of routinely given vaccines require two or three immunisations for full protective efficacy. Single dose vaccination has long been considered a key solution to improving the global immunisation coverage. Recent infectious disease outbreaks have further highlighted the need for vaccines that can achieve full efficacy after a single administration. Viral vectors are a potent immunisation platform, benefiting from intrinsic immuno-stimulatory features while retaining excellent safety profile through the use of non-replicating viruses. We investigated the scope for enhancing the protective efficacy of a single dose adenovirus-vectored malaria vaccine in a mouse model of malaria by co-administering it with vaccine adjuvants. Out of 11 adjuvants, only two, Abisco(®)-100 and CoVaccineHT(TM), enhanced vaccine efficacy and sterile protection following malaria challenge. The CoVaccineHT(TM) adjuvanted vaccine induced significantly higher proportion of antigen specific central memory CD8(+) cells, and both adjuvants resulted in increased proportion of CD8(+) T cells expressing the CD107a degranulation marker in the absence of IFNγ, TNFα and IL2 production. Our results show that the efficacy of vaccines designed to induce protective T cell responses can be positively modulated with chemical adjuvants and open the possibility of achieving full protection with a single dose immunisation.
We have just witnessed the largest and most devastating outbreak of Ebola virus disease, which highlighted the urgent need for development of an efficacious vaccine that could be used to curtail future outbreaks. Prior to 2014, there had been limited impetus worldwide to develop a vaccine since the virus was first discovered in 1976. Though too many lives were lost during this outbreak, it resulted in the significantly accelerated clinical development of a number of candidate vaccines through an extraordinary collaborative global effort coordinated by the World Health Organisation (WHO) and involving a number of companies, trial centres, funders, global stakeholders and agencies. We have acquired substantial safety and immunogenicity data on a number of vaccines in Caucasian and African populations. The rapid pace of events led to the initiation of the landmark efficacy trial testing the rVSV-vectored vaccine, which showed high level efficacy in an outbreak setting when deployed using an innovative ring vaccination strategy. Though the Public Health Emergency of International Concern (PHEIC) declared by the WHO has now been lifted, the global scientific community faces numerous challenges ahead to ensure that there is a licensed, deployable vaccine available for use in future outbreaks for at least the Zaire and Sudan strains of Ebola virus. There remain several unanswered questions on the durability of protection, mechanistic immunological correlates and preferred deployment strategies. This review outlines a brief history of the development of Ebola vaccines, the significant progress made since the scale of the outbreak became apparent, some lessons learnt and how they could shape future development of vaccines and the management of similar outbreaks.
BACKGROUND: The incidence of type 1 diabetes (T1D) is increasing globally. One hypothesis is that increasing childhood obesity rates may explain part of this increase, but, as T1D is rare, intervention studies are challenging to perform. The aim of this study was to assess this hypothesis with a Mendelian randomization approach that uses genetic variants as instrumental variables to test for causal associations. METHODS AND FINDINGS: We created a genetic instrument of 23 single nucleotide polymorphisms (SNPs) associated with childhood adiposity in children aged 2-10 years. Summary-level association results for these 23 SNPs with childhood-onset (<17 years) T1D were extracted from a meta-analysis of genome-wide association study with 5,913 T1D cases and 8,828 reference samples. Using inverse-variance weighted Mendelian randomization analysis, we found support for an effect of childhood adiposity on T1D risk (odds ratio 1.32, 95% CI 1.06-1.64 per standard deviation score in body mass index [SDS-BMI]). A sensitivity analysis provided evidence of horizontal pleiotropy bias (p = 0.04) diluting the estimates towards the null. We therefore applied Egger regression and multivariable Mendelian randomization methods to control for this type of bias and found evidence in support of a role of childhood adiposity in T1D (odds ratio in Egger regression, 2.76, 95% CI 1.40-5.44). Limitations of our study include that underlying genes and their mechanisms for most of the genetic variants included in the score are not known. Mendelian randomization requires large sample sizes, and power was limited to provide precise estimates. This research has been conducted using data from the Early Growth Genetics (EGG) Consortium, the Genetic Investigation of Anthropometric Traits (GIANT) Consortium, the Tobacco and Genetics (TAG) Consortium, and the Social Science Genetic Association Consortium (SSGAC), as well as meta-analysis results from a T1D genome-wide association study. CONCLUSIONS: This study provides genetic support for a link between childhood adiposity and T1D risk. Together with evidence from observational studies, our findings further emphasize the importance of measures to reduce the global epidemic of childhood obesity and encourage mechanistic studies.
Patterns of genomic evolution between primary and metastatic breast cancer have not been studied in large numbers, despite patients with metastatic breast cancer having dismal survival. We sequenced whole genomes or a panel of 365 genes on 299 samples from 170 patients with locally relapsed or metastatic breast cancer. Several lines of analysis indicate that clones seeding metastasis or relapse disseminate late from primary tumors, but continue to acquire mutations, mostly accessing the same mutational processes active in the primary tumor. Most distant metastases acquired driver mutations not seen in the primary tumor, drawing from a wider repertoire of cancer genes than early drivers. These include a number of clinically actionable alterations and mutations inactivating SWI-SNF and JAK2-STAT3 pathways.
BACKGROUND: Genomic tests may improve the stratification of patients to receive new therapies in several disease areas. However, the use of expensive targeted therapies can impact on the cost effectiveness of these tests. This study presents an economic evaluation of genomic testing in chronic lymphocytic leukaemia in the context of the UK National Health Service. METHODS: Cost-effectiveness and cost-utility analyses (using life-years and quality-adjusted life-years) were undertaken from a National Health Service and societal perspective. Five strategies were evaluated across several age groups using Markov modelling: three strategies that reflected varying current genetic testing practice and two configurations of genomic testing (including ibrutinib treatment). RESULTS: Genomic testing strategies yielded the most life-years/quality-adjusted life-years per patient, but were not cost effective compared with a threshold of £30,000 per life-year/quality-adjusted life-year gained. Cost-effectiveness acceptability curves indicated that there was some uncertainty surrounding this result. A genomic testing strategy becomes the most cost-effective option if a higher end-of-life cost-effectiveness threshold of £50,000 is applied, if a societal costing perspective is considered in 25-year-old patients or if the cost of ibrutinib treatment falls. CONCLUSION: Stratifying patients with chronic lymphocytic leukaemia to targeted treatment using genomic testing improves health outcomes, but will likely only represent a cost-effective use of limited National Health Service resources if a higher cost-effectiveness threshold or societal costing perspective is applied, or if the price of ibrutinib treatment is reduced. This result may be broadly indicative of the likely cost effectiveness of other genomic tests that inform the stratification of patients to high cost-targeted therapies.
Genetic discovery from the multitude of phenotypes extractable from routine healthcare data can transform understanding of the human phenome and accelerate progress toward precision medicine. However, a critical question when analyzing high-dimensional and heterogeneous data is how best to interrogate increasingly specific subphenotypes while retaining statistical power to detect genetic associations. Here we develop and employ a new Bayesian analysis framework that exploits the hierarchical structure of diagnosis classifications to analyze genetic variants against UK Biobank disease phenotypes derived from self-reporting and hospital episode statistics. Our method displays a more than 20% increase in power to detect genetic effects over other approaches and identifies new associations between classical human leukocyte antigen (HLA) alleles and common immune-mediated diseases (IMDs). By applying the approach to genetic risk scores (GRSs), we show the extent of genetic sharing among IMDs and expose differences in disease perception or diagnosis with potential clinical implications.
Epicardium-derived cells (EPDCs) contribute cardiovascular cell types during development and in adulthood respond to Thymosin β4 (Tβ4) and myocardial infarction (MI) by reactivating a fetal gene programme to promote neovascularization and cardiomyogenesis. The mechanism for epicardial gene (re-)activation remains elusive. Here we reveal that BRG1, the essential ATPase subunit of the SWI/SNF chromatin-remodelling complex, is required for expression of Wilms' tumour 1 (Wt1), fetal EPDC activation and subsequent differentiation into coronary smooth muscle, and restores Wt1 activity upon MI. BRG1 physically interacts with Tβ4 and is recruited by CCAAT/enhancer-binding protein β (C/EBPβ) to discrete regulatory elements in the Wt1 locus. BRG1-Tβ4 co-operative binding promotes optimal transcription of Wt1 as the master regulator of embryonic EPDCs. Moreover, chromatin immunoprecipitation-sequencing reveals BRG1 binding at further key loci suggesting SWI/SNF activity across the fetal epicardial gene programme. These findings reveal essential functions for chromatin-remodelling in the activation of EPDCs during cardiovascular development and repair.
The genome is organized via CTCF-cohesin-binding sites, which partition chromosomes into 1-5 megabase (Mb) topologically associated domains (TADs), and further into smaller sub-domains (sub-TADs). Here we examined in vivo an ∼80 kb sub-TAD, containing the mouse α-globin gene cluster, lying within a ∼1 Mb TAD. We find that the sub-TAD is flanked by predominantly convergent CTCF-cohesin sites that are ubiquitously bound by CTCF but only interact during erythropoiesis, defining a self-interacting erythroid compartment. Whereas the α-globin regulatory elements normally act solely on promoters downstream of the enhancers, removal of a conserved upstream CTCF-cohesin boundary extends the sub-TAD to adjacent upstream CTCF-cohesin-binding sites. The α-globin enhancers now interact with the flanking chromatin, upregulating expression of genes within this extended sub-TAD. Rather than acting solely as a barrier to chromatin modification, CTCF-cohesin boundaries in this sub-TAD delimit the region of chromatin to which enhancers have access and within which they interact with receptive promoters.
Elevated blood pressure is a major risk factor for cardiovascular disease and has a substantial genetic contribution. Genetic variation influencing blood pressure has the potential to identify new pharmacological targets for the treatment of hypertension. To discover additional novel blood pressure loci, we used 1000 Genomes Project-based imputation in 150 134 European ancestry individuals and sought significant evidence for independent replication in a further 228 245 individuals. We report 6 new signals of association in or near HSPB7, TNXB, LRP12, LOC283335, SEPT9, and AKT2, and provide new replication evidence for a further 2 signals in EBF2 and NFKBIA Combining large whole-blood gene expression resources totaling 12 607 individuals, we investigated all novel and previously reported signals and identified 48 genes with evidence for involvement in blood pressure regulation that are significant in multiple resources. Three novel kidney-specific signals were also detected. These robustly implicated genes may provide new leads for therapeutic innovation.
© Institute of Mathematical Statistics, 2017. The combination of genetic information with electronic patient records promises to provide a powerful new resource for understanding human disease and its treatment. Here we develop and apply a novel stochastic compartmental model to a large dataset on Clostridium difficile infection (CDI) in three Oxfordshire hospitals over a 2.5 year period which combines genetic information on 858 confirmed cases of CDI with a database of 750,000 patient records. C. difficile is a major cause of healthcare-associated diarrhoea and is responsible for substantial mortality and morbidity, with relatively little known about its biology or its transmission epidemiology. Bayesian analysis of our model, via Markov chain Monte Carlo, provides new information about the biology of CDI, including genetic heterogeneity in infectiousness across different sequence types, and evidence for ward contamination as a significant mode of transmission, and allows inferences about the contribution of particular individuals, wards or hospitals to transmission of the bacterium, and assessment of changes in these over time following changes in hospital practice. Our work demonstrates the value of using statistical modelling and computational inference on large-scale hospital patient databases and genetic data.
Lean body mass, consisting mostly of skeletal muscle, is important for healthy aging. We performed a genome-wide association study for whole body (20 cohorts of European ancestry with n = 38,292) and appendicular (arms and legs) lean body mass (n = 28,330) measured using dual energy X-ray absorptiometry or bioelectrical impedance analysis, adjusted for sex, age, height, and fat mass. Twenty-one single-nucleotide polymorphisms were significantly associated with lean body mass either genome wide (p < 5 × 10(-8)) or suggestively genome wide (p < 2.3 × 10(-6)). Replication in 63,475 (47,227 of European ancestry) individuals from 33 cohorts for whole body lean body mass and in 45,090 (42,360 of European ancestry) subjects from 25 cohorts for appendicular lean body mass was successful for five single-nucleotide polymorphisms in/near HSD17B11, VCAN, ADAMTSL3, IRS1, and FTO for total lean body mass and for three single-nucleotide polymorphisms in/near VCAN, ADAMTSL3, and IRS1 for appendicular lean body mass. Our findings provide new insight into the genetics of lean body mass.Lean body mass is a highly heritable trait and is associated with various health conditions. Here, Kiel and colleagues perform a meta-analysis of genome-wide association studies for whole body lean body mass and find five novel genetic loci to be significantly associated.
Expression of HLA-C varies widely across individuals in an allele-specific manner. This variation in expression can influence efficacy of the immune response, as shown for infectious and autoimmune diseases. MicroRNA binding partially influences differential HLA-C expression, but the additional contributing factors have remained undetermined. Here we use functional and structural analyses to demonstrate that HLA-C expression is modulated not just at the RNA level, but also at the protein level. Specifically, we show that variation in exons 2 and 3, which encode the α1/α2 domains, drives differential expression of HLA-C allomorphs at the cell surface by influencing the structure of the peptide-binding cleft and the diversity of peptides bound by the HLA-C molecules. Together with a phylogenetic analysis, these results highlight the diversity and long-term balancing selection of regulatory factors that modulate HLA-C expression.
Osteosarcoma is a primary malignancy of bone that affects children and adults. Here, we present the largest sequencing study of osteosarcoma to date, comprising 112 childhood and adult tumours encompassing all major histological subtypes. A key finding of our study is the identification of mutations in insulin-like growth factor (IGF) signalling genes in 8/112 (7%) of cases. We validate this observation using fluorescence in situ hybridization (FISH) in an additional 87 osteosarcomas, with IGF1 receptor (IGF1R) amplification observed in 14% of tumours. These findings may inform patient selection in future trials of IGF1R inhibitors in osteosarcoma. Analysing patterns of mutation, we identify distinct rearrangement profiles including a process characterized by chromothripsis and amplification. This process operates recurrently at discrete genomic regions and generates driver mutations. It may represent an age-independent mutational mechanism that contributes to the development of osteosarcoma in children and adults alike.
In order to uncover miRNA changes in endometriosis pathogenesis, both endometriotic lesions and endometrial biopsies, as well as stromal and epithelial cells isolated from these tissues have been investigated and a large number of dysregulated miRNAs have been reported. However, the concordance between the result of different studies has remained small. One potential explanation for limited overlap between the proposed disease-related miRNAs could be the heterogeneity in tissue composition, as some studies have compared highly heterogeneous whole-lesion biopsies with endometrial tissue, some have compared the endometrium from patients and controls, and some have used pure cell fractions isolated from lesions and endometrium. This review focuses on the results of published miRNA studies in endometriosis to reveal the potential impact of tissue heterogeneity on the discovery of disease-specific miRNA alterations in endometriosis. Additionally, functional studies that explore the roles of endometriosis-involved miRNAs are discussed.
Robust and cost-effective genome editing in a diverse array of cells and model organisms is now possible thanks to the discovery of the RNA-guided endonucleases of the CRISPR-Cas system. The commonly used Cas9 of Streptococcus pyogenes shows high levels of activity but, depending on the application, has been associated with some shortcomings. Firstly, the enzyme has been shown to cause mutagenesis at genomic sequences resembling the target sequence. Secondly, the stringent requirement for a specific motif adjacent to the selected target site can limit the target range of this enzyme. Lastly, the physical size of Cas9 challenges the efficient delivery of genomic engineering tools based on this enzyme as viral particles for potential therapeutic applications. Related and parallel strategies have been employed to address these issues. Taking advantage of the wealth of structural information that is becoming available for CRISPR-Cas effector proteins, Cas9 has been redesigned by mutagenizing key residues contributing to activity and target recognition. The protein has also been shortened and redesigned into component subunits in an attempt to facilitate its efficient delivery. Furthermore, the CRISPR-Cas toolbox has been expanded by exploring the properties of Cas9 orthologues and other related effector proteins from diverse bacterial species, some of which exhibit different target site specificities and reduced molecular size. It is hoped that the improvements in accuracy, target range and efficiency of delivery will facilitate the therapeutic application of these site-specific nucleases.
The inner uterine lining (endometrium) is a unique tissue going through remarkable changes each menstrual cycle. Endometrium has its characteristic DNA methylation profile, although not much is known about the endometrial methylome changes throughout the menstrual cycle. The impact of methylome changes on gene expression and thereby on the function of the tissue, including establishing receptivity to implanting embryo, is also unclear. Therefore, this study used genome-wide technologies to characterize the methylome and the correlation between DNA methylation and gene expression in endometrial biopsies collected from 17 healthy fertile-aged women from pre-receptive and receptive phase within one menstrual cycle. Our study showed that the overall methylome remains relatively stable during this stage of the menstrual cycle, with small-scale changes affecting 5% of the studied CpG sites (22,272 out of studied 437,022 CpGs, FDR < 0.05). Of differentially methylated CpG sites with the largest absolute changes in methylation level, approximately 30% correlated with gene expression measured by RNA sequencing, with negative correlations being more common in 5' UTR and positive correlations in the gene 'Body' region. According to our results, extracellular matrix organization and immune response are the pathways most affected by methylation changes during the transition from pre-receptive to receptive phase.
SCIENCE, 356 (6343), | Citations: 1 (Web of Science Lite) | Read more2017. Resistance to malaria through structural variation of red blood cell invasion receptors
BACKGROUND: Plasmodium vivax is the most widespread human malaria geographically; however, no effective vaccine exists. Red blood cell invasion by the P. vivax merozoite depends on an interaction between the Duffy antigen receptor for chemokines (DARC) and region II of the parasite's Duffy-binding protein (PvDBP_RII). Naturally acquired binding-inhibitory antibodies against this interaction associate with clinical immunity, but it is unknown whether these responses can be induced by human vaccination. METHODS: Safety and immunogenicity of replication-deficient chimpanzee adenovirus serotype 63 (ChAd63) and modified vaccinia virus Ankara (MVA) viral vectored vaccines targeting PvDBP_RII (Salvador I strain) were assessed in an open-label dose-escalation phase Ia study in 24 healthy UK adults. Vaccines were delivered by the intramuscular route in a ChAd63-MVA heterologous prime-boost regimen using an 8-week interval. RESULTS: Both vaccines were well tolerated and demonstrated a favorable safety profile in malaria-naive adults. PvDBP_RII-specific ex-vivo IFN-γ T cell, antibody-secreting cell, memory B cell, and serum IgG responses were observed after the MVA boost immunization. Vaccine-induced antibodies inhibited the binding of vaccine homologous and heterologous variants of recombinant PvDBP_RII to the DARC receptor, with median 50% binding-inhibition titers greater than 1:100. CONCLUSION: We have demonstrated for the first time to our knowledge that strain-transcending antibodies can be induced against the PvDBP_RII antigen by vaccination in humans. These vaccine candidates warrant further clinical evaluation of efficacy against the blood-stage P. vivax parasite. TRIAL REGISTRATION: Clinicaltrials.gov NCT01816113. FUNDING: Support was provided by the UK Medical Research Council, UK National Institute of Health Research Oxford Biomedical Research Centre, and the Wellcome Trust.
Reduced cardiac vagal control reflected in low heart rate variability (HRV) is associated with greater risks for cardiac morbidity and mortality. In two-stage meta-analyses of genome-wide association studies for three HRV traits in up to 53,174 individuals of European ancestry, we detect 17 genome-wide significant SNPs in eight loci. HRV SNPs tag non-synonymous SNPs (in NDUFA11 and KIAA1755), expression quantitative trait loci (eQTLs) (influencing GNG11, RGS6 and NEO1), or are located in genes preferentially expressed in the sinoatrial node (GNG11, RGS6 and HCN4). Genetic risk scores account for 0.9 to 2.6% of the HRV variance. Significant genetic correlation is found for HRV with heart rate (-0.74<rg<-0.55) and blood pressure (-0.35<rg<-0.20). These findings provide clinically relevant biological insight into heritable variation in vagal heart rhythm regulation, with a key role for genetic variants (GNG11, RGS6) that influence G-protein heterotrimer action in GIRK-channel induced pacemaker membrane hyperpolarization.
Phenotypic variance heterogeneity across genotypes at a single nucleotide polymorphism (SNP) may reflect underlying gene-environment (G×E) or gene-gene interactions. We modeled variance heterogeneity for blood lipids and BMI in up to 44,211 participants and investigated relationships between variance effects (Pv), G×E interaction effects (with smoking and physical activity), and marginal genetic effects (Pm). Correlations between Pv and Pm were stronger for SNPs with established marginal effects (Spearman's ρ = 0.401 for triglycerides, and ρ = 0.236 for BMI) compared to all SNPs. When Pv and Pm were compared for all pruned SNPs, only BMI was statistically significant (Spearman's ρ = 0.010). Overall, SNPs with established marginal effects were overrepresented in the nominally significant part of the Pv distribution (Pbinomial <0.05). SNPs from the top 1% of the Pm distribution for BMI had more significant Pv values (PMann-Whitney = 1.46×10-5), and the odds ratio of SNPs with nominally significant (<0.05) Pm and Pv was 1.33 (95% CI: 1.12, 1.57) for BMI. Moreover, BMI SNPs with nominally significant G×E interaction P-values (Pint<0.05) were enriched with nominally significant Pv values (Pbinomial = 8.63×10-9 and 8.52×10-7 for SNP × smoking and SNP × physical activity, respectively). We conclude that some loci with strong marginal effects may be good candidates for G×E, and variance-based prioritization can be used to identify them.
BACKGROUND: Influenza challenge trials are important for vaccine efficacy testing. Currently, disease severity is determined by self-reported scores to a list of symptoms which can be highly subjective. A more objective measure would allow for improved data analysis. METHODS: Twenty-one volunteers participated in an influenza challenge trial. We calculated the daily sum of scores (DSS) for a list of 16 influenza symptoms. Whole blood collected at baseline and 24, 48, 72 and 96 h post challenge was profiled on Illumina HT12v4 microarrays. Changes in gene expression most strongly correlated with DSS were selected to train a Random Forest model and tested on two independent test sets consisting of 41 individuals profiled on a different microarray platform and 33 volunteers assayed by qRT-PCR. RESULTS: 1456 probes are significantly associated with DSS at 1% false discovery rate. We selected 19 genes with the largest fold change to train a random forest model. We observed good concordance between predicted and actual scores in the first test set (r = 0.57; RMSE = -16.1%) with the greatest agreement achieved on samples collected approximately 72 h post challenge. Therefore, we assayed samples collected at baseline and 72 h post challenge in the second test set by qRT-PCR and observed good concordance (r = 0.81; RMSE = -36.1%). CONCLUSIONS: We developed a 19-gene qRT-PCR panel to predict DSS, validated on two independent datasets. A transcriptomics based panel could provide a more objective measure of symptom scoring in future influenza challenge studies. Trial registration Samples were obtained from a clinical trial with the ClinicalTrials.gov Identifier: NCT02014870, first registered on December 5, 2013.
Melioidosis, a severe infection with the environmental bacterium Burkholderia pseudomallei, is being recognised increasingly frequently. What determines its uneven distribution within endemic areas is poorly understood. We cultured soil from a rice field in Laos for B. pseudomallei at different depths on 4 occasions over a 13-month period. We also measured physical and chemical parameters in order to identify associated characteristics. Overall, 195 of 653 samples (29.7%) yielded B. pseudomallei. A higher prevalence of B. pseudomallei was found at soil depths greater than the 30 cm currently recommended for B. pseudomallei environmental sampling. B. pseudomallei was associated with a high soil water content and low total nitrogen, carbon and organic matter content. Our results suggested that a sampling grid of 25 five metre square quadrats (i.e. 25 × 25 m) should be sufficient to detect B. pseudomallei at a given location if samples are taken at a soil depth of at least 60 cm. However, culture of B. pseudomallei in environmental samples is difficult and liable to variation. Future studies should both rely on molecular approaches and address the micro-heterogeneity of soil when investigating physico-chemical associations with the presence of B. pseudomallei.
Chronic lymphocytic leukaemia (CLL) consists of two biologically and clinically distinct subtypes defined by the abundance of somatic hypermutation (SHM) affecting the Ig variable heavy-chain locus (IgHV). The molecular mechanisms underlying these subtypes are incompletely understood. Here, we present a comprehensive whole-genome sequencing analysis of somatically acquired genetic events from 46 CLL patients, including a systematic comparison of coding and non-coding single-nucleotide variants, copy number variants and structural variants, regions of kataegis and mutation signatures between IgHV(mut) and IgHV(unmut) subtypes. We demonstrate that one-quarter of non-coding mutations in regions of kataegis outside the Ig loci are located in genes relevant to CLL. We show that non-coding mutations in ATM may negatively impact on ATM expression and find non-coding and regulatory region mutations in TCL1A, and in IgHV(unmut) CLL in IKZF3, SAMHD1,PAX5 and BIRC3. Finally, we show that IgHV(unmut) CLL is dominated by coding mutations in driver genes and an aging signature, whereas IgHV(mut) CLL has a high incidence of promoter and enhancer mutations caused by aberrant activation-induced cytidine deaminase activity. Taken together, our data support the hypothesis that differences in clinical outcome and biological characteristics between the two subgroups might reflect differences in mutation distribution, incidence and distinct underlying mutagenic mechanisms.Leukemia advance online publication, 27 June 2017; doi:10.1038/leu.2017.177.
© 2017 The Authors. The Journal of Pathology published by John Wiley & Sons Ltd on behalf of Pathological Society of Great Britain and Ireland. The functional role of bone morphogenetic protein (BMP) signalling in colorectal cancer (CRC) is poorly defined, with contradictory results in cancer cell line models reflecting the inherent difficulties of assessing a signalling pathway that is context-dependent and subject to genetic constraints. By assessing the transcriptional response of a diploid human colonic epithelial cell line to BMP ligand stimulation, we generated a prognostic BMP signalling signature, which was applied to multiple CRC datasets to investigate BMP heterogeneity across CRC molecular subtypes. We linked BMP and Notch signalling pathway activity and function in human colonic epithelial cells, and normal and neoplastic tissue. BMP induced Notch through a γ-secretase-independent interaction, regulated by the SMAD proteins. In homeostasis, BMP/Notch co-localization was restricted to cells at the top of the intestinal crypt, with more widespread interaction in some human CRC samples. BMP signalling was downregulated in the majority of CRCs, but was conserved specifically in mesenchymal-subtype tumours, where it interacts with Notch to induce an epithelial–mesenchymal transition (EMT) phenotype. In intestinal homeostasis, BMP–Notch pathway crosstalk is restricted to differentiating cells through stringent pathway segregation. Conserved BMP activity and loss of signalling stringency in mesenchymal-subtype tumours promotes a synergistic BMP–Notch interaction, and this correlates with poor patient prognosis. BMP signalling heterogeneity across CRC subtypes and cell lines can account for previous experimental contradictions. Crosstalk between the BMP and Notch pathways will render mesenchymal-subtype CRC insensitive to γ-secretase inhibition unless BMP activation is concomitantly addressed. © 2017 The Authors. Journal of Pathology published by John Wiley & Sons Ltd on behalf of Pathological Society of Great Britain and Ireland.
Several lines of evidence indicate that the most primitive staphylococcal species, those of the Staphylococcus sciuri group, were involved in the first stages of evolution of the staphylococcal cassette chromosome mec (SCCmec), the genetic element carrying the β-lactam resistance gene mecA However, many steps are still missing from this evolutionary history. In particular, it is not known how mecA was incorporated into the mobile element SCC prior to dissemination among Staphylococcus aureus and other pathogenic staphylococcal species. To gain insights into the possible contribution of several species of the Staphylococcus sciuri group to the assembly of SCCmec, we sequenced the genomes of 106 isolates, comprising S. sciuri (n = 76), Staphylococcus vitulinus (n = 18), and Staphylococcus fleurettii (n = 12) from animal and human sources, and characterized the native location of mecA and the SCC insertion site by using a variety of comparative genomic approaches. Moreover, we performed a single nucleotide polymorphism (SNP) analysis of the genomes in order to understand SCCmec evolution in relation to phylogeny. We found that each of three species of the S. sciuri group contributed to the evolution of SCCmec: S. vitulinus and S. fleurettii contributed to the assembly of the mec complex, and S. sciuri most likely provided the mobile element in which mecA was later incorporated. We hypothesize that an ancestral SCCmec III cassette (an element carried by one of the most epidemic methicillin-resistant S. aureus clones) originated in S. sciuri possibly by a recombination event in a human host or a human-created environment and later was transferred to S. aureus.
The malaria parasite Plasmodium falciparum invades human red blood cells by a series of interactions between host and parasite surface proteins. By analyzing genome sequence data from human populations, including 1269 individuals from sub-Saharan Africa, we identify a diverse array of large copy-number variants affecting the host invasion receptor genes GYPA and GYPB We find that a nearby association with severe malaria is explained by a complex structural rearrangement involving the loss of GYPB and gain of two GYPB-A hybrid genes, which encode a serologically distinct blood group antigen known as Dantu. This variant reduces the risk of severe malaria by 40% and has recently increased in frequency in parts of Kenya, yet it appears to be absent from west Africa. These findings link structural variation of red blood cell invasion receptors with natural resistance to severe malaria.
Atrial fibrillation affects more than 33 million people worldwide and increases the risk of stroke, heart failure, and death. Fourteen genetic loci have been associated with atrial fibrillation in European and Asian ancestry groups. To further define the genetic basis of atrial fibrillation, we performed large-scale, trans-ancestry meta-analyses of common and rare variant association studies. The genome-wide association studies (GWAS) included 17,931 individuals with atrial fibrillation and 115,142 referents; the exome-wide association studies (ExWAS) and rare variant association studies (RVAS) involved 22,346 cases and 132,086 referents. We identified 12 new genetic loci that exceeded genome-wide significance, implicating genes involved in cardiac electrical and structural remodeling. Our results nearly double the number of known genetic loci for atrial fibrillation, provide insights into the molecular basis of atrial fibrillation, and may facilitate the identification of new potential targets for drug discovery.
Type 2 diabetes is a global epidemic with major effects on healthcare expenditure and quality of life. Currently available treatments are inadequate for the prevention of comorbidities, yet progress towards new therapies remains slow. A major barrier is the insufficiency of traditional preclinical models for predicting drug efficacy and safety. Human genetics offers a complementary model to assess causal mechanisms for target validation. Genetic perturbations are 'experiments of nature' that provide a uniquely relevant window into the long-term effects of modulating specific targets. Here, we show that genetic discoveries over the past decades have accurately predicted (now known) therapeutic mechanisms for type 2 diabetes. These findings highlight the potential for use of human genetic variation for prospective target validation, and establish a framework for future applications. Studies into rare, monogenic forms of diabetes have also provided proof-of-principle for precision medicine, and the applicability of this paradigm to complex disease is discussed. Finally, we highlight some of the limitations that are relevant to the use of genome-wide association studies (GWAS) in the search for new therapies for diabetes. A key outstanding challenge is the translation of GWAS signals into disease biology and we outline possible solutions for tackling this experimental bottleneck.
The Middle East respiratory syndrome coronavirus (MERS-CoV) has infected more than 1900 humans, since 2012. The syndrome ranges from asymptomatic and mild cases to severe pneumonia and death. The virus is believed to be circulating in dromedary camels without notable symptoms since the 1980s. Therefore, dromedary camels are considered the only animal source of infection. Neither antiviral drugs nor vaccines are approved for veterinary or medical use despite active research on this area. Here, we developed four vaccine candidates against MERS-CoV based on ChAdOx1 and MVA viral vectors, two candidates per vector. All vaccines contained the full-length spike gene of MERS-CoV; ChAdOx1 MERS vaccines were produced with or without the leader sequence of the human tissue plasminogen activator gene (tPA) where MVA MERS vaccines were produced with tPA, but either the mH5 or F11 promoter driving expression of the spike gene. All vaccine candidates were evaluated in a mouse model in prime only or prime-boost regimens. ChAdOx1 MERS with tPA induced higher neutralising antibodies than ChAdOx1 MERS without tPA. A single dose of ChAdOx1 MERS with tPA elicited cellular immune responses as well as neutralising antibodies that were boosted to a significantly higher level by MVA MERS. The humoral immunogenicity of a single dose of ChAdOx1 MERS with tPA was equivalent to two doses of MVA MERS (also with tPA). MVA MERS with mH5 or F11 promoter induced similar antibody levels; however, F11 promoter enhanced the cellular immunogenicity of MVA MERS to significantly higher magnitudes. In conclusion, our study showed that MERS-CoV vaccine candidates could be optimized by utilising different viral vectors, various genetic designs of the vectors, or different regimens to increase immunogenicity. ChAdOx1 and MVA vectored vaccines have been safely evaluated in camels and humans and these MERS vaccine candidates should now be tested in camels and in clinical trials.
We provide in this paper a detailed characterization of the human peripheral CD4(+) CD127(low)CD25(+) regulatory T cell (Treg) compartment, with a particular emphasis in defining the population expressing higher levels of the IL-6 receptor (IL-6R). We provide a description of the phenotype of this population by assessing both the surface expression by flow cytometry as well as their transcriptional profile and functional features. In addition, we also present functional data describing the responsiveness of these subsets to IL-6 signalling in vitro and to IL-2 in vivo. The data presented in this paper support the research article "Human IL-6R(hi)TIGIT(-) CD4(+)CD127(low)CD25(+) T cells display potent in vitro suppressive capacity and a distinct Th17 profile" (Ferreira RC et al., 2017; doi: 10.1016/j.clim.2017.03.002) .
© 2017 Biometrika Trust. Bayesian robustness under model misspecification is a current area of active research. Among recent ideas is that of raising the likelihood function to a power. In this paper we discuss the choice of appropriate power and provide examples.
To characterize type 2 diabetes (T2D)-associated variation across the allele frequency spectrum, we conducted a meta-analysis of genome-wide association data from 26,676 T2D case and 132,532 control subjects of European ancestry after imputation using the 1000 Genomes multiethnic reference panel. Promising association signals were followed up in additional data sets (of 14,545 or 7,397 T2D case and 38,994 or 71,604 control subjects). We identified 13 novel T2D-associated loci (P < 5 × 10(-8)), including variants near the GLP2R, GIP, and HLA-DQA1 genes. Our analysis brought the total number of independent T2D associations to 128 distinct signals at 113 loci. Despite substantially increased sample size and more complete coverage of low-frequency variation, all novel associations were driven by common single nucleotide variants. Credible sets of potentially causal variants were generally larger than those based on imputation with earlier reference panels, consistent with resolution of causal signals to common risk haplotypes. Stratification of T2D-associated loci based on T2D-related quantitative trait associations revealed tissue-specific enrichment of regulatory annotations in pancreatic islet enhancers for loci influencing insulin secretion and in adipocytes, monocytes, and hepatocytes for insulin action-associated loci. These findings highlight the predominant role played by common variants of modest effect and the diversity of biological mechanisms influencing T2D pathophysiology.
Motivation: The identification of genetic variants influencing gene expression (known as expression quantitative trait loci or eQTLs) is important in unravelling the genetic basis of complex traits. Detecting multiple eQTLs simultaneously in a population based on paired DNA-seq and RNA-seq assays employs two competing types of models: models which rely on appropriate transformations of RNA-seq data (and are powered by a mature mathematical theory), or count-based models, which represent digital gene expression explicitly, thus rendering such transformations unnecessary. The latter constitutes an immensely popular methodology, which is however plagued by mathematical intractability. Results: We develop tractable count-based models, which are amenable to efficient estimation through the introduction of latent variables and the appropriate application of recent statistical theory in a sparse Bayesian modelling framework. Furthermore, we examine several transformation methods for RNA-seq read counts and we introduce arcsin, logit and Laplace smoothing as preprocessing steps for transformation-based models. Using natural and carefully simulated data from the 1000 Genomes and gEUVADIS projects, we benchmark both approaches under a variety of scenarios, including the presence of noise and violation of basic model assumptions. We demonstrate that an arcsin transformation of Laplace-smoothed data is at least as good as state-of-the-art models, particularly at small samples. Furthermore, we show that an over-dispersed Poisson model is comparable to the celebrated Negative Binomial, but much easier to estimate. These results provide strong support for transformation-based versus count-based (particularly Negative-Binomial-based) models for eQTL mapping. Availability and implementation: All methods are implemented in the free software eQTLseq: https://github.com/dvav/eQTLseq. Contact: email@example.com. Supplementary information: Supplementary data are available at Bioinformatics online.
Endometriosis is a heritable hormone-dependent gynecological disorder, associated with severe pelvic pain and reduced fertility; however, its molecular mechanisms remain largely unknown. Here we perform a meta-analysis of 11 genome-wide association case-control data sets, totalling 17,045 endometriosis cases and 191,596 controls. In addition to replicating previously reported loci, we identify five novel loci significantly associated with endometriosis risk (P<5 × 10(-8)), implicating genes involved in sex steroid hormone pathways (FN1, CCDC170, ESR1, SYNE1 and FSHB). Conditional analysis identified five secondary association signals, including two at the ESR1 locus, resulting in 19 independent single nucleotide polymorphisms (SNPs) robustly associated with endometriosis, which together explain up to 5.19% of variance in endometriosis. These results highlight novel variants in or near specific genes with important roles in sex steroid hormone signalling and function, and offer unique opportunities for more targeted functional research efforts.
BACKGROUND: Endometriosis is typically regarded as a premenopausal disease, resolving after natural or iatrogenic menopause due to declining oestrogen levels. Nonetheless, case reports over the years have highlighted the incidence of recurrent postmenopausal endometriosis. It is now clear that both recurrence and malignant transformation of endometriotic foci can occur in the postmenopausal period. Postmenopausal women are commonly treated with hormone replacement therapy (HRT) to treat climacteric symptoms and prevent bone loss; however, HRT may reactivate endometriosis and stimulate malignant transformation in women with a history of endometriosis. Given the uncertain risks of initiating HRT, it is difficult to determine the best menopausal management for this group of women. OBJECTIVE AND RATIONAL: The aim of this study was to systematically review the existing literature on management of menopausal symptoms in women with a history of endometriosis. We also aimed to evaluate the published literature on the risks associated with HRT in these women, and details regarding optimal formulations and timing (i.e. initiation and duration) of HRT. SEARCH METHODS: Four electronic databases (MEDLINE via OVID, Embase via OVID, PsycINFO via OVID and CINAHL via EbscoHost) were searched from database inception until June 2016, using a combination of relevant controlled vocabulary terms and free-text terms related to 'menopause' and 'endometriosis'. Inclusion criteria were: menopausal women with a history of endometriosis and menopausal treatment including HRT or other preparations. Case reports/series, observational studies and clinical trials were included. Narrative review articles, organizational guidelines and conference abstracts were excluded, as were studies that did not report on any form of menopausal management. Articles were assessed for risk of bias and quality using GRADE criteria. OUTCOMES: We present a synthesis of the existing case reports of endometriosis recurrence or malignant transformation in women undergoing treatment for menopausal symptoms. We highlight common presenting symptoms, potential risk factors and outcomes amongst the studies. Sparse high-quality evidence was identified, with few observational studies and only two randomized controlled trials. Given this paucity of data, no definitive conclusions can be drawn concerning risk. WIDER IMPLICATIONS: Due to the lack of high-quality studies, it remains unclear how to advise women with a history of endometriosis regarding the management of menopausal symptoms. The absolute risk of disease recurrence and malignant transformation cannot be quantified, and the impact of HRT use on these outcomes is not known. Multicentre randomized trials or large observational studies are urgently needed to inform clinicians and patients alike.
The indigenous populations of the South Pacific experience a high burden of rheumatic heart disease (RHD). Here we report a genome-wide association study (GWAS) of RHD susceptibility in 2,852 individuals recruited in eight Oceanian countries. Stratifying by ancestry, we analysed genotyped and imputed variants in Melanesians (607 cases and 1,229 controls) before follow-up of suggestive loci in three further ancestral groups: Polynesians, South Asians and Mixed or other populations (totalling 399 cases and 617 controls). We identify a novel susceptibility signal in the immunoglobulin heavy chain (IGH) locus centring on a haplotype of nonsynonymous variants in the IGHV4-61 gene segment corresponding to the IGHV4-61*02 allele. We show each copy of IGHV4-61*02 is associated with a 1.4-fold increase in the risk of RHD (odds ratio 1.43, 95% confidence intervals 1.27-1.61, P=4.1 × 10(-9)). These findings provide new insight into the role of germline variation in the IGH locus in disease susceptibility.
The tumour-associated antigen 5T4 is an attractive target for cancer immunotherapy. However to date, reported 5T4-specific cellular immune responses induced by various immunisation platforms have been largely weak or non-existent. In the present study, we have evaluated a heterologous prime boost regime based on the simian adenovirus ChAdOx1 and modified vaccinia virus Ankara (MVA) expressing 5T4 for immunogenicity and tumour protective efficacy in a mouse cancer model. Vaccination-induced immune responses were strong, durable and attributable primarily to CD8+ T cells. By comparison, homologous MVA vaccination regimen did not induce detectable 5T4-specific T cell responses. ChAdOx1-MVA vaccinated mice were completely protected against subsequent B16 melanoma challenge, but in therapeutic settings this regime was only modestly effective in delaying tumour outgrowth. Concomitant delivery of the vaccine with monoclonal antibodies (mAbs) targeting immune checkpoint regulators LAG-3, PD-1 or PD-L1 demonstrated that the combination of vaccine with anti PD-1 mAb could significantly delay tumour growth and increase overall survival of tumour-bearing mice. Our findings support a translation of the combinatorial approach based on the heterologous ChAdOx1-MVA vaccination platform with immune checkpoint blockade into the clinic for the treatment of 5T4-positive tumours such as prostate, renal, colorectal, gastric, ovarian, lung cancer and mesothelioma.
The Y chromosome is frequently lost in hematopoietic cells, which represents the most common somatic alteration in men. However, the mechanisms that regulate mosaic loss of chromosome Y (mLOY), and its clinical relevance, are unknown. We used genotype-array-intensity data and sequence reads from 85,542 men to identify 19 genomic regions (P < 5 × 10(-8)) that are associated with mLOY. Cumulatively, these loci also predicted X chromosome loss in women (n = 96,123; P = 4 × 10(-6)). Additional epigenome-wide methylation analyses using whole blood highlighted 36 differentially methylated sites associated with mLOY. The genes identified converge on aspects of cell proliferation and cell cycle regulation, including DNA synthesis (NPAT), DNA damage response (ATM), mitosis (PMF1, CENPN and MAD1L1) and apoptosis (TP53). We highlight the shared genetic architecture between mLOY and cancer susceptibility, in addition to inferring a causal effect of smoking on mLOY. Collectively, our results demonstrate that genotype-array-intensity data enables a measure of cell cycle efficiency at population scale and identifies genes implicated in aneuploidy, genome instability and cancer susceptibility.
Outcomes of hepatitis C virus (HCV) infection and treatment depend on viral and host genetic factors. Here we use human genome-wide genotyping arrays and new whole-genome HCV viral sequencing technologies to perform a systematic genome-to-genome study of 542 individuals who were chronically infected with HCV, predominantly genotype 3. We show that both alleles of genes encoding human leukocyte antigen molecules and genes encoding components of the interferon lambda innate immune system drive viral polymorphism. Additionally, we show that IFNL4 genotypes determine HCV viral load through a mechanism dependent on a specific amino acid residue in the HCV NS5A protein. These findings highlight the interplay between the innate immune system and the viral genome in HCV control.
Whole-genome sequencing (WGS) has transformed the understanding of the genetic drivers of cancer and is increasingly being used in cancer medicine to identify personalized therapies. Here we describe a case in which the application of WGS identified a tumoral BRCA2 deletion in a patient with aggressive dedifferentiated prostate cancer that was repeat-biopsied after disease progression. This would not have been detected by standard BRCA testing, and it led to additional treatment with a maintenance poly ADP ribose polymerase (PARP) inhibitor following platinum-based chemotherapy. This case demonstrates that repeat biopsy upon disease progression and application of WGS to tumor samples has meaningful clinical utility and the potential to transform outcomes in patients with cancer.
BACKGROUND: Obesity is genetically heterogeneous and highly heritable, although polymorphisms explain the phenotype in only a small proportion of obese children. We investigated the presence of copy number variations (CNVs) in "classical" genes known to be associated with (monogenic) early-onset obesity in children. METHODS: In 194 obese Caucasian children selected for early-onset and severe obesity from our obesity cohort we screened for deletions and/or duplications by multiplex ligation-dependent probe amplification reaction (MLPA). As we found one MLPA probe to interfere with a polymorphism in SIM1 we investigated its association with obesity and other phenotypic traits in our extended cohort of 2305 children. RESULTS: In the selected subset of most severely obese children, we did not find CNV with MLPA in POMC, LEP, LEPR, MC4R, MC3R or MC2R genes. However, one SIM1 probe located at exon 9 gave signals suggestive for SIM1 insufficiency in 52 patients. Polymerase chain reaction (PCR) analysis identified this as a false positive result due to interference with single nucleotide polymorphism (SNP) rs3734354/rs3734355. We, therefore, investigated for associations of this polymorphism with obesity and metabolic traits in our extended cohort. We found rs3734354/rs3734355 to be associated with body mass index-standard deviation score (BMI-SDS) (p = 0.003), but not with parameters of insulin metabolism, blood pressure or food intake. CONCLUSIONS: In our modest sample of severely obese children, we were unable to find CNVs in well-established monogenic obesity genes. Nevertheless, we found an association of rs3734354 in SIM1 with obesity of early-onset type in children, although not with obesity-related traits.
BACKGROUND: Understanding the genetic architecture of cardiac structure and function may help to prevent and treat heart disease. This investigation sought to identify common genetic variations associated with inter-individual variability in cardiac structure and function. METHODS: A GWAS meta-analysis of echocardiographic traits was performed, including 46,533 individuals from 30 studies (EchoGen consortium). The analysis included 16 traits of left ventricular (LV) structure, and systolic and diastolic function. RESULTS: The discovery analysis included 21 cohorts for structural and systolic function traits (n = 32,212) and 17 cohorts for diastolic function traits (n = 21,852). Replication was performed in 5 cohorts (n = 14,321) and 6 cohorts (n = 16,308), respectively. Besides 5 previously reported loci, the combined meta-analysis identified 10 additional genome-wide significant SNPs: rs12541595 near MTSS1 and rs10774625 in ATXN2 for LV end-diastolic internal dimension; rs806322 near KCNRG, rs4765663 in CACNA1C, rs6702619 near PALMD, rs7127129 in TMEM16A, rs11207426 near FGGY, rs17608766 in GOSR2, and rs17696696 in CFDP1 for aortic root diameter; and rs12440869 in IQCH for Doppler transmitral A-wave peak velocity. Findings were in part validated in other cohorts and in GWAS of related disease traits. The genetic loci showed associations with putative signaling pathways, and with gene expression in whole blood, monocytes, and myocardial tissue. CONCLUSION: The additional genetic loci identified in this large meta-analysis of cardiac structure and function provide insights into the underlying genetic architecture of cardiac structure and warrant follow-up in future functional studies. FUNDING: For detailed information per study, see Acknowledgments.
Few genome-wide association studies (GWAS) account for environmental exposures, like smoking, potentially impacting the overall trait variance when investigating the genetic contribution to obesity-related traits. Here, we use GWAS data from 51,080 current smokers and 190,178 nonsmokers (87% European descent) to identify loci influencing BMI and central adiposity, measured as waist circumference and waist-to-hip ratio both adjusted for BMI. We identify 23 novel genetic loci, and 9 loci with convincing evidence of gene-smoking interaction (GxSMK) on obesity-related traits. We show consistent direction of effect for all identified loci and significance for 18 novel and for 5 interaction loci in an independent study sample. These loci highlight novel biological functions, including response to oxidative stress, addictive behaviour, and regulatory functions emphasizing the importance of accounting for environment in genetic analyses. Our results suggest that tobacco smoking may alter the genetic susceptibility to overall adiposity and body fat distribution.
The leading malaria vaccine in development is the circumsporozoite protein (CSP)-based particle vaccine, RTS,S, which targets the pre-erythrocytic stage of Plasmodium falciparum infection. It induces modest levels of protective efficacy, thought to be mediated primarily by CSP-specific antibodies. We aimed to enhance vaccine efficacy by generating a more immunogenic CSP-based particle vaccine and therefore developed a next-generation RTS,S-like vaccine, called R21. The major improvement is that in contrast to RTS,S, R21 particles are formed from a single CSP-hepatitis B surface antigen (HBsAg) fusion protein, and this leads to a vaccine composed of a much higher proportion of CSP than in RTS,S. We demonstrate that in BALB/c mice R21 is immunogenic at very low doses and when administered with the adjuvants Abisco-100 and Matrix-M it elicits sterile protection against transgenic sporozoite challenge. Concurrent induction of potent cellular and humoral immune responses was also achieved by combining R21 with TRAP-based viral vectors and protective efficacy was significantly enhanced. In addition, in contrast to RTS,S, only a minimal antibody response to the HBsAg carrier was induced. These studies identify an anti-sporozoite vaccine component that may improve upon the current leading malaria vaccine RTS,S. R21 is now under evaluation in Phase 1/2a clinical trials.
Development of a protective and broadly-acting vaccine against the most widely distributed human malaria parasite, Plasmodium vivax, will be a major step towards malaria elimination. However, a P. vivax vaccine has remained elusive by the scarcity of pre-clinical models to test protective efficacy and support further clinical trials. In this study, we report the development of a highly protective CSP-based P. vivax vaccine, a virus-like particle (VLP) known as Rv21, able to provide 100% sterile protection against a stringent sporozoite challenge in rodent models to malaria, where IgG2a antibodies were associated with protection in absence of detectable PvCSP-specific T cell responses. Additionally, we generated two novel transgenic rodent P. berghei parasite lines, where the P. berghei csp gene coding sequence has been replaced with either full-length P. vivax VK210 or the allelic VK247 csp that additionally express GFP-Luciferase. Efficacy of Rv21 surpassed viral-vectored vaccination using ChAd63 and MVA. We show for the first time that a chimeric VK210/247 antigen can elicit high level cross-protection against parasites expressing either CSP allele, which provide accessible and affordable models suitable to support the development of P. vivax vaccines candidates. Rv21 is progressing to GMP production and has entered a path towards clinical evaluation.
Impacts of introgressive hybridisation may range from genomic erosion and species collapse to rapid adaptation and speciation but opportunities to study these dynamics are rare. We investigated the extent, causes and consequences of a hybrid zone between Anopheles coluzzii and Anopheles gambiae in Guinea-Bissau, where high hybridisation rates appear to be stable at least since the 1990s. Anopheles gambiae was genetically partitioned into inland and coastal subpopulations, separated by a central region dominated by A. coluzzii. Surprisingly, whole genome sequencing revealed that the coastal region harbours a hybrid form characterised by an A. gambiae-like sex chromosome and massive introgression of A. coluzzii autosomal alleles. Local selection on chromosomal inversions may play a role in this process, suggesting potential for spatiotemporal stability of the coastal hybrid form and providing resilience against introgression of medically-important loci and traits, found to be more prevalent in inland A. gambiae.
The IL23R region on chromosome 1 exhibits complex associations with ankylosing spondylitis (AS). We used publicly available epigenomic information and historical genetic association data to identify a putative regulatory element (PRE) in the intergenic region between IL23R and IL12RB2, which includes two single-nucleotide polymorphisms (SNPs) independently associated with AS-rs924080 (P=2 × 10(-3)) and rs11578380 (P=2 × 10(-4)). In luciferase reporter assays, this PRE showed silencer activity (P<0.001). Haplotype and conditional analysis of 4230 historical AS cases and 9700 controls revealed a possible AS-associated extended haplotype, including the PRE and risk variants at three SNPs (rs11209026, rs11209032 and rs924080), but excluding the rs11578380 risk variant. However, the rs924080 association was absent after conditioning on the primary association with rs11209032, which, in contrast, was robust to conditioning on all other AS-associated SNPs in this region (P<2 × 10(-8)). The role of this putative silencer on some IL23R extended haplotypes therefore remains unclear.
Ex vivo functional immunoassays such as ELISpot and intracellular cytokine staining (ICS) by flow cytometry are crucial tools in vaccine development both in the identification of novel immunogenic targets and in the immunological assessment of samples from clinical trials. Cryopreservation and subsequent thawing of PBMCs via validated processes has become a mainstay of clinical trials due to processing restrictions inherent in the disparate location and capacity of trial centres, and also in the need to standardize biological assays at central testing facilities. Logistical and financial requirement to batch process samples from multiple study timepoints are also key. We used ELISpot and ICS assays to assess antigen-specific immunogenicity in blood samples taken from subjects enrolled in a phase II malaria heterologous prime-boost vaccine trial and showed that the freeze thaw process can result in a 3-5-fold reduction of malaria antigen-specific IFNγ-producing CD3(+)CD4(+) effector populations from PBMC samples taken post vaccination. We have also demonstrated that peptide responsive CD8(+) T cells are relatively unaffected, as well as CD4(+) T cell populations that do not produce IFNγ. These findings contribute to a growing body of data that could be consolidated and synthesised as guidelines for clinical trials with the aim of increasing the efficiency of vaccine development pipelines.
Staphylococcus aureus are globally disseminated among farmed chickens causing skeletal muscle infections, dermatitis, and septicaemia. The emergence of poultry-associated lineages has involved zoonotic transmission from humans to chickens but questions remain about the specific adaptations that promote proliferation of chicken pathogens. We characterized genetic variation in a population of genome-sequenced S. aureus isolates of poultry and human origin. Genealogical analysis identified a dominant poultry-associated sequence cluster within the CC5 clonal complex. Poultry and human CC5 isolates were significantly distinct from each other and more recombination events were detected in the poultry isolates. We identified 44 recombination events in 33 genes along the branch extending to the poultry-specific CC5 cluster, and 47 genes were found more often in CC5 poultry isolates compared with those from humans. Many of these gene sequences were common in chicken isolates from other clonal complexes suggesting horizontal gene transfer among poultry associated lineages. Consistent with functional predictions for putative poultry-associated genes, poultry isolates showed enhanced growth at 42 °C and greater erythrocyte lysis on chicken blood agar in comparison with human isolates. By combining phenotype information with evolutionary analyses of staphylococcal genomes, we provide evidence of adaptation, following a human-to-poultry host transition. This has important implications for the emergence and dissemination of new pathogenic clones associated with modern agriculture.
Recent advances in highly multiplexed immunoassays have allowed systematic large-scale measurement of hundreds of plasma proteins in large cohort studies. In combination with genotyping, such studies offer the prospect to 1) identify mechanisms involved with regulation of protein expression in plasma, and 2) determine whether the plasma proteins are likely to be causally implicated in disease. We report here the results of genome-wide association (GWA) studies of 83 proteins considered relevant to cardiovascular disease (CVD), measured in 3,394 individuals with multiple CVD risk factors. We identified 79 genome-wide significant (p<5e-8) association signals, 55 of which replicated at P<0.0007 in separate validation studies (n = 2,639 individuals). Using automated text mining, manual curation, and network-based methods incorporating information on expression quantitative trait loci (eQTL), we propose plausible causal mechanisms for 25 trans-acting loci, including a potential post-translational regulation of stem cell factor by matrix metalloproteinase 9 and receptor-ligand pairs such as RANK-RANK ligand. Using public GWA study data, we further evaluate all 79 loci for their causal effect on coronary artery disease, and highlight several potentially causal associations. Overall, a majority of the plasma proteins studied showed evidence of regulation at the genetic level. Our results enable future studies of the causal architecture of human disease, which in turn should aid discovery of new drug targets.
The epidemiologically most important mechanism of antibiotic resistance in Staphylococcus aureus is associated with mecA-an acquired gene encoding an extra penicillin-binding protein (PBP2a) with low affinity to virtually all β-lactams. The introduction of mecA into the S. aureus chromosome has led to the emergence of methicillin-resistant S. aureus (MRSA) pandemics, responsible for high rates of mortality worldwide. Nonetheless, little is known regarding the origin and evolution of mecA. Different mecA homologues have been identified in species belonging to the Staphylococcus sciuri group representing the most primitive staphylococci. In this study we aimed to identify evolutionary steps linking these mecA precursors to the β-lactam resistance gene mecA and the resistance phenotype. We sequenced genomes of 106 S. sciuri, S. vitulinus and S. fleurettii strains and determined their oxacillin susceptibility profiles. Single-nucleotide polymorphism (SNP) analysis of the core genome was performed to assess the genetic relatedness of the isolates. Phylogenetic analysis of the mecA gene homologues and promoters was achieved through nucleotide/amino acid sequence alignments and mutation rates were estimated using a Bayesian analysis. Furthermore, the predicted structure of mecA homologue-encoded PBPs of oxacillin-susceptible and -resistant strains were compared. We showed for the first time that oxacillin resistance in the S. sciuri group has emerged multiple times and by a variety of different mechanisms. Development of resistance occurred through several steps including structural diversification of the non-binding domain of native PBPs; changes in the promoters of mecA homologues; acquisition of SCCmec and adaptation of the bacterial genetic background. Moreover, our results suggest that it was exposure to β-lactams in human-created environments that has driven evolution of native PBPs towards a resistance determinant. The evolution of β-lactam resistance in staphylococci highlights the numerous resources available to bacteria to adapt to the selective pressure of antibiotics.
Somatic cells acquire mutations throughout the course of an individual's life. Mutations occurring early in embryogenesis are often present in a substantial proportion of, but not all, cells in postnatal humans and thus have particular characteristics and effects. Depending on their location in the genome and the proportion of cells they are present in, these mosaic mutations can cause a wide range of genetic disease syndromes and predispose carriers to cancer. They have a high chance of being transmitted to offspring as de novo germline mutations and, in principle, can provide insights into early human embryonic cell lineages and their contributions to adult tissues. Although it is known that gross chromosomal abnormalities are remarkably common in early human embryos, our understanding of early embryonic somatic mutations is very limited. Here we use whole-genome sequences of normal blood from 241 adults to identify 163 early embryonic mutations. We estimate that approximately three base substitution mutations occur per cell per cell-doubling event in early human embryogenesis and these are mainly attributable to two known mutational signatures. We used the mutations to reconstruct developmental lineages of adult cells and demonstrate that the two daughter cells of many early embryonic cell-doubling events contribute asymmetrically to adult blood at an approximately 2:1 ratio. This study therefore provides insights into the mutation rates, mutational processes and developmental outcomes of cell dynamics that operate during early human embryogenesis.
Atrial fibrillation (AF) is a growing public health burden, and its treatment remains a challenge. AF leads to electrical remodeling of the atria, which in turn promotes AF maintenance and resistance to treatment. Although remodeling has long been a therapeutic target in AF, its causes remain poorly understood. We show that atrial-specific up-regulation of microRNA-31 (miR-31) in goat and human AF depletes neuronal nitric oxide synthase (nNOS) by accelerating mRNA decay and alters nNOS subcellular localization by repressing dystrophin translation. By shortening action potential duration and abolishing rate-dependent adaptation of the action potential duration, miR-31 overexpression and/or disruption of nNOS signaling recapitulates features of AF-induced remodeling and significantly increases AF inducibility in mice in vivo. By contrast, silencing miR-31 in atrial myocytes from patients with AF restores dystrophin and nNOS and normalizes action potential duration and its rate dependency. These findings identify atrial-specific up-regulation of miR-31 in human AF as a key mechanism causing atrial dystrophin and nNOS depletion, which in turn contributes to the atrial phenotype begetting this arrhythmia. miR-31 may therefore represent a potential therapeutic target in AF.
To identify novel coding association signals and facilitate characterization of mechanisms influencing glycemic traits and type 2 diabetes risk, we analyzed 109,215 variants derived from exome array genotyping together with an additional 390,225 variants from exome sequence in up to 39,339 normoglycemic individuals from five ancestry groups. We identified a novel association between the coding variant (p.Pro50Thr) in AKT2 and fasting plasma insulin (FI), a gene in which rare fully penetrant mutations are causal for monogenic glycemic disorders. The low-frequency allele is associated with a 12% increase in FI levels. This variant is present at 1.1% frequency in Finns but virtually absent in individuals from other ancestries. Carriers of the FI-increasing allele had increased 2-h insulin values, decreased insulin sensitivity, and increased risk of type 2 diabetes (odds ratio 1.05). In cellular studies, the AKT2-Thr50 protein exhibited a partial loss of function. We extend the allelic spectrum for coding variants in AKT2 associated with disorders of glucose homeostasis and demonstrate bidirectional effects of variants within the pleckstrin homology domain of AKT2.
Over 150 different proteins attach to the plasma membrane using glycosylphosphatidylinositol (GPI) anchors. Mutations in 18 genes that encode components of GPI-anchor biogenesis result in a phenotypic spectrum that includes learning disability, epilepsy, microcephaly, congenital malformations and mild dysmorphic features. To determine the incidence of GPI-anchor defects, we analysed the exome data from 4293 parent-child trios recruited to the Deciphering Developmental Disorders (DDD) study. All probands recruited had a neurodevelopmental disorder. We searched for variants in 31 genes linked to GPI-anchor biogenesis and detected rare biallelic variants in PGAP3, PIGN, PIGT (n=2), PIGO and PIGL, providing a likely diagnosis for six families. In five families, the variants were in a compound heterozygous configuration while in a consanguineous Afghani kindred, a homozygous c.709G>C; p.(E237Q) variant in PIGT was identified within 10-12 Mb of autozygosity. Validation and segregation analysis was performed using Sanger sequencing. Across the six families, five siblings were available for testing and in all cases variants co-segregated consistent with them being causative. In four families, abnormal alkaline phosphatase results were observed in the direction expected. FACS analysis of knockout HEK293 cells that had been transfected with wild-type or mutant cDNA constructs demonstrated that the variants in PIGN, PIGT and PIGO all led to reduced activity. Splicing assays, performed using leucocyte RNA, showed that a c.336-2A>G variant in PIGL resulted in exon skipping and p.D113fs*2. Our results strengthen recently reported disease associations, suggest that defective GPI-anchor biogenesis may explain ~0.15% of individuals with developmental disorders and highlight the benefits of data sharing.
Whole-exome/whole-genome sequencing (WES/WGS) has the potential to enhance genetic diagnosis of rare disease, and is increasingly becoming part of routine clinical care in mainstream medicine. Effective translation will require ongoing efforts in a number of areas including: selection of appropriate patients, provision of effective consent, pre- and post-test genetic counselling, improving variant interpretation algorithms and practices, and management of secondary findings including those found incidentally and those actively sought. Allied to this is the need for an effective education programme for all members of clinical teams involved in care of patients with rare disease, as well as to maintain public confidence in the use of these technologies. We established a Genomic Medicine Multidisciplinary Team (GM-MDT) in 2014 to build on the experiences of earlier successful research-based WES/WGS studies, to address these needs and to review results including pertinent and secondary findings. Here we report on a qualitative study of decision-making in the GM-MDT combined with analysis of semi-structured interviews with GM-MDT members. Study findings show that members appreciate the clinical and scientific diversity of the GM-MDT and value it for education and oversight. To date, discussions have focussed on case selection including the extent and interpretation of clinical and family history information required to establish likely monogenic aetiology and inheritance model. Achieving a balance between effective use of WES/WGS - prioritising cases in a diverse and highly complex patient population where WES/WGS will be tractable - and meeting the recruitment targets of a large project is considered challenging.
BMJ, 356 pp. j1388. | Citations: 1 (Web of Science Lite) | Read more2017. Urgent improvements needed to diagnose and manage Lynch syndrome.
Phylogenetic methods have shown promise in understanding the development of broadly neutralizing antibody lineages (bNAbs). However, the mutational process that generates these lineages, somatic hypermutation, is biased by hotspot motifs which violates important assumptions in most phylogenetic substitution models. Here, we develop a modified GY94-type substitution model that partially accounts for this context dependency while preserving independence of sites during calculation. This model shows a substantially better fit to three well-characterized bNAb lineages than the standard GY94 model. We also demonstrate how our model can be used to test hypotheses concerning the roles of different hotspot and coldspot motifs in the evolution of B-cell lineages. Further, we explore the consequences of the idea that the number of hotspot motifs, and perhaps the mutation rate in general, is expected to decay over time in individual bNAb lineages.
The functional role of bone morphogenetic protein (BMP) signalling in colorectal cancer (CRC) is poorly defined, with contradictory results in cancer cell line models reflecting the inherent difficulties of assessing a signalling pathway that is context-dependent and subject to genetic constraints. By assessing the transcriptional response of a diploid human colonic epithelial cell line to BMP ligand stimulation, we generated a prognostic BMP signalling signature, which was applied to multiple CRC datasets to investigate BMP heterogeneity across CRC molecular subtypes. We linked BMP and Notch signalling pathway activity and function in human colonic epithelial cells, and normal and neoplastic tissue. BMP induced Notch through a γ-secretase-independent interaction, regulated by the SMAD proteins. In homeostasis, BMP/Notch co-localization was restricted to cells at the top of the intestinal crypt, with more widespread interaction in some human CRC samples. BMP signalling was downregulated in the majority of CRCs, but was conserved specifically in mesenchymal-subtype tumours, where it interacts with Notch to induce an epithelial-mesenchymal transition (EMT) phenotype. In intestinal homeostasis, BMP-Notch pathway crosstalk is restricted to differentiating cells through stringent pathway segregation. Conserved BMP activity and loss of signalling stringency in mesenchymal-subtype tumours promotes a synergistic BMP-Notch interaction, and this correlates with poor patient prognosis. BMP signalling heterogeneity across CRC subtypes and cell lines can account for previous experimental contradictions. Crosstalk between the BMP and Notch pathways will render mesenchymal-subtype CRC insensitive to γ-secretase inhibition unless BMP activation is concomitantly addressed. © 2017 The Authors. Journal of Pathology published by John Wiley & Sons Ltd on behalf of Pathological Society of Great Britain and Ireland.
MUFAs are unsaturated FAs with one double bond and are derived from endogenous synthesis and dietary intake. Accumulating evidence has suggested that plasma and erythrocyte MUFA levels are associated with cardiometabolic disorders, including CVD, T2D, and metabolic syndrome (MS). Previous genome-wide association studies (GWASs) have identified seven loci for plasma and erythrocyte palmitoleic and oleic acid levels in populations of European origin. To identify additional MUFA-associated loci and the potential functional variant at each locus, we performed ethnic-specific GWAS meta-analyses and trans-ethnic meta-analyses in more than 15,000 participants of Chinese and European ancestry. We identified novel genome-wide significant associations for vaccenic acid at FADS1/2 and PKD2L1 [log10(Bayes factor) ≥ 8.07] and for gondoic acid at FADS1/2 and GCKR [log10(Bayes factor) ≥ 6.22], and also observed improved fine-mapping resolutions at FADS1/2 and GCKR loci. The greatest improvement was observed at GCKR, where the number of variants in the 99% credible set was reduced from 16 (covering 94.8 kb) to 5 (covering 19.6 kb, including a missense variant rs1260326) after trans-ethnic meta-analysis. We also confirmed the previously reported associations of PKD2L1, FADS1/2, GCKR, and HIF1AN with palmitoleic acid and of FADS1/2 and LPCAT3 with oleic acid in the Chinese-specific GWAS and the trans-ethnic meta-analyses. Pathway-based analyses suggested that the identified loci were in unsaturated FA metabolism and signaling pathways. Our findings provide novel insight into the genetic basis relevant to MUFA metabolism and biology.
Modeling bifurcations in single-cell transcriptomics data has become an increasingly popular field of research. Several methods have been proposed to infer bifurcation structure from such data, but all rely on heuristic non-probabilistic inference. Here we propose the first generative, fully probabilistic model for such inference based on a Bayesian hierarchical mixture of factor analyzers. Our model exhibits competitive performance on large datasets despite implementing full Markov-Chain Monte Carlo sampling, and its unique hierarchical prior structure enables automatic determination of genes driving the bifurcation process. We additionally propose an Empirical-Bayes like extension that deals with the high levels of zero-inflation in single-cell RNA-seq data and quantify when such models are useful. We apply or model to both real and simulated single-cell gene expression data and compare the results to existing pseudotime methods. Finally, we discuss both the merits and weaknesses of such a unified, probabilistic approach in the context practical bioinformatics analyses.
Breast cancer risks conferred by many germline missense variants in the BRCA1 and BRCA2 genes, often referred to as variants of uncertain significance (VUS), have not been established. In this study, associations between 19 BRCA1 and 33 BRCA2 missense substitution variants and breast cancer risk were investigated through a breast cancer case-control study using genotyping data from 38 studies of predominantly European ancestry (41,890 cases and 41,607 controls) and nine studies of Asian ancestry (6,269 cases and 6,624 controls). The BRCA2 c.9104A>C, p.Tyr3035Ser (OR = 2.52; P = 0.04), and BRCA1 c.5096G>A, p.Arg1699Gln (OR = 4.29; P = 0.009) variant were associated with moderately increased risks of breast cancer among Europeans, whereas BRCA2 c.7522G>A, p.Gly2508Ser (OR = 2.68; P = 0.004), and c.8187G>T, p.Lys2729Asn (OR = 1.4; P = 0.004) were associated with moderate and low risks of breast cancer among Asians. Functional characterization of the BRCA2 variants using four quantitative assays showed reduced BRCA2 activity for p.Tyr3035Ser compared with wild-type. Overall, our results show how BRCA2 missense variants that influence protein function can confer clinically relevant, moderately increased risks of breast cancer, with potential implications for risk management guidelines in women with these specific variants. Cancer Res; 77(11); 2789-99. ©2017 AACR.
To date many clinical studies aim to increase the number and/or fitness of CD4(+)CD127(low)CD25(+) regulatory T cells (Tregs) in vivo to harness their regulatory potential in the context of treating autoimmune disease. Here, we sought to define the phenotype and function of Tregs expressing the highest levels of IL-6 receptor (IL-6R). We have identified a population of CD4(+)CD127(low)CD25(+) TIGIT(-) T cells distinguished by their elevated IL-6R expression that lacked expression of HELIOS, showed higher CTLA-4 expression, and displayed increased suppressive capacity compared to IL-6R(hi)TIGIT(+) Tregs. IL-6R(hi)TIGIT(-) CD127(low)CD25(+) T cells contained a majority of cells demethylated at FOXP3 and displayed a Th17 transcriptional signature, including RORC (RORγt) and the capacity of producing both pro- and anti-inflammatory cytokines, such as IL-17, IL-22 and IL-10. We propose that in vivo, in the presence of IL-6-associated inflammation, the suppressive function of CD4(+)CD127(low)CD25(+) FOXP3(+)IL-6R(hi)TIGIT(-) T cells is temporarily disarmed allowing further activation of the effector functions and potential pathogenic tissue damage.
Routine full characterization of Mycobacterium tuberculosis is culture based, taking many weeks. Whole-genome sequencing (WGS) can generate antibiotic susceptibility profiles to inform treatment, augmented with strain information for global surveillance; such data could be transformative if provided at or near the point of care. We demonstrate a low-cost method of DNA extraction directly from patient samples for M. tuberculosis WGS. We initially evaluated the method by using the Illumina MiSeq sequencer (40 smear-positive respiratory samples obtained after routine clinical testing and 27 matched liquid cultures). M. tuberculosis was identified in all 39 samples from which DNA was successfully extracted. Sufficient data for antibiotic susceptibility prediction were obtained from 24 (62%) samples; all results were concordant with reference laboratory phenotypes. Phylogenetic placement was concordant between direct and cultured samples. With Illumina MiSeq/MiniSeq, the workflow from patient sample to results can be completed in 44/16 h at a reagent cost of £96/£198 per sample. We then employed a nonspecific PCR-based library preparation method for sequencing on an Oxford Nanopore Technologies MinION sequencer. We applied this to cultured Mycobacterium bovis strain BCG DNA and to combined culture-negative sputum DNA and BCG DNA. For flow cell version R9.4, the estimated turnaround time from patient to identification of BCG, detection of pyrazinamide resistance, and phylogenetic placement was 7.5 h, with full susceptibility results 5 h later. Antibiotic susceptibility predictions were fully concordant. A critical advantage of MinION is the ability to continue sequencing until sufficient coverage is obtained, providing a potential solution to the problem of variable amounts of M. tuberculosis DNA in direct samples.
Most cancers evolve from a single founder cell through a series of clonal expansions that are driven by somatic mutations. These clonal expansions can lead to several coexisting subclones sharing subsets of mutations. Analysis of massively parallel sequencing data can infer a tumor's subclonal composition through the identification of populations of cells with shared mutations. We describe the principles that underlie subclonal reconstruction through single nucleotide variants (SNVs) or copy number alterations (CNAs) from bulk or single-cell sequencing. These principles include estimating the fraction of tumor cells for SNVs and CNAs, performing clustering of SNVs from single- and multisample cases, and single-cell sequencing. The application of subclonal reconstruction methods is providing key insights into tumor evolution, identifying subclonal driver mutations, patterns of parallel evolution and differences in mutational signatures between cellular populations, and characterizing the mechanisms of therapy resistance, spread, and metastasis.
Chronic obstructive pulmonary disease (COPD) is characterized by reduced lung function and is the third leading cause of death globally. Through genome-wide association discovery in 48,943 individuals, selected from extremes of the lung function distribution in UK Biobank, and follow-up in 95,375 individuals, we increased the yield of independent signals for lung function from 54 to 97. A genetic risk score was associated with COPD susceptibility (odds ratio per 1 s.d. of the risk score (∼6 alleles) (95% confidence interval) = 1.24 (1.20-1.27), P = 5.05 × 10(-49)), and we observed a 3.7-fold difference in COPD risk between individuals in the highest and lowest genetic risk score deciles in UK Biobank. The 97 signals show enrichment in genes for development, elastic fibers and epigenetic regulation pathways. We highlight targets for drugs and compounds in development for COPD and asthma (genes in the inositol phosphate metabolism pathway and CHRM3) and describe targets for potential drug repositioning from other clinical indications.
Molecular Ecology, 26 (11), pp. 2880-2894. | Read more2017. Population genetic structure and adaptation of malaria parasites on the edge of endemic distribution
Inappropriate activation or inadequate regulation of CD4+ and CD8+ T cells may contribute to the initiation and progression of multiple autoimmune and inflammatory diseases. Studies on disease-associated genetic polymorphisms have highlighted the importance of biological context for many regulatory variants, which is particularly relevant in understanding the genetic regulation of the immune system and its cellular phenotypes. Here we show cell type-specific regulation of transcript levels of genes associated with several autoimmune diseases in CD4+ and CD8+ T cells including a trans-acting regulatory locus at chr12q13.2 containing the rs1131017 SNP in the RPS26 gene. Most remarkably, we identify a common missense variant in IL27, associated with type 1 diabetes that results in decreased functional activity of the protein and reduced expression levels of downstream IRF1 and STAT1 in CD4+ T cells only. Altogether, our results indicate that eQTL mapping in purified T cells provides novel functional insights into polymorphisms and pathways associated with autoimmune diseases.
Progress in sepsis research has been severely hampered by a heterogeneous disease phenotype, limiting the interpretation of clinical trials and the development of effective therapeutic interventions. Application of omics-based methodologies is advancing understanding of the dysregulated host immune response to infection in sepsis. However, the frequently elusive nature of the infecting organism in sepsis has limited efforts to understand the effect of disease heterogeneity involving the pathogen. Recent advances in nucleic acid sequencing-based pathogen analysis provide the opportunity for more accurate and comprehensive microbiological diagnosis. In this Review, we explore how better understanding of the host-pathogen interaction can substantially enhance, and in turn benefit from, current and future application of omics-based approaches to understand the host response in sepsis. We illustrate this using recent work accounting for heterogeneity involving the pathogen. We propose that there is a timely opportunity to further resolve sepsis heterogeneity by considering host-pathogen interactions, enabling progress towards a precision medicine approach.
Sepsis is a deleterious inflammatory response to infection with high mortality. Reliable sepsis biomarkers could improve diagnosis, prognosis, and treatment. Integration of human genetics, patient metabolite and cytokine measurements, and testing in a mouse model demonstrate that the methionine salvage pathway is a regulator of sepsis that can accurately predict prognosis in patients. Pathway-based genome-wide association analysis of nontyphoidal Salmonella bacteremia showed a strong enrichment for single-nucleotide polymorphisms near the components of the methionine salvage pathway. Measurement of the pathway's substrate, methylthioadenosine (MTA), in two cohorts of sepsis patients demonstrated increased plasma MTA in nonsurvivors. Plasma MTA was correlated with levels of inflammatory cytokines, indicating that elevated MTA marks a subset of patients with excessive inflammation. A machine-learning model combining MTA and other variables yielded approximately 80% accuracy (area under the curve) in predicting death. Furthermore, mice infected with Salmonella had prolonged survival when MTA was administered before infection, suggesting that manipulating MTA levels could regulate the severity of the inflammatory response. Our results demonstrate how combining genetic data, biomolecule measurements, and animal models can shape our understanding of disease and lead to new biomarkers for patient stratification and potential therapeutic targeting.
KIAA0319 is a transmembrane protein associated with dyslexia with a presumed role in neuronal migration. Here we show that KIAA0319 expression is not restricted to the brain but also occurs in sensory and spinal cord neurons, increasing from early postnatal stages to adulthood and being downregulated by injury. This suggested that KIAA0319 participates in functions unrelated to neuronal migration. Supporting this hypothesis, overexpression of KIAA0319 repressed axon growth in hippocampal and dorsal root ganglia neurons; the intracellular domain of KIAA0319 was sufficient to elicit this effect. A similar inhibitory effect was observed in vivo as axon regeneration was impaired after transduction of sensory neurons with KIAA0319. Conversely, the deletion of Kiaa0319 in neurons increased neurite outgrowth in vitro and improved axon regeneration in vivo. At the mechanistic level, KIAA0319 engaged the JAK2-SH2B1 pathway to activate Smad2, which played a central role in KIAA0319-mediated repression of axon growth. In summary, we establish KIAA0319 as a novel player in axon growth and regeneration with the ability to repress the intrinsic growth potential of axons. This study describes a novel regulatory mechanism operating during peripheral nervous system and central nervous system axon growth, and offers novel targets for the development of effective therapies to promote axon regeneration.
Hereditary mixed polyposis syndrome is a rare colon cancer predisposition syndrome caused by a duplication of a noncoding sequence near the gremlin 1, DAN family BMP antagonist gene (GREM1) originally described in Ashkenazi Jews. Few families with GREM1 duplications have been described, so there are many questions about detection and management. We report 4 extended families with the duplication near GREM1 previously found in Ashkenazi Jews; 3 families were identified at cancer genetic clinics in Israel and 1 family was identified in a cohort of patients with familial colorectal cancer. Their clinical features include extracolonic tumors, onset of polyps in adolescence, and rapid progression of some polyps to advanced adenomas. One family met diagnostic criteria for Lynch syndrome. Expansion of the hereditary mixed polyposis syndrome phenotype can inform surveillance strategies for carriers of GREM1 duplications.
Importance: The causal direction and magnitude of the association between telomere length and incidence of cancer and non-neoplastic diseases is uncertain owing to the susceptibility of observational studies to confounding and reverse causation. Objective: To conduct a Mendelian randomization study, using germline genetic variants as instrumental variables, to appraise the causal relevance of telomere length for risk of cancer and non-neoplastic diseases. Data Sources: Genomewide association studies (GWAS) published up to January 15, 2015. Study Selection: GWAS of noncommunicable diseases that assayed germline genetic variation and did not select cohort or control participants on the basis of preexisting diseases. Of 163 GWAS of noncommunicable diseases identified, summary data from 103 were available. Data Extraction and Synthesis: Summary association statistics for single nucleotide polymorphisms (SNPs) that are strongly associated with telomere length in the general population. Main Outcomes and Measures: Odds ratios (ORs) and 95% confidence intervals (CIs) for disease per standard deviation (SD) higher telomere length due to germline genetic variation. Results: Summary data were available for 35 cancers and 48 non-neoplastic diseases, corresponding to 420 081 cases (median cases, 2526 per disease) and 1 093 105 controls (median, 6789 per disease). Increased telomere length due to germline genetic variation was generally associated with increased risk for site-specific cancers. The strongest associations (ORs [95% CIs] per 1-SD change in genetically increased telomere length) were observed for glioma, 5.27 (3.15-8.81); serous low-malignant-potential ovarian cancer, 4.35 (2.39-7.94); lung adenocarcinoma, 3.19 (2.40-4.22); neuroblastoma, 2.98 (1.92-4.62); bladder cancer, 2.19 (1.32-3.66); melanoma, 1.87 (1.55-2.26); testicular cancer, 1.76 (1.02-3.04); kidney cancer, 1.55 (1.08-2.23); and endometrial cancer, 1.31 (1.07-1.61). Associations were stronger for rarer cancers and at tissue sites with lower rates of stem cell division. There was generally little evidence of association between genetically increased telomere length and risk of psychiatric, autoimmune, inflammatory, diabetic, and other non-neoplastic diseases, except for coronary heart disease (OR, 0.78 [95% CI, 0.67-0.90]), abdominal aortic aneurysm (OR, 0.63 [95% CI, 0.49-0.81]), celiac disease (OR, 0.42 [95% CI, 0.28-0.61]) and interstitial lung disease (OR, 0.09 [95% CI, 0.05-0.15]). Conclusions and Relevance: It is likely that longer telomeres increase risk for several cancers but reduce risk for some non-neoplastic diseases, including cardiovascular diseases.
Single particle cryoEM has emerged as a powerful method for structure determination of proteins and complexes, complementing X-ray crystallography and NMR spectroscopy. Yet, for many systems, the resolution of cryoEM density map has been limited to 4-6 Å, which only allows for resolving bulky amino acids side chains, thus hindering accurate model building from the density map. On the other hand, experimental chemical shifts (CS) from solution and solid state MAS NMR spectra provide atomic level data for each amino acid within a molecule or a complex; however, structure determination of large complexes and assemblies based on NMR data alone remains challenging. Here, we present a novel integrated strategy to combine the highly complementary experimental data from cryoEM and NMR computationally by molecular dynamics simulations to derive an atomistic model, which is not attainable by either approach alone. We use the HIV-1 capsid protein (CA) C-terminal domain as well as the large capsid assembly to demonstrate the feasibility of this approach, termed NMR CS-biased cryoEM structure refinement.
BACKGROUND: Genome-wide association studies have so far identified 56 loci associated with risk of coronary artery disease (CAD). Many CAD loci show pleiotropy; that is, they are also associated with other diseases or traits. OBJECTIVES: This study sought to systematically test if genetic variants identified for non-CAD diseases/traits also associate with CAD and to undertake a comprehensive analysis of the extent of pleiotropy of all CAD loci. METHODS: In discovery analyses involving 42,335 CAD cases and 78,240 control subjects we tested the association of 29,383 common (minor allele frequency >5%) single nucleotide polymorphisms available on the exome array, which included a substantial proportion of known or suspected single nucleotide polymorphisms associated with common diseases or traits as of 2011. Suggestive association signals were replicated in an additional 30,533 cases and 42,530 control subjects. To evaluate pleiotropy, we tested CAD loci for association with cardiovascular risk factors (lipid traits, blood pressure phenotypes, body mass index, diabetes, and smoking behavior), as well as with other diseases/traits through interrogation of currently available genome-wide association study catalogs. RESULTS: We identified 6 new loci associated with CAD at genome-wide significance: on 2q37 (KCNJ13-GIGYF2), 6p21 (C2), 11p15 (MRVI1-CTR9), 12q13 (LRP1), 12q24 (SCARB1), and 16q13 (CETP). Risk allele frequencies ranged from 0.15 to 0.86, and odds ratio per copy of the risk allele ranged from 1.04 to 1.09. Of 62 new and known CAD loci, 24 (38.7%) showed statistical association with a traditional cardiovascular risk factor, with some showing multiple associations, and 29 (47%) showed associations at p < 1 × 10(-4) with a range of other diseases/traits. CONCLUSIONS: We identified 6 loci associated with CAD at genome-wide significance. Several CAD loci show substantial pleiotropy, which may help us understand the mechanisms by which these loci affect CAD risk.
BACKGROUND: Expression quantitative trait loci (eQTL) databases represent a valuable resource to link disease-associated SNPs to specific candidate genes whose gene expression is significantly modulated by the SNP under investigation. We previously identified signal inhibitory receptor on leukocytes-1 (SIRL-1) as a powerful regulator of human innate immune cell function. While it is constitutively high expressed on neutrophils, on monocytes the SIRL-1 surface expression varies strongly between individuals. The underlying mechanism of regulation, its genetic control as well as potential clinical implications had not been explored yet. METHODS: Whole blood eQTL data of a Chinese cohort was used to identify SNPs regulating the expression of VSTM1, the gene encoding SIRL-1. The genotype effect was validated by flow cytometry (cell surface expression), correlated with electrophoretic mobility shift assay (EMSA), chromatin immunoprecipitation (ChIP) and bisulfite sequencing (C-methylation) and its functional impact studied the inhibition of reactive oxygen species (ROS). RESULTS: We found a significant association of a single CpG-SNP, rs612529T/C, located in the promoter of VSTM1. Through flow cytometry analysis we confirmed that primarily in the monocytes the protein level of SIRL-1 is strongly associated with genotype of this SNP. In monocytes, the T allele of this SNP facilitates binding of the transcription factors YY1 and PU.1, of which the latter has been recently shown to act as docking site for modifiers of DNA methylation. In line with this notion rs612529T associates with a complete demethylation of the VSTM1 promoter correlating with the allele-specific upregulation of SIRL-1 expression. In monocytes, this upregulation strongly impacts the IgA-induced production of ROS by these cells. Through targeted association analysis we found a significant Meta P value of 1.14 × 10(-6) for rs612529 for association to atopic dermatitis (AD). CONCLUSION: Low expression of SIRL-1 on monocytes is associated with an increased risk for the manifestation of an inflammatory skin disease. It thus underlines the role of both the cell subset and this inhibitory immune receptor in maintaining immune homeostasis in the skin. Notably, the genetic regulation is achieved by a single CpG-SNP, which controls the overall methylation state of the promoter gene segment.
Total publications on this page: 100
Total citations for publications on this page: 120