BACKGROUND: More than 1100 mutations that cause hemophilia B (HB) have been identified. At the same time, specific F9 mutations are present at high frequencies in certain populations, which raise questions about the origin of HB mutations.
OBJECTIVES: To describe the mutation spectrum of all HB families in Sweden and investigate if mutations appearing in several families are due to independent recurrent mutations (RMs) or to a common mutation event (i.e. are identical by descent (IBD)).
PATIENTS/METHODS: The registered Swedish HB population consists of patients from 86 families. Mutations were identified by resequencing and identical haplotypes were defined using 74 markers and a control population of 285 individuals. The ages of IBD mutations were estimated using ESTIAGE.
RESULTS: Out of 77 presumably unrelated patients with substitution mutations, 47 patients (61%) had mutations in common with other patients. Haplotyping of the 47 patients showed that 24 patients had IBD mutations (51%) with estimated ages of between two and 23 generations. A majority of these patients had mild disease. Eight of the 15 mutations observed in more than one family were C>T transitions in CpG sites and all eight were RMs.
CONCLUSIONS: The association of IBD mutations with a mild phenotype is similar to what has been previously observed in hemophilia A. Noteworthy features of the mutations that are common to more than one family are the equal proportions of patients with RM and IBD mutations and the correlation between the occurrence of RMs and C>T transitions at CpG sites.
Background: Hemophilia A (HA) has a high level of variation within the disease class, with more than 1000 mutations being listed in the HAMSTeRS database. At the same time a number of F8 mutations are present in specific populations at high frequencies. Objectives: The simultaneous presence of large numbers of rare mutations and a small number of high-frequency mutations raises questions about the origins of HA mutations. The present study was aimed at describing the origins of HA mutations in the complete Swedish population. The primary issue was to determine what proportion of identical mutations are identical by descent (IBD) and what proportion are attributable to recurrent mutation events. The age of IBD mutations was also determined. Patients/Methods: In Sweden, the care of HA is centralized, and the Swedish HA population consists of ∼ 750 patients from > 300 families (35% severe, 15% moderate, and 50% mild). Identical haplotypes were defined by single-nucleotide polymorphism and microsatellite haplotyping, and the ages of the mutations were estimated with estiage. Results: Among 212 presumably unrelated patients with substitution mutations, 97 (46%) had mutations in common with other patients. Haplotyping of the 97 patients showed that 47 had IBD mutations (22%) with estimated ages of between two and 35 generations. The frequency of mild disease increased with an increasing number of patients sharing the mutations. Conclusions: A majority of the IBD mutations are mild and have age estimates of a few hundred years, but some could date back to the Middle Ages.
BACKGROUND: Variation in the 10 toll-like receptor (TLR) genes has been significantly associated with allergic rhinitis (AR) in several candidate gene studies and three large genome-wide association studies. These have all investigated common variants, but no investigations for rare variants (MAF ≤ 1%) have been made in AR. The present study aims to describe the genetic variation of the promoter and coding sequences of the 10 TLR genes in 288 AR patients.
METHODS: Sanger sequencing and Ion Torrent next-generation sequencing was used to identify polymorphisms in a Swedish AR population and these were subsequently compared and evaluated using 1000Genomes and Exome Aggregation Consortium (ExAC) data.
RESULTS: The overall level of genetic variation was clearly different among the 10 TLR genes. The TLR10-TLR1-TLR6 locus was the most variable, while the TLR7-TLR8 locus was consistently showing a much lower level of variation. The AR patients had a total of 37 promoter polymorphisms with 14 rare (MAF ≤ 1%) and 14 AR-specific polymorphisms. These numbers were highly similar when comparing the AR and the European part of the 1000Genomes populations, with the exception of TLR10 where a significant (P = 0.00009) accumulation of polymorphisms were identified. The coding sequences had a total of 119 polymorphisms, 68 were rare and 43 were not present in the European part of the 1000Genomes population. Comparing the numbers of rare and AR-specific SNPs in the patients with the European part of the 1000Genomes population it was seen that the numbers were quite similar both for individual genes and for the sum of all 10 genes. However, TLR1, TLR5, TLR7 and TLR9 showed a significant excess of rare variants in the AR population when compared to the non-Finnish European part of ExAC. In particular the TLR1 S324* nonsense mutation was clearly overrepresented in the AR population.
CONCLUSIONS: Most TLR genes showed a similar level of variation between AR patients and public databases, but a significant excess of rare variants in AR patients were detected in TLR1, TLR5, TLR7, TLR9 and TLR10. This further emphasizes the frequently reproduced TLR10-TLR1-TLR6 locus as being involved in the pathogenesis of allergic rhinitis.
BACKGROUND: A previous investigation of all 10 TLR-genes for associations with allergic rhinitis (AR) detected a number of significant SNPs in the TLR8 locus. The associations indicated that an accumulation of rare variants could explain the signal. The present study therefore searches for rare variants in the TLR8 region and also investigates the reproducibility of previous SNP associations.
METHODS: The TLR8 gene was re-sequenced in 288 AR patients from Malmö and the data was compared with publically available data. Seven previously AR-associated SNPs from TLR8 were analyzed for AR-associations in 422 AR patients and 859 controls from the BAMSE cohort. The associations detected in present and previous studies were compared.
RESULTS: Sequencing detected 13 polymorphisms (3 promotor, 10 coding) among 288 AR patients. Four of the coding polymorphisms were rare (MAF <1%) and three of those were novel. Two coding polymorphisms were benign missense mutations and the rest were synonymous. Comparison with 1000Genomes and Exome Aggregation Consortium data revealed no accumulation of rare variants in the AR cases. The AR-association tests made using the BAMSE cohort yielded 5 P-values < 0.05. Tests of IgE-levels yielded 4 significant SNP associations to birch pollen. Comparing results between different populations revealed opposing risk alleles, different gender effects and response to different allergens in the different populations.
CONCLUSIONS: Rare variants in TLR8 are not associated with AR. Comparison of present and previous association studies reveal contradictory results for common variants. Thus, no associations exist between genetic variation in TLR8 and AR. This article is protected by copyright. All rights reserved.
Genetic studies of chronic rhinosinusitis (CRS) have identified a total of 53 CRS-associated SNPs that were subsequently evaluated for their reproducibility in a recent study. The rs2873551 SNP in linkage disequilibrium with PARS2 showed the strongest association signal. The present study aims to comprehensively screen for rare variants in PARS2 and evaluate for accumulation of such variants in CRS-patients. Sanger sequencing and long-range PCR were used to screen for rare variants in the putative promoter region and coding sequence of 310 CRS-patients and a total of 21 variants were detected. The mutation spectrum was then compared with data from European populations of the 1000Genomes project (EUR) and the Exome Aggregation Consortium (ExAC). The CRS population showed a significant surplus of low-frequency variants compared with ExAC data. Haplotype analysis of the region showed a significant excess of rare haplotypes in the CRS population compared to the EUR population. Two missense mutations were also genotyped in the 310 CRS patients and 372 CRS-negative controls, but no associations with the disease were found. This is the first re-sequencing study in CRS research and also the first study to show an association of rare variants with the disease.
The evolutionary history of the common chloroplast (cp) genome of the allotetraploid Arabidopsis suecica and its maternal parent A. thaliana was investigated by sequencing 50 fragments of cpDNA, resulting in 98 polymorphic sites. The variation in the A. suecica sample was small, in contrast to that of the A. thaliana sample. The time to the most recent common ancestor (T(MRCA)) of the A. suecica cp genome alone was estimated to be about one 37th of the T(MRCA) of both the A. thaliana and A. suecica cp genomes. This corresponds to A. suecica having a MRCA between 10 000 and 50 000 years ago, suggesting that the entire species originated during, or before, this period of time, although the estimates are sensitive to assumptions made about population size and mutation rate. The data was also consistent with the hypothesis of A. suecica being of single origin. Isolation-by-distance and population structure in A. thaliana depended upon the geographical scale analysed; isolation-by-distance was found to be weak on the global scale but locally pronounced. Within the genealogical cp tree of A. thaliana, there were indications that the root of the A. suecica species is located among accessions of A. thaliana that come primarily from central Europe. Selective neutrality of the cp genome could not be rejected, despite the fact that it contains several completely linked protein-coding genes.
A coalescent-based method was used to investigate the origins of the allotetraploid Arabidopsis suecica, using 52 nuclear microsatellite loci typed in eight individuals of A. suecica and 14 individuals of its maternal parent Arabidopsis thaliana, and four short fragments of genomic DNA sequenced in a sample of four individuals of A. suecica and in both its parental species A. thaliana and Arabidopsis arenosa. All loci were variable in A. thaliana but only 24 of the 52 microsatellite loci and none of the four sequence fragments were variable in A. suecica. We explore a number of possible evolutionary scenarios for A. suecica and conclude that it is likely that A. suecica has a recent, unique origin between 12,000 and 300,000 years ago. The time estimates depend strongly on what is assumed about population growth and rates of mutation. When combined with what is known about the history of glaciations, our results suggest that A. suecica originated south of its present distribution in Sweden and Finland and then migrated north, perhaps in the wake of the retreating ice.
The level of variation and the mutation rate were investigated in an empirical study of 244 chloroplast microsatellites in 15 accessions of Arabidopsis thaliana. In contrast to SNP variation, microsatellite variation in the chloroplast was found to be common, although less common than microsatellite variation in the nucleus. No microsatellite variation was found in coding regions of the chloroplast. To evaluate different models of microsatellite evolution as possible explanations for the observed pattern of variation, the length distribution of microsatellites in the published DNA sequence of the A. thaliana chloroplast was subsequently used. By combining information from these two analyses we found that the mode of evolution of the chloroplast mononucleotide microsatellites was best described by a linear relation between repeat length and mutation rate, when the repeat lengths exceeded about 7 bp. This model can readily predict the variation observed in non-coding chloroplast DNA. It was found that the number of uninterrupted repeat units had a large impact on the level of chloroplast microsatellite variation. No other factors investigated-such as the position of a locus within the chromosome, or imperfect repeats-appeared to affect the variability of chloroplast microsatellites. By fitting the slippage models to the Genbank sequence of chromosome 1, we show that the difference between microsatellite variation in the nucleus and the chloroplast is largely due to differences in slippage rate.
Protein S deficiency is a dominantly inherited disorder that results from mutations in the PROS1 gene. Previous sequencing of the gene failed to detect mutations in eight out of 18 investigated Swedish families, whereas segregation analyses detected large deletions in three out of the eight families. The present study investigates more thoroughly for the presence of deletions but also for other types of rearrangements. FISH analysis confirmed the existence of the three previously identified large deletions, but failed to identify any other type of rearrangement among the eight analysed families. MLPA analysis of the PROS1 gene revealed two smaller deletions covering two and four exons, respectively. Thus, deletions could be found in five out of eight families where no point mutations could be found despite sequencing of the gene. Twelve additional, not previously analysed, families were subsequently analysed using MLPA. The analysis identified two smaller deletions (3 and 4 exons). Including all PS-deficient families, i.e. also the 10 families where sequencing found a causative point mutation, deletions were identified in seven out of 30 PS-deficient families. A strategy of sequencing followed by MLPA analysis in mutation-negative families identified the causative mutation in 15 out of 18 of Swedish PS-deficient families. Most deletions were different as determined by their sizes, locations and flanking haplotypes. FISH (8 families) and MLPA analysis (20 families) failed to identify other types of rearrangements.
Random amplified polymorphic DNA (RAPD) markers were used to estimate the level of genetic variation in Swedish accessions of the allopolyploid Arabidopsis suecica and its parental species A. thaliana and A. arenosa. The results showed clear differences among the three species with respect to the level of variation. A. arenosa was highly variable, A. thaliana showed a moderate level of variation whereas A. suecica was much less variable than the two other species. An extended analysis covering 19 Swedish populations of A. suecica corroborated the low level of variation in this species, yet 16 unique phenotypes were observed. No isolation by distance was observed. When the genetic variation was partitioned among and within populations of A. suecica, the results showed that the majority of the variation (81%) occurred among populations. This result is interpreted as a strong indication that A. suecica is autogamous in nature.
von Willebrand factor (VWF) level and function are influenced by genetic variation in VWF and several other genes in von Willebrand disease type 1 (VWD1) patients. This study comprehensively screened for VWF variants and investigated the presence of ABO genotypes and common and rare VWF variants in Swedish VWD1 patients. The VWF gene was resequenced using Ion Torrent and Sanger sequencing in 126 index cases historically diagnosed with VWD. Exon 7 of the ABO gene was resequenced using Sanger sequencing. Multiplex ligation-dependent probe amplification analysis was used to investigate for copy number variants. Genotyping of 98 single nucleotide variants allowed allele frequency comparisons with public databases. Seven VWD2 mutations and 36 candidate VWD1 mutations (5 deletions, 4 nonsense, 21 missense, 1 splice, and 5 synonymous mutations) were identified. Nine mutations were found in more than one family and nine VWD1 index cases carried more than one candidate mutation. The T-allele of rs1063857 (c.2385T > C, p.Y795 = ) and blood group O were both frequent findings and contributed to disease in the Swedish VWD1 population. VWD2 mutations were found in 20 and candidate VWD1 mutations in 51 index cases out of 106 (48%). VWF mutations, a VWF haplotype, and blood group O all contributed to explain disease in Swedish VWD1 patients.
BACKGROUND: F8 int1h inversions (Inv1) are detected in 1-2% of severe hemophilia A (HA) patients. Long-range polymerase chain reaction (PCR) and inverse-shifting PCR have been used to diagnose these inversions.
OBJECTIVES: To design and validate a sensitive and robust assay for detection of F8 Inv1 inversions.
METHODS: Archival DNA samples were investigated using mile-post assays and droplet digital PCR.
RESULTS: Mile-post assays for Inv1 showing high specificities and sensitivities were designed and optimized. Analysis of four patients, two carrier mothers and 40 healthy controls showed concordance with known mutation status with one exception. One patient had a duplication involving exons 2-22 of the F8 gene instead of an Inv1 mutation. DNA mixtures with different proportions of wild type and Inv1 DNA correlated well with the observed relative linkage for both wild type and Inv1 assays and estimated the limit of detection of these assays to 2% of the rare chromosome.
CONCLUSIONS: The mile-post strategy has several inherent control systems. The absolute counting of target molecules by both assays enables determination of template quantity, detection of copy number variants and rare variants occurring in primer and probe annealing sites and estimation of DNA integrity through the observed linkage. The presented Inv1 mile-post analysis offers sensitive and robust detection and quantification of the F8 int1h inversions and other rearrangements involving intron 1 in patients and their mothers.
Background: The occurrence of mosaicism in hemophilia A (HA) has been investigated in several studies using different detection methods. Objectives: To characterize and compare the ability of AmpliSeq/Ion Torrent sequencing and droplet digital polymerase chain reaction (ddPCR) for mosaic detection in HA. Methods: Ion Torrent sequencing and ddPCR were used to analyze 20 healthy males and 16 mothers of sporadic HA patients. Results: An error-rate map over all coding positions and all positions reported as mutated in the F8-specific mutation database was produced. The sequencing produced a mean read depth of >1500X where >97% of positions were covered by >100 reads. Higher error frequencies were observed in positions with A or T as reference allele and in positions surrounded on both sides with C or G. Seventeen of 9319 positions had a mean substitution error frequency >1%. The ability to identify low-level mosaicism was determined primarily by read depth and error rate of each specific position. Limit of detection (LOD) was <1% for 97% of positions with substitutions and 90% of indel positions. The positions with LOD >1% require repeated testing and mononucleotide repeats with more than four repeat units need an alternative analysis strategy. Mosaicism was detected in 1 of 16 mothers and confirmed using ddPCR. Conclusions: Deep sequencing using an AmpliSeq/Ion Torrent strategy allows for simultaneous identification of disease-causing mutations in patients and mosaicism in mothers. ddPCR has high sensitivity but is hampered by the need for mutationspecific design.
DNA sequencing was performed on up to 12 chloroplast DNA regions [giving a total of 4288 base pairs (bp) in length] from the allopolyploid Arabidopsis suecica (48 accessions) and its two parental species, A. thaliana (25 accessions) and A. arenosa (seven accessions). Arabidopsis suecica was identical to A. thaliana at all 93 sites where A. thaliana and A. arenosa differed, thus showing that A. thaliana is the maternal parent of A. suecica. Under the assumption that A. thaliana and A. arenosa separated 5 million years ago, we estimated a substitution rate of 2.9 x 10(-9) per site per year in noncoding single copy sequence. Within A. thaliana we found 12 substitution (single bp) and eight insertion/deletion (indel) polymorphisms, separating the 25 accessions into 15 haplotypes. Eight of the A. thaliana accessions from central Sweden formed one cluster, which was separated from a cluster consisting of central European and extreme southern Swedish accessions. This latter cluster also included the A. suecica accessions, which were all identical except for one 5 bp indel. We interpret this low level of variation as a strong indication that A. suecica effectively has a single origin, which we dated at 20 000 years ago or more.
RAPD (random amplified polymorphic DNA) is a multiplex marker system that conventionally uses single-primer PCR to amplify random DNA fragments. Because of its multiplex nature, it is frequently used in bulked segregant analysis (BSA). In view of the very large numbers of markers BSA often requires, we investigated the use of mixtures of primers as a method of increasing the number of markers available. Theoretically, if a single-primer reaction produces x bands on average, an unrestrained PCR process using a primers should produce xa2 bands. Initially, we investigated mixtures containing from one to five primers. The average number of products increased slightly from the single-primer to the multiple-primer case, whereas it was rather constant for the different multi-primer combinations. This deviation from the theoretical expectations, which we attribute to the effects of competition, shows mixtures of more than two primers to be inefficient. The properties of two-primer mixtures in which the proportions of the two primers were varied were also investigated. The intensities of most of the products were influenced by the proportions of the primers used to create the mixture. A good fit was obtained to a model in which the average competitive ability of a band is directly proportional to the probability of randomly obtaining the band-producing primer combination from the pool of primers. Using two-primer mixtures, a(a-1)/2 different two-primer mixtures can be produced. A comparison of different schemes for constructing the two-primer mixtures indicates that the degree of resampling is similar for all schemes. In conclusion, the use of two-primer mixtures is a simple but very powerful strategy in BSA as it can generate an extremely large number of markers.
The breeding system of Arabidopsis suecica was investigated through genetic analysis of microsatellite segregation patterns in five controlled crosses as well as in 16 single-mother families collected in the wild. Analysis of single and two-locus segregations in the F2 generation following a cross clearly shows that A. suecica is reproduces sexually. The single-mother families show a high level of homozygosity corroborating earlier results indicating a high level of inbreeding. The high level of individual homozygosity is due both to a high level of selfing and to the underlying population structure.