Skip to main content

Noninvasive prenatal testing of α-thalassemia and β-thalassemia through population-based parental haplotyping



Noninvasive prenatal testing (NIPT) of recessive monogenic diseases depends heavily on knowing the correct parental haplotypes. However, the currently used family-based haplotyping method requires pedigrees, and molecular haplotyping is highly challenging due to its high cost, long turnaround time, and complexity. Here, we proposed a new two-step approach, population-based haplotyping-NIPT (PBH-NIPT), using α-thalassemia and β-thalassemia as prototypes.


First, we deduced parental haplotypes with Beagle 4.0 with training on a large retrospective carrier screening dataset (4356 thalassemia carrier screening-positive cases). Second, we inferred fetal haplotypes using a parental haplotype-assisted hidden Markov model (HMM) and the Viterbi algorithm.


With this approach, we enrolled 59 couples at risk of having a fetus with thalassemia and successfully inferred 94.1% (111/118) of fetal alleles. We confirmed these alleles by invasive prenatal diagnosis, with 99.1% (110/111) accuracy (95% CI, 95.1–100%).


These results demonstrate that PBH-NIPT is a sensitive, fast, and inexpensive strategy for NIPT of thalassemia.


The discovery of cell-free fetal DNA enables noninvasive prenatal testing (NIPT) for common aneuploidies [1, 2], microdeletion/microduplication syndromes [3,4,5], and monogenic disorders. Initially, NIPT of monogenic disorders focused on detecting de novo or paternally inherited variants responsible for dominant monogenic disorders [6, 7]. Reports indicate that the average genomic carrier burden for severe pediatric recessive variants is 2.8 per person [8] and that the cumulative prevalence among live births is approximately 0.8% [9]. NIPT for most recessive monogenic disorders involves several technical challenges and has only been made clinically available for a limited number of recessive conditions [10] despite the relatively high prevalence of such disorders, because analysis of maternally inherited fetal alleles has been hampered by the high background of maternal DNA in cell-free DNA (cfDNA) [11]. The current approaches for NIPT of recessive diseases are typically classified into two categories [12, 13]: relative mutation dosage (RMD) analysis [14] and relative haplotype dosage (RHDO) analysis [15]. The RMD approach focuses on quantitative comparisons between variant and wild-type alleles present in cfDNA and has relatively high sensitivity and specificity [14, 16]. This approach is powerful for detecting single nucleotide variants (SNVs) and small insertions/deletions (InDels) but usually cannot detect large InDels and copy number variants (CNVs) [17,18,19]. Its performance is also affected by sequencing errors and amplification bias of low-abundance fetal variants in cfDNA [20]. Unlike the RMD approach, the RHDO approach determines the relative proportions of variant and normal haplotypes in maternal plasma [21] and can theoretically detect most types of variants, including large InDels and CNVs, in one test [16, 22]. However, RHDO analysis requires parental haplotype information [23]. Although molecular phasing approaches to determine parental haplotypes, including linked-read sequencing [24, 25] and targeted locus amplification (TLA) [26], have not been widely used in clinical settings due to their high cost and complex procedures [27,28,29,30,31,32], population-based parental haplotyping provides an alternative approach due to its rapid turnaround and inexpensive and relatively simple procedures. However, the use of this method has been limited to a founder variant (GBA gene, c.1226A>G) [33].

Thalassemia causes hemoglobin deficiency and affects approximately 4.4 per 10,000 live births worldwide [34]. Its genetic complexity involves three types of variants: SNVs, InDels, and CNVs. In southern China, the most prevalent variants of α-thalassemia are -α3.7 deletion, --SEA deletion, -α4.2 deletion, HBA2 c.369C>G, and HBA2 c.427T>C, while those of β-thalassemia are HBB c.126_129delCTTT, HBB c.52A>T, HBB c.316-197C>T, HBB c.-78A>G, and HBB c.79G>A [35, 36]. Our population screening data for thalassemia [35] shows that these 10 variants account for 87.9% of β-thalassemia carriers and 96.5% of α-thalassemia carriers (Additional file 1: Fig. S1).

In the present study, we proposed a novel population-based haplotyping-NIPT method (PBH-NIPT) for α-thalassemia and β-thalassemia in which nonfounder variants were detected when the sample size of the reference panel (population data used to infer parental haplotypes) was sufficiently large for accurate deduction of parental haplotypes. The PBH-NIPT model was trained on a large retrospective carrier screening dataset, and its accuracy was verified via invasive prenatal diagnosis. In addition, we assessed the effect of the reference panel sample size on the outcomes of PBH-NIPT.


Patients and samples

The ethics committees of Guangzhou Women and Children’s Medical Center and BGI approved this study (approval numbers: 2017102408 and BGI-IRB 18043). Fifty-nine couples at risk of having a fetus with thalassemia provided written informed consent. The clinical features of the participants are provided in the supplement (Additional file 2: Table S1). We collected 5 ml of blood from each parent. We promptly isolated maternal plasma using a two-step centrifugation method [37]. We used 10 ml of amniotic fluid (AF) or 5 mg of chorionic villus sample (CVS) for invasive prenatal diagnosis.

Sequencing library preparation

We extracted cfDNA from maternal plasma using a QIAamp Circulating Nucleic Acid Kit (Qiagen, Dusseldorf, Germany) and extracted parental gDNA from peripheral blood and fetal DNA from CVS or AF using a QIAamp DNA Mini Kit (Qiagen).

We used gDNA (500 ng) for library construction and fragmented it ultrasonically with a Bioruptor Pico (Diagenode, Liege, Belgium), yielding 300–700-bp fragments. We then performed end repair, phosphorylation, and A-tailing reactions on the sheared DNA and ligated BGISEQ adaptors with specific barcodes to the A-tailed products. We performed 4–6 cycles of polymerase chain reaction (PCR) amplification to enrich the target regions and performed hybridization capture according to the NimbleGen protocols after pooling twenty barcoded gDNA libraries in equal amounts. Finally, we performed circularization of the post-capture library to generate circular single-stranded DNA (ssDNA). We prepared the maternal plasma DNA library using the same method except without fragmentation and pooled eight cfDNA libraries in equal amounts. After quantitation using Qubit 3.0 (Thermo Fisher, Waltham, USA), we used rolling circle replication to form DNA nanoballs (DNBs) from the ssDNA and loaded each DNB into 1 lane to be processed for 100-bp paired-end sequencing on the BGISEQ-500 and MGISEQ-2000 platforms (BGI, Shenzhen, China).

Reference panel construction

We generated the reference panel from 4356 thalassemia carrier screening-positive cases. Of the total 4356 cases, 3867 were obtained from our previously published paper [35], and 489 were obtained from unpublished in-house data.

We first used our previously published algorithm [35] to call SNPs from 4356 positive carriers and then filtered SNPs with a sequencing depth of less than 20-fold in more than 2% of the population or with an allele ratio between 5 and 40% in more than 70% of heterozygous individuals in the population. We used the publicly available software Beagle (version 4.0) to construct haplotypes for 4356 individuals and used these data as the reference panel for the next step. Since SNPs and InDels are the acceptable input for Beagle, we treated CNVs as SNPs in the phasing procedure. CNVs are represented as the VCF format of SNPs in the Beagle input file (VCF format), where the genomic position is the start position of the CNV, and the genotypes “0/1” and “1/1” represent heterozygous and homozygous CNVs, respectively.

Construction of parental haplotypes by PBH

We aligned the sequence reads from parental gDNA and maternal plasma DNA to the reference human genome (hg19) using BWA version 0.7.12. We marked duplicate reads with Picard version 1.87 and performed variant calling as previously described [35]. We also treated CNVs as SNPs in the phasing procedure. We used the haplotypes of the reference panel and the genotypes of the parents as inputs to deduce parental haplotypes with Beagle 4.0. Finally, we used only heterozygous SNPs to represent parental haplotypes.

NIPT of thalassemia

We calculated the fetal fraction (FF) as described in Additional file 3 and inferred fetal haplotypes inherited from the father and mother separately. First, we determined paternal inheritance using paternal informative SNPs, which were heterozygous in the father but homozygous in the mother. Second, we determined maternal inheritance using maternal informative SNPs, which included two types of SNPs: (1) SNPs heterozygous in the mother but homozygous in the father and (2) SNPs heterozygous in the parents in the blocks where the first step inferred the fetal inherited haplotype from the father. Because informative SNPs linked to the inherited haplotype are overrepresented in maternal plasma, we applied the hidden Markov model (HMM) and Viterbi algorithm [38] to determine the fetal genotypes of pathogenic sites (Additional file 3: Supplementary Methods). For samples with CNVs, all SNPs in the CNV region were not selected as informative SNPs to perform Viterbi decoding.

Invasive prenatal diagnosis of thalassemia

We performed invasive prenatal diagnosis via chorionic villus sampling or amniocentesis in accordance with standard protocols. We determined fetal genotypes through gap-PCR and reverse dot blot PCR (RDB-PCR).

The effect of the reference panel sample size on the outcomes of PBH-NIPT

To assess the effect of the reference panel sample size on the outcomes of PBH-NIPT, we randomly selected one-half, one-quarter, one-sixth, one-eighth, one-twelfth, and 50 of the samples from the total reference panel and performed three independent tests.


As shown in Fig. 1, the PBH-NIPT workflow involves the following steps. First, we generated the reference panel from 4356 thalassemia carrier screening-positive cases. Of the total 4356 cases, 3867 were obtained from our previously published paper [35], and 489 were obtained from unpublished in-house data. Second, we enrolled 59 couples in whom both partners carried at least one of the 10 aforementioned variants and were at risk of having a fetus with thalassemia major or intermedia [39] (Additional file 2: Table S1). The average gestational age at the time of collection was 12.6+3 weeks (range 10+1–22 weeks), and the average FF was 15.4% (range 6.0–26.1%) (Additional file 4: Table S2). We subjected genomic DNA (gDNA) of the couples and fetuses as well as maternal cfDNA to hybridization-based capture and sequencing using a strategy previously described for thalassemia carrier screening [35]. We obtained an average target region coverage of 177-fold (range 56–678) in maternal plasma and 203-fold (range 85–360) in parental gDNA. Third, we inferred parental haplotypes by PBH (see the “Methods” section and Fig. 1). To evaluate the reliability of PBH, we also constructed parental haplotypes by family-based haplotyping (FBH) and calculated the percentage of concordant single-nucleotide polymorphisms (SNPs) phased by these two methods (Additional file 3: Supplementary Methods). The average concordance rates of phased SNPs in the maternal and paternal haplotypes were 98.7% (range 87.5–100%) and 95.7% (range 59.2–100%), respectively (Additional file 5: Fig. S2; Additional file 6: Table S3).

Fig. 1

Population-based haplotyping-noninvasive prenatal testing (PBH-NIPT) workflow. The workflow involves 3 parts. We first constructed haplotypes of 59 couples using the haplotypes of 4356 samples as the reference panel. Then, we inferred fetal haplotypes using a hidden Markov model (HMM) and the Viterbi algorithm. Finally, we used invasive prenatal diagnosis results to validate PBH-NIPT results

To correctly infer fetal genotypes of pathogenic sites (rather than all SNPs), we developed a hidden Markov model (HMM) and used the Viterbi algorithm. We calculated a confidence score (CS), defined as the probability of obtaining the correct NIPT result, to evaluate the reliability of each prediction. A “no-call” condition was defined when (1) the CS was less than 0.99 or (2) the inferred haplotype contained two haplotype blocks (pathogenic and normal), and neither block spanned the target gene (HBB or HBA) (Additional file 3: Supplementary Methods). Accordingly, NIPT successfully inferred 111/118 (94.1%) alleles, and invasive prenatal diagnosis confirmed these alleles, with 99.1% (110/111 alleles) accuracy (95% CI, 95.1–100%) (Table 1, Fig. 2, and Additional file 7: Fig. S3). Among these 59 fetuses, 52 had both alleles detected; of these 52 fetuses, 15 were normal, 25 were carriers, and 12 were affected. Seven fetuses had only one allele successfully detected, and the other allele failed, with a CS of less than 0.99 (Table 1 and Fig. 2). Among the 7 fetuses with only one allele inferred by NIPT, 6 inherited the pathogenic allele. Obviously, invasive prenatal diagnosis was needed, which we used to clarify that 4 fetuses were affected and 2 were carriers.

Table 1 Noninvasive prenatal testing of thalassemia
Fig. 2

Outcomes of PBH-NIPT. We performed PBH-NIPT on 59 couples. Fifty-nine fetuses had 111 total alleles confirmed by invasive prenatal diagnosis, with 99.1% accuracy (95% CI, 95.1–100%). Fifty-two fetuses had both alleles detected for a total of 104 alleles; 7 fetuses had only one allele detected

To evaluate the relationship between the accuracy of NIPT and the reference panel sample size, we randomly selected one-half, one-quarter, one-sixth, one-eighth, one-twelfth, and 50 of the samples from the total reference panel and performed three independent tests. As expected, in the 52 fetuses in whom NIPT inferred both alleles, the NIPT outcome improved as the reference panel sample size increased (Fig. 3). Reduction of the sample size to one-half of the total reference panel yielded accuracies of NIPT of approximately 89.3% for β-thalassemia and 95.1% for α-thalassemia relative to the invasive prenatal diagnosis results.

Fig. 3

Effect of the reference panel sample size on the outcomes of PBH-NIPT. We randomly extracted one-half, one-quarter, one-sixth, one-eighth, one-twelfth, and 50 of the samples from the total reference panel and performed three independent tests for a β-thalassemia and b α-thalassemia. In the 52 fetuses in whom NIPT successfully inferred both alleles, the outcome of NIPT improved as the sample size of the reference panel increased


This study demonstrated the feasibility of PBH-NIPT for thalassemia. PBH-NIPT can be used after carrier screening for thalassemia. For high-risk couples reluctant to undergo an invasive procedure, PBH-NIPT is a more attractive option, requiring only a simple blood draw from the pregnant woman. For most conditions, including the deduction of carrier and normal individuals using NIPT, no further confirmation is needed. For conditions where NIPT detects an affected fetus (12 cases) or detects only one pathogenic allele (6 cases), invasive prenatal diagnosis is recommended. In our study, PBH-NIPT dramatically reduced the number of invasive prenatal diagnosis required by approximately 69.5% (from 59 to 18 fetuses).

Here, NIPT successfully inferred 94.1% of the fetal alleles (111/118) from the 59 fetuses. Focusing on the 7 no-call cases clearly shows that the number of informative SNPs in all 7 cases was fewer than 3. This problem can be resolved by increasing the number of informative SNPs flanking the target gene through expansion of the target region [33]. Moreover, this study demonstrated that the reference panel size could affect the performance of NIPT. However, the reference panel size is not a limiting factor since large-scale expanded carrier screening for recessive monogenic disorders is common in clinical practice [40].

This study aimed to evaluate and provide a simple, fast, and inexpensive NIPT method for thalassemia. Compared with the current linked-read sequencing-based NIPT method, which requires 15–20 days (10 days of wet lab work and 5–10 days of data analysis) [25], PBH-NIPT requires only 5–7 days (4–5 days of wet lab work and 1–2 days of data analysis). Training PBH on a large reference panel requires only a few minutes [41]. The PBH-NIPT method costs approximately 80 dollars, as estimated in the supplement (Additional file 8: Table S4). Considering the cost of invasive prenatal diagnosis (~ 1000 dollars/sample [42]) testing in 6 cases, the actual cost of PBH-NIPT per sample is 174 dollars, which is significantly less than those of molecular haplotyping (~ 1500 dollars/sample [25]) and invasive prenatal diagnosis (~ 1000 dollars/sample [42]).

This study has two limitations. First, since all 59 families and training reference data were from southern China, the test cannot detect individuals with ethnic backgrounds differing from those in the training population. Currently, we can only consult ethnic information based on self-reports before testing. A potentially good solution would be to add a quantifiable QC parameter to provide guidance for the reliability of the test. Therefore, we will consider including SNPs that are able to distinguish ethnic information when designing the next version of the probe [43]. Second, the population frequency of these 10 variants was 0.15~2.66% in our dataset [35], and more data are needed to validate whether PBH-NIPT is able to detect variants with lower frequencies.


In summary, we developed and verified PBH-NIPT, a novel method for prenatal testing of α-thalassemia and β-thalassemia. Compared with invasive prenatal diagnosis, this method achieved 99.1% accuracy (95% CI, 95.1–100%). Therefore, we propose that this strategy might be extended to detect variants in addition to single-haplotype founder variants in other recessive monogenic disorders. Additional studies with larger sample sizes are required to confirm the application and performance of PBH-NIPT for other populations and variants with lower frequencies.

Availability of data and materials

The datasets supporting the conclusions of this article are included within the article and its additional files. Nonidentifiable data of 59 families generated during this study are deposited in the CNGB Nucleotide Sequence Archive (CNSA: with accession number CNP0000644 [44]. The above data have also been deposited in the European Variation Archive (EVA: with accession number PRJEB42529 [45]. The raw data of 4356 samples, which was not open to the public in the original paper [35], cannot be submitted to public databases because the patients were not consented for sharing their raw data. The source data used to generate Fig. 3 and Fig. S1 are also available. The customized scripts for PBH and NIPT can be found at

The following open software were used:

Beagle 4.0 ( [46]

BWA 0.7.12 ( [47]

Picard 1.87 ( [48]



Noninvasive prenatal testing


Family-based haplotyping


Population-based haplotyping-NIPT


Hidden Markov model


Cell-free DNA


Relative mutation dosage


Relative haplotype dosage


Single nucleotide variants


Small insertions/deletions


Copy number variants


Targeted locus amplification


Fetal fraction


Genomic DNA


Single-nucleotide polymorphisms


Confidence score


Amniotic fluid


Chorionic villus sample


Polymerase chain reaction


Single-stranded DNA


DNA nanoballs


Reverse dot blot PCR


  1. 1.

    Zhang H, Gao Y, Jiang F, Fu M, Yuan Y, Guo Y, et al. Non-invasive prenatal testing for trisomies 21, 18 and 13: clinical experience from 146,958 pregnancies. Ultrasound Obstet Gynecol. 2015;45(5):530–8.

    CAS  Article  Google Scholar 

  2. 2.

    Wong AI, Lo YM. Noninvasive fetal genomic, methylomic, and transcriptomic analyses using maternal plasma and clinical implications. Trends Mol Med. 2015;21(2):98–108.

    CAS  Article  Google Scholar 

  3. 3.

    Liang D, Cram DS, Tan H, Linpeng S, Liu Y, Sun H, et al. Clinical utility of noninvasive prenatal screening for expanded chromosome disease syndromes. Genet Med. 2019;21(9):1998–2006.

    CAS  Article  Google Scholar 

  4. 4.

    Lo KK, Karampetsou E, Boustred C, McKay F, Mason S, Hill M, et al. Limited clinical utility of non-invasive prenatal testing for subchromosomal abnormalities. Am J Hum Genet. 2016;98(1):34–44.

    CAS  Article  Google Scholar 

  5. 5.

    Pertile MD, Halks-Miller M, Flowers N, Barbacioru C, Kinnings SL, Vavrek D et al. Rare autosomal trisomies, revealed by maternal plasma DNA sequencing, suggest increased risk of feto-placental disease. Sci Transl Med. 2017;9(405):eaan1240.

  6. 6.

    Chitty LS, Mason S, Barrett AN, McKay F, Lench N, Daley R, et al. Non-invasive prenatal diagnosis of achondroplasia and thanatophoric dysplasia: next-generation sequencing allows for a safer, more accurate, and comprehensive approach. Prenat Diagn. 2015;35(7):656–62.

    CAS  Article  Google Scholar 

  7. 7.

    Zhang J, Li J, Saucier JB, Feng Y, Jiang Y, Sinson J, et al. Non-invasive prenatal sequencing for multiple Mendelian monogenic disorders using circulating cell-free fetal DNA. Nat Med. 2019;25(3):439–47.

    CAS  Article  Google Scholar 

  8. 8.

    Bell CJ, Dinwiddie DL, Miller NA, Hateley SL, Ganusova EE, Mudge J, et al. Carrier testing for severe childhood recessive diseases by next-generation sequencing. Sci Transl Med. 2011;3(65):65ra4.

    CAS  Article  Google Scholar 

  9. 9.

    Christianson A, Howson CP, Modell B. March of dimes: global report on birth defects, the hidden toll of dying and disabled children; 2006.

    Google Scholar 

  10. 10.

    Tsao DS, Silas S, Landry BP, Itzep NP, Nguyen AB, Greenberg S, et al. A novel high-throughput molecular counting method with single base-pair resolution enables accurate single-gene NIPT. Sci Rep. 2019;9(1):14382.

    Article  Google Scholar 

  11. 11.

    Liao GJ, Gronowski AM, Zhao Z. Non-invasive prenatal testing using cell-free fetal DNA in maternal circulation. Clin Chim Acta. 2014;428:44–50.

    CAS  Article  Google Scholar 

  12. 12.

    Wong FC, Lo YM. Prenatal diagnosis innovation: genome sequencing of maternal plasma. Annu Rev Med. 2016;67:419–32.

    CAS  Article  Google Scholar 

  13. 13.

    Chiu EKL, Hui WWI, Chiu RWK. cfDNA screening and diagnosis of monogenic disorders - where are we heading? Prenat Diagn. 2018;38(1):52–8.

    CAS  Article  Google Scholar 

  14. 14.

    Lun FM, Tsui NB, Chan KC, Leung TY, Lau TK, Charoenkwan P, et al. Noninvasive prenatal diagnosis of monogenic diseases by digital size selection and relative mutation dosage on DNA in maternal plasma. Proc Natl Acad Sci U S A. 2008;105(50):19920–5.

    CAS  Article  Google Scholar 

  15. 15.

    Lam KW, Jiang P, Liao GJ, Chan KC, Leung TY, Chiu RW, et al. Noninvasive prenatal diagnosis of monogenic diseases by targeted massively parallel sequencing of maternal plasma: application to beta-thalassemia. Clin Chem. 2012;58(10):1467–75.

    CAS  Article  Google Scholar 

  16. 16.

    Shi J, Zhang R, Li J, Zhang R. Novel perspectives in fetal biomarker implementation for the noninvasive prenatal testing. Crit Rev Clin Lab Sci. 2019;56(6):374–92.

    CAS  Article  Google Scholar 

  17. 17.

    Yang X, Zhou Q, Zhou W, Zhong M, Guo X, Wang X, et al. A cell-free DNA barcode-enabled single-molecule test for noninvasive prenatal diagnosis of monogenic disorders: application to beta-thalassemia. Adv Sci (Weinh). 2019;6(11):1802332.

    Article  Google Scholar 

  18. 18.

    Phallen J, Sausen M, Adleff V, Leal A, Hruban C, White J et al. Direct detection of early-stage cancers using circulating tumor DNA. Sci Transl Med. 2017;9(403):eaan2415.

  19. 19.

    Duan H, Liu N, Zhao Z, Liu Y, Wang Y, Li Z, et al. Non-invasive prenatal testing of pregnancies at risk for phenylketonuria. Arch Dis Child Fetal Neonatal Ed. 2019;104(1):F24–F9.

    Article  Google Scholar 

  20. 20.

    Newman AM, Lovejoy AF, Klass DM, Kurtz DM, Chabon JJ, Scherer F, et al. Integrated digital error suppression for improved detection of circulating tumor DNA. Nat Biotechnol. 2016;34(5):547–55.

    CAS  Article  Google Scholar 

  21. 21.

    Scotchman E, Chandler NJ, Mellis R, Chitty LS. Noninvasive prenatal diagnosis of single-gene diseases: the next frontier. Clin Chem. 2020;66(1):53–60.

    Article  Google Scholar 

  22. 22.

    Hayward J, Chitty LS. Beyond screening for chromosomal abnormalities: advances in non-invasive diagnosis of single gene disorders and fetal exome sequencing. Semin Fetal Neonatal Med. 2018;23(2):94–101.

    Article  Google Scholar 

  23. 23.

    Snyder MW, Adey A, Kitzman JO, Shendure J. Haplotype-resolved genome sequencing: experimental methods and applications. Nat Rev Genet. 2015;16(6):344–58.

    CAS  Article  Google Scholar 

  24. 24.

    Hui WW, Jiang P, Tong YK, Lee WS, Cheng YK, New MI, et al. Universal haplotype-based noninvasive prenatal testing for single gene diseases. Clin Chem. 2017;63(2):513–24.

    CAS  Article  Google Scholar 

  25. 25.

    Jang SS, Lim BC, Yoo SK, Shin JY, Kim KJ, Seo JS, et al. Targeted linked-read sequencing for direct haplotype phasing of maternal DMD alleles: a practical and reliable method for noninvasive prenatal diagnosis. Sci Rep. 2018;8(1):8678.

    Article  Google Scholar 

  26. 26.

    Vermeulen C, Geeven G, de Wit E, Verstegen M, Jansen RPM, van Kranenburg M, et al. Sensitive monogenic noninvasive prenatal diagnosis by targeted haplotyping. Am J Hum Genet. 2017;101(3):326–39.

    CAS  Article  Google Scholar 

  27. 27.

    Burgtorf C, Kepper P, Hoehe M, Schmitt C, Reinhardt R, Lehrach H, et al. Clone-based systematic haplotyping (CSH): a procedure for physical haplotyping of whole genomes. Genome Res. 2003;13(12):2717–24.

    CAS  Article  Google Scholar 

  28. 28.

    Kitzman JO, Mackenzie AP, Adey A, Hiatt JB, Patwardhan RP, Sudmant PH, et al. Haplotype-resolved genome sequencing of a Gujarati Indian individual. Nat Biotechnol. 2011;29(1):59–63.

    CAS  Article  Google Scholar 

  29. 29.

    Peters BA, Kermani BG, Sparks AB, Alferov O, Hong P, Alexeev A, et al. Accurate whole-genome sequencing and haplotyping from 10 to 20 human cells. Nature. 2012;487(7406):190–5.

    CAS  Article  Google Scholar 

  30. 30.

    Selvaraj S, J RD, Bansal V, Ren B. Whole-genome haplotype reconstruction using proximity-ligation and shotgun sequencing. Nat Biotechnol 2013;31(12):1111–1118.

  31. 31.

    Amini S, Pushkarev D, Christiansen L, Kostem E, Royce T, Turk C, et al. Haplotype-resolved whole-genome sequencing by contiguity-preserving transposition and combinatorial indexing. Nat Genet. 2014;46(12):1343–9.

    CAS  Article  Google Scholar 

  32. 32.

    Zhang K, Zhu J, Shendure J, Porreca GJ, Aach JD, Mitra RD, et al. Long-range polony haplotyping of individual human chromosome molecules. Nat Genet. 2006;38(3):382–7.

    CAS  Article  Google Scholar 

  33. 33.

    Zeevi DA, Altarescu G, Weinberg-Shukron A, Zahdeh F, Dinur T, Chicco G, et al. Proof-of-principle rapid noninvasive prenatal diagnosis of autosomal recessive founder mutations. J Clin Invest. 2015;125(10):3757–65.

    Article  Google Scholar 

  34. 34.

    Rund D, Rachmilewitz E. Beta-thalassemia. N Engl J Med. 2005;353(11):1135–46.

    CAS  Article  Google Scholar 

  35. 35.

    Shang X, Peng Z, Ye Y, Asan, Zhang X, Chen Y et al. Rapid targeted next-generation sequencing platform for molecular screening and clinical genotyping in subjects with hemoglobinopathies. EBioMedicine. 2017;23:150–159.

  36. 36.

    Yin A, Li B, Luo M, Xu L, Wu L, Zhang L, et al. The prevalence and molecular spectrum of alpha- and beta-globin gene mutations in 14,332 families of Guangdong Province, China. PLoS One. 2014;9(2):e89855.

    Article  Google Scholar 

  37. 37.

    Liao C, Yin AH, Peng CF, Fu F, Yang JX, Li R, et al. Noninvasive prenatal diagnosis of common aneuploidies by semiconductor sequencing. Proc Natl Acad Sci U S A. 2014;111(20):7415–20.

    CAS  Article  Google Scholar 

  38. 38.

    Ye J, Chen C, Yuan Y, Han L, Wang Y, Qiu W, et al. Haplotype-based noninvasive prenatal diagnosis of hyperphenylalaninemia through targeted sequencing of maternal plasma. Sci Rep. 2018;8(1):161.

    Article  Google Scholar 

  39. 39.

    Piel FB, Weatherall DJ. The alpha-thalassemias. N Engl J Med. 2014;371(20):1908–16.

    Article  Google Scholar 

  40. 40.

    Haque IS, Lazarin GA, Kang HP, Evans EA, Goldberg JD, Wapner RJ. Modeled fetal risk of genetic diseases identified by expanded carrier screening. JAMA. 2016;316(7):734–42.

    Article  Google Scholar 

  41. 41.

    Miar Y, Sargolzaei M, Schenkel FS. A comparison of different algorithms for phasing haplotypes using Holstein cattle genotypes and pedigree data. J Dairy Sci. 2017;100(4):2837–49.

    CAS  Article  Google Scholar 

  42. 42.

    Harris RA, Washington AE, Nease RF Jr, Kuppermann M. Cost utility of prenatal diagnosis and the risk-based threshold. Lancet. 2004;363(9405):276–82.

    Article  Google Scholar 

  43. 43.

    Huang T, Shu Y, Cai YD. Genetic differences among ethnic groups. BMC Genomics. 2015;16:1093.

    Article  Google Scholar 

  44. 44.

    Chen C, Li R, Sun J, Zhu Y, Jiang L, Li J, et al. Noninvasive prenatal testing of α-thalassemia and β-thalassemia through population-based parental haplotyping. Datasets. China National Genebank (CNGB) Nucleotide Sequence Archive (CNSA). 2020.

  45. 45.

    Chen C, Li R, Sun J, Zhu Y, Jiang L, Li J, et al. Noninvasive prenatal testing of α-thalassemia and β-thalassemia through population-based parental haplotyping. European Variation Archive (EVA): Datasets; 2021.

    Google Scholar 

  46. 46.

    Beagle 4.0. Accessed 11 Dec 2018.

  47. 47.

    BWA 0.7.12. Accessed 11 Dec 2018.

  48. 48.

    Picard 1.87. Accessed 11 Dec 2018.

Download references


The authors thank the affected families participating in the study. We also thank the medical staff of Guangzhou Women and Children’s Medical Center for help with acquiring the blood samples, and we thank the sequencing and bioinformatic center of BGI-Tianjin for data procurement and transmission.


The work was supported by the Major Technical Innovation Project of Hubei Province (No. 2017ACA097), the Special Foundation for High-level Talents of Guangdong (No. 2016TX03R171), and the Shenzhen Municipal Government of China (No. JCYJ20170412152854656). The funding body played no role in the design of this study and collection, analysis, and interpretation of data and in writing of the manuscript.

Author information




CC, ZP, and CL designed the study. LJ, XA, FZ, and LF conducted the experiments. JS, YZ, FG, YW, and PN conducted the statistical analyses. RL, JL, FF, and JW validated the NIPT results. CC and JS drafted and revised the paper. ZP, CL, BX, YY, JS, YD, YS, XG, SZ, and WW supervised the project. All authors read and approved the final manuscript.

Corresponding authors

Correspondence to Zhiyu Peng or Can Liao.

Ethics declarations

Ethics approval and consent to participate

The ethics committee of Guangzhou Women and Children’s Medical Center and BGI approved this study (2017102408 and BGI-IRB 18043). Written informed consent was obtained from all participants before sample collection. This study was performed in accordance with the principles of the Helsinki Declaration.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1: Fig. S1.

Variant spectrum of thalassemia in our published data [35].

Additional file 2: Table S1.

Clinical information of the participating families.

Additional file 3: Supplementary Methods.

Additional file 4: Table S2.

Characteristics of the 59 families.

Additional file 5: Fig. S2.

Concordance of parental haplotypes deduced by PBH and FBH.

Additional file 6: Table S3.

Concordance of parental haplotypes deduced by PBH and FBH.

Additional file 7: Fig. S3.

NIPT results for thalassemia.

Additional file 8: Table S4.

Cost and turnaround time of PBH-NIPT.

Additional file 9: Source data for Fig. 3.

Additional file 10: Source data for Fig. S1.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Chen, C., Li, R., Sun, J. et al. Noninvasive prenatal testing of α-thalassemia and β-thalassemia through population-based parental haplotyping. Genome Med 13, 18 (2021).

Download citation


  • NIPT
  • Recessive monogenic diseases
  • Haplotypes
  • Population-based haplotyping
  • α-Thalassemia
  • β-Thalassemia