Skip to main content

Advertisement

Table 3 Error rates for classifiers trained on one data set and tested on other public data sets

From: Transferring genomics to the clinic: distinguishing Burkitt and diffuse large B cell lymphomas

  BL error ratea DLBCL error ratea
Normalization Z-score Rank XPN DWD Z-score Rank XPN DWD
Train GSE4732_p1: test on other data sets below
GSE4475 (strict)b 0.09 0.09 0.09 0.09 0.017 0.017 0.006 0
GSE4732_p2 0.182 0.212 0.152 0.152 0 0 0 0
GSE10172 (strict)b 0.231 0.308 0.385 0.308 0 0 0 0
GSE26673 eBL 0.615 0.692 0.846 0.384     
GSE26673 and GSE17189 HIV-related 0.833 1 1 0.667 0 0 0 0
Train GSE4475 strict BL definition: test on other data sets below
GSE4732_p1 0.04 0.04 0.04 0.04 0.012 0.008 0.012 0.012
GSE4732_p2 0.303 0.333 0.273 0.273 0 0 0 0
GSE10172 (strict) 0.154 0.154 0.308 0.154 0 0 0 0
GSE26673 eBL 0.615 0.538 0.769 0.538     
GSE26673 and GSE17189 HIV-related 0.833 0.833 1 0.833 0 0 0 0
Train GSE4475 wide BL definition: test on other data sets below
GSE4732_p1 0.02 0.02 0.02 0.02 0.04 0.05 0.06 0.07
GSE4732_p2 0.06 0.03 0.03 0.03 0.015 0.015 0.015 0.015
GSE10172 (strict) 0.078 0.078 0 0.078 0.043 0.043 0 0.043
GSE26673 eBL 0.154 0.154 0.308 0.154     
GSE26673 and GSE17189 HIV-related 0.5 0.333 0.833 0.5 0 0 0 0
  1. aError rate is (1 − Recall) value for the indicated class [Recall = True positives/(True positives + False negatives)]
  2. bThe sample in this data set is assigned to mBL, intermediate, non-mBL categories; here we set the strict BL definition as the standard which put intermediate and non-mBL together as the DLBCL class. eBL endemic BL, mBL molecular BL