Transferring genomics to the clinic: distinguishing Burkitt and diffuse large B cell lymphomas

Table 3 Error rates for classifiers trained on one data set and tested on other public data sets

	BL error rate^a				DLBCL error rate^a
Normalization	Z-score	Rank	XPN	DWD	Z-score	Rank	XPN	DWD
Train GSE4732_p1: test on other data sets below
GSE4475 (strict)^b	0.09	0.09	0.09	0.09	0.017	0.017	0.006	0
GSE4732_p2	0.182	0.212	0.152	0.152	0	0	0	0
GSE10172 (strict)^b	0.231	0.308	0.385	0.308	0	0	0	0
GSE26673 eBL	0.615	0.692	0.846	0.384
GSE26673 and GSE17189 HIV-related	0.833	1	1	0.667	0	0	0	0
Train GSE4475 strict BL definition: test on other data sets below
GSE4732_p1	0.04	0.04	0.04	0.04	0.012	0.008	0.012	0.012
GSE4732_p2	0.303	0.333	0.273	0.273	0	0	0	0
GSE10172 (strict)	0.154	0.154	0.308	0.154	0	0	0	0
GSE26673 eBL	0.615	0.538	0.769	0.538
GSE26673 and GSE17189 HIV-related	0.833	0.833	1	0.833	0	0	0	0
Train GSE4475 wide BL definition: test on other data sets below
GSE4732_p1	0.02	0.02	0.02	0.02	0.04	0.05	0.06	0.07
GSE4732_p2	0.06	0.03	0.03	0.03	0.015	0.015	0.015	0.015
GSE10172 (strict)	0.078	0.078	0	0.078	0.043	0.043	0	0.043
GSE26673 eBL	0.154	0.154	0.308	0.154
GSE26673 and GSE17189 HIV-related	0.5	0.333	0.833	0.5	0	0	0	0

^aError rate is (1 − Recall) value for the indicated class [Recall = True positives/(True positives + False negatives)]
^bThe sample in this data set is assigned to mBL, intermediate, non-mBL categories; here we set the strict BL definition as the standard which put intermediate and non-mBL together as the DLBCL class. eBL endemic BL, mBL molecular BL

ISSN: 1756-994X