Dataset*

GEO access number or web location

Number of patients

% DFS data available

% OS data available

Reference

1

GSE1456

159

100

100

[22]

2

GSE2603

99

82.8

0

[23]

3

GSE6532

327

96.0

0

[24]

4

GSE3494

108&

100

100

[25]

5

GSE7390

198

100

100

[26]

6

GSE11121

197

100

0

[27]

7

GSE12093

136

100

0

[28]

8

GSE2034

286

100

0

[29]

9

ArrayExpress:E-TABM-158

130

99.2

99.2

[30]

10

NKI (http://microarray-pubs.stanford.edu/wound_NKI/)

295

100

100

[31]

11

GSE17080

185

0

0

[32]

12

GSE3744

47

0

0

[33]

*Dataset 1 to 9, 11 and 12 are profiled using Affymetrix microarray, Dataset 11 using Agilent Oligo microarray. Dataset 1-10 are used to evaluate prognostic impact of FBXW7 expression. Dataset 1-10 was used for analysis of disease-free survival in Figure 2, 3, 5 and 6. Dataset 1, 4, 5, 9 and 10 was used for analysis of overall survival in Figure 2, 3, 5 and 6. Dataset 11 and 12 contains both normal and breast cancer samples.
&There are total 251 patents in GSE3494, 143 of them overlapped with GSE6532 were removed.
Table 1: Information of gene expression datasets used in this study.