Skip to main content

Table 4 Data distribution of the training sets and the independent test sets

From: iPseU-NCP: Identifying RNA pseudouridine sites using random forest and NCP-encoded features

Dataset

Number of samples

Species

Group

 

Possitive

Negative

Total

  

S_628

314

314

628

S. cerevisiae

Training (Development)

M_944

472

472

944

M. musculus

 

H_990

495

495

990

H. sapiens

 

S_200

100

100

200

S. cerevisiae

Independent Test

H_200

100

100

200

H. sapiens