iPseU-NCP: Identifying RNA pseudouridine sites using random forest and NCP-encoded features

BMC Genomics

Table 4 Data distribution of the training sets and the independent test sets

Dataset	Number of samples			Species	Group
	Possitive	Negative	Total
S_628	314	314	628	S. cerevisiae	Training (Development)
M_944	472	472	944	M. musculus
H_990	495	495	990	H. sapiens
S_200	100	100	200	S. cerevisiae	Independent Test
H_200	100	100	200	H. sapiens

ISSN: 1471-2164