Schema for mining the homologues sequences. The schema represents the approach followed for mining PC homologues. The seed sequence, PC of Drosophila was searched in the NCBI protein sequence database and the genome sequences of the model organisms. The TBLASTN hits of genome sequences were extended 5 kb upstream and 5 kb downstream and merged. The extended hits were subjected to gene prediction and the sequences showing >98% similarity were considered as redundant records and the non redundant records were considered as putative homologues and subjected to motif predictions. Sequences were grouped based on the motif conservation.