Of the 1097 sequences 663 are alternative splice variants. In addition 111 of the sequences have the same length and are 100% identical to another sequence.
The division of the sequences shows that 661 come from the 10 regions that were selected manually and just 436 come from the 34 regions that were selected at random. Although there are less manually selected
regions, the total selected regions is teh same for the manual and "randomly" selected regions.
BLAST finds human homologues for all but 22 of the CDS sequences. 1003 sequences that have GO terms easily associated to them.
13 sequences have their whole sequence covered by PDB structure and 587 sequences find at least one template structure with BLAST.
Of the 994 sequences with PFAM domains, 42.5% (423 sequences) have at least one PFAM domain that is broken in two, either by insertions or deletions.