||This data set contains non-redundant protein sequences of viruses and viroids from publicly available resources. Peptide sequences less than 10 amino acids are excluded from this set. Environmental samples are also excluded unless they are from the NCBI RefSeq database. Shorter sequences that can be aligned with a longer sequence with the same taxon ID and 100% identity are treated as redundant sequences, and they are eliminated from this data set.
Database updated on 2013-05-17 from NCBI Refseq release 59 and Genbank release 195