Seven files with indexes should be generated, together with the log file formatdb.log.
You will retrieve the peptide sequence identified as 13507741 in the database NC_000912.faa.
This operation should generate a list of peptide clusters for the Mycoplasma pneumoniae proteome. The parameters -S 30 -L 0.0 -b T specify that for a pair of sequences to be considered as neighbors, at least 30% of the aligned residues should be identical, and the overlap between the aligned sequences should be at least 0% (the obvious minimum). The parameter -b T (both true) specifies that the condition of minimum overlap must be fulfiled when calculated as relative to both sequences involved in the alignment.
The file that SHOULD BE used as input
for blastclust is the file NC_000912.faa that was located
in your home directory, and not the one that exists in the directory BLASTDATA.
This is because
blastclust generates a temporary database that is
removed automatically later. If you used the NC_000912.faa file
that is contained in BLASTDATA, blastclust would destroy the indexes
created by formatdb.
blast2adjLists_ed1.pl
blast2adjLists-ed1.pl blast_table e-value
The output of blast2adjLists-ed1.pl is an adjacency list like the one in the example previously given: it contain a list of nodes, and for each of them, a list of the connected neighbor nodes. It is easy to modify the script to generate a different output, for example, an adjacency matrix.
adja2cluster_v2.pl
adja2cluster-v2.pl adjacency_list