FIGURE 2. Network representation for 142 complete protein
sequences similar to PURases linked by 6419 edges. The protein sequences
depicted here were selected by clustering at a threshold of 90%
sequence identity. Edges (links) were selected at a threshold of 60%
global sequence similarity, without defining a core domain region. Nodes
are coloured according to their annotated source organisms, with
Proteobacteria in blue and unknown bacteria in white. The network on the
left represents sequences with an N-terminal lid and a C-terminal
β-sandwich domain and contains 127 nodes connected by 6314 edges.
Diamonds represent sequences originating from the genusPseudomonas (from the class Gammaproteobacteria). The network on
the right represents sequences similar to carboxylesterases and contains
15 nodes connected by 105 edges. Squares represent sequences originating
from the class of Betaproteobacteria. See Methods section for more
details on the network layout.