Synteny Block

Definition 🔗

A synteny block is a region of closely related genes. They are helpful in genome rearrangement, to determine genomic similarities between two genomes from different species.

Construction 🔗

Two compare the genomes of two organisms, one can first:

  1. Construct shared-kmer dot-plots between the two. For genomes AA and BB, SharedKmeri,jk=1(Kmer(A,i,k)=Kmer(B,j,k))SharedKmer^k_{i, j} = 1(Kmer(A, i, k) = Kmer(B, j, k)), taking reverse-complement pairings into account.
  2. Find diagonals of SharedKmerkSharedKmer^k greater than a certain length (to remove noise). The sign of the synteny block is based on the strandedness of the match: positive means a 535’ \to 3’ and negative means 353’\to 5’ (reverse complement). To find the diagnonals, one can use simple techniques like connected components of a graph consisting of shared-kmers within a certain max-distance of each other.