Help:Sequencing Tool:Automatic Assessment Algorithm

The Registry provides a tool to automatically align and evaluate sequences against the sequence of a specified part. This is how that algorithm works as of 18 March 2008.

Each sequence is displayed as a green bar in the information box for that sequence and in the Automatic Alignment box if the sequence aligns with the part. The bar will be green where the quality is "good" and gray where the quality is not "good". If a BioBrick™ prefix or suffix is found, it will be marked as a red brick box under the sequence bar.

Find Best Match

The sequencing algorithms use a utility function to compare two sequences and report the alignment which matches best.

Find_Best_Match(A, B) shifts sequence A relative to sequence B one base at a time. At each position, it compares the overlapping sequence to see if the bases match. If the two bases match the score for that position is increased by one point if the bases do not match, then the score id decreased by two points. Notes: This does not work well if there is a deleted or inserted base in either sequence.

Automatic Alignment

The Automatic Alignment section compares each sequence read to the target part's sequence.

Each base in the part is compared to the corresponding base in every aligned sequence read. The program scans through all of the aligned sequences and selects the sequence with the highest quality value. If that base is an N, then the part is marked as 'N" at that base. If they agree, then that base is "good'. If they disagree, that base of the part is marked as 'Bad". If the sequence read does not have quality information, then all of the aligned sequences are examined. N's are ingnored. If ANY of the sequence reads agree with the part, then the base is marked as "Good". Otherwise, the base is marked as "Bad".

Status for the reading is reported as Inconsistent if any of the bases of the part were marked as "Bad". If all of the bases are marked "Good" the par is marked as Confirmed. If any base is marked as neither Good nor Bad, the result is marked as "Not enough information".

Help:Sequencing Tool:Automatic Assessment Algorithm

Contents

Definitions

New Sequences

Sequence Graphic

Find Best Match

Automatic Alignment