AmbiScript: A functional notation to support the manipulation and analysis of genetic data
AmbiScript is a functional nucleic acid notation, which uses character symmetries to facilitate the complementation of deoxyribonucleic acids (DNA) and identification of palindromic regions1. The symbols have been designed to underscore key biochemical properties of the nucleotides, trivialize the derivation and complementation of ambiguity characters, and support the visualization of sequence polymorphisms. This web site highlights five functional characteristics of the notation.
- Mnemonics. AmbiScript symbols use prominent typographic features to highlight key biochemical aspects of the nucleic acids they represent.
- Complementation. Complementary nucleotides are encoded by the rotationally related ambigraphic character, making it is possible to complement genetic sequences by simply rotating the text 180°.
- Palindromes. Character symmetries also make it easy to locate palindromic regions in AmbiScript encoded genetic sequences.
- Legibility. Distinctive ascenders and descenders facilitate the identification of nucleotide polymorphisms in multiple sequence alignments. An ultra-bold version of the AmbiScript notation further enhances sequence legibility.
- Ambiguity. Ambiguity characters are easily constructed and complemented by overlaying the symbols for the constituent bases.
- Rozak DA, Rozak AJ. 2008. Simplicity, function, and legibility in an enhanced ambirgaphic nucleic acid notation. BioTechniques 44(6):811-813. (PMID: 18476835)