Basic gene finding


All CLC bio's workbenches include the option of finding genes by determining open reading frames. This is a fairly simple approach, but may prove valuable in a number of cases - e.g. as a first step in annotating sequences such as cloning vectors or bacterial genomes.

The following parameter options are available:

  • Choice between finding reading frames on one strand or on both strands
  • Setting of minimum length of the reading frame (defined by the user)
  • Start codon, choice between
    • AUG
    • Any codon
    • All start codons in genetic code
    • User defined
  • Stop codon included in annotation? (yes/no)
  • Open ended sequence? (yes/no)
    This option allows the open reading frame to start outside the sequence
  • Choice of genetic codes (all genetic code translation tables are included)
  • Minium length of the Open Reading Frame

Read more