The Notch sequence from Drosophila is used as the sample data. Notch is thought to control cell fate decisions in development It encodes a large, transmembrane protein which may function through cell adhesion, and it was cloned and sequenced in 1985(WhartonetaI., 1985a&b; Kidd et al., 1986). Notch is an ideal sequence to analyze because it contains many features that computers are good at finding. Figure 1 shows a schematic of the Notch protein and its major features. The Notch sequence is available in the Genbank and EMBL sequence database under accession numbers M16153, M16149, M16150, M16151 and M16152 (see Appendix VIS). The most successful way to approach this chapter is to reproduce the analyses. This will familiarize one with a specific software package, and offers a more accurate picture of the volume of output data produced by many programs than could be allowed in the figures in this chapter. Programs for most of the analyses used in this chapter are widely available on IBM PCs, Macintoshes and mainframe computers. The examples have intentionally been kept generic, but programs from the following sources were used: Genetics Computer Group Sequence Analysis Software (Devereux et al., 1984; Genetics Computer Group Inc., Madison, WI),Genbank Online Services (Benton,1990),NationalLibrary of Medicine Services (Benson et al., 1990), and PC Gene (A. Bairoch, University of Geneva; ™ Intelligenetics Inc., Mountain View, CA and Genofit SA). Unless a researcher is studying nontranslatable segments of DNA, the immediate goal upon the isolation of a new gene is usually to deduce the amino acid sequence of its product. The laboratory approach might go from isolating a cDNA clone, determining its nucleotide sequence, locating alarge open reading frame, and translating the sequence into a putative protein. In this case priority is usually given to analyzing the putative protein, with promoter regions introns being sequenced later to elucidate gene regulation. The organization of the following example analysis of Notch reflects these laboratory priorities by beginning with cDNA analysis, moving to protein analysis, and then returning to DNA analysis for the genomic sequence.