Isolating genes

 

We talked about getting the gene for the insulin hormone, putting it into a plasmid which we put into bacteria, and getting the bacteria to make our insulin.  How do we get the insulin gene in the first place?

 

Libraries

-          somehow cut up genome from which you are trying to isolate a gene (in this case the human genome) and put the pieces into a vector

o       makes a storable collection of clones that contains copies of every sequence in the whole genome

-          how to cut up?

o       Can use restriction enzymes

o       Can just shear DNA to make random pieces

-          vectors (Table 9.2)

o       plasmids

§         small inserts (say 5kb or less)

§         can have about 200 bacterial colonies on a plate that are well separated and don’t grow into each other

o       phage lambda (l)

§         phage is a virus that infect bacteria

§         dsDNA genome

§         can insert up to 25kb of dsDNA

§         Can easily have 1000 separate lambda/plate

o       Cosmid

§         Derived from phage lambda

§         Is maintained like a plasmid in a bacterial cell

§         Can insert up to 45kb

o       Phage P1

§         Derived from a phage

§         Can insert up to 100kb

o       BAC

§         Bacterial artificial chromosome

§         A human-made chromosome, circular, contains chromosomal origin or replication

§         Can insert up to 150kb

o       YAC

§         Yeast artificial chromosome

§         A human-made chromosome, linear

§         Can insert up to 2 megabases, but often several hundred kb

Why is size of insert such an important consideration in vectors?

-          if making human genomic library, human genome is 3X109 bp

o       Plasmid say 5kb/plasmid insert

§         3X109 (size of genome)/5X103 (av. size of insert) = 6X105 clones

§         = 3000 plates if 200 colonies/plate

o       Phage say 20kb/phage insert

§         3X109/2X104 = 1.5X105 clones

§         = 150 plates if 1000 phage/plate

o       BAC av size of 100kb insert

§         3X109/1X105 = 3000 clones

§         = 15 plates is 200 colonies/plate

 

What about genome equivalents?

-          we are assuming in the example above that the genome was cut up into completely nonoverlapping fragments and that each vector will take one fragment (one genome equivalent)

-          in reality you are cutting up many genomes and vectors are getting the same or overlapping fragments

o       so if you only had 3000 clones in the BAC example above, it is very likely that some sequences would be present multiple times, and some not at all

o       so need to make a bigger library, like to have four or five genome equivalents

§         on average each gene present 4 or 5 times in library

§         95% chance that there is at least one of every gene

 

 

cDNA libraries

-          when insert is not genomic DNA but rather cDNA

-          cDNA is a dsDNA copy of a mRNA

o       construction (Fig. 9.10)

§         isolate mRNA from a tissue

§         use enzyme reverse transcriptase and TTTTTTT primer to make a DNA copy of the mRNA

§         destroy RNA with RNAse enzyme

§         take advantage of hairpins at 3’ end to prime complementary strand synthesis

§         end you get a double stranded DNA copy of an mRNA

-          eliminates problem of introns in genomic DNA sequence

-          also allows you to make library of only genes expressed by a particular cell type or under a particular condition

o       this can be an advantage or a disadvantage (see Fig. 9.11)