Lecture 8, 9 & 10 (Updated 9/11, 9:15 am)

Transposable elements or Controlling elements.
 

    We normally (and correctly) think of a gene as a very stable unit. Mutation rates are low (10-6 to 10-7 ) and the position of the gene on the chromosome is constant. That this is not always the case comes from studies of  mobile DNA elements  termed transposable elements (TE), or controlling elements or transposons, that have the ability to move throughout the genome. These elements have been found in virtually all organisms.

     Transposable elements were first found in maize by Barbara McClintock who was later awarded the Nobel Prize for this work.  She identified these elements due to a property that some - but not all - TEs have, namely causing mutability of normally very stable genes.  McClintock defined TEs as follows:  Transposable elements, of apparently sporadic occurrence, which make themselves visible through their effects on gene activities.  The element may move to a new site in which case gene action at the former position maybe  restored.

    Studies of genes and genomes continue to identify the significant impact of transposable elements on  living organisms.  Four fundamental properties of  our genetic material have been impacted by TEs.
 

Secondly, TEs can be used as a tool for gene isolation.  This is detailed below.
 

        First, some general comments about transposable elements:
 

  1) Some elements are associated with genetic instability while others are not.

   2) Some move through DNA while others move through RNA     3) Elements usually (but not always have repeats on their ends. 4.)  Elements usually make a duplication of host sequences upon insertion. 5.  Excision events are imprecise. 6. A protein encoded by the transposable element is needed for transposition.   7.  Mutability allows one to follow cell fates through development. 8. Non autonomous elements are derived from autonomous elements by mutation  and can become quite divergent. 9.) Origin of Transposable Elements.
  Some classic plant examples:
   Transposable elements as tools for gene discovery:
 "The ever-present scientific triangle :  Gene, Clone & Phenotype."

    Rationale: In  modern studies of biological sciences, the relationships between a gene and  its biochemical function and the phenotype following loss of its function are becoming more and more common.   Historically, people have entered the "triangle" from basically two positions.  Some start with a mutant and wish to decipher the underlying biochemical lesion (forward genetics) while others start with a cloned gene and wish to know the phenotype of a knock-out mutant (reverse genetics).  Transposable elements have been used in both approaches.  In fact, because of the wealth of cloned genes now in databases and more refined experimental procedures,  it is now possible to start with a biochemical pathway and isolate genes and mutants in one step. Procedures for each of the three goals is detailed below.

  (a) Have mutant, want gene (forward genetics).
 

Step 1.   Cross known mutant with stock containing the wildtype form of the gene to be tagged and the transposable element system. Rationale: With those organisms in which self-pollination is possible, it is best to use the TE stock as female and the mutant as male.  This alleviates problems of accidental self-pollinations giving false positives.

                               Cross 1:   TE stock (M1/M1)   X  mutant  (m1-R/m1-R)

Step 2.  Select resulting progeny exhibiting mutant phenotype. Rationale: What one is looking for is an insertion into the gene of interest  abolishing gene function.  Such insertions are seen in the cross above as mutant progeny.  They are rare events. 

Step 3.   Cross selected mutant progeny with a wildtype stock lacking the TE.  Rationale:  From this point on, the goal is to associate, via co-segregation,  a TE-hybridizing DNA fragment with the newly-created mutant.  This step is necessary since there are multiple copies of the element in the genome.  Since the newly-created mutant and the starting mutant likely have the same phenotype, it is best to separate the two as quickly as possible.  A cross to wild type does this.  Each offspring from this cross will be heterozygous with a functional or wild type allele; however, it is impossible at this point to identify which mutant allele is contained in each heterozygote.   There are a number of ways to address this.  For example, if the starting mutant allele were closely linked to another recessive, morphologically-detectable mutant, selfing of the heterozygous seed from cross 2 would identify those families derived from the starting allele.  Closely-linked molecular markers (RFLPs, RAPDs, etc) are now available in many organisms and could be used for this purpose.  Below we will proceed with the worst case scenario, namely that we have no outside markers to aid in allele identification.

                            Cross 2:   selected mutant  X  wildtype lacking TE
                               (m1-R/m1-TE)     X   M1/M1     => M1/m1-R     &    M1/m1-TE

Step 4.  Cross each of five heterozygotes  from Cross 2 with  the starting mutant.

  a)   M1/?  X  m1-R/ m1-R   => M1/m1-R (wildtype phenotype)   &  ?/ m1-R (mutant phenotype)
  b)   M1/?  X  m1-R/ m1-R   => M1/m1-R (wildtype phenotype)   &  ?/ m1-R (mutant phenotype)
  c )   M1/?  X  m1-R/ m1-R   => M1/m1-R (wildtype phenotype)   &  ?/ m1-R (mutant phenotype)
  d )   M1/?  X  m1-R/ m1-R   => M1/m1-R (wildtype phenotype)   &  ?/ m1-R (mutant phenotype)
  e )   M1/?  X  m1-R/ m1-R   => M1/m1-R (wildtype phenotype)   &  ?/ m1-R (mutant phenotype)

    Rationale: As noted above, the heterozygote used in this cross will contain either the starting allele,  m1-R or the allele of interest, m1-TE.   The probability that not one of the  five heterozygotes contains m1-TE  is   (.5) 5 or 3.13%.  In other words, we have 97% chance that at least one of the resulting progenies in 4a through 4e contains m1-TE.

Step 5.  Grow representative individuals of each of the ten groups above, and extract DNA from pools of each class.  Rationale: Note that there are ten groups above.  In those progenies segregating for m1-TE, all mutant progeny will contain TE in a new position  (assuming the new mutant is due to insertion of the TE) while none of the wildtype progeny from the same family will contain the new band.  Families containing the m1-R allele will show no association of a new TE insertion with the mutant allele.

 DNA from several (the more, the better) individuals per class is used  (pooled DNA) to cancel out random associations of bands with the new mutant.  For example, if we randomly chose one wildtype progeny and one mutant progeny from a particular family, the chance of finding a genetically unlinked  DNA band in the mutant DNA prep but not in the wildtype prep is 1 in 4.  Pooling cancels out these associations.  Note that one can identify TE-hybridizing DNA fragments  linked but separate from the new mutant.  These fragments would be separable by recombination from the new mutant.  These, then, would be seen as bands of higher intensity in the mutants but present (and of lower intensity) in the wildtype DNA.

Step 6.  The 10 DNA preparations are digested with a restriction enzyme, electrophoresed, blotted and probed with the TE probe.  Band present in the mutant track but not present in the wildtype track from the same family is chosen for cloning.

Step 7.  Clone band.   Several approaches can be used here.  The simplest is to isolate DNA in the size range of the hybridizing fragment, synthesize a genomic library in a conventional lambda vector, probe the library with the TE and purify the hybridizing band.

 Step 8.    Isolate fragment(s)  abutting the TE element.   One should be able to isolate two junction fragments of the TE with the gene; however one will suffice.  Standard molecular techniques are used here.  (If you are not familiar with them, I would be happy to discuss these with you. LCH)

Step 9  Abutting fragments isolated in (8) are used to isolate wildtype sequences.

Step 10.  Prove cloned fragment is really the gene.    Several methods can be used. The best and most direct is to transform the original mutant with the cloned wildtype gene and obtain complementation.  Other approaches include:  probing  other, independently-derived mutants from step 1 and showing that the same fragments are altered or probing heritable revertants of the original mutant and showing that the wildtype pattern seen on Southern blots returns.  (This last approach is problematic with Robertson's Mu since its reversion rate is quite low. )
 

(b) Have gene, want mutant (reverse genetics).

    In lots of situations, an investigator has cloned a gene and wishes to know what happens when its function is lost.  This is sometimes called "reverse genetics".
Fortunately, the procedure here is much less involved than that discussed above.  It can be as simple as a few phone calls. Unfortunately some of these calls are with the university attorneys.

Rationale:  Given an actively mobile  TE element and given a population of sufficient size, every gene should contain at one insertion in the entire population. Such populations are now available in several organisms.  In the case of maize and Robertson's Mu,  investigators first at Pioneer Hi-Bred, later at Norvartis  and now at the University of Florida have generated populations of sufficient size to tag every gene.  Pioneer synthesized 50,000 maize families from various divergent sources but which all contained Mu.   The population has been screened for insertions into about 100 genes and the success rate has been found to be quite high.

    (Note that other forms of insertion can also be used as along as the sequence of the insertion is known.  A very large population of Arabidopsis plants containing T DNA insertions has been generated at several locations.  Several UF laboratories have taken advantage of the Salk collection.)

    Detection of a Mu insertion into a cloned gene turns out to be rather simple. Recall that Mu, like many TEs,  contains repeated sequences on its termini.  Here they are inverted and ~ 200 base pairs in length.  Because they are inverted, one PCR primer complementary to the terminus can synthesize DNA in both directions.

      Assume that you have DNA from a line with Mu inserted into your gene and another line with no such insertion.  If you added to this DNA a primer complementary to your gene and another primer complementary to a Mu end and perform a PCR reaction, a DNA product should be produced from the first line but not from the second line.   This is the basis to the approach.

   Fortunately 50,000 PCR reactions need not be run.  Empirically, it has been found that one can mix 300 samples and detect a single insertion in a single genome.  Hence, instead of running 50,000 samples, one need only perform 50,000/300 or 167 initial reactions.  Given identification of a particular pool containing the insertion, another trick is played to cut the work even further.    Note that  289 samples can be arranged in a grid 17 samples long and 17 samples wide.  If we arrange the samples like this and then make mixtures along both axes, any one sample is represented in two pools.  There are  17 + 17 or 34 pools and hence only 34 PCR reactions are needed to probe 289 samples.   Hence, one can screen a population of 50,000 families and identify the one family with the insertion in your gene of interest in only (167 + 34) or  201 PCR reactions.  This is basically a day or two job.

      There is another advantage to this.  Note that the PCR screen can detect the insertion in your gene of interest when the gene is heterozygous with another allele.  Hence mutants that are lethal in homozygous condition can be detected in the screen.

 The procedure here is simple.  One sends to Pioneer or Norvatis a primer for one's gene.  (Actually, Pioneer likes to do this with two primers, one from each end of the gene.)  The complicated part of this is getting an agreement between the seed company and your institution.  The goal of seed companies is to make money and they wish to tie up patenting/licensing aspects of any technology resulting from your subsequent work with the gene.

 Given agreement, seeds of the appropriate stock(s) are sent to you.

 Interestingly, less than 10% of the insertions into cloned genes so far isolated by Pioneer have clear, discernible phenotypes.  There are two classical explanations for this (1) the gene is so important that homozygotes die at a very early stage in development, or (2) there exists gene or functional redundancy such that removal of one copy is compensated for by other members of the family or by a redundant pathway.  One could also argue that the gene is simply uninportant.  Also quite interesting is the fact that the frequency of  gene insertions with obvious  mutant phenotypes is also low (<10%) in yeast.

(c) Have nothing, want it all, a.k.a. Don McCarty.

       Recent developments in PCR technology, and the availability of micro-arrays of cDNA clones and sequences of known and unknown genes in databases have opened up a new approach to studying function, genes and mutants.  Dr. Don McCarty, who will present the material in this course in developmental genetics is a leader in this area. 

    In this approach,  an inbred with an active transposable element is simply self pollinated and one looks for new mutants.  Mutants can be of any type.  This works well for people interested in a particular phenotype but not necessarily tied to a particular gene.  There was a project here where seed mutants were collected -- any mutant that altered the seed in any way.  In this experiment, approximately 7% of the ears segregated for a seed mutant.  Hence, there are lots of target genes; estimated to be around 300. 

    Each of these mutants then is a candidate for gene isolation, as described below. 

 The first development is termed TAIL-PCR.  Specific to tagging and more specifically to Mu-tagging, PCR reactions are run with a PCR primer complementary to the Mu terminus (as discussed above) and also with what is termed a promiscuous primer.  This primer has two important properties: (1) It is degenerate.  In other words, instead of being one sequence, it is a mixture of several but a finite number sequences.  (2) It has an AT content higher than the Mu primer.

 The PCR machine is programmed such that the first few rounds of amplification are done at higher temperatures of hybridization.  In doing so, only the Mu primer hybridizes and amplifies.  There is what is called linear amplification of Mu sequences.  This increases the number of templates for the second round of PCR.

 The second round of PCR is done at lower temperatures.  This allows the promiscuous primer to hybridize and one get exponential amplification.   The third step of PCR involves replacement of the first Mu primer with a second Mu primer internal or nested to the first one.  Again a round of high-temperature PCR followed by a second round of low-temperature PCR is performed.

    Another approach is to do first strand synthesis using the Mu primer, then place a run of dA's on the 3' end of the first strand.  Then oligo- dT is added to prime off the run of A's. 

 The two approaches each result in several to many amplified sequences termed amplicons.   There are several things one can do:
 

<>TILLING is another form of "reverse" genetics that has recently been developed. 

    This technology termed TILLING for (Targeting induced local lesions in genomes)  is described in a number of publications.  Till, B.J., Reynolds, S.H., Greene, E.A. et al. (2003b) Large-scale discovery of induced point mutations with high-throughput TILLING. Genome Res. 13, 524–530 is a particularly good one.  Paste the following into your browser and you can connect to it.  (http://www.genome.org). 

    Basically this is a five step process:

(1)  Heavily mutagenize the organism with a mutagen that makes single base substitutions.  Ethyl methane sulfonate (EMS) makes singe base transitions (G/C to A/T) and is usually used for this purpose.  

(2)  Make pools of individuals from eight separate mutagenized organisms.
 
(3)  PCR a region of the gene for which mutations are needed.  One PCR primer is labeled and the fragment from wildtype should be about 1000 bp in length. 

(4)  Denature and renature the PCR product, cut with the nuclease CEL.1 and separate the fragments on a denaturing polyacrylamide gels.   

            Note that if a mutant in this 1000 bp region is present in the pool, denaturing and renaturing the DNA will give rise to a DNA duplex with a mismatched base pair.  The CEL 1 nuclease recognizes mismatches and cleaves the DNA.  This will give rise to small fragments instead of the one large, wildtype fragment on the denaturing polyacrylamide gel. Also note that since one of the two DNA primers is labeled, one can get an estimate of the size of the labeled fragment and hence the relative position of the mutant within the 1000 bp (accurate to 15 bp).  <>

(5)  Repeat the process above with each of the individuals (mixed with some wildtype DNA) to identify the mutant.  Sequence the fragment from the mutant to identify location.

    What mutants do we look for?   Note that non-sense mutations early in the gene and splice site mutations are good candidates for generating null knock-out mutations.  Likewise, missense mutations in a highly conserved region of the gene are likely loss-of-function or reduced function mutations. 

    By collecting a series of mutations, one may have a whole range of  changes, ranging from complete loss of gene  function to impaired function.  The latter  are useful with dealing with a gene  essential for   completion of a life cycle.
 

For your reading pleasure:
 

McClintock, B. (1984) Science 226, 792-801.  (McClintock's Nobel Prize acceptance presentation.  She emphasizes genomic shock as activating the elements in this article.)
 

Walbot, 1992  Annual Review of Plant Physiol & Plant Molecular Biology 43: 49-82.
   Transposons as tags.   ( A good discussion of how-to-clone with the elements.)

 Grant et al 1993. Molecular and General Genetics 241: 153 and references cited therein
     Molecular biology of Spm/En suppression (This and the article below discuss some of the complexities of the Spm/En system we didn't have time to cover.)

Menseen et al  1990.  EMBO Journal 9:  3051
     Interesting Spm/En responsive alleles.

Giroux et al  1994 Proc. Natl Acad Sci., 91:12150-154
       The element Ds acts as a perfect intron.   (A particularly clear example ( to quote W. Gilbert) of the creation of an intron. This is one of my favorites for all kinds of reasons!!)

Fedoroff.  1989.  Cell 56: 181-191 (A good review for the time.).
Gierl and Saedler.  1989. Annual Review of Genetics 23: 71-85. (A good review for the time.).

 McCarty, D. R., Settles, A. M., Suzuki, M., Tan, B. C., Latshaw, S., Porch, T., Robin, K., Baier, J., Avigne, W., Lai, J., Messing, J., Koch, K., and L. Curtis Hannah 2005   Steady-state transposon mutagenesis in inbred maize.  Plant J. 44: 52-61.  ( Gene isolation with maize Mu)

 Till, B.J., Reynolds, S.H., Greene, E.A. et al. (2003b) Large-scale discovery of induced point mutations with high-throughput TILLING. Genome Res. 13, 524–530  (TILLING)