{"id":70,"date":"2020-07-31T20:21:55","date_gmt":"2020-08-01T00:21:55","guid":{"rendered":"https:\/\/pressbooks.bccampus.ca\/kathleef\/?post_type=chapter&#038;p=70"},"modified":"2025-06-16T16:16:35","modified_gmt":"2025-06-16T20:16:35","slug":"chapter-9-investigating-gene-function-2-knock-out-and-knock-down","status":"publish","type":"chapter","link":"https:\/\/pressbooks.bccampus.ca\/kathleef\/chapter\/chapter-9-investigating-gene-function-2-knock-out-and-knock-down\/","title":{"raw":"Investigating Gene Function Part 3 - Knock-down and Knock-out","rendered":"Investigating Gene Function Part 3 &#8211; Knock-down and Knock-out"},"content":{"raw":"<h1 style=\"text-align: center\">Introduction<\/h1>\r\nIn this section we will look at ways to go beyond detecting when and where a gene is expressed; we will explore ways of reducing or eliminating the gene's expression, and using the resulting phenotype to deduce the likely role of the gene product. Understanding when and where the gene is expressed (discussed in <a href=\"https:\/\/pressbooks.bccampus.ca\/kathleef\/chapter\/chapter-7a-analyzing-gene-function1-gene-expression\/#B_When_and_where\">Chapter 7<\/a> and <a href=\"https:\/\/pressbooks.bccampus.ca\/kathleef\/chapter\/chapter-7b-investigating-gene-function-3-expression-constructs\/#A1_what\">Chapter 8<\/a>) will help us know where we should expect to see the phenotype resulting from reduced or no expression of the gene. It is an important first step in elucidating gene function.\u00a0 Note that we could be surprised, however. Sometimes despite the knowledge we have accumulated about a gene, we find it works in an unexpected way - and produces a phenotype we would not have predicted. This is what is exciting about science- we don't really know the answer until the experiments are done.\r\n\r\n&nbsp;\r\n\r\nWe will cover four means of eliminating or reducing gene expression here:\u00a0 insertional mutagenesis, homologous recombination, RNAi, and CRISPR-Cas9. There will be the strongest focus on the last two of these, both of which are borrowed from bacteria. We will consider how they work in nature, protecting cells from infection and then how they are used in analysis of gene function.\u00a0 The approaches used to make the constructs for these applications will also be described.\r\n\r\n&nbsp;\r\n<div class=\"textbox shaded\">\r\n<h2 style=\"margin-top: 0em\">Contents<\/h2>\r\n<p style=\"margin-top: 0em;margin-bottom: 0em;margin-left: 0em\"><a href=\"#Learning_Outcomes\">Learning Outcomes<\/a>\r\n<a href=\"#terminology\">Terminology before we begin<\/a>\r\n<a href=\"#A_insertional\">A. Insertional Inactivation<\/a><\/p>\r\n<p style=\"margin-top: 0em;margin-bottom: 0em;margin-left: 2em\"><a href=\"#A1_Ti\">A-1. Ti Plasmids in Plants<\/a>\r\n<a href=\"#A2_p_element\">A-2. P elements in fruit flies<\/a><\/p>\r\n<p style=\"margin-top: 0em;margin-bottom: 0em;margin-left: 0em\"><a href=\"#B_homologous\">B. Homologous Recombination<\/a><\/p>\r\n<p style=\"margin-top: 0em;margin-bottom: 0em;margin-left: 0em\"><a href=\"#C_RNAi\">C. RNAi<\/a><\/p>\r\n<p style=\"margin-top: 0em;margin-bottom: 0em;margin-left: 2em\"><a href=\"#C1_history\">C-1. A little history<\/a>\r\n<a href=\"#C2_how\">C-2. How it works<\/a>\r\n<a href=\"#C3_Making_constructs_for_RNAi\">C-3. Making constructs for RNAi<\/a><\/p>\r\n<p style=\"margin-top: 0em;margin-bottom: 0em;margin-left: 0em\"><a href=\"#D_CRISPR\">D. CRISPR<\/a><\/p>\r\n<p style=\"margin-top: 0em;margin-bottom: 0em;margin-left: 2em\"><a href=\"#D1_preamble\">D-1. Preamble<\/a>\r\n<a href=\"#D2_bacteria\">D-2. How bacteria do it<\/a>\r\n<a href=\"#D3_how\">D-3. How we do it: making CRISPR constructs<\/a><\/p>\r\n<p style=\"margin-top: 0em;margin-bottom: 0em;margin-left: 4em\"><a href=\"#D3i_NHEJ\">D3-i. Knock out - NHEJ repair<\/a>\r\n<a href=\"#D3ii_HDR\">D3-ii. Directed modification - HDR<\/a>\r\n<a href=\"#D3iii_Designing_oligonucleotides\">D3-iii. Designing oligonucleotides that will generate an overhang for cloning<\/a><\/p>\r\n&nbsp;\r\n\r\n<a href=\"https:\/\/pressbooks.bccampus.ca\/kathleef\/back-matter\/recorded-lecture-videos\/#8\">List of lecture videos (excluding supplemental videos)<\/a>\r\n\r\n<\/div>\r\n\r\n<hr style=\"height: 5px;border-top: solid black\" \/>\r\n\r\n<div class=\"textbox textbox--learning-objectives\"><header class=\"textbox__header\">\r\n<h2 class=\"textbox__title\" style=\"margin-top: 0em;margin-bottom: 0em\"><a id=\"Learning_Outcomes\"><\/a>Learning Outcomes<\/h2>\r\n<\/header>\r\n<div class=\"textbox__content\">\r\n<ul>\r\n \t<li>\r\n<div><span style=\"text-decoration: underline\">Describe<\/span> the methods used to knock out or knock down gene function<\/div><\/li>\r\n \t<li><span style=\"text-decoration: underline\">Distinguish<\/span> between forward and reverse genetic approaches<\/li>\r\n \t<li>\r\n<div><span style=\"text-decoration: underline\">Distinguish<\/span> between the techniques \u2013 used in different organisms, different approaches etc.<\/div><\/li>\r\n \t<li>\r\n<div><span style=\"text-decoration: underline\">Describe<\/span> how we make constructs for RNAi and how RNAi works<\/div><\/li>\r\n \t<li><span style=\"text-decoration: underline\">Describe<\/span> how to make a CRISPR construct through the method we will use in Bioinformatics Assignment#2<\/li>\r\n<\/ul>\r\n<\/div>\r\n<\/div>\r\n\r\n<hr style=\"height: 5px;border-top: solid black\" \/>\r\n\r\n<h2><a id=\"terminology\"><\/a>Terminology before we begin:<\/h2>\r\n<span style=\"color: #0000ff\"><span style=\"color: #000000\">It is important to learn a bit of terminology before we begin.\u00a0 <span style=\"color: #0000ff\"><strong>Forward genetics<\/strong><\/span> is a process by which we induce many mutations at random and then after we've done all the work of isolating the mutations, we have to still figure out which mutations we want to study, and then do a lot of analysis to find out which gene is mutated<\/span><strong>.\u00a0 Reverse genetics <\/strong><span style=\"color: #000000\">i<\/span><span style=\"color: #000000\">s an approach by which we identify what gene we want to study and target it specifically in some way. We then see what the effect is of altering the expression of the gene in order to determine what its function is.\u00a0 This has only been possible since the sequencing of the genomes <\/span><span style=\"color: #000000\">of many model organisms.\u00a0 Both approaches have value but reverse genetics approaches are more targeted so we know exactly what changes we are making in the gene which helps us interpret the results. Forward genetics can produce mutations that you would never have designed on purpose and that produces an effect that is unexpected or unpredictable. But it is much more work than reverse genetics<\/span><strong>. <\/strong><span style=\"color: #000000\">We will be looking at one forward genetics approach and multiple reverse genetics approaches in this chapter.<\/span><\/span>\r\n\r\n<span style=\"color: #0000ff\"><strong>Loss of function<\/strong><\/span> mutations are mutations that reduce or eliminate a gene's function. In this case we may be eliminating the gene itself or preventing it from producing any of its protein product. These are examples of <strong><span style=\"color: #0000ff\">amorphic<\/span><\/strong> mutations; ones that are not producing any of the protein product they normally make.\u00a0 In other cases we might reduce the transcription of the gene but not completely eliminate it. Or a protein could be produced that is missing some amino acids or has some incorrect amino acids and though it is not very active, it is able to perform its function a little bit. These are examples of <strong><span style=\"color: #0000ff\">hypomorphic<\/span><\/strong> mutations, in these mutations the function of the gene is reduced but there is still some protein that somewhat does the job. To really understand the function of a gene we want to see what the phenotype of the amorphic mutation is but we like to use hypomorphic mutations for other purposes- both have their uses in genetics.\r\n\r\n&nbsp;\r\n\r\nThere are also <strong><span style=\"color: #0000ff\">gain of function<\/span><\/strong> mutations in which - for example - a gene is more active than usual either because it is being transcribed at a higher rate than usual, or because the protein can't be degraded or down-regulated when it is supposed to stop performing its function. There may not be a phenotype associated with this type of <strong><span style=\"color: #0000ff\">hypermorphic<\/span><\/strong> mutation, but there could be a very strong and informative effect. It is very dependent on what the gene's product does. <span style=\"color: #0000ff\"><strong>Neomorphic<\/strong> <\/span>mutations involve the mis-regulation of the gene or a change in the protein that causes it to do something different from the wild type situation. In these situations a gene might be expressed in a stage or type of tissue where it is not normally expressed. Or, the protein might be modified somehow so that it interacts with other proteins it would not normally interact with in a cell. Or it might localize to a place where it is not supposed to be. Sometimes these changes are very impactful. A signalling molecule that is active in the wrong tissue can promote cell division when it is not appropriate for the cells to be dividing- this could lead to tumour formation.\r\n\r\n&nbsp;\r\n\r\n[h5p id=\"31\"]\r\n\r\n<a href=\"https:\/\/pressbooks.bccampus.ca\/kathleef\/wp-content\/uploads\/sites\/1093\/2020\/07\/e-text-chapter-8-slides-part-1.pptx\">Click here for the powerpoint slides presented in the video below<\/a>.\r\n\r\n[embed]https:\/\/www.youtube.com\/watch?v=kxfzpby29x8&amp;feature=youtu.be&amp;hd=1[\/embed]\r\n\r\n<hr \/>\r\n\r\n<h2><a id=\"A_insertional\"><\/a>A. Insertional Mutagenesis:<\/h2>\r\nOne way to knock out a gene's function is to make a mutation in it. There are various ways to cause changes in DNA sequence of genes. You can treat the organisms with chemicals or radiation and then look for the phenotypes that result. Then you must do a lot of work to figure out what gene has been mutated to cause the phenotype you see.\u00a0 This is an example of <span style=\"color: #000000\">forward genetics<\/span><strong><span style=\"color: #0000ff\">.<\/span><\/strong> In this approach, mutations are induced (caused) at random and then you have to figure out what gene has been altered. A different way to make a mutation is to insert a segment of DNA into the gene. It is quite unlikely that adding a huge DNA sequence into a gene would leave the gene still functioning perfectly. This is because the gene will be frame-shifted. The gene can still be transcribed, but when the RNA is translated, at the point where the inserted DNA was transcribed, the amino acid sequence of the protein will be incorrect. And most likely there will be a stop codon in the sequence, leading to a truncated (shortened) protein that contains the wrong amino acids.\u00a0 \u00a0This is called <span style=\"color: #0000ff\"><strong>insertional inactivation<\/strong><\/span> and we have systems for using this approach in some of our commonly used model organisms. The insertion of the DNA into the genes to cause mutations is still random, but we can use our knowledge of the DNA we have introduced and the power of PCR to more easily determine which gene has been altered. We will talk about 2 types of insertional inactivation. The principles of these methods apply to other methods in other organisms you might learn about in other courses. (<span style=\"color: #993366\"><strong>Note: For this semester, I am going to leave out the second part of this section, part A-2 about fruit flies<\/strong>)<\/span>\r\n\r\n[h5p id=\"34\"]\r\n\r\n&nbsp;\r\n\r\n<hr \/>\r\n\r\n<h3><a id=\"A1_Ti\"><\/a>A-1. Ti Plasmids in Plants<\/h3>\r\nThere is a type of soil dwelling bacterium that can infect plants and cause large \"galls\" which are a type of benign tumour. The tissues in these \"galls\" are reprogrammed by the bacterium to produce a protected place for the bacteria to live, and to also produce certain amino acids and other nutrients that the bacteria consume. The bacterium, <em>Agrobacterium tumefaciens,\u00a0<\/em> contains a very large circular plasmid, 140 kb to 235 kb in size, called the Ti plasmid. The plasmid has features you would expect of any plasmid, such as an origin of replication. It also has a region called T-DNA which actually causes the formation of the tumour.\u00a0 During infection of the plant tissue, a copy of this T-DNA is inserted into the genome of the plant (the nuclear genome). In nature, this T-DNA has genes that encode enzymes that produce hormones in the host plant. It is this manipulation of the plant's own hormones that leads to the formation of the tumour, called a crown gall.\r\n\r\nThere is also a region on the T-DNA responsible for synthesis of opines. These are unusual derivatives of sugars or amino acids and they provide nutrients to the bacteria living in the tumour tissue. The Ti plasmid also has genes on it that are needed for using the opines as a source of nutrition. These genes are not transferred to the plant. They are needed only by the bacteria.\r\n\r\nThe T-DNA has a short sequence (about 25 bp) to either side of it; these are called LB for left border and RB for right border.\u00a0 These sequences are necessary for the transfer of a copy of the T-DNA to the host plant's genomic DNA. The Ti plasmid (but not the T-DNA) also has a<span style=\"text-align: initial;font-size: 1em\"> virulence region; this is where the genes necessary to allow the bacterium to infect the host cell are found. There are some other regions on these huge plasmids that are not relevant for this topic. I am adding a simple diagram of the Ti plasmid from wikipedia. Pay attention to the LB and RB sequences, what is in the T-DNA part of the plasmid and what is not. The genes in the T-DNA segment are the ones that are copied and moved into the plant's genome while the ones that are not in this region are functional in the bacterium only.<\/span>\r\n\r\n<img class=\"\" src=\"https:\/\/upload.wikimedia.org\/wikipedia\/commons\/d\/d1\/Ti_plasmid.svg\" alt=\"Ti plasmid - Wikipedia\" width=\"403\" height=\"305\" \/>\r\n\r\nAs with so many of our genetic engineering techniques, researchers have found something operating in nature and have figured out how to use it in research.\u00a0 The Ti plasmid has been modified for use in the genetic engineering of plants.\u00a0 We'll talk later in the semester about some details of how the plasmid is used but for now the main thing to know is that one of the changes made is to remove the genes relating to plant hormone regulation and opine metabolism and to add a gene for antibiotic resistance. The plasmid can be introduced into plants using the antibiotic resistance as a selection mechanism.\u00a0 It is more complex than it sounds, and again, we will talk about the details of how the DNA gets into the plants and the process of selection etc. later in the semester.\u00a0 The result of transforming plant cells on a large scale is that many tens of thousands of lines of plants (perhaps by now, hundreds of thousands!) have been generated, each with a unique mutation in it, caused by an insertion of a large segment of T-DNA.\r\n\r\nSo far this is a lot like other forward genetics techniques but the work needed to try to figure out which gene has been altered by the mutation is much less than when genes are mutated by chemicals or radiation. This is because we know the exact DNA sequence of the T-DNA. And we can use this knowledge to design a primer to sequence the DNA of plants with a T-DNA insert into a gene. The primer recognizes the T-DNA sequence and is directed outward towards the gene sequence. This is a way of using the sequence we know - the T-DNA - to identify the sequence we don't know - the plant gene.\r\n\r\n[h5p id=\"35\"]\r\n\r\nBelow is the recorded lecture for this topic (<a href=\"https:\/\/pressbooks.bccampus.ca\/kathleef\/wp-content\/uploads\/sites\/1093\/2020\/07\/e-text-chapter-8-slides-part-2.pptx\">click here for the powerpoint slides<\/a>):\r\n\r\n[embed]https:\/\/www.youtube.com\/watch?v=DVbLeKi4Xyk&amp;feature=youtu.be&amp;hd=1[\/embed]\r\n\r\n<hr \/>\r\n\r\n<h3><a id=\"A2_p_element\"><\/a>A-2. P-elements in fruit flies<\/h3>\r\n<span style=\"color: #993366\"><strong>NOTE: for now, we will leave out this section. In many ways it is very similar to the plant example already explained. I've written it and it will stay in because it might be used in a later offering of the course. But you can ignore section A-2 this time.<\/strong><\/span>\r\n\r\nIn fruit flies, the transposable element called the P-element is used to create mutations and to make transgenic flies for other reasons.\u00a0 The P-element is remarkable: it is a gene which has the ability to move from place to place in the genome. The single gene on a P-element encodes transposase which is the name of the enzyme that cuts the DNA to either side of the element and inserts it into a new place in the genome.\r\n\r\nP-elements were introduced into <em>Drosophila melanogaster<\/em> (the fruit fly most used in genetics research) probably no more than about 200 years ago. In the early days of fly research, flies isolated from the wild lacked P-elements and by the 1970 virtually every strain established from a wild population contained plenty of these elements. This is a tremendously rapid evolutionary change from the element being extremely rare in the 1920s to being practically ubiquitous by the 1970s.\r\n\r\nIt perhaps won't surprise you to learn that P-elements have been modified from their wild form for use in fly research.\u00a0 For instance, the P-element has been transformed into a reporter construct, by putting a GFP gene on it, with the appropriate regulatory sequences and some of you who have taken BISC 302W already know quite a bit about all the interesting research being one with P-elements. Here we will just briefly outline their use in forward genetics approaches.\r\n\r\nLike the Ti plasmid in plants, the P-element can be introduced into flies and allowed to insert into genes causing mutant phenotypes. In the fly community a concerted effort went into doing genetic screening on a large scale to try to produce a mutation in every single gene of the fly. Each mutation was of course kept as a separate line of flies that all have the same P-element induced mutation.\r\n\r\nFor genetic screening, the P-elements have had their transposase gene removed and replaced with the <em>white<sup>+<\/sup><\/em> gene. The + means that it is the wild type version of the gene. Fly genes are named backwards so the <em>white<sup>+<\/sup><\/em> gene is required to make the wild type red eye of the fly.\r\n\r\nThe P-element is introduced into flies by injecting a plasmid that contains the P-element (with its <em>white+<\/em> reporter gene) into the posterior region of a very early embryo. At this stage the embryo is one big cell with many nuclei- the nuclei divide but the cell doesn't.\u00a0 The posterior region is where we introduce the DNA because we are hoping it will be incorporated into a nucleus that goes on to form a germ cell. This will then give rise to gametes which may have the P-element somewhere in the genome. The injection is quite finicky- a very thin and very sharp needle is used to make the tiniest hole possible in the embryo. You don't want to damage it. We inject the plasmid that contains the P-element and another plasmid that has the transposase gene on it. The transposase enzyme will cut the P-element out of the plasmid and insert it somewhere in the genome of some of the nuclei in the embryo.\u00a0 The flies we use for producing the embryos for injection are all mutant for the <em>white<\/em> gene. They have white eyes.\r\n\r\nAfter the injection process we allow the embryos to recover and develop into adults. All the adults that develop have white eyes. That is because most won't have a P-element in any of their cells. And those that do have the P-element incorporated into some of their cells won't have it in the eyes, which develop from anterior cells in the embryo.\u00a0 The flies though are mated to either females or males that also have white eyes. Most of the progeny of these crosses have white eyes but some have red or orange or pinkish eyes. These flies developed from an gamete that had the P-element incorporated into the DNA of a germ cell.\u00a0 All of their cells now have the P-element in that position on that chromosome. And the <em>white<sup>+<\/sup><\/em> gene is being transcribed and translated from the P-element. If it is expressed a lot the eye is nearly wild type but if it is expressed just a little the eye might be just pinkish, very pale.\r\n\r\nOnce we have a P-element containing strain, we can do large scale genetic screens where we cause the element to mobilize randomly around the genome and we then collect a large number of individuals that have P element induced mutations. Each individual is used to make a strain of flies that all have the same mutation as the original fly. A consortium of researchers initiated a project to make a P-element mutation in every single gene of the fly as a resource for the community.\u00a0 There are thousands of strains available to researchers, each of which has a mutation in a single gene. And there are many P-elements, which have different components, for different purposes. For example, some have the <em>lacZ<\/em> gene as a reporter, while others have <em>GFP<\/em>; this is in addition to <em>white<sup>+<\/sup><\/em>.\r\n\r\nThe value of having a P element induced mutation rather than chemical or radiation induced, is that it is fairly simple to figure out what gene the P-element is inserted into. There are two methods we might use, depending on which P-element is inserted into the gene.\r\n\r\nThe first method is called <span style=\"color: #0000ff\"><strong>inverse PCR.<\/strong><\/span> It involves isolating the genomic DNA of individuals with the P-element mutation and cutting the DNA with an enzyme that cuts in a known location inside the P-element.\u00a0 The DNA is then ligated in a very large volume with a fairly small amount of DNA. We want the ligation mixture to be very dilute so that when the ligase glues two ends together they are most likely going to be the two ends of the same molecule. This generates a whole bunch of circles of DNA. A very few of these contain part of the P-element and a little bit of the sequence that was right beside the element.\u00a0 You set up a PCR reaction using a bit of the ligation mix and you use two primers that recognize the P-element and are directed outward toward the flanking DNA. A band is produced that can then be cloned into a plasmid, which is then used to transform cells. Colonies are grown overnight and plasmid minipreps are made by alkaline lysis. The DNA is then sent for sequencing. Even if you only have a small amount of DNA sequence it is usually enough to determine exactly where the element is located in the DNA of the fly.\r\n\r\n&nbsp;\r\n\r\n<span style=\"color: #0000ff\"><strong>Plasmid rescue<\/strong><\/span> is a quicker technique but is only possible if your mutation was caused by the right kind of P-element. Some have been designed that have a bacterial origin of replication and an ampicillin resistance gene on them.\u00a0 In this case you again isolate the genomic DNA of flies with the mutation, and you do a restriction digest of the DNA with a restriction enzyme that cuts in a particular location in the P-element. You also do the very dilute ligation. The usefulness of this type of P-element is that some of the circles that are made in this ligation will have some of the P-element DNA, some of the flanking fruit fly DNA, an amp resistance gene and an origin of replication. In essence you have made a little plasmid containing the DNA you want to sequence (the flanking DNA). So the PCR, cloning, etc. steps can be skipped. Instead you take your ligation mix and transform a small amount into some <em>E. coli<\/em> cells. You plate the transformed cells on amp plates and only those that contain the plasmid will survive. Then you can select colonies and grow some up and make plasmid preps from them, just as above.\r\n\r\n&nbsp;\r\n\r\n<hr style=\"height: 5px;border-top: solid black\" \/>\r\n\r\n<h2><a id=\"B_homologous\"><\/a>B. Homologous Recombination:<\/h2>\r\nIn some organisms there is no convenient system for insertional mutagenesis. But we can use homologous recombination between the gene on the chromosome of a cell and an introduced construct to swap out the functional gene and replace it with a non-functional copy. In this case we know the exact sequence change that has been made to the gene and if we have removed the entire coding sequence we can be sure we have eliminated the gene function. The term <strong>recombination<\/strong> can refer both to crossing over and independent assortment. In this case we are referring specifically to crossover.\r\n\r\nThis requires a good understanding of the gene you are working with and knowledge of the sequence. The construct you design will contain segments of the DNA sequence to either side of the gene you want to knock out. But instead of the gene, you have an antibiotic resistance gene between the flanking sequences. Or in you might have a marker gene that produces a visible phenotype in individuals that have had their gene replaced by the marker. The image below will help explain the process. In this method we are taking advantage of the fact that crossing over can occur between two segments of DNA with homologous sequences. When two crossovers occur, the DNA between the two crossovers is \"swapped\".\u00a0 In this way we can replace the actual gene we are studying, with the sequence we have placed on the construct.\u00a0 The vector, with the wild type copy of the gene \"swapped onto\" it, does not persist in the cells.\r\n\r\n<img class=\"alignnone wp-image-650\" src=\"https:\/\/pressbooks.bccampus.ca\/kathleef\/wp-content\/uploads\/sites\/1093\/2020\/10\/image-for-HR-chapter-8-755x1024.jpg\" alt=\"\" width=\"412\" height=\"559\" \/>\r\n\r\nIn the image, the vector is shown, with the antibiotic resistance gene (in purple) flanked by sequences to either side of the gene of interest. At least 2000 base pairs of flanking sequence is needed for this technique to work and many constructs that have been successfully used have quite a bit more- 6000 to 14000 nucleotides.\u00a0 Having a large amount of flanking sequence is needed to increase the chances of crossovers occurring. You need two crossovers to occur quite close to each other and this is a rare event. The process works well in many prokaryotes, in yeast, in mice and flies. But it does not work at all in plants or in human cells, which is unfortunate because if you think about it, this is a technique that could be used not only to replace a wild type (normal) copy of a gene with a different sequence in order to knock out the gene. It could do the reverse too - replace a mutated copy of a gene that is causing an inherited disease, with the wild type version - a form of gene therapy.\r\n\r\n&nbsp;\r\n\r\nHowever, this is less disappointing than it used to be because we now have the CRISPR system of gene editing, which shows great potential for gene therapy and appears to work well in all organisms studied.\u00a0 The use of CRISPR for gene knockout will be described below and we may touch on it again towards the end of the semester when we talk about the use of genetic engineering in human medicine.\u00a0 The first self test question below is about this figure:\r\n\r\n<img class=\"alignnone size-full wp-image-1074\" src=\"https:\/\/pressbooks.bccampus.ca\/kathleef\/wp-content\/uploads\/sites\/1093\/2020\/07\/HR-in-mice.png\" alt=\"\" width=\"312\" height=\"89\" \/>\r\n\r\n[h5p id=\"32\"]\r\n\r\n[h5p id=\"33\"]\r\n\r\n<a href=\"https:\/\/pressbooks.bccampus.ca\/kathleef\/wp-content\/uploads\/sites\/1093\/2020\/07\/e-text-chapter-8-slides-part-3.pptx\">Click here for the powerpoint slides presented in the video below<\/a>.\r\n\r\n[embed]https:\/\/www.youtube.com\/watch?v=3fl3n8c-Ryw&amp;feature=youtu.be&amp;hd=1[\/embed]\r\n\r\n<hr style=\"height: 5px;border-top: solid black\" \/>\r\n\r\n<h2><a id=\"C_RNAi\"><\/a>C. RNAi:<\/h2>\r\nRNAi\u00a0 (RNA interference) is yet another process we use in genetic engineering that is borrowed from nature. It is a natural process among many eukaryotes. Once we understood how it works to reduce expression of a gene, we learned how to use it to perform targeted reduction of our genes of interest. And when I say \"we\" I specifically mean other people who figured this out. I am not one of those people.\r\n<h3><a id=\"C1_history\"><\/a>C-1. A little history<\/h3>\r\nI know this history very informally from the perspective of the fly research community.\u00a0 In the 1990s people were trying to knock out gene function by injecting anti-sense RNA into for example fly embryos, and then looking for phenotypes. The theory was that the antisense RNA would bind to the sense RNA in the cells and would prevent its translation. Thus we would get no functional protein and we could see the results of directly targeting a particular gene for inactivation.\u00a0 It was variably effective; sometimes it seemed to work well and sometimes less well.\u00a0 Then I was told at a meeting that someone figured out that the effect was stronger when both sense and antisense RNA was injected into the embryos. They had made a mistake by introducing both (by not linearizing the plasmid before transcribing the gene; see <span style=\"text-align: initial;font-size: 1em\"><a href=\"https:\/\/pressbooks.bccampus.ca\/kathleef\/chapter\/chapter-7a-analyzing-gene-function1-gene-expression\/#B1i_making\">Chapter 7<\/a>) and thought the experiment would not work at all. But it did and the effect was considerably stronger than just injecting the antisense RNA alone. This did not at first make much sense when we were talking about it then, but a few years later, the work of Fire and Mello, nematode researchers, showed that introducing double stranded RNA initiated a process that culminated in destruction of the sense RNA in a cell, via a process they called RNA interference. They published their work on this in 1998 and got the Nobel Prize for it in 2006. Of course, it turns out that plant researchers had also been circling around the same topic and had called it post-transcriptional gene silencing (PTGS). The first report of it in plants was published in 1990.\u00a0 So multiple research groups were figuring out the same process at around the same time.<\/span>\r\n\r\nThe point about this method is that first off, somehow the introduction or production of double stranded RNA that corresponded to a particular gene led to the destruction of the normal mRNA from that gene and thus the gene's function- the protein product - was eliminated or greatly reduced in amount. Second, the phenotype that results may tell you something about what the gene is needed for. This is an example of a\u00a0 loss of function mutation. Imagine there is a factory that builds wheelbarrows. There are 12 people that go into the factory every day and each does something to build the wheelbarrow. Everyone does their job. Nobody covers for anyone else. Now imagine that one of the people gets \"inactivated\" somehow- let's imagine it is an impromptu vacation rather than something more sinister- and you are watching the wheelbarrows come out of the factory and they are missing the grips on the handles. What do you conclude? You can tell what the job of the absent worker was; their job was to install the part that is missing. This is very simplistic but gives an idea of how a loss of function mutation might suggest the function of a gene.\r\n\r\nRNAi allows you to decide which gene to knock out, rather than trying to make random mutations and then find one in the gene you are studying. If you have done RNA <em>in situ<\/em> hybridization and\/or made a reporter construct for your gene you will know where the gene is expressed and so you will know where to look for the phenotypes: the physical or physiological or health related etc. effects of reducing or eliminating the gene's function.\r\n\r\n[h5p id=\"36\"]\r\n\r\n&nbsp;\r\n\r\n<hr \/>\r\n\r\n<h3><a id=\"C2_how\"><\/a>C-2. How it works<\/h3>\r\nRNAi is a process that works in most eukaryotes. It seems to be a defence mechanism against RNA viruses. Double stranded RNA is not normal in a cell so when it is detected, it is a sign that a retrovirus may have infected the cell. A protein called dicer, which acts as a <strong><span style=\"color: #0000ff\">homodimer<\/span><\/strong> (two identical subunits combine to perform the function) binds to double-stranded RNA in a non-sequence-specific way. That means it doesn't recognize any particular sequence but just binds to any ds-RNA. Dicer is a type III RNA endonuclease. The size of the dimer dictates where the enzyme cuts the RNA. It cuts to either side of the complex leading to ~22 bp pieces of ds-RNA. The sizes vary slightly among organisms.\r\n\r\nThen a complex called RISC (<strong>R<\/strong>NA <strong>I<\/strong>nduced <strong>S<\/strong>ilencing <strong>C<\/strong>omplex) unwinds the ds-RNA pieces and hangs on to the antisense strand of the RNA. It uses this strand to target additional complementary RNA. When it finds a sequence that matches - the mRNA of the targeted gene- it cuts that RNA as well. The RNA endonuclease that does the cutting of the mRNA is called argonaut, but it is nicknamed \"slicer\".\r\n\r\n<em>I am not clear on how the RISC complex hangs on to one strand of the RNA, the guide strand (which is the antisense strand) and lets go of the other strand, the passenger strand (the sense strand).\u00a0 But in any case, it does retain the antisense strand and this is how it is able to target the mRNA sequence of the gene.<\/em>\r\n\r\nThere is a lot more to find out about the mechanism of RNAi if you are interested. It is also a mechanism that many organisms use to regulate their own genes. So some genes are transcribed but then at certain times in development, short anti-sense RNAs called microRNAs (miRNAs) bind to the mRNA and prevent its translation. They also target destruction of the message through dicer and the RISC complex because when they bind to the mRNA it creates ds-RNA, which triggers the RNAi process. It probably seems inefficient but it works.\u00a0 You will (or may have already) learn about it in Bisc 333.\r\n\r\nThere are human health connections as well. A form of macular degeneration is associated with an age-related down regulation of a form of dicer in the eye. There are sequences called Alu sequences in the human genome, and these are repetitive remnants of a retroviral infection in a human ancestor long ago. These are still expressed sometimes and are kept in check by miRNAs. When dicer is less expressed as the patient gets older, the Alu sequences begin to accumulate because there is a less efficient miRNA process to shut down their expression, and this accumulation leads to degeneration of the retina. I don't know exactly how the accumulation of these sequences brings about the retinal degeneration, but it is interesting. Perhaps reintroducing active dicer through gene therapy could help treat this form of blindness.\r\n\r\n&nbsp;\r\n\r\n<hr \/>\r\n\r\n<h3><a id=\"C3_Making_constructs_for_RNAi\"><\/a>C-3. Making constructs for RNAi<\/h3>\r\nConstructs that will be used for RNAi don't need to contain the entire sequence of the gene we are targeting, in fact it seems to work better if only part of the gene is used. A few hundred base pairs of sequence is usually enough.\u00a0 A standard length is 300 to 800 base pairs but for a gene I studied years ago, we used only about 80 base pairs because this was the largest section of the gene we could find that was unique, but I've since learned that many people have successfully used smaller sections of the gene they are targeting- even as small as 40 bp. It is probably somewhat related to the gene you are targeting. But whatever the size of your insert, the sequence you use must be unique to the gene you are studying.\u00a0 In the case of the gene I was interested in, there were two genes with very very similar sequence that performed slightly different functions in the flies during development.\u00a0 You might also be studying one member of a gene family and in this case many family members would have quite similar sequences. Sometimes the sequence chosen just has short stretches of sequence that are not unique to the gene we are targeting. You only need 22 nucleotides of such sequence to generate \"off-target\" effects. This means that the RNAi process will not only inactivate mRNA from the intended gene but also for another gene or genes which are not the intended target. You don't want the phenotype you see to result from knocking out multiple genes because you will not know this and will assume that the cool phenotype you observe is the result of knocking down or knocking out your intended gene. You can search the sequence you plan to use in your construct for short regions that match different genes than the intended one. If you find such matches, a different section of the gene must be used.\r\n\r\nThe construct you make must produce double stranded RNA. So the construct needs a promoter to transcribe the DNA that you insert into the vector. You can insert the same DNA sequence twice into the same vector, but in opposite orientation. The RNA polymerase will transcribe the first copy and will continue through the second copy- because this copy is in the opposite orientation, the RNA of the second half of the transcript will be the reverse complement of the first half. Therefore it can fold back on itself to product ds-RNA. We often include a short spacer between the two inserts to make the folding of the RNA a bit easier.\u00a0 Vectors have been designed with two cloning sites that can allow you to automatically put your inserts into the vector in the correct orientation.\u00a0 We will talk about this \"recombinase\" cloning method later in in the semester. We can use restriction enzyme cloning too, but it is a lot of work- we have to clone one insert into the vector at a time and we have to use different restriction enzymes for the two inserts so that we are only affecting one insert at a time. And! We always have to be really careful in planning our cloning procedure to ensure we have the inserts in opposite orientation. Below is an image of just part of such a construct, showing the two inserts which are the exact same sequence but in opposite orientation, and the promoter to one side and terminator to the other. The spacer between is to help the formation of a hairpin- double stranded RNA with a loop at one end. The loop may be degraded after the RNA folds into the hairpin structure.\r\n\r\n<img class=\"alignnone size-large wp-image-655\" src=\"https:\/\/pressbooks.bccampus.ca\/kathleef\/wp-content\/uploads\/sites\/1093\/2020\/10\/RNAi-construct-1024x175.jpg\" alt=\"\" width=\"1024\" height=\"175\" \/>\r\n\r\n<span style=\"text-align: initial;font-size: 1em\">The second approach is much easier. In this case your insert is cloned into the polylinker and there are two copies of the same promoter- one to either side of your insert.\u00a0 Once you've made the transgenic organism, the insert can be transcribed in both directions at the same time, making two complementary RNAs that will bind each other to become ds-RNA. The image below shows the two promoters to either side of the insert which can be in any orientation.\u00a0<\/span>\r\n\r\n&nbsp;\r\n\r\n<img class=\"alignnone size-full wp-image-659\" src=\"https:\/\/pressbooks.bccampus.ca\/kathleef\/wp-content\/uploads\/sites\/1093\/2020\/10\/Two-promoters-one-insert.jpg\" alt=\"\" width=\"990\" height=\"222\" \/>\r\n\r\nRNAi can be done in a transient way as is often done using cultured cells. The double stranded RNA is made <em>in vitro<\/em> and then isolated and injected into the cells or into embryos (this is done in fly embryos sometimes).\u00a0 Or a plasmid is introduced that will transcribe the ds-RNA but the plasmid is not stable in the cells, and so will be degraded in a relatively short time. These approaches lead to only a transient assay because you introduce the RNA, and you then have a few hours to see the effect. It could be a change in cell behaviour or in gene expression or some other immediate result of the RNAi.\u00a0 After a few hours the effect diminishes as all the ds-RNA has been cut up and used in the response and\/or the plasmid has been degraded by the cell.\r\n\r\nTo do a longer term experiment the construct you've made to produce the ds-RNA is introduced into an organism and part of the plasmid is integrated into the genome of the organism. Then a strain or line of that organism that contains the construct is kept.\u00a0 We generally like to use inducible promoters in that case, so that we can turn on the production of ds-RNA at a defined time or place and observe the effects. In some organisms, we can use a promoter called the heat-shock promoter. Heat shock proteins are chaperones that help keep proteins from unfolding during high temperature or other types of stress in a cell. The promoters of these proteins are very strong and they induce a large amount of expression. The RNA polymerases and some other regulatory proteins are sitting on these promoters all the time, ready to start transcription the moment these proteins are needed- which by definition is going to be some type of emergency situation for the cell. So when there is a sudden temperature increase the heat shock genes are transcribed vigorously to produce lots of chaperone proteins and protect the cell from the damaging effects of the heat treatment.\u00a0 When we attach these promoters to other genes they will be highly transcribed under our control- when we heat shock the cells or organisms that carry the construct. In different systems, different inducible promoters may be used, but the point is the same- we can turn on the RNAi at specific times to see the effect of knocking out a gene at a particular time.\r\n\r\nSometimes the RNAi effect is not very strong. In that case you can sometimes introduce two different RNAi constructs into the organism, each targeting a different part of the gene. That can increase the effect. Also, you can introduce a construct that has extra copies of the dicer gene on it. Extra copies of the dicer gene mean more of the dicer protein and the construct should have dicer under the control of the same inducible promoter as your RNAi construct. So, when you induce the ds-RNAi for your gene of interest, at the same time you make a lot of extra dicer protein and this greatly enhances the RNAi response.\r\n\r\n[h5p id=\"37\"]\r\n\r\n[h5p id=\"38\"]\r\n\r\n<a href=\"https:\/\/pressbooks.bccampus.ca\/kathleef\/wp-content\/uploads\/sites\/1093\/2020\/07\/e-text-chapter-8-slides-part-4.pptx\">Click here for the powerpoint slides presented in the video below<\/a>.\r\n\r\n[embed]https:\/\/www.youtube.com\/watch?v=csPye3tPTW8&amp;feature=youtu.be&amp;hd=1[\/embed]\r\n\r\n&nbsp;\r\n\r\n&nbsp;\r\n\r\n<hr style=\"height: 5px;border-top: solid black\" \/>\r\n\r\n<h2><a id=\"D_CRISPR\"><\/a>D. CRISPR:<\/h2>\r\n<h3><a id=\"D1_preamble\"><\/a>D-1. Preamble<\/h3>\r\nA few days ago the 2020 Nobel Prize for Chemistry was awarded to Jennifer Doudna and Emmanuelle Charpentier for their work on the CRISPR-Cas 9 system.\u00a0 They were studying how bacteria can protect themselves from a second viral infection after surviving the first infection and this curiosity-driven research led to the elucidation of a mechanism by which we could edit genes using the same system. Please read the following short article that describes a bit about the research and the researchers.\u00a0 It also contains a short scrollable description of how the whole process works that is easy to understand and nicely illustrated.\u00a0 Your next bioinformatics project is the production of oligonucleotides to make a CRISPR construct for a gene in the foxtail millet, <em>Setaria viridis<\/em>. <span style=\"background-color: #ffff00\">This will contribute toward a <span style=\"text-decoration: underline\">global research project<\/span> that I will tell you about next week.<\/span>\r\n\r\n<a href=\"https:\/\/theconversation.com\/what-is-crispr-the-gene-editing-technology-that-won-the-chemistry-nobel-prize-147695\" target=\"_blank\" rel=\"noopener noreferrer\">The Conversation: What is CRISPR, the gene editing technology that won the Chemistry Nobel prize?<\/a>\r\n\r\n<hr \/>\r\n\r\n<h3><a id=\"D2_bacteria\"><\/a>D-2. How bacteria do it<\/h3>\r\nBacteria that survive an infection by a bacteriophage (virus that infects bacteria) cut out and keep a small segment of the phage DNA and store it in a region of the bacterial chromosome that contains a set of short repeats. Between the repeats are the DNA records of previous viral infections- segments of viral DNA. The repeats are called <strong>C<\/strong>lustered <strong>R<\/strong>egularly <strong>I<\/strong>nterspaced <strong>S<\/strong>hort <strong>P<\/strong>alindromic <strong>R<\/strong>epeats. This is where the term CRISPR comes from.\u00a0 The DNA from the phage that is stored in the CRISPR sites is not just random sequence; it is very precise: it is 20 nucleotides of sequence that are found immediately upstream of a sequence: NGG.\u00a0 This is called a <strong><span style=\"color: #0000ff\">PAM<\/span><\/strong> sequence, which means <strong>p<\/strong>rotospacer <strong>a<\/strong>djacent\u00a0 <strong>m<\/strong>otif. The NGG sequence is not inserted in the CRISPR loci but it is in the corresponding phage DNA and is important for the bacterial response to infection later.\r\n\r\nWhen bacteria are infected again by the same type of phage - not the same bacterium but the descendants of the cell that survived the infection - the CRISPR loci are transcribed to make RNA. The RNA that comes from the previous phage DNA is the <strong><span style=\"color: #0000ff\">cr<\/span><span style=\"color: #0000ff\">RNA<\/span><\/strong><span style=\"color: #0000ff\">.<\/span> Another RNA is also transcribed, which is called the <span style=\"color: #0000ff\"><strong>tra<\/strong><strong>crRNA<\/strong>.<\/span>\u00a0 These are functional RNAs; they are not translated into protein but perform their functions as RNAs and they have regions where the two RNAs are complementary to each other and so can base pair with each other. A third gene is also transcribed and translated to produce the <span style=\"color: #0000ff\"><strong>Cas9<\/strong><\/span> protein. Cas means <strong>C<\/strong>RISPR <strong>As<\/strong>sociated protein 9.\u00a0 There are many of these in different systems but we'll stick to Cas9 for BISC 357.\u00a0 (It has had other names in the past: Cas5, Csn1, or Csx12 so if you come across these in your studies or in other courses, you will know that these also refer to the Cas9 protein)\r\n\r\nSo the bacteria are undergoing an infection and there are two RNAs and one protein produced in response. The tracrRNA has a region that binds to the Cas9 protein and the region of complementarity to the crRNA. It acts as a linker between the two.\u00a0 The crRNA is complementary to one strand of the phage DNA. So it binds to the phage DNA and the tracrRNA forms the link connecting the Cas9 protein to the crRNA and when Cas9 is in position on the phage DNA it cuts the phage DNA in a very precise location- 3 nucleotides upstream of the PAM sequence. It cuts both strands of the DNA to make a \"break\".\u00a0 The phage gene cannot be expressed and furthermore, the DNA is degraded by the bacterial exonucleases. This is where the PAM sequence is essential. The PAM sequence is unique to each type of Cas protein and is absolutely required for the protein to be able to cut the DNA. So even if the crRNA recognizes the 20 bases of target sequence in the phage genome and the tracrRNA connects the Cas9 protein to the crRNA, Cas9 must bind to the PAM sequence or it will be unable to undergo the conformational shift that activates the endonuclease function.\r\n\r\nBacteria are really quite remarkable.\r\n\r\n<a href=\"https:\/\/pressbooks.bccampus.ca\/kathleef\/wp-content\/uploads\/sites\/1093\/2020\/07\/e-text-chapter-8-slides-part-5.pptx\">Click here for the powerpoint slides presented in the video below<\/a>.\r\n\r\n[embed]https:\/\/www.youtube.com\/watch?v=LM0HaUEpT6o&amp;feature=youtu.be&amp;hd=1[\/embed]\r\n\r\n<hr \/>\r\n\r\n<h3><a id=\"D3_how\"><\/a>D-3. How we use it: the principle of gene editing<\/h3>\r\nWhen we make constructs for CRISPR we use vectors that have everything we need except for the unique 20 bases of sequence that will target the gene we want to knock out.\u00a0 The gene for Cas9 is on the vector because the organisms we are studying don't have this protein. The gene for Cas9 has been modified to contain one or more <strong><span style=\"color: #0000ff\">nuclear localization signals<\/span> <\/strong>(NLSs). The protein must enter the nucleus to cut the target DNA so the NLS is required to direct the protein to the correct location to do its job. Bacteria have no nuclear envelope and so do not require NLSs on their Cas9 protein. The two genes for tracr RNA and crRNA have been fused. The \"overlap\" region is removed and the two parts of the RNA that are needed to direct Cas9 to the target DNA are transcribed as a <strong><span style=\"color: #0000ff\">single guide RNA<\/span><\/strong> (sgRNA). At one end is the 20 nucleotide sequence that is designed by the researcher and is specific for the gene of interest. The part that folds up and binds with Cas9 is the same for every gene, so this is an unvarying part of the construct. Both the Cas9 gene and the sgRNA gene are under the control of strong promoters that work in eukaryotes.\r\n\r\n&nbsp;\r\n\r\nThe constructs are introduced into organisms in a variety of ways, depending on the organism.\u00a0 Once inside the single guide RNA is expressed as is the Cas9 protein.\u00a0 They enter the nucleus and the sgRNA binds the target sequence on the gene of interest. The Cas9 protein interacts with the PAM sequence and changes conformation in order to\u00a0 cut the DNA 3 bases upstream of the NGG sequence.\r\n\r\nOnce a cut is made in the DNA sequence, several things can happen. An accurate repair system might repair the DNA. If this happens it might be cut again. This may happen repeatedly. But there are repair systems that are less accurate that can \"repair\" the DNA but that will introduce mistakes in the sequence in the process. These are the ones that will inactivate the gene.\r\n\r\n[h5p id=\"39\"]\r\n\r\n<a href=\"https:\/\/pressbooks.bccampus.ca\/kathleef\/wp-content\/uploads\/sites\/1093\/2020\/07\/e-text-chapter-8-slides-part-6.pptx\">Click here for the powerpoint slides presented in the video below<\/a>.\r\n\r\n[embed]https:\/\/www.youtube.com\/watch?v=JXF2HKOkYUw&amp;feature=youtu.be&amp;hd=1[\/embed]\r\n<h4><a id=\"D3i_NHEJ\"><\/a>D3-i. Knock out - NHEJ repair<\/h4>\r\nWhen the <strong>n<\/strong>on-<strong>h<\/strong>omologous <strong>e<\/strong>nd <strong>j<\/strong>oining (<span style=\"color: #0000ff\"><strong>NHEJ<\/strong><\/span>) repair system acts, it reconnects broken DNA but not in an \"accurate\" way. This type of repair can cause insertion or deletions into the DNA at the site of the break and if multiple breaks occur along a chromosome, the \"wrong\" ends might be connected, to generate inversions or translocations. I always think of this repair system as the \"Emergency! Just glue the pieces back together, don't worry about getting it perfect!\" type of system.\u00a0 So in the case that we introduce our construct that targets our gene, in many cells we expect that the NHEJ repair system may repair the breaks that are induced in the gene. Consider that if even one nucleotide is added at the break, the entire gene is frame-shifted. Thus the correct protein won't be produced. Or if nucleotides are deleted and they are not in multiples of three we will also get a frameshift mutation. The mutations are called \"indel\" because either insertion or deletion of one or two nucleotides may occur and in both situations, a frameshift results. Frameshifts usually result in truncated proteins- you may recall from your gene annotation assignments, that all reading frames except the correct one contain lots of stop codons.\r\n\r\nThen we can look at the phenotype of the gene knockout to see the consequences of inactivating the gene.\u00a0 We will probably also collect DNA from the organism and sequence the gene to determine what the exact change in the sequence was. We may obtain several individuals with mutant phenotypes; they may not have exactly the same genetic change.\r\n<h4><a id=\"D3ii_HDR\"><\/a>D3-ii. Directed modification - HDR<\/h4>\r\nHomology dependent repair is accurate and sequence-based. If a break occurs in the DNA, proteins bind it, and generate single stranded sections on the ends of the break that then do a \"homology search\". The strands generally find the homologue (we are talking about diploid organisms here) and use that homologue to copy the correct sequence into the broken region.\r\n\r\nIn general this repair system would not result in a mutation of the gene we are targeting. However, if we provide some \"donor\" DNA along with the construct that we introduce into the research organism, we are essentially providing the homology dependent repair systems a piece of DNA to copy into the broken region. In this case we could introduce a reporter gene, or a piece of DNA with a particular type of mutation that we want to study. Suppose we thought a certain set of a few amino acids were the critical ones for a protein's function. We could provide a donor DNA sequence that lacks\u00a0 the codons for those amino acids, and the repair system could incorporate that into the gene. We could then check to see whether we were correct about the protein's function.\r\n\r\n<a href=\"https:\/\/pressbooks.bccampus.ca\/kathleef\/wp-content\/uploads\/sites\/1093\/2020\/07\/e-text-chapter-8-slides-part-7.pptx\">Click here for the powerpoint slides presented in the video below<\/a>.\r\n\r\n[embed]https:\/\/www.youtube.com\/watch?v=jHSe94TFZyY&amp;feature=youtu.be&amp;hd=1[\/embed]\r\n\r\n&nbsp;\r\n\r\nThere is a former student from 357 who also TAd the course a couple of years ago who is currently using this very approach to study heart disease in a zebra fish model for his PhD research.\r\n\r\nPerhaps you've already thought of the most interesting application of this approach: the possibility of cutting a target DNA sequence that is actually causing a disease or condition in a person, and replacing that mutated sequence with the wild type form, in order to treat the condition. This has already been tried with sickle cell anemia, and within the last few years, there were reports of a person who had been treated for sickle cell anemia using CRISPR. In sickle cell anemia the hemoglobin gene has a mutation in it that causes the hemoglobin molecules to bind incorrectly to each other. They don't carry oxygen as effectively as regular hemoglobin and they cause a deformation of the red bloods cells that causes the cells to get stuck in small capillaries sometimes. This can cause severe pain and because the oxygen carrying capacity is low, people with this condition have low energy. To treat this condition, stem cells were collected from the woman with the condition. The CRISPR treatment was performed on the cells to inactivate a gene called <em>BCL11A. <\/em>This gene shuts off production of fetal hemoglobin a few months after a baby is born. Inactivating this gene in the adult woman's stem cells allowed the fetal hemoglobin\u00a0 gene to turn back on in those cells. The woman underwent a round of chemotherapy, presumably to kill the stem cells in her bone marrow and then the CRISPR-treated stem cells were reintroduced into her body. Producing large amounts of the fetal hemoglobin molecule (which is wild type) prevents the sickling of the blood cells. As a bonus, the fetal form of hemoglobin has higher oxygen carrying capacity than the adult form. The person who was treated seems to be in good health, a couple of years after the treatment.\u00a0 There is a short article about this, from July of 2022 linked below:\r\n\r\n&nbsp;\r\n\r\n<a href=\"https:\/\/www.healthline.com\/health-news\/first-person-treated-for-sickle-cell-disease-with-crispr-is-doing-well\" target=\"_blank\" rel=\"noopener noreferrer\">healthline: First Person Treated for Sickle Cell Disease with CRISPR Is Doing Well<\/a>\r\n<h4><em><a id=\"D3iii_Designing_oligonucleotides\"><\/a>D3-iii. Designing oligonucleotides that will generate an overhang for cloning<\/em><\/h4>\r\nThis is not specific to CRISPR, but is an approach we are going to use in our CRISPR bioinformatics exercise, so it is a good place to introduce it.\u00a0 We design oligonucleotides that are complementary to each other, EXCEPT for four nucleotides at the 5' ends of the oligos.\u00a0 We are going to clone into a vector cut with a special enzyme which cuts OUTSIDE of the recognition sequence so that the overhang generated is different at each site, depending on what sequence is near the recognition site. Our vector has two restriction sites (<em>Bsa<\/em>I) and the overhangs generated are not complementary to each other. Thus the vector cannot re-circularize during ligation.\r\n\r\nTo make the insert, we take equimolar amounts (the same # of molecules) of each oligo and combine them in a PCR tube. We heat to 96C for about 5 minutes and then allow the temperature to decrease very gradually. This allows the complementary parts of the forward and reverse oligos to find each other and to bind. The 5' ends stick out (see below) and the sequences that are sticking out are complementary to the overhangs in the vector.\u00a0 There is more information in the lecture on CRISPR, the bioinformatics assignment in which you design the oligos for this project, the lecture about the project we're working on, as well as your lab handout, later in the semester. In the image below, the squiggly lines at the 5' ends of the annealed oligos are the 4-bp sequences that match the insertion site in the cut vector.\r\n\r\n<img class=\"alignnone size-full wp-image-1082\" src=\"https:\/\/pressbooks.bccampus.ca\/kathleef\/wp-content\/uploads\/sites\/1093\/2020\/07\/annealed-oligo.png\" alt=\"\" width=\"418\" height=\"77\" \/>\r\n\r\n<hr \/>\r\n<p style=\"text-align: left\"><a href=\"https:\/\/pressbooks.bccampus.ca\/kathleef\/chapter\/chapter-8-investigating-gene-function-3-expression-constructs\/\">Previous (Chapter 8)<\/a><span style=\"float: right\"><a href=\"https:\/\/pressbooks.bccampus.ca\/kathleef\/chapter\/chapter-10-sanger-sequencing\/\">Next (Chapter 10)<\/a><\/span><\/p>","rendered":"<h1 style=\"text-align: center\">Introduction<\/h1>\n<p>In this section we will look at ways to go beyond detecting when and where a gene is expressed; we will explore ways of reducing or eliminating the gene&#8217;s expression, and using the resulting phenotype to deduce the likely role of the gene product. Understanding when and where the gene is expressed (discussed in <a href=\"https:\/\/pressbooks.bccampus.ca\/kathleef\/chapter\/chapter-7a-analyzing-gene-function1-gene-expression\/#B_When_and_where\">Chapter 7<\/a> and <a href=\"https:\/\/pressbooks.bccampus.ca\/kathleef\/chapter\/chapter-7b-investigating-gene-function-3-expression-constructs\/#A1_what\">Chapter 8<\/a>) will help us know where we should expect to see the phenotype resulting from reduced or no expression of the gene. It is an important first step in elucidating gene function.\u00a0 Note that we could be surprised, however. Sometimes despite the knowledge we have accumulated about a gene, we find it works in an unexpected way &#8211; and produces a phenotype we would not have predicted. This is what is exciting about science- we don&#8217;t really know the answer until the experiments are done.<\/p>\n<p>&nbsp;<\/p>\n<p>We will cover four means of eliminating or reducing gene expression here:\u00a0 insertional mutagenesis, homologous recombination, RNAi, and CRISPR-Cas9. There will be the strongest focus on the last two of these, both of which are borrowed from bacteria. We will consider how they work in nature, protecting cells from infection and then how they are used in analysis of gene function.\u00a0 The approaches used to make the constructs for these applications will also be described.<\/p>\n<p>&nbsp;<\/p>\n<div class=\"textbox shaded\">\n<h2 style=\"margin-top: 0em\">Contents<\/h2>\n<p style=\"margin-top: 0em;margin-bottom: 0em;margin-left: 0em\"><a href=\"#Learning_Outcomes\">Learning Outcomes<\/a><br \/>\n<a href=\"#terminology\">Terminology before we begin<\/a><br \/>\n<a href=\"#A_insertional\">A. Insertional Inactivation<\/a><\/p>\n<p style=\"margin-top: 0em;margin-bottom: 0em;margin-left: 2em\"><a href=\"#A1_Ti\">A-1. Ti Plasmids in Plants<\/a><br \/>\n<a href=\"#A2_p_element\">A-2. P elements in fruit flies<\/a><\/p>\n<p style=\"margin-top: 0em;margin-bottom: 0em;margin-left: 0em\"><a href=\"#B_homologous\">B. Homologous Recombination<\/a><\/p>\n<p style=\"margin-top: 0em;margin-bottom: 0em;margin-left: 0em\"><a href=\"#C_RNAi\">C. RNAi<\/a><\/p>\n<p style=\"margin-top: 0em;margin-bottom: 0em;margin-left: 2em\"><a href=\"#C1_history\">C-1. A little history<\/a><br \/>\n<a href=\"#C2_how\">C-2. How it works<\/a><br \/>\n<a href=\"#C3_Making_constructs_for_RNAi\">C-3. Making constructs for RNAi<\/a><\/p>\n<p style=\"margin-top: 0em;margin-bottom: 0em;margin-left: 0em\"><a href=\"#D_CRISPR\">D. CRISPR<\/a><\/p>\n<p style=\"margin-top: 0em;margin-bottom: 0em;margin-left: 2em\"><a href=\"#D1_preamble\">D-1. Preamble<\/a><br \/>\n<a href=\"#D2_bacteria\">D-2. How bacteria do it<\/a><br \/>\n<a href=\"#D3_how\">D-3. How we do it: making CRISPR constructs<\/a><\/p>\n<p style=\"margin-top: 0em;margin-bottom: 0em;margin-left: 4em\"><a href=\"#D3i_NHEJ\">D3-i. Knock out &#8211; NHEJ repair<\/a><br \/>\n<a href=\"#D3ii_HDR\">D3-ii. Directed modification &#8211; HDR<\/a><br \/>\n<a href=\"#D3iii_Designing_oligonucleotides\">D3-iii. Designing oligonucleotides that will generate an overhang for cloning<\/a><\/p>\n<p>&nbsp;<\/p>\n<p><a href=\"https:\/\/pressbooks.bccampus.ca\/kathleef\/back-matter\/recorded-lecture-videos\/#8\">List of lecture videos (excluding supplemental videos)<\/a><\/p>\n<\/div>\n<hr style=\"height: 5px;border-top: solid black\" \/>\n<div class=\"textbox textbox--learning-objectives\">\n<header class=\"textbox__header\">\n<h2 class=\"textbox__title\" style=\"margin-top: 0em;margin-bottom: 0em\"><a id=\"Learning_Outcomes\"><\/a>Learning Outcomes<\/h2>\n<\/header>\n<div class=\"textbox__content\">\n<ul>\n<li>\n<div><span style=\"text-decoration: underline\">Describe<\/span> the methods used to knock out or knock down gene function<\/div>\n<\/li>\n<li><span style=\"text-decoration: underline\">Distinguish<\/span> between forward and reverse genetic approaches<\/li>\n<li>\n<div><span style=\"text-decoration: underline\">Distinguish<\/span> between the techniques \u2013 used in different organisms, different approaches etc.<\/div>\n<\/li>\n<li>\n<div><span style=\"text-decoration: underline\">Describe<\/span> how we make constructs for RNAi and how RNAi works<\/div>\n<\/li>\n<li><span style=\"text-decoration: underline\">Describe<\/span> how to make a CRISPR construct through the method we will use in Bioinformatics Assignment#2<\/li>\n<\/ul>\n<\/div>\n<\/div>\n<hr style=\"height: 5px;border-top: solid black\" \/>\n<h2><a id=\"terminology\"><\/a>Terminology before we begin:<\/h2>\n<p><span style=\"color: #0000ff\"><span style=\"color: #000000\">It is important to learn a bit of terminology before we begin.\u00a0 <span style=\"color: #0000ff\"><strong>Forward genetics<\/strong><\/span> is a process by which we induce many mutations at random and then after we&#8217;ve done all the work of isolating the mutations, we have to still figure out which mutations we want to study, and then do a lot of analysis to find out which gene is mutated<\/span><strong>.\u00a0 Reverse genetics <\/strong><span style=\"color: #000000\">i<\/span><span style=\"color: #000000\">s an approach by which we identify what gene we want to study and target it specifically in some way. We then see what the effect is of altering the expression of the gene in order to determine what its function is.\u00a0 This has only been possible since the sequencing of the genomes <\/span><span style=\"color: #000000\">of many model organisms.\u00a0 Both approaches have value but reverse genetics approaches are more targeted so we know exactly what changes we are making in the gene which helps us interpret the results. Forward genetics can produce mutations that you would never have designed on purpose and that produces an effect that is unexpected or unpredictable. But it is much more work than reverse genetics<\/span><strong>. <\/strong><span style=\"color: #000000\">We will be looking at one forward genetics approach and multiple reverse genetics approaches in this chapter.<\/span><\/span><\/p>\n<p><span style=\"color: #0000ff\"><strong>Loss of function<\/strong><\/span> mutations are mutations that reduce or eliminate a gene&#8217;s function. In this case we may be eliminating the gene itself or preventing it from producing any of its protein product. These are examples of <strong><span style=\"color: #0000ff\">amorphic<\/span><\/strong> mutations; ones that are not producing any of the protein product they normally make.\u00a0 In other cases we might reduce the transcription of the gene but not completely eliminate it. Or a protein could be produced that is missing some amino acids or has some incorrect amino acids and though it is not very active, it is able to perform its function a little bit. These are examples of <strong><span style=\"color: #0000ff\">hypomorphic<\/span><\/strong> mutations, in these mutations the function of the gene is reduced but there is still some protein that somewhat does the job. To really understand the function of a gene we want to see what the phenotype of the amorphic mutation is but we like to use hypomorphic mutations for other purposes- both have their uses in genetics.<\/p>\n<p>&nbsp;<\/p>\n<p>There are also <strong><span style=\"color: #0000ff\">gain of function<\/span><\/strong> mutations in which &#8211; for example &#8211; a gene is more active than usual either because it is being transcribed at a higher rate than usual, or because the protein can&#8217;t be degraded or down-regulated when it is supposed to stop performing its function. There may not be a phenotype associated with this type of <strong><span style=\"color: #0000ff\">hypermorphic<\/span><\/strong> mutation, but there could be a very strong and informative effect. It is very dependent on what the gene&#8217;s product does. <span style=\"color: #0000ff\"><strong>Neomorphic<\/strong> <\/span>mutations involve the mis-regulation of the gene or a change in the protein that causes it to do something different from the wild type situation. In these situations a gene might be expressed in a stage or type of tissue where it is not normally expressed. Or, the protein might be modified somehow so that it interacts with other proteins it would not normally interact with in a cell. Or it might localize to a place where it is not supposed to be. Sometimes these changes are very impactful. A signalling molecule that is active in the wrong tissue can promote cell division when it is not appropriate for the cells to be dividing- this could lead to tumour formation.<\/p>\n<p>&nbsp;<\/p>\n<div id=\"h5p-31\">\n<div class=\"h5p-iframe-wrapper\"><iframe id=\"h5p-iframe-31\" class=\"h5p-iframe\" data-content-id=\"31\" style=\"height:1px\" src=\"about:blank\" frameBorder=\"0\" scrolling=\"no\" title=\"Type of mutations: drag the correct term to the space beside each mutation description\"><\/iframe><\/div>\n<\/div>\n<p><a href=\"https:\/\/pressbooks.bccampus.ca\/kathleef\/wp-content\/uploads\/sites\/1093\/2020\/07\/e-text-chapter-8-slides-part-1.pptx\">Click here for the powerpoint slides presented in the video below<\/a>.<\/p>\n<p><iframe loading=\"lazy\" id=\"oembed-1\" title=\"chapter 8 part 1 introduction and terminology\" width=\"500\" height=\"281\" src=\"https:\/\/www.youtube.com\/embed\/kxfzpby29x8?feature=oembed&#38;rel=0\" frameborder=\"0\" allowfullscreen=\"allowfullscreen\"><\/iframe><\/p>\n<hr \/>\n<h2><a id=\"A_insertional\"><\/a>A. Insertional Mutagenesis:<\/h2>\n<p>One way to knock out a gene&#8217;s function is to make a mutation in it. There are various ways to cause changes in DNA sequence of genes. You can treat the organisms with chemicals or radiation and then look for the phenotypes that result. Then you must do a lot of work to figure out what gene has been mutated to cause the phenotype you see.\u00a0 This is an example of <span style=\"color: #000000\">forward genetics<\/span><strong><span style=\"color: #0000ff\">.<\/span><\/strong> In this approach, mutations are induced (caused) at random and then you have to figure out what gene has been altered. A different way to make a mutation is to insert a segment of DNA into the gene. It is quite unlikely that adding a huge DNA sequence into a gene would leave the gene still functioning perfectly. This is because the gene will be frame-shifted. The gene can still be transcribed, but when the RNA is translated, at the point where the inserted DNA was transcribed, the amino acid sequence of the protein will be incorrect. And most likely there will be a stop codon in the sequence, leading to a truncated (shortened) protein that contains the wrong amino acids.\u00a0 \u00a0This is called <span style=\"color: #0000ff\"><strong>insertional inactivation<\/strong><\/span> and we have systems for using this approach in some of our commonly used model organisms. The insertion of the DNA into the genes to cause mutations is still random, but we can use our knowledge of the DNA we have introduced and the power of PCR to more easily determine which gene has been altered. We will talk about 2 types of insertional inactivation. The principles of these methods apply to other methods in other organisms you might learn about in other courses. (<span style=\"color: #993366\"><strong>Note: For this semester, I am going to leave out the second part of this section, part A-2 about fruit flies<\/strong>)<\/span><\/p>\n<div id=\"h5p-34\">\n<div class=\"h5p-iframe-wrapper\"><iframe id=\"h5p-iframe-34\" class=\"h5p-iframe\" data-content-id=\"34\" style=\"height:1px\" src=\"about:blank\" frameBorder=\"0\" scrolling=\"no\" title=\"Insertional mutagenesis 1\"><\/iframe><\/div>\n<\/div>\n<p>&nbsp;<\/p>\n<hr \/>\n<h3><a id=\"A1_Ti\"><\/a>A-1. Ti Plasmids in Plants<\/h3>\n<p>There is a type of soil dwelling bacterium that can infect plants and cause large &#8220;galls&#8221; which are a type of benign tumour. The tissues in these &#8220;galls&#8221; are reprogrammed by the bacterium to produce a protected place for the bacteria to live, and to also produce certain amino acids and other nutrients that the bacteria consume. The bacterium, <em>Agrobacterium tumefaciens,\u00a0<\/em> contains a very large circular plasmid, 140 kb to 235 kb in size, called the Ti plasmid. The plasmid has features you would expect of any plasmid, such as an origin of replication. It also has a region called T-DNA which actually causes the formation of the tumour.\u00a0 During infection of the plant tissue, a copy of this T-DNA is inserted into the genome of the plant (the nuclear genome). In nature, this T-DNA has genes that encode enzymes that produce hormones in the host plant. It is this manipulation of the plant&#8217;s own hormones that leads to the formation of the tumour, called a crown gall.<\/p>\n<p>There is also a region on the T-DNA responsible for synthesis of opines. These are unusual derivatives of sugars or amino acids and they provide nutrients to the bacteria living in the tumour tissue. The Ti plasmid also has genes on it that are needed for using the opines as a source of nutrition. These genes are not transferred to the plant. They are needed only by the bacteria.<\/p>\n<p>The T-DNA has a short sequence (about 25 bp) to either side of it; these are called LB for left border and RB for right border.\u00a0 These sequences are necessary for the transfer of a copy of the T-DNA to the host plant&#8217;s genomic DNA. The Ti plasmid (but not the T-DNA) also has a<span style=\"text-align: initial;font-size: 1em\"> virulence region; this is where the genes necessary to allow the bacterium to infect the host cell are found. There are some other regions on these huge plasmids that are not relevant for this topic. I am adding a simple diagram of the Ti plasmid from wikipedia. Pay attention to the LB and RB sequences, what is in the T-DNA part of the plasmid and what is not. The genes in the T-DNA segment are the ones that are copied and moved into the plant&#8217;s genome while the ones that are not in this region are functional in the bacterium only.<\/span><\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"\" src=\"https:\/\/upload.wikimedia.org\/wikipedia\/commons\/d\/d1\/Ti_plasmid.svg\" alt=\"Ti plasmid - Wikipedia\" width=\"403\" height=\"305\" \/><\/p>\n<p>As with so many of our genetic engineering techniques, researchers have found something operating in nature and have figured out how to use it in research.\u00a0 The Ti plasmid has been modified for use in the genetic engineering of plants.\u00a0 We&#8217;ll talk later in the semester about some details of how the plasmid is used but for now the main thing to know is that one of the changes made is to remove the genes relating to plant hormone regulation and opine metabolism and to add a gene for antibiotic resistance. The plasmid can be introduced into plants using the antibiotic resistance as a selection mechanism.\u00a0 It is more complex than it sounds, and again, we will talk about the details of how the DNA gets into the plants and the process of selection etc. later in the semester.\u00a0 The result of transforming plant cells on a large scale is that many tens of thousands of lines of plants (perhaps by now, hundreds of thousands!) have been generated, each with a unique mutation in it, caused by an insertion of a large segment of T-DNA.<\/p>\n<p>So far this is a lot like other forward genetics techniques but the work needed to try to figure out which gene has been altered by the mutation is much less than when genes are mutated by chemicals or radiation. This is because we know the exact DNA sequence of the T-DNA. And we can use this knowledge to design a primer to sequence the DNA of plants with a T-DNA insert into a gene. The primer recognizes the T-DNA sequence and is directed outward towards the gene sequence. This is a way of using the sequence we know &#8211; the T-DNA &#8211; to identify the sequence we don&#8217;t know &#8211; the plant gene.<\/p>\n<div id=\"h5p-35\">\n<div class=\"h5p-iframe-wrapper\"><iframe id=\"h5p-iframe-35\" class=\"h5p-iframe\" data-content-id=\"35\" style=\"height:1px\" src=\"about:blank\" frameBorder=\"0\" scrolling=\"no\" title=\"TI plasmid 1\"><\/iframe><\/div>\n<\/div>\n<p>Below is the recorded lecture for this topic (<a href=\"https:\/\/pressbooks.bccampus.ca\/kathleef\/wp-content\/uploads\/sites\/1093\/2020\/07\/e-text-chapter-8-slides-part-2.pptx\">click here for the powerpoint slides<\/a>):<\/p>\n<p><iframe loading=\"lazy\" id=\"oembed-2\" title=\"Chapter 8 recording 2, T DNA insertions into genes\" width=\"500\" height=\"281\" src=\"https:\/\/www.youtube.com\/embed\/DVbLeKi4Xyk?feature=oembed&#38;rel=0\" frameborder=\"0\" allowfullscreen=\"allowfullscreen\"><\/iframe><\/p>\n<hr \/>\n<h3><a id=\"A2_p_element\"><\/a>A-2. P-elements in fruit flies<\/h3>\n<p><span style=\"color: #993366\"><strong>NOTE: for now, we will leave out this section. In many ways it is very similar to the plant example already explained. I&#8217;ve written it and it will stay in because it might be used in a later offering of the course. But you can ignore section A-2 this time.<\/strong><\/span><\/p>\n<p>In fruit flies, the transposable element called the P-element is used to create mutations and to make transgenic flies for other reasons.\u00a0 The P-element is remarkable: it is a gene which has the ability to move from place to place in the genome. The single gene on a P-element encodes transposase which is the name of the enzyme that cuts the DNA to either side of the element and inserts it into a new place in the genome.<\/p>\n<p>P-elements were introduced into <em>Drosophila melanogaster<\/em> (the fruit fly most used in genetics research) probably no more than about 200 years ago. In the early days of fly research, flies isolated from the wild lacked P-elements and by the 1970 virtually every strain established from a wild population contained plenty of these elements. This is a tremendously rapid evolutionary change from the element being extremely rare in the 1920s to being practically ubiquitous by the 1970s.<\/p>\n<p>It perhaps won&#8217;t surprise you to learn that P-elements have been modified from their wild form for use in fly research.\u00a0 For instance, the P-element has been transformed into a reporter construct, by putting a GFP gene on it, with the appropriate regulatory sequences and some of you who have taken BISC 302W already know quite a bit about all the interesting research being one with P-elements. Here we will just briefly outline their use in forward genetics approaches.<\/p>\n<p>Like the Ti plasmid in plants, the P-element can be introduced into flies and allowed to insert into genes causing mutant phenotypes. In the fly community a concerted effort went into doing genetic screening on a large scale to try to produce a mutation in every single gene of the fly. Each mutation was of course kept as a separate line of flies that all have the same P-element induced mutation.<\/p>\n<p>For genetic screening, the P-elements have had their transposase gene removed and replaced with the <em>white<sup>+<\/sup><\/em> gene. The + means that it is the wild type version of the gene. Fly genes are named backwards so the <em>white<sup>+<\/sup><\/em> gene is required to make the wild type red eye of the fly.<\/p>\n<p>The P-element is introduced into flies by injecting a plasmid that contains the P-element (with its <em>white+<\/em> reporter gene) into the posterior region of a very early embryo. At this stage the embryo is one big cell with many nuclei- the nuclei divide but the cell doesn&#8217;t.\u00a0 The posterior region is where we introduce the DNA because we are hoping it will be incorporated into a nucleus that goes on to form a germ cell. This will then give rise to gametes which may have the P-element somewhere in the genome. The injection is quite finicky- a very thin and very sharp needle is used to make the tiniest hole possible in the embryo. You don&#8217;t want to damage it. We inject the plasmid that contains the P-element and another plasmid that has the transposase gene on it. The transposase enzyme will cut the P-element out of the plasmid and insert it somewhere in the genome of some of the nuclei in the embryo.\u00a0 The flies we use for producing the embryos for injection are all mutant for the <em>white<\/em> gene. They have white eyes.<\/p>\n<p>After the injection process we allow the embryos to recover and develop into adults. All the adults that develop have white eyes. That is because most won&#8217;t have a P-element in any of their cells. And those that do have the P-element incorporated into some of their cells won&#8217;t have it in the eyes, which develop from anterior cells in the embryo.\u00a0 The flies though are mated to either females or males that also have white eyes. Most of the progeny of these crosses have white eyes but some have red or orange or pinkish eyes. These flies developed from an gamete that had the P-element incorporated into the DNA of a germ cell.\u00a0 All of their cells now have the P-element in that position on that chromosome. And the <em>white<sup>+<\/sup><\/em> gene is being transcribed and translated from the P-element. If it is expressed a lot the eye is nearly wild type but if it is expressed just a little the eye might be just pinkish, very pale.<\/p>\n<p>Once we have a P-element containing strain, we can do large scale genetic screens where we cause the element to mobilize randomly around the genome and we then collect a large number of individuals that have P element induced mutations. Each individual is used to make a strain of flies that all have the same mutation as the original fly. A consortium of researchers initiated a project to make a P-element mutation in every single gene of the fly as a resource for the community.\u00a0 There are thousands of strains available to researchers, each of which has a mutation in a single gene. And there are many P-elements, which have different components, for different purposes. For example, some have the <em>lacZ<\/em> gene as a reporter, while others have <em>GFP<\/em>; this is in addition to <em>white<sup>+<\/sup><\/em>.<\/p>\n<p>The value of having a P element induced mutation rather than chemical or radiation induced, is that it is fairly simple to figure out what gene the P-element is inserted into. There are two methods we might use, depending on which P-element is inserted into the gene.<\/p>\n<p>The first method is called <span style=\"color: #0000ff\"><strong>inverse PCR.<\/strong><\/span> It involves isolating the genomic DNA of individuals with the P-element mutation and cutting the DNA with an enzyme that cuts in a known location inside the P-element.\u00a0 The DNA is then ligated in a very large volume with a fairly small amount of DNA. We want the ligation mixture to be very dilute so that when the ligase glues two ends together they are most likely going to be the two ends of the same molecule. This generates a whole bunch of circles of DNA. A very few of these contain part of the P-element and a little bit of the sequence that was right beside the element.\u00a0 You set up a PCR reaction using a bit of the ligation mix and you use two primers that recognize the P-element and are directed outward toward the flanking DNA. A band is produced that can then be cloned into a plasmid, which is then used to transform cells. Colonies are grown overnight and plasmid minipreps are made by alkaline lysis. The DNA is then sent for sequencing. Even if you only have a small amount of DNA sequence it is usually enough to determine exactly where the element is located in the DNA of the fly.<\/p>\n<p>&nbsp;<\/p>\n<p><span style=\"color: #0000ff\"><strong>Plasmid rescue<\/strong><\/span> is a quicker technique but is only possible if your mutation was caused by the right kind of P-element. Some have been designed that have a bacterial origin of replication and an ampicillin resistance gene on them.\u00a0 In this case you again isolate the genomic DNA of flies with the mutation, and you do a restriction digest of the DNA with a restriction enzyme that cuts in a particular location in the P-element. You also do the very dilute ligation. The usefulness of this type of P-element is that some of the circles that are made in this ligation will have some of the P-element DNA, some of the flanking fruit fly DNA, an amp resistance gene and an origin of replication. In essence you have made a little plasmid containing the DNA you want to sequence (the flanking DNA). So the PCR, cloning, etc. steps can be skipped. Instead you take your ligation mix and transform a small amount into some <em>E. coli<\/em> cells. You plate the transformed cells on amp plates and only those that contain the plasmid will survive. Then you can select colonies and grow some up and make plasmid preps from them, just as above.<\/p>\n<p>&nbsp;<\/p>\n<hr style=\"height: 5px;border-top: solid black\" \/>\n<h2><a id=\"B_homologous\"><\/a>B. Homologous Recombination:<\/h2>\n<p>In some organisms there is no convenient system for insertional mutagenesis. But we can use homologous recombination between the gene on the chromosome of a cell and an introduced construct to swap out the functional gene and replace it with a non-functional copy. In this case we know the exact sequence change that has been made to the gene and if we have removed the entire coding sequence we can be sure we have eliminated the gene function. The term <strong>recombination<\/strong> can refer both to crossing over and independent assortment. In this case we are referring specifically to crossover.<\/p>\n<p>This requires a good understanding of the gene you are working with and knowledge of the sequence. The construct you design will contain segments of the DNA sequence to either side of the gene you want to knock out. But instead of the gene, you have an antibiotic resistance gene between the flanking sequences. Or in you might have a marker gene that produces a visible phenotype in individuals that have had their gene replaced by the marker. The image below will help explain the process. In this method we are taking advantage of the fact that crossing over can occur between two segments of DNA with homologous sequences. When two crossovers occur, the DNA between the two crossovers is &#8220;swapped&#8221;.\u00a0 In this way we can replace the actual gene we are studying, with the sequence we have placed on the construct.\u00a0 The vector, with the wild type copy of the gene &#8220;swapped onto&#8221; it, does not persist in the cells.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-650\" src=\"https:\/\/pressbooks.bccampus.ca\/kathleef\/wp-content\/uploads\/sites\/1093\/2020\/10\/image-for-HR-chapter-8-755x1024.jpg\" alt=\"\" width=\"412\" height=\"559\" srcset=\"https:\/\/pressbooks.bccampus.ca\/kathleef\/wp-content\/uploads\/sites\/1093\/2020\/10\/image-for-HR-chapter-8-755x1024.jpg 755w, https:\/\/pressbooks.bccampus.ca\/kathleef\/wp-content\/uploads\/sites\/1093\/2020\/10\/image-for-HR-chapter-8-221x300.jpg 221w, https:\/\/pressbooks.bccampus.ca\/kathleef\/wp-content\/uploads\/sites\/1093\/2020\/10\/image-for-HR-chapter-8-768x1042.jpg 768w, https:\/\/pressbooks.bccampus.ca\/kathleef\/wp-content\/uploads\/sites\/1093\/2020\/10\/image-for-HR-chapter-8-1132x1536.jpg 1132w, https:\/\/pressbooks.bccampus.ca\/kathleef\/wp-content\/uploads\/sites\/1093\/2020\/10\/image-for-HR-chapter-8-65x88.jpg 65w, https:\/\/pressbooks.bccampus.ca\/kathleef\/wp-content\/uploads\/sites\/1093\/2020\/10\/image-for-HR-chapter-8-225x305.jpg 225w, https:\/\/pressbooks.bccampus.ca\/kathleef\/wp-content\/uploads\/sites\/1093\/2020\/10\/image-for-HR-chapter-8-350x475.jpg 350w, https:\/\/pressbooks.bccampus.ca\/kathleef\/wp-content\/uploads\/sites\/1093\/2020\/10\/image-for-HR-chapter-8.jpg 1308w\" sizes=\"auto, (max-width: 412px) 100vw, 412px\" \/><\/p>\n<p>In the image, the vector is shown, with the antibiotic resistance gene (in purple) flanked by sequences to either side of the gene of interest. At least 2000 base pairs of flanking sequence is needed for this technique to work and many constructs that have been successfully used have quite a bit more- 6000 to 14000 nucleotides.\u00a0 Having a large amount of flanking sequence is needed to increase the chances of crossovers occurring. You need two crossovers to occur quite close to each other and this is a rare event. The process works well in many prokaryotes, in yeast, in mice and flies. But it does not work at all in plants or in human cells, which is unfortunate because if you think about it, this is a technique that could be used not only to replace a wild type (normal) copy of a gene with a different sequence in order to knock out the gene. It could do the reverse too &#8211; replace a mutated copy of a gene that is causing an inherited disease, with the wild type version &#8211; a form of gene therapy.<\/p>\n<p>&nbsp;<\/p>\n<p>However, this is less disappointing than it used to be because we now have the CRISPR system of gene editing, which shows great potential for gene therapy and appears to work well in all organisms studied.\u00a0 The use of CRISPR for gene knockout will be described below and we may touch on it again towards the end of the semester when we talk about the use of genetic engineering in human medicine.\u00a0 The first self test question below is about this figure:<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-1074\" src=\"https:\/\/pressbooks.bccampus.ca\/kathleef\/wp-content\/uploads\/sites\/1093\/2020\/07\/HR-in-mice.png\" alt=\"\" width=\"312\" height=\"89\" srcset=\"https:\/\/pressbooks.bccampus.ca\/kathleef\/wp-content\/uploads\/sites\/1093\/2020\/07\/HR-in-mice.png 312w, https:\/\/pressbooks.bccampus.ca\/kathleef\/wp-content\/uploads\/sites\/1093\/2020\/07\/HR-in-mice-300x86.png 300w, https:\/\/pressbooks.bccampus.ca\/kathleef\/wp-content\/uploads\/sites\/1093\/2020\/07\/HR-in-mice-65x19.png 65w, https:\/\/pressbooks.bccampus.ca\/kathleef\/wp-content\/uploads\/sites\/1093\/2020\/07\/HR-in-mice-225x64.png 225w\" sizes=\"auto, (max-width: 312px) 100vw, 312px\" \/><\/p>\n<div id=\"h5p-32\">\n<div class=\"h5p-iframe-wrapper\"><iframe id=\"h5p-iframe-32\" class=\"h5p-iframe\" data-content-id=\"32\" style=\"height:1px\" src=\"about:blank\" frameBorder=\"0\" scrolling=\"no\" title=\"Homologous recombination\"><\/iframe><\/div>\n<\/div>\n<div id=\"h5p-33\">\n<div class=\"h5p-iframe-wrapper\"><iframe id=\"h5p-iframe-33\" class=\"h5p-iframe\" data-content-id=\"33\" style=\"height:1px\" src=\"about:blank\" frameBorder=\"0\" scrolling=\"no\" title=\"Homologous recombination question 2\"><\/iframe><\/div>\n<\/div>\n<p><a href=\"https:\/\/pressbooks.bccampus.ca\/kathleef\/wp-content\/uploads\/sites\/1093\/2020\/07\/e-text-chapter-8-slides-part-3.pptx\">Click here for the powerpoint slides presented in the video below<\/a>.<\/p>\n<p><iframe loading=\"lazy\" id=\"oembed-3\" title=\"Chapter 8 part 3 Homolgous recombination\" width=\"500\" height=\"281\" src=\"https:\/\/www.youtube.com\/embed\/3fl3n8c-Ryw?feature=oembed&#38;rel=0\" frameborder=\"0\" allowfullscreen=\"allowfullscreen\"><\/iframe><\/p>\n<hr style=\"height: 5px;border-top: solid black\" \/>\n<h2><a id=\"C_RNAi\"><\/a>C. RNAi:<\/h2>\n<p>RNAi\u00a0 (RNA interference) is yet another process we use in genetic engineering that is borrowed from nature. It is a natural process among many eukaryotes. Once we understood how it works to reduce expression of a gene, we learned how to use it to perform targeted reduction of our genes of interest. And when I say &#8220;we&#8221; I specifically mean other people who figured this out. I am not one of those people.<\/p>\n<h3><a id=\"C1_history\"><\/a>C-1. A little history<\/h3>\n<p>I know this history very informally from the perspective of the fly research community.\u00a0 In the 1990s people were trying to knock out gene function by injecting anti-sense RNA into for example fly embryos, and then looking for phenotypes. The theory was that the antisense RNA would bind to the sense RNA in the cells and would prevent its translation. Thus we would get no functional protein and we could see the results of directly targeting a particular gene for inactivation.\u00a0 It was variably effective; sometimes it seemed to work well and sometimes less well.\u00a0 Then I was told at a meeting that someone figured out that the effect was stronger when both sense and antisense RNA was injected into the embryos. They had made a mistake by introducing both (by not linearizing the plasmid before transcribing the gene; see <span style=\"text-align: initial;font-size: 1em\"><a href=\"https:\/\/pressbooks.bccampus.ca\/kathleef\/chapter\/chapter-7a-analyzing-gene-function1-gene-expression\/#B1i_making\">Chapter 7<\/a>) and thought the experiment would not work at all. But it did and the effect was considerably stronger than just injecting the antisense RNA alone. This did not at first make much sense when we were talking about it then, but a few years later, the work of Fire and Mello, nematode researchers, showed that introducing double stranded RNA initiated a process that culminated in destruction of the sense RNA in a cell, via a process they called RNA interference. They published their work on this in 1998 and got the Nobel Prize for it in 2006. Of course, it turns out that plant researchers had also been circling around the same topic and had called it post-transcriptional gene silencing (PTGS). The first report of it in plants was published in 1990.\u00a0 So multiple research groups were figuring out the same process at around the same time.<\/span><\/p>\n<p>The point about this method is that first off, somehow the introduction or production of double stranded RNA that corresponded to a particular gene led to the destruction of the normal mRNA from that gene and thus the gene&#8217;s function- the protein product &#8211; was eliminated or greatly reduced in amount. Second, the phenotype that results may tell you something about what the gene is needed for. This is an example of a\u00a0 loss of function mutation. Imagine there is a factory that builds wheelbarrows. There are 12 people that go into the factory every day and each does something to build the wheelbarrow. Everyone does their job. Nobody covers for anyone else. Now imagine that one of the people gets &#8220;inactivated&#8221; somehow- let&#8217;s imagine it is an impromptu vacation rather than something more sinister- and you are watching the wheelbarrows come out of the factory and they are missing the grips on the handles. What do you conclude? You can tell what the job of the absent worker was; their job was to install the part that is missing. This is very simplistic but gives an idea of how a loss of function mutation might suggest the function of a gene.<\/p>\n<p>RNAi allows you to decide which gene to knock out, rather than trying to make random mutations and then find one in the gene you are studying. If you have done RNA <em>in situ<\/em> hybridization and\/or made a reporter construct for your gene you will know where the gene is expressed and so you will know where to look for the phenotypes: the physical or physiological or health related etc. effects of reducing or eliminating the gene&#8217;s function.<\/p>\n<div id=\"h5p-36\">\n<div class=\"h5p-iframe-wrapper\"><iframe id=\"h5p-iframe-36\" class=\"h5p-iframe\" data-content-id=\"36\" style=\"height:1px\" src=\"about:blank\" frameBorder=\"0\" scrolling=\"no\" title=\"RNAi 1\"><\/iframe><\/div>\n<\/div>\n<p>&nbsp;<\/p>\n<hr \/>\n<h3><a id=\"C2_how\"><\/a>C-2. How it works<\/h3>\n<p>RNAi is a process that works in most eukaryotes. It seems to be a defence mechanism against RNA viruses. Double stranded RNA is not normal in a cell so when it is detected, it is a sign that a retrovirus may have infected the cell. A protein called dicer, which acts as a <strong><span style=\"color: #0000ff\">homodimer<\/span><\/strong> (two identical subunits combine to perform the function) binds to double-stranded RNA in a non-sequence-specific way. That means it doesn&#8217;t recognize any particular sequence but just binds to any ds-RNA. Dicer is a type III RNA endonuclease. The size of the dimer dictates where the enzyme cuts the RNA. It cuts to either side of the complex leading to ~22 bp pieces of ds-RNA. The sizes vary slightly among organisms.<\/p>\n<p>Then a complex called RISC (<strong>R<\/strong>NA <strong>I<\/strong>nduced <strong>S<\/strong>ilencing <strong>C<\/strong>omplex) unwinds the ds-RNA pieces and hangs on to the antisense strand of the RNA. It uses this strand to target additional complementary RNA. When it finds a sequence that matches &#8211; the mRNA of the targeted gene- it cuts that RNA as well. The RNA endonuclease that does the cutting of the mRNA is called argonaut, but it is nicknamed &#8220;slicer&#8221;.<\/p>\n<p><em>I am not clear on how the RISC complex hangs on to one strand of the RNA, the guide strand (which is the antisense strand) and lets go of the other strand, the passenger strand (the sense strand).\u00a0 But in any case, it does retain the antisense strand and this is how it is able to target the mRNA sequence of the gene.<\/em><\/p>\n<p>There is a lot more to find out about the mechanism of RNAi if you are interested. It is also a mechanism that many organisms use to regulate their own genes. So some genes are transcribed but then at certain times in development, short anti-sense RNAs called microRNAs (miRNAs) bind to the mRNA and prevent its translation. They also target destruction of the message through dicer and the RISC complex because when they bind to the mRNA it creates ds-RNA, which triggers the RNAi process. It probably seems inefficient but it works.\u00a0 You will (or may have already) learn about it in Bisc 333.<\/p>\n<p>There are human health connections as well. A form of macular degeneration is associated with an age-related down regulation of a form of dicer in the eye. There are sequences called Alu sequences in the human genome, and these are repetitive remnants of a retroviral infection in a human ancestor long ago. These are still expressed sometimes and are kept in check by miRNAs. When dicer is less expressed as the patient gets older, the Alu sequences begin to accumulate because there is a less efficient miRNA process to shut down their expression, and this accumulation leads to degeneration of the retina. I don&#8217;t know exactly how the accumulation of these sequences brings about the retinal degeneration, but it is interesting. Perhaps reintroducing active dicer through gene therapy could help treat this form of blindness.<\/p>\n<p>&nbsp;<\/p>\n<hr \/>\n<h3><a id=\"C3_Making_constructs_for_RNAi\"><\/a>C-3. Making constructs for RNAi<\/h3>\n<p>Constructs that will be used for RNAi don&#8217;t need to contain the entire sequence of the gene we are targeting, in fact it seems to work better if only part of the gene is used. A few hundred base pairs of sequence is usually enough.\u00a0 A standard length is 300 to 800 base pairs but for a gene I studied years ago, we used only about 80 base pairs because this was the largest section of the gene we could find that was unique, but I&#8217;ve since learned that many people have successfully used smaller sections of the gene they are targeting- even as small as 40 bp. It is probably somewhat related to the gene you are targeting. But whatever the size of your insert, the sequence you use must be unique to the gene you are studying.\u00a0 In the case of the gene I was interested in, there were two genes with very very similar sequence that performed slightly different functions in the flies during development.\u00a0 You might also be studying one member of a gene family and in this case many family members would have quite similar sequences. Sometimes the sequence chosen just has short stretches of sequence that are not unique to the gene we are targeting. You only need 22 nucleotides of such sequence to generate &#8220;off-target&#8221; effects. This means that the RNAi process will not only inactivate mRNA from the intended gene but also for another gene or genes which are not the intended target. You don&#8217;t want the phenotype you see to result from knocking out multiple genes because you will not know this and will assume that the cool phenotype you observe is the result of knocking down or knocking out your intended gene. You can search the sequence you plan to use in your construct for short regions that match different genes than the intended one. If you find such matches, a different section of the gene must be used.<\/p>\n<p>The construct you make must produce double stranded RNA. So the construct needs a promoter to transcribe the DNA that you insert into the vector. You can insert the same DNA sequence twice into the same vector, but in opposite orientation. The RNA polymerase will transcribe the first copy and will continue through the second copy- because this copy is in the opposite orientation, the RNA of the second half of the transcript will be the reverse complement of the first half. Therefore it can fold back on itself to product ds-RNA. We often include a short spacer between the two inserts to make the folding of the RNA a bit easier.\u00a0 Vectors have been designed with two cloning sites that can allow you to automatically put your inserts into the vector in the correct orientation.\u00a0 We will talk about this &#8220;recombinase&#8221; cloning method later in in the semester. We can use restriction enzyme cloning too, but it is a lot of work- we have to clone one insert into the vector at a time and we have to use different restriction enzymes for the two inserts so that we are only affecting one insert at a time. And! We always have to be really careful in planning our cloning procedure to ensure we have the inserts in opposite orientation. Below is an image of just part of such a construct, showing the two inserts which are the exact same sequence but in opposite orientation, and the promoter to one side and terminator to the other. The spacer between is to help the formation of a hairpin- double stranded RNA with a loop at one end. The loop may be degraded after the RNA folds into the hairpin structure.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-large wp-image-655\" src=\"https:\/\/pressbooks.bccampus.ca\/kathleef\/wp-content\/uploads\/sites\/1093\/2020\/10\/RNAi-construct-1024x175.jpg\" alt=\"\" width=\"1024\" height=\"175\" srcset=\"https:\/\/pressbooks.bccampus.ca\/kathleef\/wp-content\/uploads\/sites\/1093\/2020\/10\/RNAi-construct-1024x175.jpg 1024w, https:\/\/pressbooks.bccampus.ca\/kathleef\/wp-content\/uploads\/sites\/1093\/2020\/10\/RNAi-construct-300x51.jpg 300w, https:\/\/pressbooks.bccampus.ca\/kathleef\/wp-content\/uploads\/sites\/1093\/2020\/10\/RNAi-construct-768x131.jpg 768w, https:\/\/pressbooks.bccampus.ca\/kathleef\/wp-content\/uploads\/sites\/1093\/2020\/10\/RNAi-construct-65x11.jpg 65w, https:\/\/pressbooks.bccampus.ca\/kathleef\/wp-content\/uploads\/sites\/1093\/2020\/10\/RNAi-construct-225x38.jpg 225w, https:\/\/pressbooks.bccampus.ca\/kathleef\/wp-content\/uploads\/sites\/1093\/2020\/10\/RNAi-construct-350x60.jpg 350w, https:\/\/pressbooks.bccampus.ca\/kathleef\/wp-content\/uploads\/sites\/1093\/2020\/10\/RNAi-construct.jpg 1053w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/p>\n<p><span style=\"text-align: initial;font-size: 1em\">The second approach is much easier. In this case your insert is cloned into the polylinker and there are two copies of the same promoter- one to either side of your insert.\u00a0 Once you&#8217;ve made the transgenic organism, the insert can be transcribed in both directions at the same time, making two complementary RNAs that will bind each other to become ds-RNA. The image below shows the two promoters to either side of the insert which can be in any orientation.\u00a0<\/span><\/p>\n<p>&nbsp;<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-659\" src=\"https:\/\/pressbooks.bccampus.ca\/kathleef\/wp-content\/uploads\/sites\/1093\/2020\/10\/Two-promoters-one-insert.jpg\" alt=\"\" width=\"990\" height=\"222\" srcset=\"https:\/\/pressbooks.bccampus.ca\/kathleef\/wp-content\/uploads\/sites\/1093\/2020\/10\/Two-promoters-one-insert.jpg 990w, https:\/\/pressbooks.bccampus.ca\/kathleef\/wp-content\/uploads\/sites\/1093\/2020\/10\/Two-promoters-one-insert-300x67.jpg 300w, https:\/\/pressbooks.bccampus.ca\/kathleef\/wp-content\/uploads\/sites\/1093\/2020\/10\/Two-promoters-one-insert-768x172.jpg 768w, https:\/\/pressbooks.bccampus.ca\/kathleef\/wp-content\/uploads\/sites\/1093\/2020\/10\/Two-promoters-one-insert-65x15.jpg 65w, https:\/\/pressbooks.bccampus.ca\/kathleef\/wp-content\/uploads\/sites\/1093\/2020\/10\/Two-promoters-one-insert-225x50.jpg 225w, https:\/\/pressbooks.bccampus.ca\/kathleef\/wp-content\/uploads\/sites\/1093\/2020\/10\/Two-promoters-one-insert-350x78.jpg 350w\" sizes=\"auto, (max-width: 990px) 100vw, 990px\" \/><\/p>\n<p>RNAi can be done in a transient way as is often done using cultured cells. The double stranded RNA is made <em>in vitro<\/em> and then isolated and injected into the cells or into embryos (this is done in fly embryos sometimes).\u00a0 Or a plasmid is introduced that will transcribe the ds-RNA but the plasmid is not stable in the cells, and so will be degraded in a relatively short time. These approaches lead to only a transient assay because you introduce the RNA, and you then have a few hours to see the effect. It could be a change in cell behaviour or in gene expression or some other immediate result of the RNAi.\u00a0 After a few hours the effect diminishes as all the ds-RNA has been cut up and used in the response and\/or the plasmid has been degraded by the cell.<\/p>\n<p>To do a longer term experiment the construct you&#8217;ve made to produce the ds-RNA is introduced into an organism and part of the plasmid is integrated into the genome of the organism. Then a strain or line of that organism that contains the construct is kept.\u00a0 We generally like to use inducible promoters in that case, so that we can turn on the production of ds-RNA at a defined time or place and observe the effects. In some organisms, we can use a promoter called the heat-shock promoter. Heat shock proteins are chaperones that help keep proteins from unfolding during high temperature or other types of stress in a cell. The promoters of these proteins are very strong and they induce a large amount of expression. The RNA polymerases and some other regulatory proteins are sitting on these promoters all the time, ready to start transcription the moment these proteins are needed- which by definition is going to be some type of emergency situation for the cell. So when there is a sudden temperature increase the heat shock genes are transcribed vigorously to produce lots of chaperone proteins and protect the cell from the damaging effects of the heat treatment.\u00a0 When we attach these promoters to other genes they will be highly transcribed under our control- when we heat shock the cells or organisms that carry the construct. In different systems, different inducible promoters may be used, but the point is the same- we can turn on the RNAi at specific times to see the effect of knocking out a gene at a particular time.<\/p>\n<p>Sometimes the RNAi effect is not very strong. In that case you can sometimes introduce two different RNAi constructs into the organism, each targeting a different part of the gene. That can increase the effect. Also, you can introduce a construct that has extra copies of the dicer gene on it. Extra copies of the dicer gene mean more of the dicer protein and the construct should have dicer under the control of the same inducible promoter as your RNAi construct. So, when you induce the ds-RNAi for your gene of interest, at the same time you make a lot of extra dicer protein and this greatly enhances the RNAi response.<\/p>\n<div id=\"h5p-37\">\n<div class=\"h5p-iframe-wrapper\"><iframe id=\"h5p-iframe-37\" class=\"h5p-iframe\" data-content-id=\"37\" style=\"height:1px\" src=\"about:blank\" frameBorder=\"0\" scrolling=\"no\" title=\"RNAi 2\"><\/iframe><\/div>\n<\/div>\n<div id=\"h5p-38\">\n<div class=\"h5p-iframe-wrapper\"><iframe id=\"h5p-iframe-38\" class=\"h5p-iframe\" data-content-id=\"38\" style=\"height:1px\" src=\"about:blank\" frameBorder=\"0\" scrolling=\"no\" title=\"RNAi 3\"><\/iframe><\/div>\n<\/div>\n<p><a href=\"https:\/\/pressbooks.bccampus.ca\/kathleef\/wp-content\/uploads\/sites\/1093\/2020\/07\/e-text-chapter-8-slides-part-4.pptx\">Click here for the powerpoint slides presented in the video below<\/a>.<\/p>\n<p><iframe loading=\"lazy\" id=\"oembed-4\" title=\"chapter 8 part 4 RNAi\" width=\"500\" height=\"281\" src=\"https:\/\/www.youtube.com\/embed\/csPye3tPTW8?feature=oembed&#38;rel=0\" frameborder=\"0\" allowfullscreen=\"allowfullscreen\"><\/iframe><\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<hr style=\"height: 5px;border-top: solid black\" \/>\n<h2><a id=\"D_CRISPR\"><\/a>D. CRISPR:<\/h2>\n<h3><a id=\"D1_preamble\"><\/a>D-1. Preamble<\/h3>\n<p>A few days ago the 2020 Nobel Prize for Chemistry was awarded to Jennifer Doudna and Emmanuelle Charpentier for their work on the CRISPR-Cas 9 system.\u00a0 They were studying how bacteria can protect themselves from a second viral infection after surviving the first infection and this curiosity-driven research led to the elucidation of a mechanism by which we could edit genes using the same system. Please read the following short article that describes a bit about the research and the researchers.\u00a0 It also contains a short scrollable description of how the whole process works that is easy to understand and nicely illustrated.\u00a0 Your next bioinformatics project is the production of oligonucleotides to make a CRISPR construct for a gene in the foxtail millet, <em>Setaria viridis<\/em>. <span style=\"background-color: #ffff00\">This will contribute toward a <span style=\"text-decoration: underline\">global research project<\/span> that I will tell you about next week.<\/span><\/p>\n<p><a href=\"https:\/\/theconversation.com\/what-is-crispr-the-gene-editing-technology-that-won-the-chemistry-nobel-prize-147695\" target=\"_blank\" rel=\"noopener noreferrer\">The Conversation: What is CRISPR, the gene editing technology that won the Chemistry Nobel prize?<\/a><\/p>\n<hr \/>\n<h3><a id=\"D2_bacteria\"><\/a>D-2. How bacteria do it<\/h3>\n<p>Bacteria that survive an infection by a bacteriophage (virus that infects bacteria) cut out and keep a small segment of the phage DNA and store it in a region of the bacterial chromosome that contains a set of short repeats. Between the repeats are the DNA records of previous viral infections- segments of viral DNA. The repeats are called <strong>C<\/strong>lustered <strong>R<\/strong>egularly <strong>I<\/strong>nterspaced <strong>S<\/strong>hort <strong>P<\/strong>alindromic <strong>R<\/strong>epeats. This is where the term CRISPR comes from.\u00a0 The DNA from the phage that is stored in the CRISPR sites is not just random sequence; it is very precise: it is 20 nucleotides of sequence that are found immediately upstream of a sequence: NGG.\u00a0 This is called a <strong><span style=\"color: #0000ff\">PAM<\/span><\/strong> sequence, which means <strong>p<\/strong>rotospacer <strong>a<\/strong>djacent\u00a0 <strong>m<\/strong>otif. The NGG sequence is not inserted in the CRISPR loci but it is in the corresponding phage DNA and is important for the bacterial response to infection later.<\/p>\n<p>When bacteria are infected again by the same type of phage &#8211; not the same bacterium but the descendants of the cell that survived the infection &#8211; the CRISPR loci are transcribed to make RNA. The RNA that comes from the previous phage DNA is the <strong><span style=\"color: #0000ff\">cr<\/span><span style=\"color: #0000ff\">RNA<\/span><\/strong><span style=\"color: #0000ff\">.<\/span> Another RNA is also transcribed, which is called the <span style=\"color: #0000ff\"><strong>tra<\/strong><strong>crRNA<\/strong>.<\/span>\u00a0 These are functional RNAs; they are not translated into protein but perform their functions as RNAs and they have regions where the two RNAs are complementary to each other and so can base pair with each other. A third gene is also transcribed and translated to produce the <span style=\"color: #0000ff\"><strong>Cas9<\/strong><\/span> protein. Cas means <strong>C<\/strong>RISPR <strong>As<\/strong>sociated protein 9.\u00a0 There are many of these in different systems but we&#8217;ll stick to Cas9 for BISC 357.\u00a0 (It has had other names in the past: Cas5, Csn1, or Csx12 so if you come across these in your studies or in other courses, you will know that these also refer to the Cas9 protein)<\/p>\n<p>So the bacteria are undergoing an infection and there are two RNAs and one protein produced in response. The tracrRNA has a region that binds to the Cas9 protein and the region of complementarity to the crRNA. It acts as a linker between the two.\u00a0 The crRNA is complementary to one strand of the phage DNA. So it binds to the phage DNA and the tracrRNA forms the link connecting the Cas9 protein to the crRNA and when Cas9 is in position on the phage DNA it cuts the phage DNA in a very precise location- 3 nucleotides upstream of the PAM sequence. It cuts both strands of the DNA to make a &#8220;break&#8221;.\u00a0 The phage gene cannot be expressed and furthermore, the DNA is degraded by the bacterial exonucleases. This is where the PAM sequence is essential. The PAM sequence is unique to each type of Cas protein and is absolutely required for the protein to be able to cut the DNA. So even if the crRNA recognizes the 20 bases of target sequence in the phage genome and the tracrRNA connects the Cas9 protein to the crRNA, Cas9 must bind to the PAM sequence or it will be unable to undergo the conformational shift that activates the endonuclease function.<\/p>\n<p>Bacteria are really quite remarkable.<\/p>\n<p><a href=\"https:\/\/pressbooks.bccampus.ca\/kathleef\/wp-content\/uploads\/sites\/1093\/2020\/07\/e-text-chapter-8-slides-part-5.pptx\">Click here for the powerpoint slides presented in the video below<\/a>.<\/p>\n<p><iframe loading=\"lazy\" id=\"oembed-5\" title=\"Chapter 8 part 5 CRISPR INTRO\" width=\"500\" height=\"281\" src=\"https:\/\/www.youtube.com\/embed\/LM0HaUEpT6o?feature=oembed&#38;rel=0\" frameborder=\"0\" allowfullscreen=\"allowfullscreen\"><\/iframe><\/p>\n<hr \/>\n<h3><a id=\"D3_how\"><\/a>D-3. How we use it: the principle of gene editing<\/h3>\n<p>When we make constructs for CRISPR we use vectors that have everything we need except for the unique 20 bases of sequence that will target the gene we want to knock out.\u00a0 The gene for Cas9 is on the vector because the organisms we are studying don&#8217;t have this protein. The gene for Cas9 has been modified to contain one or more <strong><span style=\"color: #0000ff\">nuclear localization signals<\/span> <\/strong>(NLSs). The protein must enter the nucleus to cut the target DNA so the NLS is required to direct the protein to the correct location to do its job. Bacteria have no nuclear envelope and so do not require NLSs on their Cas9 protein. The two genes for tracr RNA and crRNA have been fused. The &#8220;overlap&#8221; region is removed and the two parts of the RNA that are needed to direct Cas9 to the target DNA are transcribed as a <strong><span style=\"color: #0000ff\">single guide RNA<\/span><\/strong> (sgRNA). At one end is the 20 nucleotide sequence that is designed by the researcher and is specific for the gene of interest. The part that folds up and binds with Cas9 is the same for every gene, so this is an unvarying part of the construct. Both the Cas9 gene and the sgRNA gene are under the control of strong promoters that work in eukaryotes.<\/p>\n<p>&nbsp;<\/p>\n<p>The constructs are introduced into organisms in a variety of ways, depending on the organism.\u00a0 Once inside the single guide RNA is expressed as is the Cas9 protein.\u00a0 They enter the nucleus and the sgRNA binds the target sequence on the gene of interest. The Cas9 protein interacts with the PAM sequence and changes conformation in order to\u00a0 cut the DNA 3 bases upstream of the NGG sequence.<\/p>\n<p>Once a cut is made in the DNA sequence, several things can happen. An accurate repair system might repair the DNA. If this happens it might be cut again. This may happen repeatedly. But there are repair systems that are less accurate that can &#8220;repair&#8221; the DNA but that will introduce mistakes in the sequence in the process. These are the ones that will inactivate the gene.<\/p>\n<div id=\"h5p-39\">\n<div class=\"h5p-iframe-wrapper\"><iframe id=\"h5p-iframe-39\" class=\"h5p-iframe\" data-content-id=\"39\" style=\"height:1px\" src=\"about:blank\" frameBorder=\"0\" scrolling=\"no\" title=\"CRISPR1\"><\/iframe><\/div>\n<\/div>\n<p><a href=\"https:\/\/pressbooks.bccampus.ca\/kathleef\/wp-content\/uploads\/sites\/1093\/2020\/07\/e-text-chapter-8-slides-part-6.pptx\">Click here for the powerpoint slides presented in the video below<\/a>.<\/p>\n<p><iframe loading=\"lazy\" id=\"oembed-6\" title=\"Chapter 8 part 6 CRISPR\" width=\"500\" height=\"281\" src=\"https:\/\/www.youtube.com\/embed\/JXF2HKOkYUw?feature=oembed&#38;rel=0\" frameborder=\"0\" allowfullscreen=\"allowfullscreen\"><\/iframe><\/p>\n<h4><a id=\"D3i_NHEJ\"><\/a>D3-i. Knock out &#8211; NHEJ repair<\/h4>\n<p>When the <strong>n<\/strong>on-<strong>h<\/strong>omologous <strong>e<\/strong>nd <strong>j<\/strong>oining (<span style=\"color: #0000ff\"><strong>NHEJ<\/strong><\/span>) repair system acts, it reconnects broken DNA but not in an &#8220;accurate&#8221; way. This type of repair can cause insertion or deletions into the DNA at the site of the break and if multiple breaks occur along a chromosome, the &#8220;wrong&#8221; ends might be connected, to generate inversions or translocations. I always think of this repair system as the &#8220;Emergency! Just glue the pieces back together, don&#8217;t worry about getting it perfect!&#8221; type of system.\u00a0 So in the case that we introduce our construct that targets our gene, in many cells we expect that the NHEJ repair system may repair the breaks that are induced in the gene. Consider that if even one nucleotide is added at the break, the entire gene is frame-shifted. Thus the correct protein won&#8217;t be produced. Or if nucleotides are deleted and they are not in multiples of three we will also get a frameshift mutation. The mutations are called &#8220;indel&#8221; because either insertion or deletion of one or two nucleotides may occur and in both situations, a frameshift results. Frameshifts usually result in truncated proteins- you may recall from your gene annotation assignments, that all reading frames except the correct one contain lots of stop codons.<\/p>\n<p>Then we can look at the phenotype of the gene knockout to see the consequences of inactivating the gene.\u00a0 We will probably also collect DNA from the organism and sequence the gene to determine what the exact change in the sequence was. We may obtain several individuals with mutant phenotypes; they may not have exactly the same genetic change.<\/p>\n<h4><a id=\"D3ii_HDR\"><\/a>D3-ii. Directed modification &#8211; HDR<\/h4>\n<p>Homology dependent repair is accurate and sequence-based. If a break occurs in the DNA, proteins bind it, and generate single stranded sections on the ends of the break that then do a &#8220;homology search&#8221;. The strands generally find the homologue (we are talking about diploid organisms here) and use that homologue to copy the correct sequence into the broken region.<\/p>\n<p>In general this repair system would not result in a mutation of the gene we are targeting. However, if we provide some &#8220;donor&#8221; DNA along with the construct that we introduce into the research organism, we are essentially providing the homology dependent repair systems a piece of DNA to copy into the broken region. In this case we could introduce a reporter gene, or a piece of DNA with a particular type of mutation that we want to study. Suppose we thought a certain set of a few amino acids were the critical ones for a protein&#8217;s function. We could provide a donor DNA sequence that lacks\u00a0 the codons for those amino acids, and the repair system could incorporate that into the gene. We could then check to see whether we were correct about the protein&#8217;s function.<\/p>\n<p><a href=\"https:\/\/pressbooks.bccampus.ca\/kathleef\/wp-content\/uploads\/sites\/1093\/2020\/07\/e-text-chapter-8-slides-part-7.pptx\">Click here for the powerpoint slides presented in the video below<\/a>.<\/p>\n<p><iframe loading=\"lazy\" id=\"oembed-7\" title=\"Chapter 8 final crispr recording\" width=\"500\" height=\"281\" src=\"https:\/\/www.youtube.com\/embed\/jHSe94TFZyY?feature=oembed&#38;rel=0\" frameborder=\"0\" allowfullscreen=\"allowfullscreen\"><\/iframe><\/p>\n<p>&nbsp;<\/p>\n<p>There is a former student from 357 who also TAd the course a couple of years ago who is currently using this very approach to study heart disease in a zebra fish model for his PhD research.<\/p>\n<p>Perhaps you&#8217;ve already thought of the most interesting application of this approach: the possibility of cutting a target DNA sequence that is actually causing a disease or condition in a person, and replacing that mutated sequence with the wild type form, in order to treat the condition. This has already been tried with sickle cell anemia, and within the last few years, there were reports of a person who had been treated for sickle cell anemia using CRISPR. In sickle cell anemia the hemoglobin gene has a mutation in it that causes the hemoglobin molecules to bind incorrectly to each other. They don&#8217;t carry oxygen as effectively as regular hemoglobin and they cause a deformation of the red bloods cells that causes the cells to get stuck in small capillaries sometimes. This can cause severe pain and because the oxygen carrying capacity is low, people with this condition have low energy. To treat this condition, stem cells were collected from the woman with the condition. The CRISPR treatment was performed on the cells to inactivate a gene called <em>BCL11A. <\/em>This gene shuts off production of fetal hemoglobin a few months after a baby is born. Inactivating this gene in the adult woman&#8217;s stem cells allowed the fetal hemoglobin\u00a0 gene to turn back on in those cells. The woman underwent a round of chemotherapy, presumably to kill the stem cells in her bone marrow and then the CRISPR-treated stem cells were reintroduced into her body. Producing large amounts of the fetal hemoglobin molecule (which is wild type) prevents the sickling of the blood cells. As a bonus, the fetal form of hemoglobin has higher oxygen carrying capacity than the adult form. The person who was treated seems to be in good health, a couple of years after the treatment.\u00a0 There is a short article about this, from July of 2022 linked below:<\/p>\n<p>&nbsp;<\/p>\n<p><a href=\"https:\/\/www.healthline.com\/health-news\/first-person-treated-for-sickle-cell-disease-with-crispr-is-doing-well\" target=\"_blank\" rel=\"noopener noreferrer\">healthline: First Person Treated for Sickle Cell Disease with CRISPR Is Doing Well<\/a><\/p>\n<h4><em><a id=\"D3iii_Designing_oligonucleotides\"><\/a>D3-iii. Designing oligonucleotides that will generate an overhang for cloning<\/em><\/h4>\n<p>This is not specific to CRISPR, but is an approach we are going to use in our CRISPR bioinformatics exercise, so it is a good place to introduce it.\u00a0 We design oligonucleotides that are complementary to each other, EXCEPT for four nucleotides at the 5&#8242; ends of the oligos.\u00a0 We are going to clone into a vector cut with a special enzyme which cuts OUTSIDE of the recognition sequence so that the overhang generated is different at each site, depending on what sequence is near the recognition site. Our vector has two restriction sites (<em>Bsa<\/em>I) and the overhangs generated are not complementary to each other. Thus the vector cannot re-circularize during ligation.<\/p>\n<p>To make the insert, we take equimolar amounts (the same # of molecules) of each oligo and combine them in a PCR tube. We heat to 96C for about 5 minutes and then allow the temperature to decrease very gradually. This allows the complementary parts of the forward and reverse oligos to find each other and to bind. The 5&#8242; ends stick out (see below) and the sequences that are sticking out are complementary to the overhangs in the vector.\u00a0 There is more information in the lecture on CRISPR, the bioinformatics assignment in which you design the oligos for this project, the lecture about the project we&#8217;re working on, as well as your lab handout, later in the semester. In the image below, the squiggly lines at the 5&#8242; ends of the annealed oligos are the 4-bp sequences that match the insertion site in the cut vector.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-1082\" src=\"https:\/\/pressbooks.bccampus.ca\/kathleef\/wp-content\/uploads\/sites\/1093\/2020\/07\/annealed-oligo.png\" alt=\"\" width=\"418\" height=\"77\" srcset=\"https:\/\/pressbooks.bccampus.ca\/kathleef\/wp-content\/uploads\/sites\/1093\/2020\/07\/annealed-oligo.png 418w, https:\/\/pressbooks.bccampus.ca\/kathleef\/wp-content\/uploads\/sites\/1093\/2020\/07\/annealed-oligo-300x55.png 300w, https:\/\/pressbooks.bccampus.ca\/kathleef\/wp-content\/uploads\/sites\/1093\/2020\/07\/annealed-oligo-65x12.png 65w, https:\/\/pressbooks.bccampus.ca\/kathleef\/wp-content\/uploads\/sites\/1093\/2020\/07\/annealed-oligo-225x41.png 225w, https:\/\/pressbooks.bccampus.ca\/kathleef\/wp-content\/uploads\/sites\/1093\/2020\/07\/annealed-oligo-350x64.png 350w\" sizes=\"auto, (max-width: 418px) 100vw, 418px\" \/><\/p>\n<hr \/>\n<p style=\"text-align: left\"><a href=\"https:\/\/pressbooks.bccampus.ca\/kathleef\/chapter\/chapter-8-investigating-gene-function-3-expression-constructs\/\">Previous (Chapter 8)<\/a><span style=\"float: right\"><a href=\"https:\/\/pressbooks.bccampus.ca\/kathleef\/chapter\/chapter-10-sanger-sequencing\/\">Next (Chapter 10)<\/a><\/span><\/p>\n","protected":false},"author":1046,"menu_order":10,"template":"","meta":{"pb_show_title":"on","pb_short_title":"Knock-out and Knock-down","pb_subtitle":"RNAi and CRISPR-Cas9","pb_authors":[],"pb_section_license":"cc-by"},"chapter-type":[47],"contributor":[],"license":[52],"class_list":["post-70","chapter","type-chapter","status-publish","hentry","chapter-type-standard","license-cc-by"],"part":3,"_links":{"self":[{"href":"https:\/\/pressbooks.bccampus.ca\/kathleef\/wp-json\/pressbooks\/v2\/chapters\/70","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/pressbooks.bccampus.ca\/kathleef\/wp-json\/pressbooks\/v2\/chapters"}],"about":[{"href":"https:\/\/pressbooks.bccampus.ca\/kathleef\/wp-json\/wp\/v2\/types\/chapter"}],"author":[{"embeddable":true,"href":"https:\/\/pressbooks.bccampus.ca\/kathleef\/wp-json\/wp\/v2\/users\/1046"}],"version-history":[{"count":26,"href":"https:\/\/pressbooks.bccampus.ca\/kathleef\/wp-json\/pressbooks\/v2\/chapters\/70\/revisions"}],"predecessor-version":[{"id":1521,"href":"https:\/\/pressbooks.bccampus.ca\/kathleef\/wp-json\/pressbooks\/v2\/chapters\/70\/revisions\/1521"}],"part":[{"href":"https:\/\/pressbooks.bccampus.ca\/kathleef\/wp-json\/pressbooks\/v2\/parts\/3"}],"metadata":[{"href":"https:\/\/pressbooks.bccampus.ca\/kathleef\/wp-json\/pressbooks\/v2\/chapters\/70\/metadata\/"}],"wp:attachment":[{"href":"https:\/\/pressbooks.bccampus.ca\/kathleef\/wp-json\/wp\/v2\/media?parent=70"}],"wp:term":[{"taxonomy":"chapter-type","embeddable":true,"href":"https:\/\/pressbooks.bccampus.ca\/kathleef\/wp-json\/pressbooks\/v2\/chapter-type?post=70"},{"taxonomy":"contributor","embeddable":true,"href":"https:\/\/pressbooks.bccampus.ca\/kathleef\/wp-json\/wp\/v2\/contributor?post=70"},{"taxonomy":"license","embeddable":true,"href":"https:\/\/pressbooks.bccampus.ca\/kathleef\/wp-json\/wp\/v2\/license?post=70"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}