Genes: Recipes for proteins

A gene is a unit of DNA that codes for a protein or part of a protein. To make a protein, the cell uses a gene’s precise sequence of nitrogenous bases (A, T, C and G) as a guide.

Essential for the survival of all living organisms, proteins come in a wide variety of forms and functions. For example, variations in pigmentation proteins are what give panda bears their unique black-and-white coat. The opsin proteins in the eyes of bees are sensitive to ultraviolet light and flower patterns invisible to the human eye, which is crucial for flower pollination.

The genome can be compared to a cookbook filled with recipes, as it contains many genes that each produce a particular protein.

Alleles: Recipe variants

When one or more of a gene’s nitrogenous bases differ, these variants are called alleles.

For example, eyes can have alleles that contain information to create brown, blue, green or grey pigment. 

Gene expression 

All of an organism’s cells contain identical DNA. However, only the sections that a specific cell needs to function are expressed. This is what makes cells (such as neurons or muscle cells) different based on their roles in the body.

The ways cells package their DNA depending on the sections they need to access is referred to as epigenetics. When a gene is activated, its DNA is transcribed into RNA. This messenger RNA is then translated by ribosomes into a functional protein.

Thanks to this precise regulation, each gene is expressed at the right time and in the right place in the organism.

Did you know?

The size of an organism’s genome does not correlate to its complexity.

For example, the human genome has around 3.2 billion base pairs, while the largest known genome belongs to the Tmesipteris oblanceolata fern, which has 160 billion base pairs!

HumanParis JaponicaTmesipteris oblanceolata Nasuia deltocephalinicola (a bacterium)Arabidopsis Thaliana
3.2 billion base pairs149 billion base pairs160 billion base pairs 112,000 base pairs115 million base pairs
Sources: The Human Genome Project pieced together only 92% of the DNA – now scientists have finally filled in the remaining 8% (theconversation.com), The Chemistry of Arabidopsis thaliana – ScienceDirect, A 160 Gbp fork fern genome shatters size record for eukaryotes: iScience (cell.com)
Learn more

Only 1% of our genome directly codes for protein synthesis. The rest, called the non-coding genome, plays a role in gene regulation. These DNA regulatory regions, such as enhancers, promoters and other sequences, regulate gene expression. In genes, promoters are binding sites located right beside the start site for transcription, which is when DNA is copied into RNA. Enhancers increase the activity of certain genes, sometimes at a distance from the transcription start site.
Proteins called transcription factors direct the timing and intensity of gene activation and deactivation, which is why some cells express certain genes more than others, why not all cells produce the same proteins, and why cells have different functions.
Studying and understanding these regulatory regions is an important part of genomics research.
Source: The Human Genome Project pieced together only 92% of the DNA – now scientists have finally filled in the remaining 8% (theconversation.com)

Protein synthesis

Cells synthesize proteins using DNA as a kind of guide or recipe book. 

Steps in protein production:   

  1. The DNA information to create a protein (the gene) is copied into messenger RNA (mRNA).
  2. The mRNA then leaves the nucleus, enters the cytoplasm and binds to a ribosome.
  3. The ribosome moves to the rough endoplasmic reticulum (RER), where it reads the mRNA sequence and assembles the amino acids into a polypeptide chain.
  4. Once complete, the chain is released into the RER, where it is processed and folded into a protein.
  5. Proteins are carried by transport vesicles from the RER to the Golgi body.

The Golgi body processes the proteins before sending them to the cell surface or into the cytoplasm. The Golgi body also adds chemical tags to classify the new proteins according to their final destination.

Proteins that will be secreted out of the cell or embedded into the cell membrane are packaged into transport vesicles that bud from the Golgi body. These vesicles fuse with the cell membrane, and the protein is either released from the cell or delivered into the membrane.

Protein structure

Proteins are made of amino acids that bind in a specific order to create polypeptide chains.

When a protein is being made, its polypeptide chain must fold up into a specific three-dimensional shape so that the protein can function properly and fulfill its role. An error in the protein’s amino acid sequence or shape can impact its function and cause a problem for the organism.

In eukaryotic cells, the first step in protein production involves locating the target gene, i.e. the DNA‑sequence “recipe” that codes for the desired protein.

Once the gene is located, the DNA sequence is copied into a molecule called messenger RNA (mRNA). RNA is similar to DNA but has only a single strand and has uracil (U) nitrogenous bases instead of thymine (T) nitrogenous bases. The process to create the DNA copy is called transcription, which is orchestrated by a number of proteins.

Unlike DNA, mRNA can leave the cell nucleus and travel to the ribosomes, which are structures in the cytoplasm that translate information from mRNA.

Codons: Three-letter codes

Codons are sequences of three nitrogenous bases that make up DNA that code for a particular amino acid. The order of the nitrogenous bases in each codon is very important. For example, C-A-C codes for the amino acid histidine, whereas C-C-A codes for the amino acid proline. As a comparison, the words “pare” and “pear” in a recipe book may have the same letters, but their different order means something important for how our recipe will turn out. 

Ribosomes are organelles that “read” mRNA one codon after another to make the backbone for the protein (the polypeptide chain). Ribosomes are like cooks who read and interpret recipes to make proteins.

With each codon, an amino acid is added to the polypeptide chain until all the codons have been read and the ribosome reaches a stop codon on the mRNA. Stop codons mark the end of the coding sequence and signal to the ribosomes to stop synthesizing the protein and release the newly created polypeptide chain.

At the end of the translation process, the polypeptide chain may be further modified; for example, it may be folded into its functional three-dimensional structure or receive specific chemical groups. These are called  post-translational modifications.

Alternative splicing is when a single gene codes for different proteins.

After DNA is transcribed into mRNA, the mRNA is spliced so that some of its sections (introns and exons) can be added or removed.

Exons are the coding sections of mRNA that determine the amino acid sequence of the final protein. Introns are non-coding sections that are not involved in forming the amino acid sequence.

Through this mechanism, the same gene can create different mRNA molecules that will translate into different proteins.

To illustrate splicing, let’s take the word EPICUREAN. If we remove some of its letters, we can form the words EPIC, CURE, PAN, RAN and so on. The same thing happens when genes are spliced. Some of the information is kept while other parts are removed to produce different copies of mRNA, which in turn produces different proteins.

A protein’s three-dimensional shape is what determines its function.

There are four different stages of folding that produce the protein’s final shape:

  • Primary structure: A protein is initially translated into a long chain of amino acids called a polypeptide. In this form, the protein is not yet functional.
  • Secondary structure: Next, the polypeptide chain folds in on itself. Hydrogen atoms in the chain interact with each other to form α-helices or β-sheets that give the protein stability.
  • Tertiary structure: The protein then takes on its final three-dimensional structure through interactions between its amino acids. It becomes functional in this form.
  • Quaternary structure: Some proteins have multiple polypeptides. A quaternary structure is the arrangement of these polypeptides.

More complex proteins consist of multiple linked polypeptides that are coded by different genes.

An example is hemoglobin, a protein in red blood cells that transports oxygen and has four subunits, each of which has a specific amino acid sequence that comes from different genes. To synthesize hemoglobin, multiple genes must simultaneously transcribe the protein’s different subunits, which are then assembled to form the final protein.

Proteins: The workers of the cell

Proteins are essential to the functioning of living organisms. They are highly varied and perform a wide array of biological functions.  

Protein function

Proteins perform so many functions in living organisms, including as enzymes or as the main component of some hormones. Some proteins are used for cell-to-cell communication, while others transport molecules or play a role in defending the organism.

EnzymesEnzymes trigger chemical reactions in living organisms and create the right conditions for these reactions to take place.  Lactase is an enzyme that lets us digest lactose.
HormonesHormones are chemical messengers that send information to different parts of the body.Insulin is a protein hormone that plays a role in glucose regulation.
Signalling proteinsThese proteins help with communication between cells, by binding to the cell membrane and sending signals to the cell, for example.Membrane receptors allow cells to receive signals. 
Contractile proteinsThese proteins are responsible for muscle contraction.Proteins such as actin and myosin help muscles contract.
Transport proteinsThese proteins allow molecules to travel from point A to point B.Hemoglobin is a protein that transports oxygen.
Support proteinsSupport proteins provide mechanical support for cells.Proteins that form the cell cytoskeleton.
Defence proteinsThese proteins fight off external pathogens and promote wound healing.Antibodies are proteins that recognize pathogens.
Storage proteinsThese proteins store up nutrient reserves.This is thought to be the role of ovalbumin, the main protein in egg white.

Proteins and phenotype

An organism’s set of observable traits, such as the colour of a plant’s flowers, is called its phenotype.  These traits depend on the proteins that produce physical characteristics and the genes that code for them.

  • Variations in pigmentation proteins are what give panda bears their unique black-and-white coat.
  • A snail’s shell is hard because of the shell’s protein matrix.
  • The proteins produced by some mushrooms are secondary metabolites with medicinal or toxic properties.
  • Yaks have a mutation that increases their production of hemoglobin (a protein), which improves their ability to transport oxygen at a high altitude.
  • The opsin proteins in the eyes of bees are sensitive to ultraviolet light and flower patterns invisible to the human eye, which is crucial for flower pollination.
  • Some insects develop resistance to pesticides by producing enzymes (proteins) that neutralize these chemicals.
  • Certain fish that live in icy waters produce antifreeze proteins that prevent ice crystals from forming in their blood and let them survive in extreme conditions.

Bookings

If you have any questions, comments, or feedback, we would love to hear from you.

Please contact us at the following email adress : education@genomequebec.com

Careers in Genomics

Working in genomics

àGenomics offers a range of career opportunities that require different levels of education. 

Biomedical Laboratory Technician*

Required level of education: College

Biomedical laboratory technicians conduct control tests on product samples based on specifications, protocols and standard operating procedures. They provide technical support to various departments: analytical, control or bio-analytical. Technicians use a variety of laboratory techniques to carry out chemical, biochemical or genetic tests.

Examples of typical duties:

  • Conducting physical, chemical, biological, biochemical or microbiological testing on samples of raw materials or finished products.
  • Performing analytical problem solving.
  • Producing documents on the results of sample testing.

Biochemist

Required level of education: University

Biochemists study and analyze chemical reactions and biological processes that occur at the molecular level of living organisms in order to enhance scientific knowledge. They also seek out real-world applications for research in areas such as medicine, pharmaceuticals, genetics, agriculture, industry and even biotechnology. 

Examples of typical duties:

  • Studying the chemical processes involved in the various functions of organisms, such as digestion, energy conversion in living matter, growth and aging.
  • Isolating and characterizing enzymes, hormones or genes and identifying their impact on the human body.
  • Using genetic engineering techniques.
  • Developing tests and new drugs.
  • Producing reports and recommendations on research results.

Biologist 

Required level of education: University

Biologists study living beings. They strive to understand how cells work. They focus on areas such as DNA replication and animal or plant cells in order to discover new therapeutic substances. They also study the chemical reactions in biological entities. They are fascinated by the interactions between active substances and living organisms. They are also interested in microorganisms, such as viruses, fungi and bacteria.

Examples of typical duties:

  • Studying manifestations of life in living organisms.
  • Conducting experiments on the growth, heredity and reproduction of plants and animals.
  • Studying the repercussions of human activities on the environment.
  • Studying the relationships among individuals (plants and animals) and their environment.

Microbiologist

Required level of education: University

Microbiologists study the structure, functions, ecology, biotechnology and genetics of microorganisms (viruses, bacteria, yeasts, fungi, algae) by conducting experiments and research to enhance scientific knowledge and develop practical applications for society and industry.

Examples of typical duties:

  • Taking samples from living tissue.
  • Isolating, identifying and harvesting specimens in the lab.
  • Studying the action of microorganisms on living tissue (infectious diseases) and examining how they propagate as infectious agents.
  • Studying microorganisms that decompose organic matter and fertilize soil.
  • Controlling the safety of food and water.
  • Controlling the quality of pharmaceuticals, drugs, cosmetics, pesticides, etc.

Many other professions require the use of genomics  

College level:

  • Analytical chemistry technician
  • Chemical process technician
  • Inspector
  • Crime scene technician

University level:

  • Biophysicist
  • Chemist
  • Coroner
  • Biotechnology engineer

To learn more about the various careers related to genomics, talk to your school guidance counsellor or consult the programs of study offered by the schools in your area. 

*Source : www.reperes.qc.ca (French only)

DNA day
DNA modification

Mutations: Driving biodiversity

Mutations are permanent changes to one or more nitrogenous bases in a DNA sequence. These genetic changes may be harmful or beneficial or may have no consequences for the organism. 

Mutations in the DNA sequence always occur randomly. However, beneficial mutations that give the organism a better chance of survival are more likely to be passed on to the next generation. This is how species evolve and adapt to changes in their environment. 

Mutations affecting a single nitrogenous base are very common. They’re called point mutations and they come in three categories:

Point mutationThis common type of change involves a single pair of nitrogenous bases
SubstitutionOne nitrogenous base is replaced with another
InsertionAn additional nitrogen base is inserted into the DNA sequence
DeletionA base is deleted from the DNA sequence
  • Substitutions can lead to a change in amino acid in the encoded protein.
  • Insertions and deletions create frameshifts by changing how codons are read, which disrupts the amino acid sequence and often produces a non-functional protein.

Mutations are also described according to the effect they produce.

Silent mutationsThese mutations cause no change in the final protein, as the amino acid in question can be encoded by different codons.  
Nonsense mutationThese mutations create a premature stop codon. The resulting proteins are incomplete and generally non-functional.
Missense mutationsThese mutations create a different amino acid in the final protein.

Other mutation typesThese mutation types involve more than one pair of nitrogenous bases
Tandem repeat mutations These mutations increase or decrease the number of repeats of a specific sequence of nitrogenous bases.
Chromosomal rearrangementsThese changes in the structure of chromosomes can disrupt normal gene function by putting genes under the control of incorrect regulators or changing their expression.
  • Breakdown in natural DNA repair mechanisms.
  • Exposure to mutagens such as toxic chemicals (e.g., in cigarettes), viruses and UV radiation from the sun.
  • Errors introduced during the DNA replication or genetic recombination that occurs during sex cell formation.

Cancer

Cancer is a disease characterized by the uncontrolled growth of abnormal cells in the body. The genetic mutations behind cancer change the mechanisms that control the cell cycle and apoptosis (programmed cell death) and allow cells to grow out of control and never die. These cells constantly divide and form tumours, which can then invade and harm the body’s tissues.

Understanding mutation mechanisms is therefore a vital part of cancer research. Identifying the specific types of mutations in cancer cells and understanding their causes lets researchers develop targeted methods to treat different types of cancer.

The study of genetic mutations has important implications for many other fields, such as evolution, developmental biology and research into hereditary diseases.

Mutation and evolution

Mutations are important from an evolutionary standpoint because they give rise to genetic diversity. Every mutation creates a new, unique version of a gene. If a mutation occurs in a sex cell, it can be passed on to the organism’s offspring.

When beneficial mutations give an organism an advantage in its specific environment, the organism is more likely to survive and reproduce. The frequency of a beneficial mutation that increases in a population over multiple generations is known as natural selection, a key concept in Charles Darwin’s theory of evolution.

Natural selection also tends to weed out mutations that hinder the survival of an organism, as individuals carrying these mutations will be less likely to reproduce and pass them on to their children.

Generation length is an also important factor in an organism’s ability to adapt to a changing environment. Organisms with shorter life cycles can potentially evolve faster, as they have more opportunities to generate offspring that carry beneficial mutations.

Evolution is therefore driven by the tension between mutations, which create genetic diversity, and natural selection, which acts as a “filter” that promotes beneficial mutations and hinders non-beneficial ones. This ongoing process of mutation and natural selection is how populations adapt, how they evolve over the long term, and how new species emerge.

Biotechnology  

What is biotechnology?

Biotechnology is the study of living organisms to develop technologies that are useful to society. There are applications for biotechnology in many fields, such as medicine, agriculture, engineering and the environment.

Transgenic organisms are created through genetic engineering, a technique that allows scientists to insert the genes of one species into the genome of another to modify certain traits. A transgenic organism therefore carries a gene from another species in its genome. For example, agricultural crops that require fewer pesticides can be engineered using a gene for resistance to certain pathogens.

CRISPR-Cas9 is used as “molecular scissors” to add, remove or change target genes. Scientists adapted this gene-editing technique from an immune defence mechanism in bacteria and archaea as a way to cut DNA using Cas9 proteins. Guide RNA sequences accompany and direct the Cas9 proteins to specific genes. Once CRISPR-Cas9 binds to its target, it cuts the DNA, which triggers the cell’s natural repair mechanisms, which modify or replace the existing gene.

Some body tissues, such as those in the spinal cord, don’t repair themselves if damaged. In such cases, stem cells can be reprogrammed to trigger the repair and regeneration of the affected tissues.

Embryonic stem cells are cells from an early‑stage embryo that differentiate into all types of cells in the body (muscle cells, epithelial cells, etc.). Although their regenerative potential is immense, their use raises ethical questions since the cells are extracted from embryos.

Scientists can also use cell reprogramming techniques to transform cells that have already differentiated—such as skin cells—into stem cells that can change into other cell types. These cells are called induced pluripotent stem cells.

Cloning is a technique to create an organism that is genetically identical to another. A famous example is Dolly, a sheep that was cloned in 1996. Learn more about Dolly here: DOLLY – Britannica

Ethical issues

Biotechnology opens up a world of possibilities. However, it also raises ethical questions about human responsibility and respect for life.

DNA: The Code of Life!

What is DNA?

DNA, which stands for deoxyribonucleic acid, is a molecule found in the cells of all living organisms that contains the genetic information an organism needs to develop, grow and function.

The complete set of genetic material that makes up an organism’s DNA is called its genome. Genomes vary between species, as well as within species and between individuals.

A genome is like a big recipe book. Genes are the recipes in the book that are used to produce a specific component that the organism needs to function.

DNA structure

The double helix code

DNA is a very long molecule with two strands shaped like a twisted ladder. This shape is what gives it the name double helix.  

The DNA strands are made of alternating sugar (deoxyribose) and phosphate molecules. Each sugar molecule bonds to one of four nitrogenous bases called adenine (A), thymine (T), cytosine (C) and guanine (G). When these bases form pairs, A always goes with T and C always goes with G. These pairs link the two strands together to form the “rungs” of the ladder.

How is DNA decoded?

The term genetic code refers to the order that the nitrogenous bases follow each other on a strand of DNA, an example being A-C-C-A-T-T-C-G-C-T. Genetic sequencing lets us decipher this code. These letters can be thought of as an alphabet forming words in a recipe that help us understand how organisms are put together.

A genome is made up of both coding and non-coding DNA. Coding DNA contains information that creates a protein. This type accounts for about 2% of the genome. The non-coding parts include promoters, stop codons and enhancers that regulate the expression of certain genes. The role of some non-coding DNA is still not fully understood.

Where is DNA located?

All of an organism‘s cells, except its gametes, contain a complete copy of its genetic material.

Animal and plant cells are called eukaryotic, which means they have a nucleus that stores DNA. Organisms, such as many microorganisms, that do not have a nucleus in their cells are called prokaryotic. In these cells, the DNA floats in the cytoplasm and clusters into a mass called the nucleoid.

In eukaryotic cells, the DNA—which contains all of an organism’s genetic information—stays in the nucleus, where it is protected and compacted.

To get the genetic information out of the nucleus, the DNA is copied into RNA. Messenger RNA then leaves the nucleus to direct protein synthesis in the cytoplasm, while other types of RNA, such as ribosomal RNA and transfer RNA, actively help different parts of the cell to function.

In eukaryotic cells, DNA is also found in mitochondria and chloroplasts.

Mitochondria are organelles that produce energy (ATP). This type of DNA is different from nuclear DNA. It has a circular shape and is transmitted from one generation to the next through the mother’s egg cells. This means that an entire maternal line will have the same mitochondrial DNA! (Note: Although rare, mitochondrial DNA can be passed on from the father. Studies are ongoing to understand how this happens.)

Chloroplasts are organelles that conduct photosynthesis. For a long time, it was thought that chloroplast DNA was also circular, but studies show that it is more often linear. Chloroplast DNA is passed on through egg cells or pollen and allows the chloroplasts to synthesize some of their proteins.

Although viruses are not made of cells, they do contain genetic material in the form of DNA or RNA. To replicate and spread, they need to hijack the mechanisms and resources of the cell they invade, called the host cell.

A DNA virus will enter the nucleus and deliver its viral genome into the host cell’s DNA. The virus then takes control of the cell’s machinery to produce viral proteins.

RNA viruses take over the ribosomes of infected cells directly to produce viral proteins.

DNA compaction: Big information in a small package

DNA is extremely long, and each human cell contains about 2 metres of this molecule! To fit the entirety of its genetic information into the tiny nucleus of a cell, DNA is compacted and coiled into chromatin.

Compaction also regulates access to DNA and controls gene expression. This is because compacted regions cannot be reached by RNA polymerases and therefore can’t be transcribed into RNA.

Chromatin is formed through multiple stages of compaction:

1. Proteins called histones form a spherical structure called the nucleosome. The DNA winds around this structure like a spool.

2. These nucleosomes become compacted to form chromatin, or strands of coiled DNA that look like beads on a necklace.  

When cells divide, chromatin reorganizes itself into an even more compact form called chromosomes

Humans have 23 pairs of chromosomes, with each pair assigned a number from 1 to 23. The first 22 pairs of chromosomes are called autosomes. The 23rd pair are the sex chromosomes, and these come in two types in humans: X and Y. The 23rd chromosome pair may consist of:

  • two identical chromosomes (XX)
  • two different chromosomes (XY)

The sex chromosomes do not always come in pairs, as an individual may have:

  • a single X chromosome

Or

  • three chromosomes (XXX, XXY or XYY)

An image of the size, shape, number and structure of an individual’s chromosomes can be produced with karyotyping, which involves taking a snapshot of chromosomes during cell division when the chromosomes are most visible.

Karyotyping is frequently done by medical geneticists to diagnose genetic disorders.

Chromosomes are passed on by the parents during sexual reproduction. Reproductive cells, called gametes, have one pair of chromosomes each. During fertilization, the gametes merge to form a zygote, which then inherits a complete set of chromosomes.

Each parent therefore passes on half of their chromosomes to their offspring. 

Educational space

Welcome to the Educational space

This platform is aimed primarily at secondary school students and science and technology teachers. It presents the basic concepts of genetics, as well as introducing the more advanced notions of genomics. Génome Québec also offers free classroom activities to help students put their knowledge into practice.

A reliable and relevant resource

The platform was developed with teachers and educational consultants to meet their needs, as well as the requirements of the Québec Ministry of Education’s Science and Technology program. In addition, all content and activities were developed with the support of scientific teams.

Flight 450

Documents associated

Flight450 minilab in classroom – Destination DNA

Are you a science and technology teacher, a lab technician or an education consultant? Are you looking for an activity on genetics that will engage your students?

The Flight450 minilab for the classroom – Destination DNA, developed by the Commission scolaire de Laval, in partnership with Génome Québec, the McGill University and Génome Québec Innovation Centre and the Commission scolaire de la Seigneurie-des-Mille-Îles, provides students with the opportunity to use scientific investigation while handling real DNA.

The Flight450 minilab for the classroom – Destination DNA, developed by the Commission scolaire de Laval, in partnership with Génome Québec, the McGill University and Génome Québec Innovation Centre and the Commission scolaire de la Seigneurie-des-Mille-Îles, provides students with the opportunity to use scientific investigation while handling real DNA.

The Flight450 minilab for the classroom – Destination DNA, developed by the Commission scolaire de Laval, in partnership with Génome Québec, the McGill University and Génome Québec Innovation Centre and the Commission scolaire de la Seigneurie-des-Mille-Îles, provides students with the opportunity to use scientific investigation while handling real DNA.

Detailed and user-friendly teaching materials (teacher’s guide, student handbook, videos, PowerPoint presentation) are made available to support you throughout the experience. By the end of this process, you and your students will have uncovered the secrets of analyzing DNA results!

Reserve your kit today and take your students on a science adventure they won’t soon forget!

During this activity, your students will be able to:

  • Replicate real fragments of human DNA
  • Use a polymerase chain reaction (PCR)
  • Make DNA migrate in agarose gel electrophoresis
  • Analyze the results

Testimonials

“I loved this experiment. The electrophoresis is a very interesting technique that was fun to do.”
— ANNICK ROCHELEAU, LAB TECHNICIAN, ÉCOLE SECONDAIRE ARMAND-CORBEIL (TERREBONNE)

“Great experience that gave students the chance to apply their theoretical knowledge and bridge the gap with careers in science.”
— YERO BA, MATH AND SVT TEACHER, COLLÈGE STANILAS (QUÉBEC)

“The material was so well organized and the quality of the documents was great: it had everything! The activity was interesting and original. Students really liked it. Thank you very much!”
— KARINE LESSARD, TEACHER, POLYVALENTE CHANOINE-ARMAND-RACICOT
(SAINT-JEAN-SUR-RICHELIEU)

From gene to protein

Genes: Recipes for proteins

A gene is a unit of DNA that codes for a protein or part of a protein. To make a protein, the cell uses a gene’s precise sequence of nitrogenous bases (A, T, C and G) as a guide.

Essential for the survival of all living organisms, proteins come in a wide variety of forms and functions. For example, variations in pigmentation proteins are what give panda bears their unique black-and-white coat. The opsin proteins in the eyes of bees are sensitive to ultraviolet light and flower patterns invisible to the human eye, which is crucial for flower pollination.

The genome can be compared to a cookbook filled with recipes, as it contains many genes that each produce a particular protein.

Alleles: Recipe variants

When one or more of a gene’s nitrogenous bases differ, these variants are called alleles.

For example, eyes can have alleles that contain information to create brown, blue, green or grey pigment. 

Gene expression 

All of an organism’s cells contain identical DNA. However, only the sections that a specific cell needs to function are expressed. This is what makes cells (such as neurons or muscle cells) different based on their roles in the body.

The ways cells package their DNA depending on the sections they need to access is referred to as epigenetics. When a gene is activated, its DNA is transcribed into RNA. This messenger RNA is then translated by ribosomes into a functional protein.

Thanks to this precise regulation, each gene is expressed at the right time and in the right place in the organism.

Did you know?

The size of an organism’s genome does not correlate to its complexity.

For example, the human genome has around 3.2 billion base pairs, while the largest known genome belongs to the Tmesipteris oblanceolata fern, which has 160 billion base pairs!

HumanParis JaponicaTmesipteris oblanceolata Nasuia deltocephalinicola (a bacterium)Arabidopsis Thaliana
3.2 billion base pairs149 billion base pairs160 billion base pairs 112,000 base pairs115 million base pairs
Sources: The Human Genome Project pieced together only 92% of the DNA – now scientists have finally filled in the remaining 8% (theconversation.com), The Chemistry of Arabidopsis thaliana – ScienceDirect, A 160 Gbp fork fern genome shatters size record for eukaryotes: iScience (cell.com)
Learn more

Only 1% of our genome directly codes for protein synthesis. The rest, called the non-coding genome, plays a role in gene regulation. These DNA regulatory regions, such as enhancers, promoters and other sequences, regulate gene expression. In genes, promoters are binding sites located right beside the start site for transcription, which is when DNA is copied into RNA. Enhancers increase the activity of certain genes, sometimes at a distance from the transcription start site.
Proteins called transcription factors direct the timing and intensity of gene activation and deactivation, which is why some cells express certain genes more than others, why not all cells produce the same proteins, and why cells have different functions.
Studying and understanding these regulatory regions is an important part of genomics research.
Source: The Human Genome Project pieced together only 92% of the DNA – now scientists have finally filled in the remaining 8% (theconversation.com)

Protein synthesis

Cells synthesize proteins using DNA as a kind of guide or recipe book. 

Steps in protein production:   

  1. The DNA information to create a protein (the gene) is copied into messenger RNA (mRNA).
  2. The mRNA then leaves the nucleus, enters the cytoplasm and binds to a ribosome.
  3. The ribosome moves to the rough endoplasmic reticulum (RER), where it reads the mRNA sequence and assembles the amino acids into a polypeptide chain.
  4. Once complete, the chain is released into the RER, where it is processed and folded into a protein.
  5. Proteins are carried by transport vesicles from the RER to the Golgi body.

The Golgi body processes the proteins before sending them to the cell surface or into the cytoplasm. The Golgi body also adds chemical tags to classify the new proteins according to their final destination.

Proteins that will be secreted out of the cell or embedded into the cell membrane are packaged into transport vesicles that bud from the Golgi body. These vesicles fuse with the cell membrane, and the protein is either released from the cell or delivered into the membrane.

Protein structure

Proteins are made of amino acids that bind in a specific order to create polypeptide chains.

When a protein is being made, its polypeptide chain must fold up into a specific three-dimensional shape so that the protein can function properly and fulfill its role. An error in the protein’s amino acid sequence or shape can impact its function and cause a problem for the organism.

In eukaryotic cells, the first step in protein production involves locating the target gene, i.e. the DNA‑sequence “recipe” that codes for the desired protein.

Once the gene is located, the DNA sequence is copied into a molecule called messenger RNA (mRNA). RNA is similar to DNA but has only a single strand and has uracil (U) nitrogenous bases instead of thymine (T) nitrogenous bases. The process to create the DNA copy is called transcription, which is orchestrated by a number of proteins.

Unlike DNA, mRNA can leave the cell nucleus and travel to the ribosomes, which are structures in the cytoplasm that translate information from mRNA.

Codons: Three-letter codes

Codons are sequences of three nitrogenous bases that make up DNA that code for a particular amino acid. The order of the nitrogenous bases in each codon is very important. For example, C-A-C codes for the amino acid histidine, whereas C-C-A codes for the amino acid proline. As a comparison, the words “pare” and “pear” in a recipe book may have the same letters, but their different order means something important for how our recipe will turn out. 

Ribosomes are organelles that “read” mRNA one codon after another to make the backbone for the protein (the polypeptide chain). Ribosomes are like cooks who read and interpret recipes to make proteins.

With each codon, an amino acid is added to the polypeptide chain until all the codons have been read and the ribosome reaches a stop codon on the mRNA. Stop codons mark the end of the coding sequence and signal to the ribosomes to stop synthesizing the protein and release the newly created polypeptide chain.

At the end of the translation process, the polypeptide chain may be further modified; for example, it may be folded into its functional three-dimensional structure or receive specific chemical groups. These are called  post-translational modifications.

Alternative splicing is when a single gene codes for different proteins.

After DNA is transcribed into mRNA, the mRNA is spliced so that some of its sections (introns and exons) can be added or removed.

Exons are the coding sections of mRNA that determine the amino acid sequence of the final protein. Introns are non-coding sections that are not involved in forming the amino acid sequence.

Through this mechanism, the same gene can create different mRNA molecules that will translate into different proteins.

To illustrate splicing, let’s take the word EPICUREAN. If we remove some of its letters, we can form the words EPIC, CURE, PAN, RAN and so on. The same thing happens when genes are spliced. Some of the information is kept while other parts are removed to produce different copies of mRNA, which in turn produces different proteins.

A protein’s three-dimensional shape is what determines its function.

There are four different stages of folding that produce the protein’s final shape:

  • Primary structure: A protein is initially translated into a long chain of amino acids called a polypeptide. In this form, the protein is not yet functional.
  • Secondary structure: Next, the polypeptide chain folds in on itself. Hydrogen atoms in the chain interact with each other to form α-helices or β-sheets that give the protein stability.
  • Tertiary structure: The protein then takes on its final three-dimensional structure through interactions between its amino acids. It becomes functional in this form.
  • Quaternary structure: Some proteins have multiple polypeptides. A quaternary structure is the arrangement of these polypeptides.

More complex proteins consist of multiple linked polypeptides that are coded by different genes.

An example is hemoglobin, a protein in red blood cells that transports oxygen and has four subunits, each of which has a specific amino acid sequence that comes from different genes. To synthesize hemoglobin, multiple genes must simultaneously transcribe the protein’s different subunits, which are then assembled to form the final protein.

Proteins: The workers of the cell

Proteins are essential to the functioning of living organisms. They are highly varied and perform a wide array of biological functions.  

Protein function

Proteins perform so many functions in living organisms, including as enzymes or as the main component of some hormones. Some proteins are used for cell-to-cell communication, while others transport molecules or play a role in defending the organism.

EnzymesEnzymes trigger chemical reactions in living organisms and create the right conditions for these reactions to take place.  Lactase is an enzyme that lets us digest lactose.
HormonesHormones are chemical messengers that send information to different parts of the body.Insulin is a protein hormone that plays a role in glucose regulation.
Signalling proteinsThese proteins help with communication between cells, by binding to the cell membrane and sending signals to the cell, for example.Membrane receptors allow cells to receive signals. 
Contractile proteinsThese proteins are responsible for muscle contraction.Proteins such as actin and myosin help muscles contract.
Transport proteinsThese proteins allow molecules to travel from point A to point B.Hemoglobin is a protein that transports oxygen.
Support proteinsSupport proteins provide mechanical support for cells.Proteins that form the cell cytoskeleton.
Defence proteinsThese proteins fight off external pathogens and promote wound healing.Antibodies are proteins that recognize pathogens.
Storage proteinsThese proteins store up nutrient reserves.This is thought to be the role of ovalbumin, the main protein in egg white.

Proteins and phenotype

An organism’s set of observable traits, such as the colour of a plant’s flowers, is called its phenotype.  These traits depend on the proteins that produce physical characteristics and the genes that code for them.

  • Variations in pigmentation proteins are what give panda bears their unique black-and-white coat.
  • A snail’s shell is hard because of the shell’s protein matrix.
  • The proteins produced by some mushrooms are secondary metabolites with medicinal or toxic properties.
  • Yaks have a mutation that increases their production of hemoglobin (a protein), which improves their ability to transport oxygen at a high altitude.
  • The opsin proteins in the eyes of bees are sensitive to ultraviolet light and flower patterns invisible to the human eye, which is crucial for flower pollination.
  • Some insects develop resistance to pesticides by producing enzymes (proteins) that neutralize these chemicals.
  • Certain fish that live in icy waters produce antifreeze proteins that prevent ice crystals from forming in their blood and let them survive in extreme conditions.

In search of DNA

In collaboration with UQAM’s Coeur des sciences, “In search of DNA” is a participatory conference for high-school students in both Cycle 1 and Cycle 2.

Virtually invite a Ph.D. student into your classroom to transform one period (approx. 60 minutes) into a “Sprint de science”!

Immerse yourself in the world of genomics research and discover how to decode DNA collected from the environment. Through various activities, try to answer the research question put to you. You’ll need to use all your knowledge and make connections between the cell, DNA, ecosystems and living organisms.

To register, consult the fact sheet and links to the Québec Education Program (QEP), visit the Coeur des sciences de l’UQAM website (French only).

.fwpl-layout, .fwpl-row { display: grid; } .fwpl-layout.el-980rpe { grid-template-columns: repeat(1, 1fr); grid-gap: 10px; } .fwpl-row.el-i0l7y8 { grid-template-columns: 1fr; }