Docsity
Docsity

Prepare for your exams
Prepare for your exams

Study with the several resources on Docsity


Earn points to download
Earn points to download

Earn points by helping other students or get them with a premium plan


Guidelines and tips
Guidelines and tips

worksheet 2 biology assignment, Assignments of Biology

genetic data, genome sequences, and different types of mutations

Typology: Assignments

2019/2020

Uploaded on 01/31/2024

jackie-garza
jackie-garza 🇺🇸

1 document

1 / 3

Toggle sidebar

This page cannot be seen from the preview

Don't miss anything!

bg1
Worksheet #2 Name:
1. Briefly describe the two competing theories regarding the origin of Homo sapiens and its distributional
expansion from its origin in Africa.
-The two competing theories regarding the origin of Homo sapiens and the distributional expansion from
Africa are the Multi-regional hypothesis and the Out of Africa 2 model. The multi-regional hypothesis
focuses on a steady evolutionary process that occurred in different regions. It states that early hominid
ancestors migrated out of Africa and that modern human evolution too place in various parts of the
world. The Out of Africa model argues that once anatomically modern humans evolved, they travelled
out of Africa to explore, colonize, and rapidly replace the archaic Homo population.
2. What does the genetic data say about these two hypotheses?
-The genetic data backs up the Out of Africa hypothesis because of the continent’s genetic diversity. This
high level of genetic variety shows that the human population must have evolved there first and then
later spread out.
3. Genetic analyses of the human species originally relied primarily on data from the mitochondrial
genome and the Y-chromosome. What feature of these two portions of the genome make them useful
for population studies such as the recent history of humans?
-Mitochondrial DNA and Y chromosome are mostly used to study the human population history because
of several reasons: Mitochondrial DNA is exclusively transferred from mother to children because
mitochondrial DNA present in sperms usually gets lost at the time of fertilization. So as to study
maternal ancestry researchers use mitochondrial DNA. Even there are different diseases linked to
maternal mitochondria which are inherited from mother to child. Y chromosome is the only
chromosome present in the cell which do not take part in the recombination or exchange their DNA with
another chromosome, unlike others. Therefore, each and every genetic information is transferred from
father to son, this what make Y chromosome so much special than any other chromosomes. Y
chromosome is only present in males, so to study the paternal ancestry, Y chromosome is sequenced and
analyzed by using different software. Different phylogenetic trees and cladograms are studied so as to
carry out the ancestry.
4. One common misconception about methods for identifying genes in genome sequence is by locating all
the start codons (ATG) and stop codons (TGA, TAG, TAA). Briefly explain why this strategy is not
helpful in the identification of genes?
- Each set of nucleotides corresponds to specific amino acid. It can be start or stop codons. If we try
locating both start and stop codons some genomes or amino acid will miss or additionally added. for
example, AUG code for methionine but also act as start codon. Gene's molecular address marks the
gene. It will describe the Gene's exact and accurate position on chromosome, location of that gene
sequence. protein-coding genes are required for identifying the same.
5. How can we identify genes in unmapped genome sequence?
- From the unmapped reads we generated de novo assemblies, after which the generated sequence contigs
were aligned to the NCBI non-redundant nucleotide database using BLAST, identifying the closest
known matching sequence.
pf3

Partial preview of the text

Download worksheet 2 biology assignment and more Assignments Biology in PDF only on Docsity!

Worksheet #2 Name:

  1. Briefly describe the two competing theories regarding the origin of Homo sapiens and its distributional expansion from its origin in Africa.
  • The two competing theories regarding the origin of Homo sapiens and the distributional expansion from Africa are the Multi-regional hypothesis and the Out of Africa 2 model. The multi-regional hypothesis focuses on a steady evolutionary process that occurred in different regions. It states that early hominid ancestors migrated out of Africa and that modern human evolution too place in various parts of the world. The Out of Africa model argues that once anatomically modern humans evolved, they travelled out of Africa to explore, colonize, and rapidly replace the archaic Homo population.
  1. What does the genetic data say about these two hypotheses?
  • The genetic data backs up the Out of Africa hypothesis because of the continent’s genetic diversity. This high level of genetic variety shows that the human population must have evolved there first and then later spread out.
  1. Genetic analyses of the human species originally relied primarily on data from the mitochondrial genome and the Y-chromosome. What feature of these two portions of the genome make them useful for population studies such as the recent history of humans?
  • Mitochondrial DNA and Y chromosome are mostly used to study the human population history because of several reasons: Mitochondrial DNA is exclusively transferred from mother to children because mitochondrial DNA present in sperms usually gets lost at the time of fertilization. So as to study maternal ancestry researchers use mitochondrial DNA. Even there are different diseases linked to maternal mitochondria which are inherited from mother to child. Y chromosome is the only chromosome present in the cell which do not take part in the recombination or exchange their DNA with another chromosome, unlike others. Therefore, each and every genetic information is transferred from father to son, this what make Y chromosome so much special than any other chromosomes. Y chromosome is only present in males, so to study the paternal ancestry, Y chromosome is sequenced and analyzed by using different software. Different phylogenetic trees and cladograms are studied so as to carry out the ancestry.
  1. One common misconception about methods for identifying genes in genome sequence is by locating all the start codons (ATG) and stop codons (TGA, TAG, TAA). Briefly explain why this strategy is not helpful in the identification of genes?
  • Each set of nucleotides corresponds to specific amino acid. It can be start or stop codons. If we try locating both start and stop codons some genomes or amino acid will miss or additionally added. for example, AUG code for methionine but also act as start codon. Gene's molecular address marks the gene. It will describe the Gene's exact and accurate position on chromosome, location of that gene sequence. protein-coding genes are required for identifying the same.
  1. How can we identify genes in unmapped genome sequence?
  • From the unmapped reads we generated de novo assemblies, after which the generated sequence contigs were aligned to the NCBI non-redundant nucleotide database using BLAST, identifying the closest known matching sequence.
  1. The sequence below is a randomly excised (cut out without considering reading frame) portion of the human gene CNTNAP2. Explain how you could find the correct reading frame simply by examining the sequence. (NOTE: You don’t need to find the correct reading frame, rather I want to know the strategy for finding the correct reading frame)
  • To identify a reading frame you locate a sequence corresponding to a start codon in order to determine the reading frame, the ATG (sense strand). You read the sequence in base triplets until a stop codon is reached (TGA, TAG, or TAA). Each reading frame divides the sequence into a series of threes. There are three possible reading frames in any sequence, depending on the starting point. CGCGCCGGCTGCGGGGCAGCGCTCCTGCTGTGGATTGTCAGCAGCTGCCTCTGCAGAGCCTGGACGGCTCCCTCCACGTCCCAAAAATGTGATGAGCCACTTGTCTCTGGACTCCCCCATGTGGCTTTCAGCAGCTCCTCCTCCATCTCTGGTAGCTATTCTCCCGGCTATGCCAAGATAAACAAGAGAGGAGGTGCTGGGGGATGGTCTCCATCAGACAGCGACCATTATCAATGGCTTCAGGTTG ACTTTGGCAATCGGAAGCAGATCAGTGCCATTGCAACCCAAGGAAGGTATAGCAGCTCAGATTGGGTGACCCAATACCGGATGCTCTACAGCGACACAGGGAGAAACTGGAAACCCTATCATCAAGATGGGAATATCTGGGCATTTCCCGGAAACATTAACTCTGACGGTGTGGTCCGGCACGAATTACAGCATCCGATTATTGCCCGCTATGTGCGCATAGTGCCTCTGGATTGGAATGGAGAA GGTCGCATTGGACTCAGAATTGAAGTTTATGGCTGTTCTTACTGGGCTGATGTTATCAACTTTGATGGCCATGTTGTATTACCATATAGATTCAGAAACAAGAAGATGAAAACACTGAAAGATGTCATTGCCTTGAACTTTAAGACGTCTGAAAGTGAAGGAGTAATCCTGCACGGAGAAGGACAGCAAGGAGATTACATTACCTTGGAACTGAAAAAAGCCAAGCTGGTCCTCAGTTTAAACTTA GGAAGCAACCAGCTTGGCCCCATATATGGCCACACATCAGTGATGACAGGAAGTTTGCTGGATGACCACCACTGGCACTCTGTGGTCATTGAGCGCCAGGGGCGGAGCATTAACCTCACTCTGGACAGGAGCATGCAGCACTTCCGTACCAATGGAGAGTTTGACTACCTGGACTTGGACTATGAGATAACCTTTGGAGGCATCCCTTTCTCTGGCAAGCCCAGCTCCAGCAGTAGAAAGAATTT CAAAGGCTGCATGGAAAGCATCAACTACAATGGCGTCAACATTACTGATCTTGCCAGAAGGAAGAAATTAGAGCCCTCAAATGTGGGAAATTTGAGCTTTTCTTGTGTGGAACCCTATACGGTGCCTGTCTTTTTCAACGCTACAAGTTACCTGGAGGTGCCCGGACGGCTTAACCAGGACCTGTTCTCAGTCAGTTTCCAGTTTAGGACATGGAACCCCAATGGTCTCCTGGTCTTCAGTCACTTT GCGGATAATTTGGGCAATGTGGAGATTGACCTCACTGAAAGCAAAGTGGGTGTTCACATCAACATCACACAGACCAAGATGAGCCAAATCGATATTTCCTCAGGTTCTGGGTTGAATGATGGACAGTGGCACGAGGTTCGCTTCCTAGCCAAGGAAAATTTTGCTATTCTCACCATCGATGGAGATGAAGCATCAGCAGTTCGAACTAATAGTCCCCTTCAAGTTAAAACTGGCGAGAAGTACTTT TTTGGAGGTTTTCTGAACCAGATGAATAACTCAAGTCACTCTGTCCTTCAGCCTTCATTCCAAGGATGCATGCAGCTCATTCAAGTGGACGATCAACTTGTAAATTTATACGAAGTGGCACAAAGGAAGCCGGGAAGTTTCGCGAATGTCAGCATTGACATGTGTGCGATCATAGACAGATGTGTGCCCAATCACTGTGAGCATGGTGGAAAGTGCTCGCAAACATGGGACAGCTTCAAATGCACT TGTGATGAGACAGGATACAGTGGGGCCACCTGCCACAACTCTATCTACGAGCCTTCCTGTGAAGCCTACAAACACCTAGGACAGACATCAAATTATTACTGGATAGATCCTGATGGCAGCGGACCTCTGGGGCCTCTGAAAGTTTACTGCAACATGACAGAGGACAAAGTGTGGACCATAGTGTCTCATGACTTGCAGATGCAGACGCCTGTGGTCGGCTACAACCCAGAAAAATACTCAGTGAC ACAGCTCGTTTACAGCGCCTCCATGGACCAGATAAGTGCCATCACTGACAGTGCCGAGTACTGCGAGCAGTATGTCTCCTATTTCTGCAAGATGTCAA
  1. If you have aligned orthologous gene sequences from an important, conserved gene; there is a different way you can determine reading frame. a. The following sequences of orthologous genes are aligned but the reading frame is unknown. What is the reading frame for these sequences? (i.e., Does the first complete codon begin at the first, second, or third nucleotide?)
  • The first complete codon begins at the second nucleotide. b. Now that you have established the correct reading frame, use the chart below and the one-letter amino acid abbreviations, translate the first eight complete codons of this gene for each species. AA1 AA2 AA3 AA4 AA5 AA6 AA7 AA Human P Y T V P V F F Lemur P Y T V P V F F Mouse P Y T V L V F F Bat P Y T V P V F F c. What do you notice about the differences between the amino acid (protein) sequences compared to the differences between nucleotide (DNA) sequences?
  • There is little difference at the amino acid level, despite some mutations at DNA level. However, at the end, we do see a lot more divergence. d. Synonymous mutations are most likely to occur at what position in the reading frame?
  • Nucleotide changes in the third position in the open reading frame often lead to synonymous mutations.