Objective: Get a sense of how genomics, the study of the genome in its entirety, needs to think about how to go about its research.   Geonomic DNA is broken up into fragments. The 5’ and 3’ ends of each fragment (a “read”) are sequenced. The sequenced reads are assembled together into contiguous sequences (“contigs”) based on sequence similarity.   The idea is to sequence enough random fragments so that every nucleotide in the genome is represented on some read. The number of such fragments needed is called the coverage, c.   The coverage c can be calculated by the formula RL/G, where R is the number of reads sequenced, L is the average length of a read and G is the total length of the genome. The units of length are bases (b) or base pairs (bp).   Consider a genome whose length is 1000 bp. “Shotgun” sequencing techniques are applied to the genome, resulting in 20 reads, with an average length of 50 bp. A very important point is that, even though 20 x 50 = 1000, there is no guarantee that ALL 1000 bp of the genome are represented in the fragments.    If the fragments are truly random and uniform, then the probability that a particular nucleotide is not sequenced on some fragment is given by e–c. Therefore, the probability that all of the nucleotides are sequenced is 1 – e–c.   Given the values given above, what is the probability that all of the nucleotides are sequenced in that genome?

Human Heredity: Principles and Issues (MindTap Course List)
11th Edition
ISBN:9781305251052
Author:Michael Cummings
Publisher:Michael Cummings
Chapter15: Genomes And Genomics
Section: Chapter Questions
Problem 6QP: Which of the following best describes the process of DNA sequencing? a. DNA is separated on a gel,...
icon
Related questions
Question
Objective: Get a sense of how genomics, the study of the genome in its entirety,
needs to think about how to go about its research.
 
Geonomic DNA is broken up into fragments. The 5’ and 3’ ends of each fragment
(a “read”) are sequenced. The sequenced reads are assembled together into
contiguous sequences (“contigs”) based on sequence similarity.
 
The idea is to sequence enough random fragments so that every nucleotide in the
genome is represented on some read. The number of such fragments needed is
called the coverage, c.
 
The coverage c can be calculated by the formula RL/G, where R is the number of
reads sequenced, L is the average length of a read and G is the total length of the
genome. The units of length are bases (b) or base pairs (bp).
 
Consider a genome whose length is 1000 bp. “Shotgun” sequencing techniques
are applied to the genome, resulting in 20 reads, with an average length of 50 bp.
A very important point is that, even though 20 x 50 = 1000, there is no guarantee
that ALL 1000 bp of the genome are represented in the fragments. 
 
If the fragments are truly random and uniform, then the probability that a
particular nucleotide is not sequenced on some fragment is given by e–c.
Therefore, the probability that all of the nucleotides are sequenced is 1 – e–c.
 
Given the values given above, what is the probability that all of the
nucleotides are sequenced in that genome?
 
Expert Solution
steps

Step by step

Solved in 3 steps with 1 images

Blurred answer
Knowledge Booster
Genome annotation
Learn more about
Need a deep-dive on the concept behind this application? Look no further. Learn more about this topic, biology and related others by exploring similar questions and additional content below.
Similar questions
  • SEE MORE QUESTIONS
Recommended textbooks for you
Human Heredity: Principles and Issues (MindTap Co…
Human Heredity: Principles and Issues (MindTap Co…
Biology
ISBN:
9781305251052
Author:
Michael Cummings
Publisher:
Cengage Learning
Biology (MindTap Course List)
Biology (MindTap Course List)
Biology
ISBN:
9781337392938
Author:
Eldra Solomon, Charles Martin, Diana W. Martin, Linda R. Berg
Publisher:
Cengage Learning