Assignment_2_ANS

.pdf

School

University of British Columbia *

*We aren’t endorsed by this school

Course

475

Subject

Biology

Date

Jan 9, 2024

Type

pdf

Pages

2

Uploaded by CorporalCloverCaterpillar16

Report
MICB 475 Assignment 2 Complete the following worksheet and hand in to Canvas as a PDF. Using the Parkinson’s Mouse dataset, complete the following: AFTER DEMULTIPLEXING: Data/information needed Your answer Total number of reads 253,874 Total number of samples 48 Range of sequencing depth 4237-16327 (represents the sample with lowest library size and highest) Maximum read length (bp) 150 nts Were all the reads the same length? yes Truncation length selected 0-150 (no truncation is ideal but if they decided to trim it might be on the right by 10 or so nts) Explain why you selected the above truncation length: The quality score was high throughout the 150 nts. The median quality didn’t really decrease even at the end of each read. You can potentially trim ~10bp at the right or left end considering there is a bit of a decrease at those ends but it would be incorrect to trim any more than that. When trimming, consider that if you do NOT trim, you retain but base information which can be better for downstream analysis but can result in more discarded reads after denoising due to the quality. If you trim too much, you risk losing base information but can retain more reads after denoising. AFTER DENOISING/CLUSTERING: These answers are based on the use of DADA2 and no trimming. Data/information needed Your answer Total number of retained 196,029 Total number of ASVs 287 Total number of samples 48 Range of sequencing depth 347 - 4996 Did the number of samples change? If it did, why do you think it did? No, it didn’t change. If it had, it’s likely that all reads in that sample were low quality reads. Name: ANS
MICB 475 Using the interactive plot in the table.qzv file, select the “donor_status” metadata category and answer the following questions: 1. How many categories exist under the donor_status metadata category? 2 2. What is the sample size for each category? 24 each 3. What is the maximum sampling depth you can go to maintain at least 10 samples per category? 4454, when you are using the slider make sure you don’t just find where it has 10 samples each but also change the sampling depth one by one to find the actual exact threshold where one sample goes down to n=9 4. How many features (ie. reads) and samples do you retain at the sampling depth you selected for question 3 above? 89,080 reads; 20 samples
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
  • Access to all documents
  • Unlimited textbook solutions
  • 24/7 expert homework help