Show simple item record

dc.contributor.authorSimons, Kristin J.
dc.contributor.authorMcClean, Phillip E.
dc.contributor.authorOsorno, Juan M.
dc.contributor.authorOladzad, Atena
dc.contributor.authorPasche, Julie S.
dc.contributor.authorLamppa, Robin
dc.description.abstractThe dataset consists of genotyping and common bacterial blight phenotyping information from genotypes within the NDSU Dry Bean Breeding Program. The Middle American data set consists of 713 genotypes and the Andean dataset consists of 139 genotypes. Both Middle American and Andean lines were phenotyped with common bacterial blight at both the unifoliate and trifoliate stages and the medians recorded. DNA was isolated from each line and sequenced using a single-end Illumina platform. Sequences were quality trimmed using SICKLE and then aligned to the Phaseolus vulgaris v2.1 reference sequence (DOE-JGI and USDA-NIFA, http://phytozome.jgi.doe.gov), indexed and sorted using BWA-MEMB and SAMtools. Read groups including library ID, platform and platform unit were added to each alignment within the BAM files using Picard (http://broadinstitute.github.io/picard/). Unifiedgenotyper from GATK3.6 (DePristo et al. 2011) was used to call variants with quality scores above 10. Quality scores between 10 and 30 were marked as low quality. Variants with a read depth of less than two were filtered using GATK3.6 variantfiltration and subsequently replaced as missing data. Low quality variants were removed via hard filtering when variants contained more than 25% missing data (50% in the MA SNP data set), more than one nucleotide, more than two alleles, or the minor allele was less than 5% in the Andean dataset(<1% in the MA SNP dataset). Genotypes with more than 90% missing data were removed. SNPs with less than 25% in the Andean dataset (50% in MA SNP dataset) of missing data were imputed in fastPHASE. The output file was converted to a hmp file for distribution. The dataset was used for identifying genomic regions associated with resistance to common bacterial blight in dry beans and can be mined for other SNPs of interest.en_US
dc.description.abstractThese datasets are used in a study that can be found in the NDSU repository at https://hdl.handle.net/10365/32840en_US
dc.title2015 NDSU Bean Breeding Program Genotyping Snapshot en_US
dc.typeDataseten_US
dc.date.accessioned2020-10-21T21:39:05Z
dc.date.available2020-10-21T21:39:05Z
dc.identifier.urihttps://hdl.handle.net/10365/31610
dc.subjectbeanen_US
dc.subjectbreedingen_US
dc.subjecthapmapen_US
dc.subjectCBBen_US
dc.subjectgenotypingen_US
dc.subjectMiddle Americanen_US
dc.subjectAndeanen_US
dc.subject.lcshBeans -- Breeding.en_US
dc.subject.lcshBeans -- Genetics.en_US
dc.subject.lcshCommon bean blight.en_US
dc.description.sponsorshipUSDA Agricultural Marketing Service grant 15-SCBGP-ND-0026en_US
dc.language.isoen_USen_US
dc.relation.isreferencedbyhttps://hdl.handle.net/10365/32840
ndsu.collegeAgriculture, Food Systems and Natural Resources
ndsu.departmentPlant Sciences


Files in this item

Thumbnail
Thumbnail
Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record