- The goals of the project are to identify introgression in barley accessions with a wild phenotype
- There is strong evidence that some of the WBDC accessions have experienced introgression
- Evidence comes from marker-based analysis of similarity to cultivated barley and from phenotypes that aren't fully "wild" such as changes in the degree of shattering.
- Given the many domestication-related genes in barley, chromosome-level genetic assignment could identify regions where domestication related alleles have or have not been introgressed into wild barley.
- Fang et al. 2014 report analysis of the BOPA SNP set in the WBDC collection.
- The full genotyped dataset is available as raw .tsv with Alchemy genotype calls are in the DRUM
- The WBDC has also been genotyped with GBS as reported in Sallam et al. 2017
- The full, filtered dataset is at this path:
/panfs/jay/groups/9/morrellp/shared/Projects/WBDC_inversions/Ahmad_GBS_morex_v3 - The file with 314 individuals is
final_filtered_treatmissing_biallilic_all_chrs_mafgt0.0016.recode.vcfbut it may be necessary to further filter variants with high heterozygosity or other issues.- Additional filtering may be needed to match filtering in the file with only 307 individuals:
WBDC_GBS_noIntro_307_mafgt0.0016.recode.tags.vcf.gz
- Additional filtering may be needed to match filtering in the file with only 307 individuals:
- The full, filtered dataset is at this path:
- The iSelect 9K dataset from the USDA Core Collection includes most, but not all the BOPA SNPs. A large set of cultivated and landrace lines were genotyped and reported by Muñoz-Amatriaín et al. 2014
- The dataset as reported by Poets et al. 2015 is available on Figshare. There is a GitHub repository that may be useful for processing this dataset.
The fully processed exome capture data set including the introgressed individuals will be in this directory once filtered:
/panfs/jay/groups/9/morrellp/shared/Datasets/Alignments/introgression_project/all_dom_and_wild/Filtered- Admixture has a haploid mode that should be useful for comparing WBDC accessions to domesticated samples from the USDA Core Collection.
- It may be necessary to phase the genotyping data for some analyses that follow, especially genetic assignment of chromosomes or chromosome painting.
- SHAPEIT4 is a current phasing tool that appears to be relatively easy to use.
- Plink identity by state analysis can identify nearly identical regions. This may still require Plink 1.9.
- MOSAIC is current tool for chromosome painting. It requires phased data.
- The genes most associated with domestication in barley involve loss of shattering, loss of seed dormancy, and row number. At least one gene has been cloned for each of these phenotypes. We may need to identify Morex_v3 positions.
- The file is [here][13].
[13]: