-
Notifications
You must be signed in to change notification settings - Fork 14
Description
Hi there,
I noticed differences in the VCF outputs between v1.5 and v1.5.3 when processing the same locus on the same sample, and I’m looking for some clarity on how to interpret the REF and ALT fields in v1.5.3. In v1.5, the REF and ALT fields seem to provide the full repeat sequences. However, in v1.5.3 the fields have changed and I am wondering whether I can get the similar sequences output in v1.5.3. I would prefer to continue using v1.5.3, since v1.5 occasionally throws an IndexError: list index out of range.
Additionally, I observed substantial differences in the reported results between v1.5 and v1.5.3. Could these discrepancies be due to the updated GMM clustering post-processing introduced in v1.5.2?
In v1.5.3,
#CHROM POS ID REF ALT QUAL FILTER INFO FORMAT .
chr1 1435798 . T . PASS RUL_REF=10 GT:DP:AD 0:15:13
In v1.5,
#CHROM POS ID REF ALT QUAL FILTER INFO FORMAT .
chr1 1435798 . GGCGCGGAGCGGCGCGGAGC GGCGCTCGGCCTGGCCCTGAGCTTGCGGCTGGCGCTGGCGCGGAGCGGCGCGAGCGCGGTGAGTGCGGCGGGCGGCCGGGCCGGG . PASS END=1435818;MOTIF=GGCGCGGAGC;COPIES=2.0 GT:DP:AL:ALR:AC:ACR:AD:ALT_MOTIF 1/1:15:17.9/17.9:2-48/2-48:3.0/3.0:0.3-8.0/0.3-8.0:14/14:GGCGCT(10);GGCCG(1);GCGGCGG(1)
Any suggestions would be appreciated.
Best,
Hsin